Can't reproduce the results of Figure.6

Amazing work!

However, when I tried to reproduce Figure 6 based on commit b1dd507a8f117d2e5ebf3c86f91fe7cea28af94e, I found that it differs quite a lot from the one in the paper on Llama-3.1 8B. What could be the reason for this? Additionally, I found that the code for reproducing Figure 6 in commit 0412fc24a3b770e4d82e6d7064a8172f24c5fcd3 contains quite a few bugs, so I tried using the latest commit to reproduce it.

<img width="1097" alt="Image" src="https://github.com/user-attachments/assets/566b0003-c7d9-4d6d-a792-8a18273ecfe9" />


| Method    | token_sparse_method |Perplexity   | Arc-Easy Accuracy | Hellaswag Accuracy | PIQA Accuracy | Winogrande Accuracy |
|-----------|--------------|--------------|-------------------|--------------------|---------------|---------------------|
| Oracle    | fixed_60pc| 6.7961       | 0.732             | 0.496              | 0.745         | 0.595               |
| Quest    | fixed_60pc|   79.74     |       0.342    | 0.322    | 0.558       |  0.498            |
| ExpPred   |fixed_60pc | 102.1971     | 0.359             | 0.328              | 0.568         | 0.511               |


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can't reproduce the results of Figure.6 #7

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Method	token_sparse_method	Perplexity	Arc-Easy Accuracy	Hellaswag Accuracy	PIQA Accuracy	Winogrande Accuracy
Oracle	fixed_60pc	6.7961	0.732	0.496	0.745	0.595
Quest	fixed_60pc	79.74	0.342	0.322	0.558	0.498
ExpPred	fixed_60pc	102.1971	0.359	0.328	0.568	0.511

Can't reproduce the results of Figure.6 #7

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions