Amazing work!
However, when I tried to reproduce Figure 6 based on commit b1dd507, I found that it differs quite a lot from the one in the paper on Llama-3.1 8B. What could be the reason for this? Additionally, I found that the code for reproducing Figure 6 in commit 0412fc2 contains quite a few bugs, so I tried using the latest commit to reproduce it.
| Method |
token_sparse_method |
Perplexity |
Arc-Easy Accuracy |
Hellaswag Accuracy |
PIQA Accuracy |
Winogrande Accuracy |
| Oracle |
fixed_60pc |
6.7961 |
0.732 |
0.496 |
0.745 |
0.595 |
| Quest |
fixed_60pc |
79.74 |
0.342 |
0.322 |
0.558 |
0.498 |
| ExpPred |
fixed_60pc |
102.1971 |
0.359 |
0.328 |
0.568 |
0.511 |
Amazing work!
However, when I tried to reproduce Figure 6 based on commit b1dd507, I found that it differs quite a lot from the one in the paper on Llama-3.1 8B. What could be the reason for this? Additionally, I found that the code for reproducing Figure 6 in commit 0412fc2 contains quite a few bugs, so I tried using the latest commit to reproduce it.