-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Question about Experiment Settings #1
Comments
Hi, thank you for the question! Can you try setting Regarding the random seeds, we found that the impact of the random seed is minimal due the relatively high dimensions of the matrices and iterative nature of the algorithm (and especially so if finetuning is performed over the diagonal matrices of the randomized Hadamard transform, though that is an optional step). Also, in case you are interested in using the CALDERA-quantized version of LLaMa-2-7B that we computed, you can now find it here on Huggingface. This checkpoint has been obtained with the above configuration, and achieves the reported PPL. Edit: 15 outer iterations should work, as long as the number of inner iterations is 50. |
I will proceed with the experiments based on the settings you kindly provided. |
Hi @soeun-22 did the above configuration help? If so, please feel free to resolve the issue. |
Hello,
Thank you for providing such an excellent paper and code! I truly appreciate your contributions.
I’ve been running experiments using your code and encountered a question regarding the experimental settings. Specifically, my Wikitext2 PPL results seem to differ from the results reported in Table 1.
I conducted the experiment using the following settings:
With these settings, I obtained a PPL of '6.4685444831848145', which is higher than the reported results in Table 1.
I would like to ask:
For clarity, I have attached a screenshot of my experimental setup.

Thank you again for this remarkable project and for your support.
I look forward to your guidance and hope you have a great day!
The text was updated successfully, but these errors were encountered: