-
Notifications
You must be signed in to change notification settings - Fork 61
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
What is the meaning of score_gt and score_cand? #16
Comments
Btw what is the exact meaning of bs in your code? @muqeeth
|
bs here in the code is batch size I think. |
But what does it mean for score_sand & gt @muqeeth |
score_cand and score_gt means the average score for wrong answers and correct answers. |
@HaokunLiu imagine I'd like to persist the scores as probabilities, is it safe to assume that |
Ha, you found this issue. In fact, if we are going to compute a probability distribution over all the choices (including correct and incorrect), they should be considered as -logits rather than probabilities. They correspond to $ - \beta (x, y)$ from eq. 2 in the paper. |
What is the meaning of score_gt and score_cand? How do I better run the model by observing these parameters?
The text was updated successfully, but these errors were encountered: