Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support loading model from wandb #184

Merged
merged 3 commits into from
Sep 25, 2024
Merged

Conversation

vwxyzjn
Copy link
Contributor

@vwxyzjn vwxyzjn commented Sep 23, 2024

Would like to get your high-level thoughts. This PR allows us to load model from wandb and save the evaluated results to wandb as well.

E.g., https://wandb.ai/ai2-llm/open_instruct_internal/runs/u9f16bws?nw=nwusercostah

image

And then you can do this kind of visualization.

image

@vwxyzjn vwxyzjn requested a review from natolambert September 23, 2024 13:44
Copy link
Collaborator

@natolambert natolambert left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, I'm guessing you tested this? One small question.

Also, fix the style

make quality

@@ -389,8 +414,9 @@ def main():

with open(output_path, "w") as f:
for chosen, rejected in zip(scores_chosen, scores_rejected):
f.write(json.dumps({"chosen": scores_chosen, "rejected": scores_rejected}) + "\n")
f.write(json.dumps({"chosen": chosen, "rejected": rejected}) + "\n")
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Was this just a bug?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah I think so,

Previous:

image

After the change in this PR:

image

@vwxyzjn vwxyzjn merged commit 984b799 into main Sep 25, 2024
3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants