Support loading model from wandb #184

vwxyzjn · 2024-09-23T13:44:30Z

Would like to get your high-level thoughts. This PR allows us to load model from wandb and save the evaluated results to wandb as well.

E.g., https://wandb.ai/ai2-llm/open_instruct_internal/runs/u9f16bws?nw=nwusercostah

And then you can do this kind of visualization.

natolambert

LGTM, I'm guessing you tested this? One small question.

Also, fix the style

make quality

natolambert · 2024-09-23T18:45:22Z

rewardbench/rewardbench.py

@@ -389,8 +414,9 @@ def main():

        with open(output_path, "w") as f:
            for chosen, rejected in zip(scores_chosen, scores_rejected):
-                f.write(json.dumps({"chosen": scores_chosen, "rejected": scores_rejected}) + "\n")
+                f.write(json.dumps({"chosen": chosen, "rejected": rejected}) + "\n")


Was this just a bug?

Yeah I think so,

Previous:

After the change in this PR:

Support loading model from wandb

e70e58f

vwxyzjn requested a review from natolambert September 23, 2024 13:44

quick fix

7f501f6

natolambert approved these changes Sep 23, 2024

View reviewed changes

address comment

ba94c43

vwxyzjn merged commit 984b799 into main Sep 25, 2024
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support loading model from wandb #184

Support loading model from wandb #184

vwxyzjn commented Sep 23, 2024

natolambert left a comment •

edited

Loading

natolambert Sep 23, 2024

vwxyzjn Sep 25, 2024

Support loading model from wandb #184

Support loading model from wandb #184

Conversation

vwxyzjn commented Sep 23, 2024

natolambert left a comment • edited Loading

Choose a reason for hiding this comment

natolambert Sep 23, 2024

Choose a reason for hiding this comment

vwxyzjn Sep 25, 2024

Choose a reason for hiding this comment

natolambert left a comment •

edited

Loading