Print per-token reward over an RM #9

natolambert · 2024-02-07T03:31:36Z

Brief documentation in analysis/README.md
We can add visualizing it soon :)

E.g.
Reward: -0.544 | Substring: I
Reward: -0.556 | Substring: I love
Reward: -0.566 | Substring: I love to
Reward: 0.099 | Substring: I love to walk
Reward: 0.096 | Substring: I love to walk the
Reward: 0.092 | Substring: I love to walk the dog
Reward: 0.09 | Substring: I love to walk the dog,
Reward: 0.087 | Substring: I love to walk the dog, what
Reward: 0.085 | Substring: I love to walk the dog, what do
Reward: 0.089 | Substring: I love to walk the dog, what do you
Reward: 0.09 | Substring: I love to walk the dog, what do you like
Reward: 0.093 | Substring: I love to walk the dog, what do you like?

ljvmiranda921 · 2024-02-07T16:39:38Z

Will review later today!

ljvmiranda921 · 2024-02-07T17:42:54Z

analysis/per_token_reward.py

+    args = get_args()
+    quantized = True  # only Starling isn't quantized for now
+    custom_dialogue = False
+    # some models need custom code to be run
+    if "oasst" in args.model or "oasst" in args.chat_template:
+        from herm.models import openassistant  # noqa
+
+        model_builder = AutoModelForSequenceClassification.from_pretrained
+        pipeline_builder = pipeline
+    elif "Starling" in args.model or "Starling" in args.chat_template:
+        from herm.models.starling import StarlingPipeline, build_starling_rm
+
+        model_builder = build_starling_rm
+        pipeline_builder = StarlingPipeline
+        quantized = False
+    elif "openbmb" in args.model or "openbmb" in args.chat_template:
+        from herm.models.openbmb import LlamaRewardModel, OpenBMBPipeline
+
+        model_builder = LlamaRewardModel.from_pretrained
+        pipeline_builder = OpenBMBPipeline
+    elif "PairRM" in args.model or "PairRM" in args.chat_template:
+        from herm.models.pairrm import DebertaV2PairRM, PairRMPipeline
+
+        custom_dialogue = True
+        model_builder = DebertaV2PairRM.from_pretrained
+        pipeline_builder = PairRMPipeline
+    elif "SHP" in args.model or "SHP" in args.chat_template:
+        from herm.models.shp import SHPPipeline
+
+        custom_dialogue = True
+        model_builder = T5ForConditionalGeneration.from_pretrained
+        pipeline_builder = SHPPipeline
+    else:
+        model_builder = AutoModelForSequenceClassification.from_pretrained
+        pipeline_builder = pipeline
+
+    if custom_dialogue:
+        raise ValueError("Custom dialogue formatting not yet supported in this script")


In case we're going to reuse this code block in the future, we should factor this logic out (so that we can reuse it on run_rm.py), but imo for v1 it's fine for now 🤔

Yeah I agree @ljvmiranda921 , and maybe add a test case.

ljvmiranda921

LGTM!

works

c599c43

natolambert requested a review from ljvmiranda921 February 7, 2024 03:31

natolambert added 2 commits February 7, 2024 03:34

nit

6091265

quality

84879c5

ljvmiranda921 reviewed Feb 7, 2024

View reviewed changes

ljvmiranda921 approved these changes Feb 7, 2024

View reviewed changes

natolambert merged commit ed1bffa into main Feb 7, 2024
3 checks passed

ljvmiranda921 deleted the per_token branch February 9, 2024 01:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Print per-token reward over an RM #9

Print per-token reward over an RM #9

natolambert commented Feb 7, 2024

ljvmiranda921 commented Feb 7, 2024

ljvmiranda921 Feb 7, 2024

natolambert Feb 7, 2024

ljvmiranda921 left a comment

Print per-token reward over an RM #9

Print per-token reward over an RM #9

Conversation

natolambert commented Feb 7, 2024

ljvmiranda921 commented Feb 7, 2024

ljvmiranda921 Feb 7, 2024

Choose a reason for hiding this comment

natolambert Feb 7, 2024

Choose a reason for hiding this comment

ljvmiranda921 left a comment

Choose a reason for hiding this comment