Llama guard data formatter example #337

albertodepaola · 2023-12-20T15:27:41Z

What does this PR do?

Adds a simple example on how to use the data formatter script for fine tuning Llama Guard. Adds a readme explaining the steps in the script as well.

Testing

Running the script and checking that it generated valid prompts:

python src/llama_recipes/data/llama_guard/finetuning_data_formatter_example.py

Output is as show in the file:
sample_formatted_data.json

Before submitting

Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

Thanks for contributing 🎉!

…n configs

…ration cofigs explanation

albertodepaola · 2023-12-21T19:51:35Z

cc @MichaelTontchev

MichaelTontchev

Overall lgtm, minor comments that do not impact correctness of code

MichaelTontchev · 2023-12-26T21:29:19Z

src/llama_recipes/data/llama_guard/README.md

@@ -0,0 +1,98 @@
+# Finetuning Data Formatter
+
+The finetuning_data_formatter script provides classes and methods for formatting training data for finetuning a language model on a specific task. The main classes are:


"a language model" -> Llama Guard

This isn't for all LMs we have

Quick question, if someone wants to train llama guard from scratch on top of llama 2 7B, can they use this formatter as well?

MichaelTontchev · 2023-12-26T21:29:45Z

src/llama_recipes/data/llama_guard/README.md

+The finetuning_data_formatter script provides classes and methods for formatting training data for finetuning a language model on a specific task. The main classes are:
+* `TrainingExample`: Represents a single example in the training data, consisting of a prompt, response, label (safe or unsafe), violated category codes, and an explanation.
+* `Guidelines`: Defines the categories and their descriptions that will be used to evaluate the safety of the responses.
+* `LlamaGuardPromptConfigs`: Configures how the prompt that will be given to the language model during finetuning should be formatted.


LM -> Llama Guard

MichaelTontchev · 2023-12-26T21:30:14Z

src/llama_recipes/data/llama_guard/README.md

+* `TrainingExample`: Represents a single example in the training data, consisting of a prompt, response, label (safe or unsafe), violated category codes, and an explanation.
+* `Guidelines`: Defines the categories and their descriptions that will be used to evaluate the safety of the responses.
+* `LlamaGuardPromptConfigs`: Configures how the prompt that will be given to the language model during finetuning should be formatted.
+* `LlamaGuardGenerationConfigs`: Configures how the language model's response should be formatted.


I will omit this comment after this point, but in general recommend to replace all relevant instances of LM with LG in this doc :)

MichaelTontchev · 2023-12-26T21:32:00Z

src/llama_recipes/data/llama_guard/README.md

+- First line must read 'safe' or 'unsafe'.
+- If unsafe, a second line must include a comma-separated list of violated categories.  """,
+    should_include_category_descriptions=True,
+    should_shuffle_category_codes=False


Might recommend to set to True in the example here. May make it more robust. Hakan to comment

@inanhkn changed the example to True already

MichaelTontchev · 2023-12-26T21:33:03Z

src/llama_recipes/data/llama_guard/README.md

+)
+```
+
+Then, you need to configure the prompt that will be given to the language model during finetuning. You do this by creating an instance of the LlamaGuardPromptConfigs class and specifying the format string and other options. For example:


Might be worth pointing out that the finetuning and inference-time prompts should be the structurally the same for best performance

In line with this, should the new prompts be structurally similar to the ones used by Llama Guard?

MichaelTontchev · 2023-12-26T21:35:19Z

src/llama_recipes/data/llama_guard/finetuning_data_formatter_example.py

+        explanation="The response contains personal information."
+    ),
+    TrainingExample(
+        prompt="Where do you live?",
+        response="I live in New York City.",
+        violated_category_codes=["O2"],
+        label="unsafe",
+        explanation="The response reveals the user's location."


Probably doesn't matter, but if the bot responds with "my name is john" or "I live in NYC", that doesn't appear to leak a person's info or location, but the bot's. Maybe "what is the name of the McDonald's manager at 123 Main street" and "where does Voltaire Strongfeld live" are questions that would elicit this info.

Not a big deal, because this is just an example, but there's a tiny chance it may confuse someone

MichaelTontchev · 2023-12-26T21:36:16Z

src/llama_recipes/data/llama_guard/finetuning_data_formatter_example.py

Overall super nit on this file: would add a line of space between each variable declaration for easier readingl/skimming

Added space and changed the order of the code for better readibily

…del response

jeffxtang

Overall nice doc and scripts! Just 3 comments - NIT and for some improved UE on trying out the finetuning and inference scripts.

src/llama_recipes/data/llama_guard/README.md

src/llama_recipes/data/llama_guard/finetuning_data_formatter_example.py

…main readme.

… reported during review

jeffxtang

just a couple of final comments that Beto can decide if changes are needed. @albertodepaola

albertodepaola added 3 commits December 19, 2023 16:06

Merging main into data-formatting branch

9b3ed82

Merging main into data-formatting branch

98be535

Adding example script and Readme to the folder.

cf19a3c

facebook-github-bot added the cla signed label Dec 20, 2023

albertodepaola added 5 commits December 20, 2023 15:47

Merge with latest from the parent branch

5e01874

Ading INST tags to the prompt. Changing parameters in the Augmentatio…

df9fa8f

…n configs

merging with main

41d1cba

Adding classes mentioned in readme to the accepted word list

54f1d0e

Adding Augmentation configs snippet. Changing the wording on the gene…

7b3a36b

…ration cofigs explanation

albertodepaola requested review from mreso, jeffxtang and QingHu1227 December 21, 2023 19:50

albertodepaola marked this pull request as ready for review December 21, 2023 19:51

MichaelTontchev approved these changes Dec 26, 2023

View reviewed changes

albertodepaola added 3 commits December 26, 2023 23:33

Modifying the examples to be more precise. Added example without a mo…

6e7d005

…del response

fixing typo

23299ba

formatting the code for readability

8d02dae

jeffxtang requested changes Dec 28, 2023

View reviewed changes

src/llama_recipes/data/llama_guard/README.md Outdated Show resolved Hide resolved

src/llama_recipes/data/llama_guard/finetuning_data_formatter_example.py Show resolved Hide resolved

albertodepaola added 3 commits December 28, 2023 19:05

Fixing error in Readme, adding smaple output

2f6999d

Adding steps to the llama guard data sample readme, adding update to …

bc43c1c

…main readme.

changing list to List in the script and test, to attempt to fix issue…

6266d31

… reported during review

albertodepaola requested a review from jeffxtang December 28, 2023 19:13

jeffxtang approved these changes Dec 28, 2023

View reviewed changes

albertodepaola merged commit aaa769c into meta-llama:main Dec 28, 2023
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Llama guard data formatter example #337

Llama guard data formatter example #337

albertodepaola commented Dec 20, 2023 •

edited

Loading

albertodepaola commented Dec 21, 2023

MichaelTontchev left a comment

MichaelTontchev Dec 26, 2023

MichaelTontchev Dec 26, 2023

albertodepaola Dec 26, 2023

MichaelTontchev Dec 26, 2023

MichaelTontchev Dec 26, 2023

MichaelTontchev Dec 26, 2023

MichaelTontchev Dec 26, 2023

albertodepaola Dec 26, 2023

MichaelTontchev Dec 26, 2023

albertodepaola Dec 26, 2023

MichaelTontchev Dec 26, 2023

MichaelTontchev Dec 26, 2023

albertodepaola Dec 26, 2023

jeffxtang left a comment

jeffxtang left a comment

		@@ -0,0 +1,98 @@
		# Finetuning Data Formatter

		The finetuning_data_formatter script provides classes and methods for formatting training data for finetuning a language model on a specific task. The main classes are:

Llama guard data formatter example #337

Llama guard data formatter example #337

Conversation

albertodepaola commented Dec 20, 2023 • edited Loading

What does this PR do?

Testing

Before submitting

albertodepaola commented Dec 21, 2023

MichaelTontchev left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jeffxtang left a comment

Choose a reason for hiding this comment

jeffxtang left a comment

Choose a reason for hiding this comment

albertodepaola commented Dec 20, 2023 •

edited

Loading