Make anthropic tool reminder injection optional #701

tw-aisi · 2024-10-15T12:54:48Z

Only one of this PR and #700 should be merged. These two PRs are two variants of a fix.

This PR contains:

What is the current behavior? (You can also link to an open issue here)

Currently, whenever we call an Anthropic model with tool calls enabled, a system message "Before answering, explain your reasoning step-by-step in tags." is forcibly injected at the end of the existing system message.

This injection cannot be turned off and is very difficult to detect in the log viewer -- you can see it in the json details of the transcript log which logs raw model requests but not the main Messages view.

What is the new behavior?

The existing behavior is maintained but can now be turned off by setting the anthropic_tool_reminder_injection flag in GenerateConfig. This flag can also be adjusted using the environment variable ANTHROPIC_TOOL_REMINDER_INJECTION and the CLI flag --anthropic-tool-reminder-injection or --no-anthropic-tool-reminder-injection.

An alternate solution is to remove this behavior entirely and give no option for turning it back on (unless the user hardcodes it in at some higher layer of the stack). #700 implements the complete removal change.

Does this PR introduce a breaking change? (What changes might users need to make in their application due to this PR?)

No.

jjallaire · 2024-10-15T22:41:16Z

Closing in lieu of #700

Make anthropic tool reminder injection optional

5c23dc4

tw-aisi mentioned this pull request Oct 15, 2024

Remove anthropic tool reminder injection #700

Merged

5 tasks

jjallaire closed this Oct 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make anthropic tool reminder injection optional #701

Make anthropic tool reminder injection optional #701

tw-aisi commented Oct 15, 2024 •

edited

Loading

jjallaire commented Oct 15, 2024

Make anthropic tool reminder injection optional #701

Make anthropic tool reminder injection optional #701

Conversation

tw-aisi commented Oct 15, 2024 • edited Loading

This PR contains:

What is the current behavior? (You can also link to an open issue here)

What is the new behavior?

Does this PR introduce a breaking change? (What changes might users need to make in their application due to this PR?)

jjallaire commented Oct 15, 2024

tw-aisi commented Oct 15, 2024 •

edited

Loading