Skip to content

Verifiers for LLM Reinforcement Learning

License

Notifications You must be signed in to change notification settings

bespokelabsai/verifiers

 
 

Repository files navigation

Verifiers: Reinforcement Learning with LLMs in Verifiable Environments

This repo is a fork of the verifiers repo. The development of this repo is using an older version of the verifiers repo. We will sync this fork with the latest version of the verifiers repo soon.

Installation

To install the dependencies, run the following commands:

git clone https://github.com/bespokelabsai/verifiers.git
cd verifiers
uv sync
source .venv/bin/activate
uv pip install flash-attn --no-build-isolation

Reproducing the BFCL Training Result

To reproduce our result, run the following command:

accelerate launch --config-file configs/zero3.yaml --num-processes 3 verifiers/examples/bfcl_agent.py

The configurations/hyperparameters used are specified in verifiers/examples/bfcl_agent.py as global variables.

About

Verifiers for LLM Reinforcement Learning

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%