Releases · ymetz/rlhfblender

05 Nov 21:11

ymetz

0.3.1.1

1852015

Release 0.3.1 Latest

Latest

Changes:

New Demo Models without GIT-LFS dependency (to maximize compatability)
Updated Correction Modal
Updated UI State Logic
Multi-Config Experiments (i.e. changing UI configs over training)
Logging of meta events such as submit/reset for reaction time measurements

Assets 2

31 Oct 19:20

ymetz

0.3.0

902d29c

Release 0.3.0 - Text Feedback, Experiment Intro, Study URLs & more

Many updates & Code Cleanup:

This update adds text feedback as a new modality, brings a major re-design including an updated intro modal, color scheme, controls, etc. Saved setup configurations can now be accessed via custom URLs making deployment for custom study setups easy and scalable.

Text Feedback
Updated Design
Setup Saving & Loading with custom URL for deployment
Massively simplified data generation
Prototype interface for keyboard shortcuts
Code restructuring in frontend (reworked state management, and increased modaluarity)
Updated demo models, compatible with gymnasium & newest StableBaselines3
New & improved intro modal
Updated Docs

Next steps:

Finishing keyboard shortcuts
Finishing input for multi-modal and text tasks/scenarios
Reward model implementations

For questions and bug-reports, feel free to reach out to @ymetz

Contributors

ymetz

Assets 2

15 Mar 14:36

ymetz

0.3.0pre

cfdda98

Pre-Release 0.3.0 Pre-release

Pre-release

First pre-release:

Experiments to collect and log feedback (https://rlhfblender.readthedocs.io/en/latest/guide/quickstart.html)
Registration of gym environments (https://rlhfblender.readthedocs.io/en/latest/guide/add_new_experiment.html)
Fully functional user interface

To-Do's until full release:

Reward Modeling components need testing and verificaiton
User Tracking with Motomo
Tutorial/Jupyter Notebook to showcase analyis of logged feedback

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Contributors

Releases: ymetz/rlhfblender

Release 0.3.1

Release 0.3.0 - Text Feedback, Experiment Intro, Study URLs & more

Contributors

Pre-Release 0.3.0