v0.1.0
Major release: refactoring of models, trainer and pipelines, and more! 🚀
- Refactored the old
task
class into a base class (RL4COLitModule
) that is the base for RL algorithms (such as REINFORCE and PPO), following the discussion in #67 - New base class for construction methods: now encoder, decoder, policy, and model can be based on common parent classes to make implementation much more modular
- Added native loading from the checkpoint, which used to be buggy
- Nice new logo (we like it, but we are obviously biased, so feel free to give us your opinion ;) )
- Added mPDP environment (and added some WIP for EquityTransformer)
- New
RL4COTrainer
that automatically includes training tricks for RL - Added Codecov coverage
- Better testing: now we thoroughly test most of the library, including training (the Hydra part as well!)
- Documentation overhaul: add Sphinx plugins for modularized, automatic docs
- ... and more!