- This project is based on an open-source offline RL framework d3rlpy. (MIT License)
- The objective of this project is to provide an integrated RL framework for manipulating every objects, such as deformable objects.
- This repository includes the outcomes of the following techniques produced by KAIST and Korea University, supported by the everyday object manipulation project (during Level 1, 2022~2024):
- CORN (Cho et al., "CORN: Contact-based Object Representation for Nonprehensile Manipulation of General Unseen Objects." ICLR 2024. link)
- DrilDICE (Seo et al., "Mitigating Covariate Shift in Behavioral Cloning via Robust Distribution Correction Estimation." NeurIPS 2024. link)
- PorelDICE (Kim et al., "Relaxed Stationary Distribution Correction Estimation for Improved Offline Policy Optimization" AAAI 2024. link)
Install this package by
git clone git@github.com:KAIST-AILab/d3rlpy-everyday-objects.git
cd d3rlpy-everyday-objects
pip install -e .
- Clone
corn@iitp-d3rlpy
(link) as in:
git clone --branch iitp-d3rlpy https://github.com/iMSquared/corn.git
-
Follow the docker build and follow the setup instructions in the README.
-
Afterward, clone
yycho0108/d3rlpy-everyda-objects@imm-corn
repo and place it in an appropriate location within the docker image. -
For integration with
d3rlpy
, refer to the provided sample scripts inscripts/imm-corn
, where:
create_corn_dataset collects
the datasettrain_corn_sac_offline
uses the dataset to train SAC (it's just a sample script, so the trained agent may not function well)train_corn_sac_online
trains SAC online on an environment withnum_env=1
. It would be very inefficient, so it's mostly meant for debugging.
While running the above scripts, you may need to configure data collection parameters, such as the number of parallel envs or the runtime device.
- Use the reproduction code contained in
reproductions/offline/drildice.py
.
- Use the reproduction code contained in
reproductions/offline/proel.py
.
- This project was supported by Institute of Information & communications Technology Planning & Evaluation (IITP) grant funded by the Korea government(MSIT) (No. RS-2022-II220311, Development of Goal-Oriented Reinforcement Learning Techniques for Contact-Rich Robotic Manipulation of Everyday Objects).