Skip to content

khtks/palr

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Implementation Code for PALR (Past Action Leakage Regularization)

  • This code contains official implementation codes of PALR, which is used to produce experimental results in the original paper presented in NeurIPS 2023 (pdf).
  • The code contains PALR and its baseline 4 methods (BC, FCA, MINE, RAP).

1. Prerequisites

  • To run this code, first install the anaconda virtual environment and install D4RL:
conda env create -f environment.yml
conda activate palr
pip install d4rl
  • (optional) Download D4RL dataset:
python download_d4rl_dataset.py

2. Train & Evaluate PALR

  • Train imitation policies using main.py. For the ease of hyperparameter search, pid pass into main code and pid-th configuration of the overall grid will be executed. Note that the default setting is:
    methodlist        = ['BC', 'RAP', 'FCA', 'MINE', 'PALR']
    envlist           = ['Hopper', 'Walker2d', 'HalfCheetah', 'Ant']
    stacksizelist     = [2, 4]
    seedlist          = [0, 1, 2, 3, 4]    

To execute 0-th configuration, i.e. method='BC', env='Hopper', stacksize=2, seed=0, run:

python train.py --pid=0

3. Notes

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%