DOGE: When Data Geometry Meets Deep Function: Generalizing Offline Reinforcement Learning (ICLR 2023)

DOGE (https://openreview.net/forum?id=lMO7TC7cuuh) is an offline RL method designed from the perspective of generalization performance of deep function approximators. DOGE trains a state-conditioned distance function that can be readily plugged into standard actor-critic methods as a policy constraint. Simple yet elegant, our algorithm enjoys better generalization compared to state-of-the-art methods on D4RL benchmarks.

Usage

To install the dependencies, use

    pip install -r requirements.txt

Benchmark experiments

You can run Mujoco tasks and AntMaze tasks like so:

    python train_distance_mujoco.py --env_name halfcheetah-medium-v2 --alpha 7.5

    python train_distance_antmaze.py --env_name antmaze-umaze-v2 --alpha 5.0

Modified AntMaze tasks

You can run the modified AntMaze medium/large tasks like so:

    python train_distance_antmaze.py --env_name antmaze-large-play-v2 --alpha 70 --toycase True

Visulization of Learning curves

You can resort to wandb to login your personal account via export your own wandb api key.

export WANDB_API_KEY=YOUR_WANDB_API_KEY

and run

wandb online

to turn on the online syncronization.

Bibtex

@inproceedings{
li2023when,
title={When Data Geometry Meets Deep Function: Generalizing Offline Reinforcement Learning},
author={Jianxiong Li and Xianyuan Zhan and Haoran Xu and Xiangyu Zhu and Jingjing Liu and Ya-Qin Zhang},
booktitle={The Eleventh International Conference on Learning Representations },
year={2023},
url={https://openreview.net/forum?id=lMO7TC7cuuh}
}

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.idea		.idea
Network		Network
RL_algos		RL_algos
Sample_Dataset		Sample_Dataset
.gitignore		.gitignore
readme.md		readme.md
requirements.txt		requirements.txt
run_d4rl.sh		run_d4rl.sh
train_distance_antmaze.py		train_distance_antmaze.py
train_distance_mujoco.py		train_distance_mujoco.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DOGE: When Data Geometry Meets Deep Function: Generalizing Offline Reinforcement Learning (ICLR 2023)

Usage

Benchmark experiments

Modified AntMaze tasks

Visulization of Learning curves

Bibtex

About

Releases

Packages

Languages

Facebear-ljx/DOGE

Folders and files

Latest commit

History

Repository files navigation

DOGE: When Data Geometry Meets Deep Function: Generalizing Offline Reinforcement Learning (ICLR 2023)

Usage

Benchmark experiments

Modified AntMaze tasks

Visulization of Learning curves

Bibtex

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages