DOGE: When Data Geometry Meets Deep Function: Generalizing Offline Reinforcement Learning (ICLR 2023)
DOGE ( is an offline RL method designed from the perspective of generalization performance of deep function approximators. DOGE trains a state-conditioned distance function that can be readily plugged into standard actor-critic methods as a policy constraint. Simple yet elegant, our algorithm enjoys better generalization compared to state-of-the-art methods on D4RL benchmarks.
To install the dependencies, use
pip install -r requirements.txt
You can run Mujoco tasks and AntMaze tasks like so:
python --env_name halfcheetah-medium-v2 --alpha 7.5
python --env_name antmaze-umaze-v2 --alpha 5.0
You can run the modified AntMaze medium/large tasks like so:
python --env_name antmaze-large-play-v2 --alpha 70 --toycase True
You can resort to wandb to login your personal account via export your own wandb api key.
and run
wandb online
to turn on the online syncronization.
title={When Data Geometry Meets Deep Function: Generalizing Offline Reinforcement Learning},
author={Jianxiong Li and Xianyuan Zhan and Haoran Xu and Xiangyu Zhu and Jingjing Liu and Ya-Qin Zhang},
booktitle={The Eleventh International Conference on Learning Representations },