Not-GAIL

To install each library here use pip install -e

Make sure you are not using python 3.9, I faced many installation problems with it. Python 3.7 or 3.8 is better.

VAE pre-training

python .\test_bac_state.py -e MiniGrid-KeyEmpty-6x6-v0 -nt not_100_key_6x6_traj -t 100_key_6x6_traj -s key_6x6_state --nepochs 100 --ae

Use Curriculum Learning for unfair speedups!

make sure the -p flag is passed (need partially obs so cnn does not get angry) python .\minigrid_ppo_training_script.py -e MiniGrid-ColoredFourRooms-v0 -r partial_img_colored4rooms -p -le MiniGrid-Empty-16x16-v0 -l partial_img_empty_16x16 Need pre-trained weights on simpler task, pass using -l and -le

For empty middle room

collected traj saved in traj_datasets\middle_empty_random_traj train BaC with python .\test_crnn_bac_triplet.py -e MiniGrid-MidEmpty-Random-6x6-v0 -t middle_empty_random_traj --bc -s avoid_middle_6x6 load and check BaC with python .\test_crnn_bac_triplet.py -e MiniGrid-MidEmpty-Random-6x6-v0 -s avoid_middle_6x6 -l weights saved in bac_weights\avoid_middle_6x6.pt

Scripts

python .\minigrid_ppo_training_script.py -e MiniGrid-Empty-Random-6x6-v0 -r cnn_1 --seed 1 --show

python .\minigrid_traj_collection_script.py -e MiniGrid-Empty-Random-6x6-v0 -r cnn_1 -s test_traj_collection --ntraj 20 --render

python .\test_crnn_bac_triplet.py -e MiniGrid-KeyEmpty-6x6-v0 -t 100_key_6x6_traj --bc -s key_6x6_bac_attn_switched -te 60 -ce 120

New Minigrid Envs

MiniGrid-KeyEmpty: Similar to the empty grid with agent and key position randomized.
- Found in : gym_minigrid/envs/keyempty.py
- Variants:
  - MiniGrid-KeyEmpty-16x16-v0
  - MiniGrid-KeyEmpty-8x8-v0
  - MiniGrid-KeyEmpty-6x6-v0
MiniGrid-nKeyEmpty: Similar to the key empty grid with agent and n keys randomized.
- Found in : gym_minigrid/envs/nkeyempty.py
- Variants:
  - MiniGrid-3KeyEmpty-8x8-v0
  - MiniGrid-2KeyEmpty-6x6-v0
MiniGrid-MidEmpty: Similar to the empty grid with agent avoiding middle blocks.
- Found in : gym_minigrid/envs/middleempty.py
- Variants:
  - MiniGrid-MidEmpty-6x6-v0
  - MiniGrid-MidEmpty-Random-6x6-v0
MiniGrid-ColoredFourRooms: Similar to 4 rooms env, with top right room colored as yellow
- Found in : gym_minigrid/envs/colored_fourrooms.py
- Variants:
  - MiniGrid-ColoredFourRooms-v0

New files added for Anything

libraries\imitation\src\imitation\algorithms\anything_module.py
libraries\imitation\src\imitation\rewards\neg_discrim_nets.py

New files added for Something

libraries\imitation\src\imitation\algorithms\something_module.py
libraries\imitation\src\imitation\utils\reward_wrapper.py

New Functions added in BC

Find them in libraries\imitation\src\imitation\algorithms\bc.py _calculate_only_l2_loss: Plain L2 loss. _calculate_crossentropy_loss: One hot actions and return CE loss.

Minigrid Tips

If using an image as obs, wrap the env as follows: env = gym.make('MiniGrid-Empty-Random-6x6-v0') env = wrappers.RGBImgObsWrapper(env) env = wrappers.ImgObsWrapper(env)

Or for a venv, use from imitation.util import util venv = util.make_vec_env('MiniGrid-Empty-Random-6x6-v0', n_envs=1, post_wrappers= [wrappers.RGBImgObsWrapper, wrappers.ImgObsWrapper])

Similarly, for a flat obs use: env = gym.make('MiniGrid-Empty-Random-6x6-v0') env = wrappers.FlatObsWrapper(env)

Pass FlatObsWrapper in post_wrappers for a venv

Test Script

To run the script first install both imitation and sb3. Then you need to re-install imitation from libraries\imiation This is because it needs to register the new files that are added.

run python test_anything_module.py

cartpole_proper.pkl are trajectories from a trained PPO that solved Cartpole-v1 env.

Ideally you should see the gen log with increasing reward and neg_gen log with reducing reward.

Changes to original script

imitation\src\imitation\util\utils.py at line 103, dropped i (needed to pass custom wrappers.)

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
.vscode		.vscode
BAC_test_scripts		BAC_test_scripts
BaC		BaC
__pycache__		__pycache__
autoencoder		autoencoder
bac_weights		bac_weights
data		data
gym-minigrid		gym-minigrid
highway-env		highway-env
libraries		libraries
logs		logs
models		models
modules		modules
traj_datasets		traj_datasets
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
collect_interactive_minigrid_traj.py		collect_interactive_minigrid_traj.py
collecting_traj.py		collecting_traj.py
gail_empty.py		gail_empty.py
interactive_minigrid_traj_collection_script.py		interactive_minigrid_traj_collection_script.py
minigrid_bc_training_script.py		minigrid_bc_training_script.py
minigrid_ppo_training_script.py		minigrid_ppo_training_script.py
minigrid_traj_collection_script.py		minigrid_traj_collection_script.py
ppo_with_custom_reward.py		ppo_with_custom_reward.py
reward_wrapper_testing.py		reward_wrapper_testing.py
test_policies.py		test_policies.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Not-GAIL

VAE pre-training

Use Curriculum Learning for unfair speedups!

For empty middle room

Scripts

New Minigrid Envs

New files added for Anything

New files added for Something

New Functions added in BC

Minigrid Tips

Test Script

Changes to original script

About

Releases

Packages

Contributors 2

Languages

License

sen-pai/Not-GAIL

Folders and files

Latest commit

History

Repository files navigation

Not-GAIL

VAE pre-training

Use Curriculum Learning for unfair speedups!

For empty middle room

Scripts

New Minigrid Envs

New files added for Anything

New files added for Something

New Functions added in BC

Minigrid Tips

Test Script

Changes to original script

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages