Official Pytorch Implementation of the paper Handling Delay in Real-Time Reinforcement Learning

Installation

For Mujoco the code was tested with python3.10.

Download Mujoco from here and place it in /home/$USER/.mujoco, and then source it:

export MUJOCO_PLUGIN_PATH=/home/$USER/.mujoco/mujoco-2.3.3/bin/mujoco_plugin/

and

export MUJOCO_PATH=/home/$USER/.mujoco/mujoco-2.3.3/

Install dependencies with:

pip install -r requirements_mujoco.txt

Tranning Mujoco

Vanilla SAC is taken from cleanRL

To train the vanilla SAC algorithm without delay run:

python train_sac.py --env_id HalfCheetah-v4 --agent Actor

To train the agent without any skip connections within a parallel computation framework and neuron execution time of 1 run:

python train_sac.py --env_id HalfCheetah-v4 --trainer delayed --agent ActorSlow --frame_skip 1

To train the agent with skip connections and with state-augmentation within a parallel computation framework and neuron execution time of 1 run:

python train_sac.py --env_id HalfCheetah-v4 --trainer delayed --agent ActorSlowConcat --num_last_actions 2 --frame_skip 1

To train it with different neuron execution times change the frame_skip parameter to 2,3 or 4.

Tranning MinAtar and MiniGrid

For MinAtar and MiniGrid the code was tested with python3.9.

Install MinAtar using the instructions here.

Install dependencies with:

pip install -r requirements_minatar.txt

For training Vanilla PPO without delay (heavily based on cleanRL), run:

python train_ppo.py --env_id MinAtar/Breakout-v0 --agent AgentSeparateActorCritic

For training an agent without skip connections within a parallel computation framework and neuron execution time of 1 run:

python train_ppo.py --env_id MinAtar/Breakout-v0 --agent ActorSlowPPO --frame_skip 1

For training an agent with skip connections and with state-augmentation within a parallel computation framework and neuron execution time of 1 run

for MinAtar:

python train_ppo.py --env_id MinAtar/Breakout-v0 --agent ActorSlowSkipResPPO --add_last_action --frame_skip 1

and for MiniGrid:

python train_ppo.py --env_id MiniGrid-DoorKey-5x5-v0 --agent ActorSlowSkipResPPO --history_states 4 --frame_skip 1

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
nets		nets
trainers		trainers
.gitignore		.gitignore
README.md		README.md
replay_buffer.py		replay_buffer.py
requirements_minatar.txt		requirements_minatar.txt
requirements_mujoco.txt		requirements_mujoco.txt
train_ppo.py		train_ppo.py
train_sac.py		train_sac.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Official Pytorch Implementation of the paper Handling Delay in Real-Time Reinforcement Learning

Installation

Tranning Mujoco

Tranning MinAtar and MiniGrid

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Official Pytorch Implementation of the paper Handling Delay in Real-Time Reinforcement Learning

Installation

Tranning Mujoco

Tranning MinAtar and MiniGrid

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages