Solving a Real-world optimization problem using Proximal Policy Optimization combined with Curriculum Learning and Reward Engineering ♻️
- Python >=3.8.0,<3.10
- Conda
- Follow instructions : Installer link
git clone git@gitlab.com:anonymousppocl1/ppo_paper.git
cd ppo_paper
Create a conda virtual environment and run the following commands
conda create -n myenv python=3.8.8
conda activate myenv
pip install -r requirements.txt
python reproduce_results_paper.py
