- Framework: Tensorflow
- Reinforcement learning algorithm: Proximal Policy Optimization (PPO)
- Reward function execution environment:
python3.5
$ pip3.5 install -r requirements.txt
$ python3.5 -m pytest -s| Name | Name | Last commit date | ||
|---|---|---|---|---|
python3.5$ pip3.5 install -r requirements.txt
$ python3.5 -m pytest -s