Proximal-Policy-Optimization

Tensorflow implementation of PPO from (https://arxiv.org/abs/1707.06347). Without any changed parameters, the program trains an agent in the Humanoid-v2 environment from OpenAI Gym.

Dependencies:

Mujoco-py (Mujoco150+)
OpenAI gym
Numpy
Tensorflow
Matplotlib
Scipy

Usecase: Example call:

python ppoMain.py --env Humanoid-v2 --episodes 1000 --localsteps 2000 --batchSize 64

python ppoMain.py -hcan be used to learn more about the input format.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
PPO.py		PPO.py
README.md		README.md
ppoMain.py		ppoMain.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Proximal-Policy-Optimization

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Proximal-Policy-Optimization

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages