ppo2 · GitHub Topics

vietnh1009 / Super-mario-bros-PPO-pytorch

#计算机科学#Proximal Policy Optimization (PPO) algorithm for Super Mario Bros

翻译 - 超级马里奥兄弟的近距离策略优化（PPO）算法

reinforcement-learning ppo ppo2 PyTorch gym Python 深度学习 super-mario-bros mario 人工智能 proximal-policy-optimization openai openai-gym

Python 1.15 k

4 年前

vietnh1009 / Contra-PPO-pytorch

#计算机科学#Proximal Policy Optimization (PPO) algorithm for Contra

reinforcement-learning 人工智能 ppo 深度学习 openai gym proximal-policy-optimization ppo2

Python 136

2 年前

seolhokim / Mujoco-Pytorch

PPO, DDPG, SAC implementation on mujoco environment

PyTorch mujoco reinforcement-learning ppo ppo2 sac ddpg

Python 106

3 年前

ZYunfeii / DRL_algorithm_library

This is a reinforcement learning algorithm library. The code takes into account both performance and simplicity, with little dependence.

deep-reinforcement-learning ddpg ppo2 td3 sac pytorch-implementation

Python 99

3 年前

jw1401 / PPO-Tensorflow-2.0

Proximal Policy Optimization with Tensorflow 2.0

reinforcement-learning ppo proximal-policy-optimization tensorflow2 policy-gradient ppo2 reinforcement-learning-algorithms

Python 31

5 年前

vietnh1009 / Sonic-PPO-pytorch

#计算机科学#Proximal Policy Optimization (PPO) algorithm for Sonic the Hedgehog

sonic sonic-the-hedgehog reinforcement-learning 人工智能深度学习 openai-gym openai gym proximal-policy-optimization ppo ppo2

Python 26

4 年前

leolellisr / poke_RL

Code repository with classical reinforcement learning and deep reinforcement learning methods for Pokémon battles in Pokémon Showdown.

pokemon reinforcement-learning sarsa-lambda function-approximation game dqn double-dqn ppo ppo2 deep-reinforcement-learning deep-rl reinforce pokemon-showdown

Jupyter Notebook 13

4 个月前

GNiendorf / snake

OpenAI's PPO baseline applied to the classic game of Snake

openai-gym ppo2 baselines deep-reinforcement-learning gym-environment game ppo reinforcement-learning openai gym-environments benchmark Project

Python 9

5 年前

denizetkar / TreeGAN

Generative Adversarial Model that generates parse trees

Generative Adversarial Network rnn PyTorch ansi-c ppo2

Python 7

5 年前

RLOpensource / spinning_up_kr

reinforcement-learning ddpg deep-deterministic-policy-gradient td3 proximal-policy-optimization ppo soft-actor-critic sac ppo2 Robotics

Python 6

6 年前

Tqualizer / Retro-Street-Fighter-reinforcement-learning

Experiments with multiple reinforcement ML algorithms to learn how to beat Street Fighter II

retro openai-gym reinforcement-learning ppo2 stable-baselines

Python 6

4 年前

AnthonyDickson / learning2write

#计算机科学#Teaching a neural network how to write letters and digits with reinforcement learning.

人工智能机器学习 reinforcement-learning ppo ppo2 mnist stable-baselines emnist-dataset 神经网络 conda

Python 4

6 年前

c2d08y / LearningBot

#计算机科学#A deep reinforcement learning Bot for https://kana.byha.top:444/

Bot 深度学习深度神经网络 deep-reinforcement-learning 神经网络 nueral-networks ppo2 reinforcement-learning

Python 4

3 年前

leonjovanovic / drl-ppo-bipedal-walker

PyTorch application of reinforcement learning Advanced Policy Gradient algorithms in OpenAI BipedalWalker- PPO

ppo2 ppo PyTorch

Python 3

3 年前

leonjovanovic / drl-ml-agents-3dball

PyTorch application of reinforcement learning DDPG and PPO algorithms in Unity 3D-Ball

ml-agents ddpg ddpg-pytorch ppo ppo2

Python 2

3 年前

primeMover2011 / PromixalPolicyOptimization

Proximal Policy Optimization using Pytorch and the Unity Reacher environment.

PyTorch deep-reinforcement-learning ppo2 proximal-policy-optimization ml-agents

Python 1

6 年前

DongChen06 / ppo_tf

PPO IMPLEMENTATION ON TENSORFLOW

ppo2 Tensorflow reinforcement-learning policy-gradient

Python 1

5 年前

harikris001 / Super-Mario-Reinforcement_Learning

#计算机科学#Reinforcement Learning in Super Mario using Pytorch and PPO

gym ppo reinforcement-learning 深度学习 deep-reinforcement-learning openai ppo2 Python python38 PyTorch

Jupyter Notebook 1

2 年前

mohith-sakthivel / sufficient-ppo

Clean and flexible implementation of PPO (built on top of stable-baselines3)

ppo ppo2 PyTorch proximal-policy-optimization reinforcement-learning

Python 0

4 年前

frankroeder / dreamduck

#计算机科学#World Models Experiments for Duckietown

reinforcement-learning vae duckietown baselines 深度学习 ppo2 gym gym-environment ducks self-driving self-driving-car

Python 0

4 年前