ppo · GitHub Topics

强化学习中文教程（蘑菇书🍄），在线阅读地址：https://datawhalechina.github.io/easy-rl/

deep-reinforcement-learning reinforcement-learning dqn ppo a3c q-learning sarsa imitation-learning policy-gradient ddpg double-dqn dueling-dqn td3

Jupyter Notebook 10.92 k

16 天前

MorvanZhou / Reinforcement-learning-with-tensorflow

#计算机科学#Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学

翻译 - 简单钢筋学习教程

reinforcement-learning 教程 q-learning sarsa sarsa-lambda deep-q-network a3c ddpg policy-gradient dqn double-dqn dueling-dqn deep-deterministic-policy-gradient actor-critic Tensorflow proximal-policy-optimization ppo 机器学习

Python 9.15 k

1 年前

thu-ml / tianshou

An elegant PyTorch deep reinforcement learning library.

翻译 - 优雅，灵活和超快的PyTorch深度强化学习平台。

PyTorch policy-gradient dqn double-dqn a2c ddpg ppo td3 sac imitation-learning mujoco atari rl cql

Python 8.39 k

22 天前

vwxyzjn / cleanrl

#计算机科学#High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

wandb reinforcement-learning PyTorch Python gym 机器学习 deep-reinforcement-learning 深度学习 atari ale a2c proximal-policy-optimization ppo advantage-actor-critic actor-critic phasic-policy-gradient

Python 6.76 k

4 天前

udacity / deep-reinforcement-learning

Repo for the Deep Reinforcement Learning Nanodegree program

deep-reinforcement-learning reinforcement-learning reinforcement-learning-algorithms neural-networks PyTorch pytorch-rl ddpg dqn ppo dynamic-programming hill-climbing ml-agents openai-gym

Jupyter Notebook 5.03 k

1 年前

andri27-ts / Reinforcement-Learning

#计算机科学#Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning

reinforcement-learning 机器学习人工智能 deep-reinforcement-learning 深度学习 evolution-strategies a2c deepmind dqn ppo

Jupyter Notebook 4.31 k

5 年前

sweetice / Deep-reinforcement-learning-with-pytorch

#算法刷题#PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

翻译 - DQN、AC、ACER、A2C、A3C、PG、DDPG、TRPO、PPO、SAC、TD3 和...的 PyTorch 实现。

policy-gradient PyTorch actor-critic-algorithm alphago deep-reinforcement-learning a2c dqn sarsa ppo a3c resnet 算法深度学习 reinforce actor-critic sac td3

Python 4.23 k

2 年前

AI4Finance-Foundation / ElegantRL

Massively Parallel Deep Reinforcement Learning. 🔥

翻译 - 使用Pytorch的深度加强学习算法轻量级，高效稳定的实现。

PyTorch reinforcement-learning ppo sac td3 dqn ddpg stable lightweight efficient a2c

Python 3.99 k

1 个月前

simoninithomas / Deep_reinforcement_learning_Course

#计算机科学#Implementations from the free course Deep Reinforcement Learning with Tensorflow and PyTorch

deep-reinforcement-learning 深度学习 Tensorflow ppo a2c actor-critic deep-q-network deep-q-learning PyTorch Unity

Jupyter Notebook 3.84 k

2 年前

ikostrikov / pytorch-a2c-ppo-acktr-gail

#计算机科学#PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) ...

PyTorch reinforcement-learning 深度学习 deep-reinforcement-learning actor-critic advantage-actor-critic a2c ppo proximal-policy-optimization hessian atari mujoco roboschool continuous-control ale

Python 3.73 k

3 年前

ShangtongZhang / DeepRL

Modularized Implementation of Deep RL Algorithms in PyTorch

PyTorch deep-reinforcement-learning dqn double-dqn deeprl ddpg ppo td3 a2c rainbow

Python 3.29 k

1 年前

seungeunrho / minimalRL

#计算机科学#Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)

翻译 - 用最少的代码行实现基本的RL算法！（基于火炬）

deep-reinforcement-learning PyTorch simple 深度学习 a3c ppo a2c reinforce acer dqn ddpg reinforcement-learning 机器学习 sac

Python 3 k

2 年前

AI4Finance-Foundation / FinRL-Trading

For trading. Please star.

deep-reinforcement-learning stock-trading ppo ddpg openai-gym sharpe-ratio

Jupyter Notebook 2.3 k

9 个月前

XinJingHao / DRL-Pytorch

#计算机科学#Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)

deep-reinforcement-learning PyTorch reinforcement-learning 机器学习 ddpg double-dqn dueling-dqn ppo q-learning sac td3

Python 2.21 k

1 个月前

nikhilbarhate99 / PPO-PyTorch

#计算机科学#Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

pytorch-implmention PyTorch pytorch-tutorial proximal-policy-optimization reinforcement-learning-algorithms deep-reinforcement-learning ppo policy-gradient 深度学习 reinforcement-learning

Python 1.98 k

9 个月前

marlbenchmark / on-policy

#算法刷题#This is the official implementation of Multi-Agent PPO (MAPPO).

smac ppo multi-agent 算法

Python 1.51 k

9 个月前

kengz / SLM-Lab

Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".

PyTorch reinforcement-learning deep-reinforcement-learning benchmark policy-gradient dqn ppo sac a2c a3c

Python 1.28 k

2 个月前

Khrylx / PyTorch-RL

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

reinforcement-learning policy-gradient pytorch-rl proximal-policy-optimization ppo PyTorch a2c Generative Adversarial Network deep-reinforcement-learning

Python 1.19 k

4 年前