actor-critic · GitHub Topics

MorvanZhou / Reinforcement-learning-with-tensorflow

#计算机科学#Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学

reinforcement-learning 教程 q-learning sarsa sarsa-lambda deep-q-network a3c ddpg policy-gradient dqn double-dqn dueling-dqn deep-deterministic-policy-gradient actor-critic Tensorflow proximal-policy-optimization ppo 机器学习

Python 9.24 k

1 年前

vwxyzjn / cleanrl

#计算机科学#High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

wandb reinforcement-learning PyTorch Python gym 机器学习 deep-reinforcement-learning 深度学习 atari ale a2c proximal-policy-optimization ppo advantage-actor-critic actor-critic phasic-policy-gradient

Python 7.41 k

11 小时前

sweetice / Deep-reinforcement-learning-with-pytorch

#算法刷题#PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

policy-gradient PyTorch actor-critic-algorithm alphago deep-reinforcement-learning a2c dqn sarsa ppo a3c resnet 算法深度学习 reinforce actor-critic sac td3

Python 4.37 k

2 年前

simoninithomas / Deep_reinforcement_learning_Course

#计算机科学#Implementations from the free course Deep Reinforcement Learning with Tensorflow and PyTorch

deep-reinforcement-learning 深度学习 Tensorflow ppo a2c actor-critic deep-q-network deep-q-learning PyTorch Unity

Jupyter Notebook 3.87 k

2 年前

ikostrikov / pytorch-a2c-ppo-acktr-gail

#计算机科学#PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) ...

PyTorch reinforcement-learning 深度学习 deep-reinforcement-learning actor-critic advantage-actor-critic a2c ppo proximal-policy-optimization hessian atari mujoco roboschool continuous-control ale

Python 3.79 k

3 年前

rlcode / reinforcement-learning

#计算机科学#Minimal and Clean Reinforcement Learning Examples

reinforcement-learning 深度学习 deep-reinforcement-learning 机器学习 policy-gradient deep-q-network dqn actor-critic a3c

Python 3.55 k

2 年前

ikostrikov / pytorch-a3c

#计算机科学#PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".

Python reinforcement-learning PyTorch 深度学习 actor-critic a3c deep-reinforcement-learning

Python 1.28 k

6 年前

chainer / chainerrl

#计算机科学#ChainerRL is a deep reinforcement learning library built on top of Chainer.

chainer reinforcement-learning 深度学习机器学习 Python dqn actor-critic

Python 1.2 k

4 年前

qfettes / DeepRL-Tutorials

Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch

Python PyTorch reinforcement-learning deep-reinforcement-learning deep-q-network double-dqn dueling-dqn rainbow actor-critic advantage-actor-critic a2c ppo

Jupyter Notebook 1.08 k

4 年前

jingweiz / pytorch-rl

#计算机科学#Deep Reinforcement Learning with pytorch & visdom

dqn a3c PyTorch visdom deep-reinforcement-learning reinforcement-learning 深度学习 actor-critic acer

Python 799

5 年前

yaserkl / RLSeq2Seq

#自然语言处理#Deep Reinforcement Learning For Sequence to Sequence Models

reinforcement-learning actor-critic policy-gradient 自然语言处理

Python 766

2 年前

omerbsezer / Reinforcement_learning_tutorial_with_demo

#计算机科学#Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers,...

reinforcement-learning 教程机器学习 q-learning sarsa policy-gradient deep-reinforcement-learning imitation-learning meta-learning actor-critic pomdps dynamic-programming a3c

Jupyter Notebook 759

6 年前

TianhongDai / reinforcement-learning-algorithms

#算法刷题#This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are st...