#

policy-gradient

https://static.github-zh.com/github_avatars/datawhalechina?size=40

强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/

Jupyter Notebook 10.96 k
19 天前
thu-ml/tianshou
https://static.github-zh.com/github_avatars/thu-ml?size=40

An elegant PyTorch deep reinforcement learning library.

翻译优雅,灵活和超快的PyTorch深度强化学习平台。

Python 8.39 k
25 天前
https://static.github-zh.com/github_avatars/sweetice?size=40

#算法刷题#PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

翻译DQN、AC、ACER、A2C、A3C、PG、DDPG、TRPO、PPO、SAC、TD3 和...的 PyTorch 实现。

Python 4.25 k
2 年前
https://static.github-zh.com/github_avatars/rlcode?size=40
Python 3.51 k
2 年前
kengz/SLM-Lab
https://static.github-zh.com/github_avatars/kengz?size=40

Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".

Python 1.28 k
2 个月前
https://static.github-zh.com/github_avatars/Khrylx?size=40

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

Python 1.19 k
4 年前
https://static.github-zh.com/github_avatars/yaserkl?size=40
Python 767
2 年前
https://static.github-zh.com/github_avatars/omerbsezer?size=40

#计算机科学#Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers,...

Jupyter Notebook 753
6 年前
https://static.github-zh.com/github_avatars/suragnair?size=40

#自然语言处理#A simplified PyTorch implementation of "SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient." (Yu, Lantao, et al.)

Python 646
7 年前
https://static.github-zh.com/github_avatars/germain-hug?size=40

Keras Implementation of popular Deep RL Algorithms (A3C, DDQN, DDPG, Dueling DDQN)

Python 542
5 年前
https://static.github-zh.com/github_avatars/medipixel?size=40

#计算机科学#Structural implementation of RL key algorithms

翻译RL密钥算法的结构化实现

Python 512
2 年前
https://static.github-zh.com/github_avatars/VinF?size=40
Python 484
1 年前
loading...
Website
Wikipedia