#

actor-critic

https://static.github-zh.com/github_avatars/vwxyzjn?size=40
Python 6.81 k
9 天前
https://static.github-zh.com/github_avatars/sweetice?size=40

#算法刷题#PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

翻译DQN、AC、ACER、A2C、A3C、PG、DDPG、TRPO、PPO、SAC、TD3 和...的 PyTorch 实现。

Python 4.25 k
2 年前
https://static.github-zh.com/github_avatars/simoninithomas?size=40
Jupyter Notebook 3.85 k
2 年前
https://static.github-zh.com/github_avatars/ikostrikov?size=40

#计算机科学#PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) ...

Python 3.73 k
3 年前
https://static.github-zh.com/github_avatars/rlcode?size=40
Python 3.51 k
2 年前
https://static.github-zh.com/github_avatars/ikostrikov?size=40

#计算机科学#PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".

Python 1.26 k
6 年前
https://static.github-zh.com/github_avatars/chainer?size=40

#计算机科学#ChainerRL is a deep reinforcement learning library built on top of Chainer.

Python 1.19 k
4 年前
https://static.github-zh.com/github_avatars/qfettes?size=40
Jupyter Notebook 1.07 k
4 年前
https://static.github-zh.com/github_avatars/yaserkl?size=40
Python 767
2 年前
https://static.github-zh.com/github_avatars/omerbsezer?size=40

#计算机科学#Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers,...

Jupyter Notebook 753
6 年前
https://static.github-zh.com/github_avatars/TianhongDai?size=40

#算法刷题#This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are st...

Python 678
4 年前
https://static.github-zh.com/github_avatars/MorvanZhou?size=40

Simple A3C implementation with pytorch + multiprocessing

Python 639
2 年前
https://static.github-zh.com/github_avatars/mpatacchiola?size=40

Python code, PDFs and resources for the series of posts on Reinforcement Learning which I published on my personal blog

Python 615
2 年前
https://static.github-zh.com/github_avatars/inoryy?size=40

#计算机科学#Reaver: Modular Deep Reinforcement Learning Framework. Focused on StarCraft II. Supports Gym, Atari, and MuJoCo.

翻译Reaver:模块化深度强化学习框架。专注于《星际争霸2》。支持Gym,Atari和MuJoCo。匹配参考结果。

Python 557
4 年前
loading...
Website
Wikipedia