#计算机科学#PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) ...
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)
#计算机科学#The Fastest Deep Reinforcement Learning Library
#计算机科学#JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.
#计算机科学#PyTorch implementation of Soft Actor-Critic (SAC)
PyTorch C++ Reinforcement Learning
翻译 - PyTorch C ++强化学习
#计算机科学#PyTorch implementation of Trust Region Policy Optimization
PyTorch Implementation of REINFORCE for both discrete & continuous control
#计算机科学#Code for the paper "Evolved Policy Gradients"
#计算机科学#End to end motion planner using Deep Deterministic Policy Gradient (DDPG) in gazebo
Tensorflow implementation of generative adversarial imitation learning
Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)
Implement A3C for Mujoco gym envs
PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)
#计算机科学#Catalyst.RL: A Distributed Framework for Reproducible RL Research
A workbench for online model-free Reinforcement Learning on continuous control problems
翻译 - 在线模型的连续控制问题的免费在线强化学习工作台
PyTorch implementation of the Q-Learning Algorithm Normalized Advantage Function for continuous control problems + PER and N-step Method
Benchmark data (i.e., DeepMind Control Suite and MuJoCo) for RL.
Proximal Policy Optimization (Continuous Version) in PyTorch.
Neural Ordinary Differential Equations for Reinforcement Learning