#计算机科学#High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
#计算机科学#PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) ...
Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch
#时序数据库#Scalable, event-driven, deep-learning-friendly backtesting library
A PyTorch library for building deep reinforcement learning agents.
翻译 - 一个PyTorch库,用于构建深度强化学习代理。
PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.
PyTorch C++ Reinforcement Learning
翻译 - PyTorch C ++强化学习
Code for Hands On Intelligent Agents with OpenAI Gym book to get started and learn to build deep reinforcement learning agents using PyTorch
Tutorials for reinforcement learning in PyTorch and Gym by implementing a few of the popular algorithms. [IN PROGRESS]
Code accompanying the blog post "Deep Reinforcement Learning with TensorFlow 2.1"
Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO
#计算机科学#PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
Curiosity-driven Exploration by Self-supervised Prediction
Reinforcing Your Learning of Reinforcement Learning
#计算机科学#Deep Reinforcement Learning in Autonomous Driving: the A3C algorithm used to make a car learn to drive in TORCS; Python 3.5, Tensorflow, tensorboard, numpy, gym-torcs, ubuntu, latex
[ICRA 2023] Demonstration-Guided Reinforcement Learning with Efficient Exploration for Task Automation of Surgical Robot
The friendly robot that beats you in Yahtzee 🤖 🎲
MLP-framework (pure numpy) and DDQN-framework for OpenAI's Gym games. +test code for PPO added. +Hindsight Experience Replay(HER) bitflip-DQN example. +prioritized replay.
Deep reinforcement learning package for torch7