强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/
An elegant PyTorch deep reinforcement learning library.
翻译 - 优雅,灵活和超快的PyTorch深度强化学习平台。
#算法刷题#PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
翻译 - DQN、AC、ACER、A2C、A3C、PG、DDPG、TRPO、PPO、SAC、TD3 和...的 PyTorch 实现。
Massively Parallel Deep Reinforcement Learning. 🔥
翻译 - 使用Pytorch的深度加强学习算法轻量级,高效稳定的实现。
Modularized Implementation of Deep RL Algorithms in PyTorch
#计算机科学#Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)
32 projects in the framework of Deep Reinforcement Learning algorithms: Q-learning, DQN, PPO, DDPG, TD3, SAC, A2C and others. Each project is provided with a detailed training log.
#计算机科学#Deep Reinforcement Learning for mobile robot navigation in ROS Gazebo simulator. Using Twin Delayed Deep Deterministic Policy Gradient (TD3) neural network, a robot learns to navigate to a random goal...
PyTorch implementation of deep reinforcement learning algorithms
#计算机科学#Reinforcement learning library(framework) designed for PyTorch, implements DQN, DDPG, A2C, PPO, SAC, MADDPG, A3C, APEX, IMPALA ...
#计算机科学#Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math
#计算机科学#lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.
Machine Readable Zone generator and checker for official travel documents sizes 1, 2, 3, MRVA and MRVB (Passports, Visas, national id cards and other travel documents)
DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)
Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022
Autonomous UAV Navigation without Collision using Visual Information in Airsim
Paddle-RLBooks is a reinforcement learning code study guide based on pure PaddlePaddle.
Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment
Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC