td3 · GitHub Topics

强化学习中文教程（蘑菇书🍄），在线阅读地址：https://datawhalechina.github.io/easy-rl/

deep-reinforcement-learning reinforcement-learning dqn ppo a3c q-learning sarsa imitation-learning policy-gradient ddpg double-dqn dueling-dqn td3

Jupyter Notebook 11.73 k

12 天前

thu-ml / tianshou

An elegant PyTorch deep reinforcement learning library.

PyTorch policy-gradient dqn double-dqn a2c ddpg ppo td3 sac imitation-learning mujoco atari rl cql

Python 8.6 k

3 天前

sweetice / Deep-reinforcement-learning-with-pytorch

#算法刷题#PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

policy-gradient PyTorch actor-critic-algorithm alphago deep-reinforcement-learning a2c dqn sarsa ppo a3c resnet 算法深度学习 reinforce actor-critic sac td3

Python 4.36 k

2 年前

AI4Finance-Foundation / ElegantRL

Massively Parallel Deep Reinforcement Learning. 🔥

PyTorch reinforcement-learning ppo sac td3 dqn ddpg stable lightweight efficient a2c

Python 4.07 k

2 个月前

ShangtongZhang / DeepRL

Modularized Implementation of Deep RL Algorithms in PyTorch

PyTorch deep-reinforcement-learning dqn double-dqn deeprl ddpg ppo td3 a2c rainbow

Python 3.32 k

1 年前

XinJingHao / DRL-Pytorch

#计算机科学#Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)

deep-reinforcement-learning PyTorch reinforcement-learning 机器学习 ddpg double-dqn dueling-dqn ppo q-learning sac td3

Python 2.71 k

20 天前

reiniscimurs / DRL-robot-navigation

#计算机科学#Deep Reinforcement Learning for mobile robot navigation in ROS Gazebo simulator. Using Twin Delayed Deep Deterministic Policy Gradient (TD3) neural network, a robot learns to navigate to a random goal...

deep-reinforcement-learning 深度学习 td3 ros gazebo reinforcement-learning

Python 1.01 k

20 天前

Rafael1s / Deep-Reinforcement-Learning-Algorithms

32 projects in the framework of Deep Reinforcement Learning algorithms: Q-learning, DQN, PPO, DDPG, TD3, SAC, A2C and others. Each project is provided with a detailed training log.

dqn td3 deep-reinforcement-learning sac ddpg ppo a2c soft-actor-critic

Jupyter Notebook 909

4 年前

dongminlee94 / deep_rl

PyTorch implementation of deep reinforcement learning algorithms

deep-reinforcement-learning PyTorch dqn a2c ppo ddpg td3 sac

Python 494

4 年前

sudharsan13296 / Deep-Reinforcement-Learning-With-Python

#计算机科学#Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math

deep-reinforcement-learning reinforcement-learning ppo ddpg td3 sac inverse-reinforcement-learning a3c actor-critic a2c 深度学习 dqn openai-gym policy-gradient double-dqn q-learning

Jupyter Notebook 425

4 年前

iffiX / machin

#计算机科学#Reinforcement learning library(framework) designed for PyTorch, implements DQN, DDPG, A2C, PPO, SAC, MADDPG, A3C, APEX, IMPALA ...

reinforcement-learning 深度学习 PyTorch pytorch-reinforcement-learning dqn ddpg sac ppo td3 distributed Python pytorch-lightning

Python 413

4 年前

zuoxingdong / lagom

#计算机科学#lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.

reinforcement-learning PyTorch 机器学习 Python research 深度学习人工智能 policy-gradient evolution-strategies deep-reinforcement-learning deep-deterministic-policy-gradient ddpg td3 soft-actor-critic mujoco proximal-policy-optimization ppo sac

Jupyter Notebook 375

3 年前

Arg0s1080 / mrz

Machine Readable Zone generator and checker for official travel documents sizes 1, 2, 3, MRVA and MRVB (Passports, Visas, national id cards and other travel documents)

checker td3 passport id-card

Python 361

1 年前

RITCHIEHuang / DeepRL_Algorithms

DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)

reinforcement-learning-algorithms pytorch-implementation deep-reinforcement-learning dqn ppo mujoco policy-gradient tensorflow2 td3 pytorch-rl soft-actor-critic

Python 334

2 年前

twni2016 / pomdp-baselines

Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022

recurrent-neural-networks generalization deep-reinforcement-learning PyTorch td3 sac

Python 320

10 个月前

sunghoonhong / AirsimDRL

Autonomous UAV Navigation without Collision using Visual Information in Airsim

reinforcement-learning airsim depth-images ddpg td3 uav drone autonomous-quadcoptor

Python 242

3 年前

RchalYang / torchrl

#算法刷题#Pytorch Implementation of Reinforcement Learning Algorithms ( Soft Actor Critic(SAC)/ DDPG / TD3 /DQN / A2C/ PPO / TRPO)

PyTorch sac ddpg 算法 reinforcement-learning dqn td3 ppo mujoco gym

Python 222

3 年前

reiniscimurs / DRL-robot-navigation-IR-SIM

Deep Reinforcement Learning for mobile robot navigation in IR-SIM simulation. Using DRL (SAC, TD3, PPO, DDPG) neural networks, a robot learns to navigate to a random goal point in a simulated environm...

ppo sac td3 ddpg ddpg-pytorch

Python 176

3 天前