编程语言

Python
Jupyter Notebook

”td3“ 的搜索结果

TD3

@sfujim

Author's PyTorch implementation of TD3 for OpenAI gym tasks

Python1.9 k

2 年前

Google Bing GitHub

Deep-reinforcement-learning-with-pytorch

@sweetice

#算法刷题#PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

policy-gradient PyTorch actor-critic-algorithm alphago deep-reinforcement-learning

Python4.37 k

2 年前

TD3_BC

@sfujim

Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL

Python362

4 年前

cleanrl

@vwxyzjn

#计算机科学#High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

wandb reinforcement-learning PyTorch Python gym

Python7.4 k

11 小时前

Twin-TD3

@yjwong1999

IEEE WCNC 2023: Deep Reinforcement Learning for Secrecy Energy-Efficient UAV Communication with Reconfigurable Intelligent Surfaces

deep-reinforcement-learning

Python134

2 年前

DRL-code-pytorch

@Lizhi-sjtu

Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.

ddpg-pytorch dqn-pytorch PyTorch

Python1.34 k

2 年前

GLKit_TD3D

@codebots-ltd

3D game prototype using GLKit and ARC under iOS 5

131

13 年前

TD3-BipedalWalkerHardcore-v2

@XinJingHao

Solve BipedalWalkerHardcore-v2 with TD3

reinforcement-learning-algorithms robot

Python90

2 年前

TD3-PyTorch-BipedalWalker-v2

@nikhilbarhate99

Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment

ddpg td3 deep-reinforcement-learning openai-gym

Python106

6 年前

LSTM-TD3

@LinghengMeng

The implementation of LSTM-TD3.

Python81

2 年前

MATD3

@ZiyuanMa

An implementation of multi-agent TD3 with paddlepaddle and parl

Python9

3 年前

torchrl

@RchalYang

#算法刷题#Pytorch Implementation of Reinforcement Learning Algorithms ( Soft Actor Critic(SAC)/ DDPG / TD3 /DQN / A2C/ PPO / TRPO)

PyTorch sac ddpg 算法

Python223

3 年前

Policy-Gradient-Methods存档

@cyoon1729

Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC

reinforcement-learning pytorch-rl PyTorch ddpg

Jupyter Notebook99

6 年前

mujoco-benchmark

@ChenDRAG

Provide full reinforcement learning benchmark on mujoco environments, including ddpg, sac, td3, pg, a2c, ppo, library

PyTorch mujoco benchmark baseline

4 年前

Popular-RL-Algorithms

@quantumiracle

PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..

reinforcement-learning soft-actor-critic state-of-the-art

Jupyter Notebook1.26 k

4 个月前

DRL-Pytorch

@XinJingHao

#计算机科学#Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)

deep-reinforcement-learning PyTorch reinforcement-learning 机器学习

Python2.73 k

1 个月前

DeepRL_Algorithms

@RITCHIEHuang

DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)

reinforcement-learning-algorithms pytorch-implementation deep-reinforcement-learning dqn

Python334

2 年前

MATD3

@Emmanuel-Naive

#计算机科学#Use Multi-agent Twin Delayed Deep Deterministic Policy Gradient(TD3) algorithm to find reasonable paths for ships

Python deep-learning deep-reinforcement-learning PyTorch reinforcement-learning

Python64

3 年前

Deep-Reinforcement-Learning-Algorithms

@Rafael1s

32 projects in the framework of Deep Reinforcement Learning algorithms: Q-learning, DQN, PPO, DDPG, TD3, SAC, A2C and others. Each project is provided with a detailed training log.

dqn td3

Jupyter Notebook911

4 年前

CORL存档

@tinkoff-ai

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

d4rl gym offline-reinforcement-learning reinforcement-learning

Python1.22 k

2 年前

Deep-Reinforcement-Learning-with-pytorch

@LxzGordon

#算法刷题#Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,DDPG for discrete action space, A2C, A3C, TD3, SAC, TRPO

PyTorch 算法 reinforcement-learning dqn ddpg

Python92

4 年前

DRL-robot-navigation

@reiniscimurs

#计算机科学#Deep Reinforcement Learning for mobile robot navigation in ROS Gazebo simulator. Using Twin Delayed Deep Deterministic Policy Gradient (TD3) neural network, a robot learns to navigate to a random goal...

deep-reinforcement-learning deep-learning td3 ros

Python1.02 k

1 个月前

DRL-Robot-Navigation-ROS2

@reiniscimurs

#计算机科学#Deep Reinforcement Learning for mobile robot navigation in ROS2 Gazebo simulator. Using DRL (SAC, TD3) neural networks, a robot learns to navigate to a random goal point in a simulated environment whi...

deep-learning 深度神经网络 deep-reinforcement-learning

Python106

5 个月前

编程语言

”td3“ 的搜索结果

相关主题