PPO x Family DRL Tutorial Course(决策智能入门级公开课:8节课帮你盘清算法理论,理顺代码逻辑,玩转决策AI应用实践 )
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) ...
Efficient Batched Reinforcement Learning in TensorFlow
#计算机科学#Proximal Policy Optimization (PPO) algorithm for Super Mario Bros
翻译 - 超级马里奥兄弟的近距离策略优化(PPO)算法
A simple and well styled PPO implementation. Based on my Medium series: https://medium.com/@eyyu/coding-ppo-from-scratch-with-pytorch-part-1-4-613dfc1b14c8.
The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization
This is the official implementation of Multi-Agent PPO (MAPPO).
PPO implementation for OpenAI gym environment based on Unity ML Agents
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
翻译 - DQN、AC、ACER、A2C、A3C、PG、DDPG、TRPO、PPO、SAC、TD3 和...的 PyTorch 实现。
Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty
PyTorch implements multi-agent reinforcement learning algorithms, including QMIX, Independent PPO, Centralized PPO, Grid Wise Control, Grid Wise Control+PPO, Grid Wise Control+DDPG.
Trading Environment(OpenAI Gym) + PPO(TensorForce)
Policy Gradient algorithms (REINFORCE, NPG, TRPO, PPO)
Proximal Policy Optimization (PPO) algorithm for Contra
LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)
PyTorch implementation of GAIL and AIRL based on PPO.
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
Proximal Policy Optimization(PPO) with Keras Implementation
Lightweight multi-agent PPO for IEEE field.
Compare PPO implementation performance on microrts gym env