”ppo“ 的搜索结果 | GitHub 中文社区

PPOxFamily

@opendilab

PPO x Family DRL Tutorial Course（决策智能入门级公开课：8节课帮你盘清算法理论，理顺代码逻辑，玩转决策AI应用实践）

course decision-intelligence Python reinforcement-learning deep-reinforcement-learning

Python2 k

6 个月前🇨🇳

Google Bing GitHub

ppo pytorch python reinforcement-learning alphago deep-reinforcement-learning ppo2 a3c mario gym

PPO-PyTorch

@nikhilbarhate99

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

Python1.75 k

5 个月前

pytorch-a2c-ppo-acktr-gail

@ikostrikov

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) ...

Python3.61 k

2 年前

batch-ppo

Google Research@google-research

Efficient Batched Reinforcement Learning in TensorFlow

Python967

6 年前

Super-mario-bros-PPO-pytorch

@vietnh1009

#计算机科学#Proximal Policy Optimization (PPO) algorithm for Super Mario Bros

翻译 - 超级马里奥兄弟的近距离策略优化（PPO）算法

reinforcement-learning ppo ppo2 PyTorch gym

Python1.1 k

3 年前

PPO-for-Beginners

@ericyangyu

A simple and well styled PPO implementation. Based on my Medium series: https://medium.com/@eyyu/coding-ppo-from-scratch-with-pytorch-part-1-4-613dfc1b14c8.

Python785

2 个月前

ppo-implementation-details

@vwxyzjn

The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization

Python651

8 个月前

on-policy

@marlbenchmark

This is the official implementation of Multi-Agent PPO (MAPPO).

Python1.34 k

4 个月前

PPO

@EmbersArc

PPO implementation for OpenAI gym environment based on Unity ML Agents

Python148

7 年前

Deep-reinforcement-learning-with-pytorch

@sweetice

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

翻译 - DQN、AC、ACER、A2C、A3C、PG、DDPG、TRPO、PPO、SAC、TD3 和...的 PyTorch 实现。

policy-gradient PyTorch actor-critic-algorithm alphago deep-reinforcement-learning

Python3.99 k

2 年前

PPO-clip-and-PPO-penalty-on-Atari-Domain

@ChengTsang

Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty

Python57

6 年前

Multi-Agent-Reinforcement-Learning

@yangchen1997

PyTorch implements multi-agent reinforcement learning algorithms, including QMIX, Independent PPO, Centralized PPO, Grid Wise Control, Grid Wise Control+PPO, Grid Wise Control+DDPG.

Python194

1 年前

tf_deep_rl_trader

@miroblog

Trading Environment(OpenAI Gym) + PPO(TensorForce)

Python242

2 年前

pg_travel

@reinforcement-learning-kr

Policy Gradient algorithms (REINFORCE, NPG, TRPO, PPO)

Python368

5 年前

Contra-PPO-pytorch

@vietnh1009

Proximal Policy Optimization (PPO) algorithm for Contra

Python135

1 年前

ppo

@drawcall

ppo is a super small and useful utils library for JavaScript 🐝🐜

JavaScript105

4 年前

ppo

@takuseno

Proximal Policy Optimization implementation with TensorFlow

Python102

6 年前

LLM-RLHF-Tuning

@Joyce94

LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)

Python375

1 年前

gail-airl-ppo.pytorch

@toshikwa

PyTorch implementation of GAIL and AIRL based on PPO.

Python153

4 年前

ppo-pytorch

@adik993

Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)

Python127

6 年前

RL-PPO-Keras

@liziniu

Proximal Policy Optimization(PPO) with Keras Implementation

Python13

4 年前

MAPPO

@LoveDoveDog

Lightweight multi-agent PPO for IEEE field.

Python5

3 年前

Carla-ppo

@bitsauce

This repository hosts a customized PPO based agent for Carla. The goal of this project is to make it easier to interact with and experiment in Carla with reinforcement learning based agents -- this, b...

Python228

3 年前

microrts-ppo-comparison

@kronion

Compare PPO implementation performance on microrts gym env

Python6

3 年前

编程语音

Python
Jupyter Notebook
JavaScript
ASP.NET