a3c“ 的搜索结果

#计算机科学#PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".

Python1.29 k
6 年前

Simple A3C implementation with pytorch + multiprocessing

Python653
3 年前

Advantage async actor-critic Algorithms (A3C) and Progressive Neural Network implemented by tensorflow.

Python120
9 年前

#计算机科学#Asynchronous Advantage Actor-Critic (A3C) algorithm for Super Mario Bros

Python1.1 k
1 年前

Implementation of Meta-RL A3C algorithm

Jupyter Notebook404
9 年前

Trading with recurrent actor-critic reinforcement learning

Jupyter Notebook437
3 年前
Python4.45 k
2 年前

A3C LSTM Atari with Pytorch plus A3G design

Python571
2 年前

#计算机科学#Deep reinforcement learning using an asynchronous advantage actor-critic (A3C) model.

Python66
8 年前

Accompanying repository for Let's make a DQN / A3C series.

Python395
7 年前

Hybrid CPU/GPU implementation of the A3C algorithm for deep reinforcement learning.

Python656
6 年前

Reinforcement learning baseline agent trained with the Actor-critic (A3C) algorithm.

Python270
6 年前

Keras Implementation of popular Deep RL Algorithms (A3C, DDQN, DDPG, Dueling DDQN)

Python545
5 年前

A high-performance Atari A3C agent in 180 lines of PyTorch

Python171
4 年前

PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch

Python114
8 年前

A continuous action space version of A3C LSTM in pytorch plus A3G design

Python258
1 年前

Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.

Python151
2 年前

#计算机科学#Reinforcement learning library(framework) designed for PyTorch, implements DQN, DDPG, A2C, PPO, SAC, MADDPG, A3C, APEX, IMPALA ...

Python416
4 年前

using actor-critic method to dealing with the path-planning UAV problem

Python13
6 年前

Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC

Jupyter Notebook100
6 年前

Decentralized Multi-Agent Exploration on ROS with Distributed Deep Reinforcement Learning using A3C Algorithm

C++35
4 年前
loading...