actor-critic-algorithm“ 的搜索结果

#计算机科学#Asynchronous Advantage Actor-Critic (A3C) algorithm for Super Mario Bros

Python1.1 k
1 年前

Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"

Python1.86 k
1 年前

Reinforcement learning baseline agent trained with the Actor-critic (A3C) algorithm.

Python272
6 年前

This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.

Python261
5 年前

The source code for "An Actor Critic Algorithm for Structured Prediction"

Jupyter Notebook166
8 年前

#计算机科学#Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.

Python1.34 k
2 年前

Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO

Python205
3 年前

Advantage async actor-critic Algorithms (A3C) and Progressive Neural Network implemented by tensorflow.

Python120
9 年前

PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch

Python114
8 年前

Pytorch implementation of the MARL algorithm, MADDPG, which correspondings to the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments".

Python647
3 年前

Soft Actor-Critic

Python1.14 k
2 年前

DSAC; Distributional Soft Actor-Critic

Python130
7 个月前

Trading with recurrent actor-critic reinforcement learning

Jupyter Notebook439
3 年前

PyTorch implementation of Soft Actor-Critic(SAC).

Python105
5 年前

PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..

Jupyter Notebook1.28 k
6 个月前
Python46
8 年前

DSAC-v2; DSAC-T; DASC; Distributional Soft Actor-Critic

Python396
2 个月前
Python251
5 年前

General implementation of Advantage Actor Critic using Pytorch

Python24
4 年前

advantage actor-critic reinforcement learning for openai gym cartpole

Python64
8 年前
Python324
4 年前
loading...