”policy-gradient“ 的搜索结果

SeqGAN

@LantaoYu

Implementation of Sequence Generative Adversarial Nets with Policy Gradient

Python2.09 k

6 年前

phasic-policy-gradient

OpenAI@openai

Code for the paper "Phasic Policy Gradient"

Python253

2 年前

DDPG-Keras-Torcs

@yanpanlau

Using Keras and Deep Deterministic Policy Gradient to play TORCS

Python717

7 年前

policy-gradient

Keon@keon

Minimal Monte Carlo Policy Gradient (REINFORCE) Algorithm Implementation in Keras

Python158

5 年前

Multi-Agent-Deep-Deterministic-Policy-Gradients

@philtabor

A Pytorch implementation of the multi agent deep deterministic policy gradients (MADDPG) algorithm

Python303

4 年前

pg_travel

@reinforcement-learning-kr

Policy Gradient algorithms (REINFORCE, NPG, TRPO, PPO)

Python368

5 年前

pytorch-ddpg

@ghliu

Implementation of the Deep Deterministic Policy Gradient (DDPG) using PyTorch

Python573

6 年前

openai-cartpole

@kvfrans

random search, hill climbing, policy gradient

Python140

6 年前

EPG

OpenAI@openai

Code for the paper "Evolved Policy Gradients"

Python249

6 年前

pytorch-maddpg

@xuehy

A pytorch implementation of MADDPG (multi-agent deep deterministic policy gradient)

Python616

6 年前

SVRG

@tianbingsz

Stochastic Variance Reduction Policy Gradient Estimation

Python11

6 年前

encode-attend-navigate

@MichelDeudon

Learning Heuristics for the TSP by Policy Gradient

Jupyter Notebook115

3 年前

openai-gym-policy-gradient

@gabrielgarza

Reinforcement Learning using Policy Gradient to solve OpenAI Gym games

Python109

7 年前

SeqGAN-PyTorch

@X-czh

Implementation of Sequence Generative Adversarial Nets with Policy Gradient in PyTorch

Jupyter Notebook53

4 年前

DDPG-PyTorch

@samlanka

Deep Deterministic Policy Gradient implemented in PyTorch for DeepMind Control Suite

Python25

6 年前

Policy-Gradient-Methods

@cyoon1729

Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC

Jupyter Notebook92

5 年前

Conditional-SeqGAN-Tensorflow

@andi611

Conditional Sequence Generative Adversarial Network trained with policy gradient, Implementation in Tensorflow

Python48

6 年前

stock_market_reinforcement_learning

@kh-kim

This project provides a stock market environment using OpenGym with Deep Q-learning and Policy Gradient.

Python791

8 年前

seqGAN

@suragnair

A simplified PyTorch implementation of "SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient." (Yu, Lantao, et al.)

Python642

6 年前

Reinforcement_learning_tutorial_with_demo

@omerbsezer

Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers,...

Jupyter Notebook734

6 年前

ddpg-aigym

@stevenpjg

Continuous control with deep reinforcement learning - Deep Deterministic Policy Gradient (DDPG) algorithm implemented in OpenAI Gym environments

Python275

7 年前

UAV-DDPG

@fangvv

Code for paper "Computation Offloading Optimization for UAV-assisted Mobile Edge Computing: A Deep Deterministic Policy Gradient Approach"

Python442

1 年前

”policy-gradient“ 的搜索结果

编程语音