Code for the paper "Phasic Policy Gradient"
Using Keras and Deep Deterministic Policy Gradient to play TORCS
Minimal Monte Carlo Policy Gradient (REINFORCE) Algorithm Implementation in Keras
A Pytorch implementation of the multi agent deep deterministic policy gradients (MADDPG) algorithm
Policy Gradient algorithms (REINFORCE, NPG, TRPO, PPO)
Implementation of the Deep Deterministic Policy Gradient (DDPG) using PyTorch
random search, hill climbing, policy gradient
Code for the paper "Evolved Policy Gradients"
A pytorch implementation of MADDPG (multi-agent deep deterministic policy gradient)
Stochastic Variance Reduction Policy Gradient Estimation
Learning Heuristics for the TSP by Policy Gradient
Reinforcement Learning using Policy Gradient to solve OpenAI Gym games
Implementation of Sequence Generative Adversarial Nets with Policy Gradient in PyTorch
Deep Deterministic Policy Gradient implemented in PyTorch for DeepMind Control Suite
Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC
Conditional Sequence Generative Adversarial Network trained with policy gradient, Implementation in Tensorflow
This project provides a stock market environment using OpenGym with Deep Q-learning and Policy Gradient.
A simplified PyTorch implementation of "SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient." (Yu, Lantao, et al.)
Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers,...
Continuous control with deep reinforcement learning - Deep Deterministic Policy Gradient (DDPG) algorithm implemented in OpenAI Gym environments