[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)
xingtian is a componentized library for the development and verification of reinforcement learning algorithms
A structured implementation of MuZero
#计算机科学#A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.
A simple implementation of MuZero algorithm for connect4 game
MiniZero: An AlphaZero and MuZero Training Framework
#计算机科学#Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variation...
#计算机科学#An implementation of MuZero in JAX.
A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environments.
#计算机科学#A C++ pytorch implementation of MuZero
A PyTorch implementation of DeepMind's MuZero agent
#计算机科学#Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variations...
Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023
Generalized AI to perform a multitude of tasks written in python3
#计算机科学#Pytorch Implementation of MuZero for gym environment. It support any Discrete , Box and Box2D configuration for the action space and observation space.
#计算机科学#Muesli RL algorithm implementation (PyTorch) (LunarLander-v2)
A clean and easy implementation of MuZero, AlphaZero and Self-Play reinforcement learning algorithms for any game.