#大语言模型#A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
#计算机科学#A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)
Personal notes about scientific and research works on "Decision-Making for Autonomous Driving"
Easily train AlphaZero-like agents on any environment you want!
#计算机科学#MCTS project for Tetris
翻译 - 俄罗斯方块的MCTS项目
#计算机科学#A student implementation of Alpha Go Zero
A General Automated Machine Learning framework to simplify the development of End-to-end AutoML toolkits in specific domains.
#计算机科学#A Deep Learning UCI-Chess Variant Engine written in C++ & Python 🦜
A pytorch tutorial for DRL(Deep Reinforcement Learning)
#算法刷题#An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku
#计算机科学#A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.
#学习与技能提升#Reinforcement learning models in ViZDoom environment
#计算机科学#Research project: create a chess engine using Deep Reinforcement Learning