GitHub 中文社区

回车: Github搜索 Shift+回车: Google搜索

©2025 GitHub中文社区论坛 GitHub官网网站地图 GitHub官方翻译

GitHub on X
GitHub on Facebook
GitHub on LinkedIn
GitHub on YouTube
GitHub on Twitch
GitHub on TikTok
GitHub’s organization on GitHub

集合主题趋势排行榜

#

self-play

Website
Wikipedia

suragnair / alpha-zero-general

#计算机科学#A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more

Tensorflow PyTorch Keras gobang alpha-zero alphago-zero alphago reinforcement-learning self-play mcts monte-carlo-tree-search 深度学习 alphazero 神经网络

Jupyter Notebook 4.1 k

3 个月前

opendilab / DI-engine

OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.

reinforcement-learning multiagent-reinforcement-learning self-play imitation-learning inverse-reinforcement-learning exploration-exploitation distributed-system Python impala smac atari mujoco r2d2 reinforcement-learning-algorithms pytorch-rl model-based-reinforcement-learning

Python 3.36 k

11 天前

opendilab / LightZero

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

alphazero atari continuous-control monte-carlo-tree-search muzero PyTorch reinforcement-learning mcts board-game gym self-play

Python 1.33 k

1 天前

opendilab / DI-star

#计算机科学#An artificial intelligence platform for the StarCraft II with large-scale distributed training and grand-master agents.

翻译 - 星际争霸II中的OpenDILab决策AI

reinforcment-learning starcraft2 self-play 人工智能深度学习 league deep-reinforcement-learning

Python 1.26 k

1 个月前

#计算机科学#The official implementation of Self-Play Fine-Tuning (SPIN)

深度学习 fine-tuning large-language-models self-play

Python 1.14 k

1 年前

#计算机科学#The official implementation of Self-Play Preference Optimization (SPPO)

深度学习 fine-tuning large-language-models rlhf self-play

Python 523

3 个月前

inspirai / TimeChamber

A Massively Parallel Large Scale Self-Play Framework

deep-reinforcement-learning reinforcement-learning self-play multi-agent

Python 340

2 年前

ChuaCheowHuan / gym-continuousDoubleAuction

A custom MARL (multi-agent reinforcement learning) environment where multiple agents trade against one another (self-play) in a zero-sum continuous double auction. Ray [RLlib] is used for training.

multi-agent-reinforcement-learning gym-environment limit-order-book high-frequency-trading ray rllib financial-engineering self-play ppo quantitative-finance quantitative-trading marl lstm

Jupyter Notebook 144

2 年前

Naton1 / osrs-pvp-reinforcement-learning

#计算机科学#Train a neural network to PvP in Old School RuneScape using reinforcement learning.

人工智能深度学习 gym Java 机器学习 oldschool-runescape osrs ppo Python PyTorch reinforcement-learning rsps runescape self-play

Java 106

1 年前

blanyal / alpha-zero

#计算机科学#AlphaZero implementation for Othello, Connect-Four and Tic-Tac-Toe based on "Mastering the game of Go without human knowledge" and "Mastering Chess and Shogi by Self-Play with a General Reinforcement ...

alphazero alpha-zero alphago-zero Tensorflow reinforcement-learning mcts self-play game 深度学习机器学习 resnet tic-tac-toe deepmind

Python 88

7 年前

seungeunrho / football-paris

The exact codes used by the team "liveinparis" at the kaggle football competition ranked 6th/1141

self-play reinforcement-learning PyTorch ppo kaggle

Python 57

4 年前

cestpasphoto / alpha-zero-general

A very fast implementation of AlphaZero, applied to games like Splendor, Santorini, The Little Prince, … Browser version available

alphago alphago-zero alphazero Python PyTorch reinforcement-learning numba self-play

Python 46

3 个月前

dellalibera / td-gammon

TD-Gammon implementation

人工智能 reinforcement-learning 神经网络 PyTorch convolutional-neural-networks self-play game

Python 45

2 年前

dellalibera / gym-backgammon

Backgammon OpenAI Gym

gym reinforcement-learning self-play game openai-gym 人工智能

Python 44

1 年前

tobiasemrich / SchafkopfRL

AI agents for the bavarian card game Schafkopf trained with reinforcement learning

collectible-card-game ppo reinforcement-learning self-play PyTorch

Python 38

1 年前

Sebastian-Schuchmann / Self-Play-TicTacToe-AI-ML-Agents-

#计算机科学#A Self Play reinforcement learning Agent learns to play TicTacToe using the ML-Agents Framework in Unity.

人工智能机器学习 reinforcement-learning ml-agents Unity self-play 神经网络 Tensorflow

C# 37

2 年前

ShibiHe / Model-Free-Episodic-Control

This is the implementation of paper Model Free Episodic Control

openai-gym deep knn NumPy self-play game-theory

Python 35

6 年前

sirmammingtonham / alphastone

#计算机科学#Using self-play, MCTS, and a deep neural network to create a hearthstone ai player

alpha-zero self-play monte-carlo-tree-search 深度学习 deep-reinforcement-learning PyTorch hearthstone 人工智能

Python 29

6 年前

Code base for Social Robot Tree Search (SoRTS).

Python 25

1 年前

mbaske / ml-selfplay-fighter

Self-Play Boxing Match made with Unity Machine Learning Agents

Unity ml-agents self-play

C# 23

3 年前

loading...