#大语言模型#Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用
Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.
翻译 - 多巴胺是用于强化学习算法的快速原型制作的研究框架。
An elegant PyTorch deep reinforcement learning library.
翻译 - 优雅,灵活和超快的PyTorch深度强化学习平台。
An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
ELF: a platform for game research with AlphaGoZero/AlphaZero reimplementation
翻译 - ELF:AlphaGoZero / AlphaZero重新实现的游戏研究平台
#计算机科学#A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
#计算机科学#Reinforcement Learning Coach by Intel AI Lab enables easy experimentation with state of the art Reinforcement Learning algorithms
翻译 - 英特尔AI实验室的强化学习教练可轻松进行最新的强化学习算法实验
#计算机科学#Implementation of papers in 100 lines of code.
#计算机科学#[ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning
A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.
#大语言模型#Distributed RL System for LLM Reasoning
#计算机科学#Python library for Reinforcement Learning.
#大语言模型#Understanding R1-Zero-Like Training: A Critical Perspective
#计算机科学#[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.
SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture.
翻译 - SEED RL:具有加速的中央推理功能的可扩展,高效的Deep-RL。使用SEED的体系结构在TF2中实现IMPALA和R2D2算法。
#计算机科学#Modular reinforcement learning library (on PyTorch and JAX) with support for NVIDIA Isaac Gym, Omniverse Isaac Gym and Isaac Lab
#大语言模型#Implementation of all RL algorithms in a simpler way