#计算机科学#PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
翻译 - 稳定基线的PyTorch版本,增强学习算法的改进实现。
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
#计算机科学#High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Self hosted FLOSS fitness/workout, nutrition and weight tracker
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
#计算机科学#Proximal Policy Optimization (PPO) algorithm for Super Mario Bros
翻译 - 超级马里奥兄弟的近距离策略优化(PPO)算法
C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.
翻译 - 用于通用 RL 环境的基于 C++ 的高性能并行环境执行引擎。
Deepdrive is a simulator that allows anyone with a PC to push the state-of-the-art in self-driving
翻译 - 无人驾驶汽车的端到端仿真
#计算机科学#Long-Term Evolution Project of Reinforcement Learning
#计算机科学#DrQ: Data regularized Q
翻译 - DrQ:数据正则化Q
#计算机科学#A PyTorch reinforcement learning library for generalizable and reproducible algorithm implementations
翻译 - 一个PyTorch强化学习库,用于可推广和可再现的算法实现