A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture.
翻译 - SEED RL:具有加速的中央推理功能的可扩展,高效的Deep-RL。使用SEED的体系结构在TF2中实现IMPALA和R2D2算法。
#计算机科学# DrQ: Data regularized Q
翻译 - DrQ:数据正则化Q