#大语言模型# 面向所有人的对话式 AI,我们相信我们即将创造一场革命,正如 Stable Diffusion 改变了现代艺术的创作过程, 我们将透过对话式 AI 来改变世界.
#大语言模型# Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
#大语言模型# Official release of InternLM2.5 base and chat models. 1M context support
#大语言模型# Robust recipes to align language models with human and AI preferences
#计算机科学# A curated list of reinforcement learning with human feedback resources (continually updated)
#自然语言处理# An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
SimPO: Simple Preference Optimization with a Reference-Free Reward