#大语言模型#Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
#大语言模型#面向所有人的对话式 AI,我们相信我们即将创造一场革命,正如 Stable Diffusion 改变了现代艺术的创作过程, 我们将透过对话式 AI 来改变世界.
#自然语言处理#中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
#大语言模型#Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).
#大语言模型#Robust recipes to align language models with human and AI preferences
#自然语言处理#Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
#计算机科学#A curated list of reinforcement learning with human feedback resources (continually updated)
#大语言模型#Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调
#计算机科学#The easiest tool for fine-tuning LLM models, synthetic data generation, and collaborating on datasets.
Align Anything: Training All-modality Model with Feedback
A Doctor for your data
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
#自然语言处理#An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
#数据仓库#Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
Open Source Application for Advanced LLM Engineering: interact, train, fine-tune, and evaluate large language models on your own computer.
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation