GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

rlhf

Website
Wikipedia
hiyouga/LLaMA-Factory
https://static.github-zh.com/github_avatars/hiyouga?size=40
hiyouga / LLaMA-Factory

#自然语言处理#Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

fine-tuningllama大语言模型pefttransformersrlhfqloraquantizationqweninstruction-tuninggptloralarge-language-modelsagent人工智能moellama3deepseekgemma自然语言处理
Python 54 k
2 天前
https://static.github-zh.com/github_avatars/LAION-AI?size=40
LAION-AI / Open-Assistant

#大语言模型#面向所有人的对话式 AI,我们相信我们即将创造一场革命,正如 Stable Diffusion 改变了现代艺术的创作过程, 我们将透过对话式 AI 来改变世界.

ChatGPTlanguage-modelrlhf人工智能assistantdiscord-bot机器学习NextPython
Python 37.41 k
1 年前
https://static.github-zh.com/github_avatars/RUCAIBox?size=40
RUCAIBox / LLMSurvey

#自然语言处理#大语言模型综述

chain-of-thoughtChatGPTin-context-learninginstruction-tuninglarge-language-models大语言模型自然语言处理pre-trained-language-modelspre-trainingrlhf
Python 11.66 k
4 个月前
ymcui/Chinese-LLaMA-Alpaca-2
https://static.github-zh.com/github_avatars/ymcui?size=40
ymcui / Chinese-LLaMA-Alpaca-2

#自然语言处理#中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)

alpacallama大语言模型llama-2large-language-models自然语言处理alpaca-2flash-attentionllama2alpaca2Yarnrlhf
Python 7.16 k
10 个月前
https://static.github-zh.com/github_avatars/InternLM?size=40
InternLM / InternLM

#大语言模型#Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).

聊天机器人gpt大语言模型long-contextrlhffine-tuning-llm中文flash-attentionpretrained-models
Python 6.97 k
5 个月前
huggingface/alignment-handbook
https://static.github-zh.com/github_avatars/huggingface?size=40
huggingface / alignment-handbook

#大语言模型#Robust recipes to align language models with human and AI preferences

大语言模型rlhftransformers
Python 5.25 k
2 个月前
argilla-io/argilla
https://static.github-zh.com/github_avatars/argilla-io?size=40
argilla-io / argilla

#自然语言处理#Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets

human-in-the-loop自然语言处理mlopsdeveloper-toolstext-labelingannotation-tool机器学习active-learningweak-supervisiontext-annotation大语言模型人工智能gpt-4rlhflangchain
Python 4.56 k
10 天前
https://static.github-zh.com/github_avatars/PKU-Alignment?size=40
PKU-Alignment / align-anything

Align Anything: Training All-modality Model with Feedback

large-language-modelsmultimodalrlhfchameleondpovision-language-model
Jupyter Notebook 4.17 k
1 个月前
https://static.github-zh.com/github_avatars/opendilab?size=40
opendilab / awesome-RLHF

#计算机科学#A curated list of reinforcement learning with human feedback resources (continually updated)

深度学习deep-reinforcement-learninghuman-feedbackreinforcement-learningrlhflarge-language-models
4.04 k
6 天前
Kiln-AI/Kiln
https://static.github-zh.com/github_avatars/Kiln-AI?size=40
Kiln-AI / Kiln

#计算机科学#The easiest tool for fine-tuning LLM models, synthetic data generation, and collaborating on datasets.

人工智能chain-of-thoughtcollaborationdataset-generationfine-tuning机器学习macOSollamaopenaipromptprompt-engineeringPythonrlhfsynthetic-dataWindowsevalsevaluation
Python 3.91 k
12 小时前
https://static.github-zh.com/github_avatars/hiyouga?size=40
hiyouga / ChatGLM-Efficient-Tuning

#大语言模型#Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调

chatglmChatGPTfine-tuningloraalpacapefthuggingfacelanguage-modeltransformersPyTorchrlhfchatglm2qlora
Python 3.71 k
2 年前
https://static.github-zh.com/github_avatars/transformerlab?size=40
transformerlab / transformerlab-app

Open Source Application for Advanced LLM + Diffusion Engineering: interact, train, fine-tune, and evaluate large language models on your own computer.

Electronllama大语言模型lorarlhftransformersMLXdiffusiondiffusion-modelsstability-diffusion
TypeScript 3.54 k
2 天前
Docta-ai/docta
https://static.github-zh.com/github_avatars/Docta-ai?size=40
Docta-ai / docta

A Doctor for your data

datadata-centric-aidata-centric-machine-learningdata-curationdata-diagnosislanguage-modelrlhf
Python 3.35 k
6 个月前
argilla-io/distilabel
https://static.github-zh.com/github_avatars/argilla-io?size=40
argilla-io / distilabel

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

人工智能huggingface大语言模型openaiPythonrlhfsynthetic-datasynthetic-dataset-generation
Python 2.8 k
3 天前
https://static.github-zh.com/github_avatars/tatsu-lab?size=40
tatsu-lab / alpaca_eval

#自然语言处理#An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

深度学习evaluationfoundation-modelsinstruction-followinglarge-language-modelsleaderboard自然语言处理rlhf
Jupyter Notebook 1.79 k
6 个月前
https://static.github-zh.com/github_avatars/THUDM?size=40
THUDM / WebGLM

#大语言模型#WebGLM: An Efficient Web-enhanced Question Answering System (KDD 2023)

ChatGPT大语言模型rlhfwebglm
Python 1.6 k
4 个月前
https://static.github-zh.com/github_avatars/PKU-Alignment?size=40
PKU-Alignment / safe-rlhf

#数据仓库#Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

ai-safetyalpaca数据集deepspeedlarge-language-modelsllama大语言模型reinforcement-learningreinforcement-learning-from-human-feedbackrlhftransformersvicunasafetygpttransformerbeaver
Python 1.5 k
1 年前
https://static.github-zh.com/github_avatars/THUDM?size=40
THUDM / ImageReward

[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation

diffusion-modelsgenerative-modelrlhf
Python 1.47 k
6 个月前
https://static.github-zh.com/github_avatars/RLHFlow?size=40
RLHFlow / RLHF-Reward-Modeling

#大语言模型#Recipes to train reward model for RLHF.

大语言模型rlhfllama3
Python 1.4 k
3 个月前
https://static.github-zh.com/github_avatars/alibaba?size=40
alibaba / ROLL

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

agenticrlhf
Python 1.39 k
9 天前
loading...