preference-alignment · GitHub Topics

princeton-nlp / SimPO

[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward

alignment large-language-models preference-alignment rlhf

Python 867

2 个月前

zjukg / KnowPAT

[Paper][ACL 2024 Findings] Knowledgeable Preference Alignment for LLMs in Domain-specific Question Answering

knowledge-graph large-language-models question-answering preference-alignment instruction-tuning

Python 191

10 个月前

junkangwu / beta-DPO

[NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$

alignment dpo preference-alignment rlhf

Python 41

6 个月前

Shentao-YANG / Dense_Reward_T2I

Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).

preference-alignment text-to-image-generation

Python 38

1 年前

Meaquadddd / DPO-Shift

DPO-Shift: Shifting the Distribution of Direct Preference Optimization

alignment large-language-models preference-alignment rlhf

Python 34

1 个月前

Video-Bench / Video-Bench

#大语言模型#Video Generation Benchmark

large-language-models multimodal-large-language-models preference-alignment sora video-generation video-understanding 大语言模型 text-to-video

Python 12

20 天前

junkangwu / Dr_DPO

[ICLR 2025] Official code of "Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization"

alignment dpo preference-alignment rlhf

Python 11

10 个月前

BARUDA-AI / Awesome-Preference-Optimization

Survey of preference alignment algorithms

alignment preference-alignment rlhf

1 年前

thibaud-perrin / synthetic-datasets

#大语言模型#Generate synthetic datasets for instruction tuning and preference alignment using tools like `distilabel` for efficient and scalable data creation.

人工智能 instruction-tuning 大语言模型 preference-alignment synthetic-data

Jupyter Notebook 0

3 个月前