human-feedback · GitHub Topics

#计算机科学#Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

人工智能 attention-mechanisms 深度学习 reinforcement-learning transformers human-feedback

Python 7.78 k

20 天前

opendilab / awesome-RLHF

#计算机科学#A curated list of reinforcement learning with human feedback resources (continually updated)

深度学习 deep-reinforcement-learning human-feedback reinforcement-learning rlhf large-language-models

3.88 k

2 个月前

conceptofmind / LaMDA-rlhf-pytorch

#计算机科学#Open-source pre-training implementation of Google's LaMDA in PyTorch. Adding RLHF similar to ChatGPT.

attention-mechanism 深度学习机器学习人工智能 human-feedback reinforcement-learning transformers

Python 472

1 年前

huggingface / data-is-better-together

#数据仓库#Let's build better datasets, together!

community 数据集 human-feedback 机器学习

Jupyter Notebook 257

4 个月前

yk7333 / d3po

[CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"

diffusion-models human-feedback reinforcement-learning

Python 215

1 年前

wxjiao / ParroT

#大语言模型#The ParroT framework to enhance and regulate the Translation Abilities during Chat based on open-sourced LLMs (e.g., LLaMA-7b, Bloomz-7b1-mt) and human written translation and evaluation data.

ChatGPT gpt-4 llama machine-translation human-feedback instruction-tuning lora

Python 176

3 个月前

xrsrke / instructGOOSE

#大语言模型#Implementation of Reinforcement Learning from Human Feedback (RLHF)

reinforcement-learning rlhf ChatGPT human-feedback

Jupyter Notebook 172

2 年前

trubrics / trubrics-python

#大语言模型#Product analytics for AI Assistants

机器学习 ml-monitoring mlops human-feedback 大语言模型 llmops Streamlit

Python 150

11 天前

PKU-Alignment / beavertails

#数据仓库#BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs).

ai-safety human-feedback language-model large-language-model 大语言模型 llms rlhf safety beaver 数据集 gpt llama

Makefile 133

1 年前

HannahKirk / prism-alignment

The Prism Alignment Project

alignment dataset human-feedback

Jupyter Notebook 72

1 年前

davidberenstein1957 / dataset-viber

Dataset Viber is your chill repo for data collection, annotation and vibe checks.

data-collection data-quality evaluation human-feedback

Python 47

7 个月前

ZhenbangDu / Reliable_AD

#数据仓库#[ECCV2024] Towards Reliable Advertising Image Generation Using Human Feedback

advertising diffusers diffusion diffusion-models eccv2024 human-feedback image-generation rlhf 数据集

Python 47

5 个月前

gao-g / prelude

#大语言模型#Code for the paper "Aligning LLM Agents by Learning Latent Preference from User Edits".

alignment gpt4 human-feedback interpretability 大语言模型 llms transformers

Python 37

5 个月前

ZiyiZhang27 / tdpo

[ICML 2024] Code for the paper "Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases"

alignment diffusion-models human-feedback reinforcement-learning rlhf text-to-image stable-diffusion

Python 35

9 个月前

AlaaLab / pathologist-in-the-loop

[ NeurIPS 2023 ] Official Codebase for "Aligning Synthetic Medical Images with Clinical Knowledge using Human Feedback"

human-feedback rlhf synthetic-data

Python 17

1 年前

victor-iyi / rlhf-trl

Reinforcement Learning from Human Feedback with 🤗 TRL

human-feedback reinforcment-learning rlhf

Python 9

2 年前

wang8740 / MAP

#大语言模型#Documentation at

finetuning human-feedback 大语言模型 rlhf

Python 6

18 天前

JacqueWill / SEO_HIF_JS

#计算机科学#Search Engine Optimization using Human Implicit Feedback

data-privacy edge-computing human-feedback 机器学习 seo-optimization

JavaScript 1

2 年前

cluebbers / dpo-rlhf-paraphrase-types

#计算机科学#Enhancing paraphrase-type generation using Direct Preference Optimization (DPO) and Reinforcement Learning from Human Feedback (RLHF), with large-scale HPC support. This project aligns model outputs t...

alignment 深度学习 human-feedback reinforcement-learning transformers

Jupyter Notebook 0

2 个月前