pretrained-language-model · GitHub Topics

wenge-research / YAYI2

YAYI 2 是中科闻歌研发的新一代开源大语言模型，采用了超过 2 万亿 Tokens 的高质量、多语言语料进行预训练。(Repo for YaYi 2 Chinese LLMs)

人工智能 chat 中文 gpt natural-language-generation pretrained-language-model

Python 3.63 k

1 年前

microsoft / torchscale

#自然语言处理#Foundation Architecture for (M)LLMs

机器视觉机器学习 multimodal 自然语言处理 pretrained-language-model speech-processing transformer translation

Python 3.07 k

1 年前

Separius / awesome-sentence-embedding

#自然语言处理# A curated list of pretrained sentence and word embedding models

word-embeddings sentence-embeddings 自然语言处理 Awesome Lists pretrained-models unsupervised-learning natural-language bert pretrained-language-model language-model

Python 2.26 k

4 年前

THUDM / P-tuning-v2

#自然语言处理#An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks

自然语言处理 prompt-tuning pretrained-language-model p-tuning parameter-efficient-learning

Python 2.03 k

1 年前

thunlp / OpenDelta

#自然语言处理#A plug-and-play library for parameter-efficient-tuning (Delta Tuning)

自然语言处理 nlp-library pretrained-language-model parameter-efficient-learning 深度学习

Python 1.02 k

7 个月前

xcfcode / Summarization-Papers

#自然语言处理#Summarization Papers

自然语言处理 summarization text-generation ChatGPT pretrained-language-model

TeX 1.01 k

2 年前

AndrewZhe / lawyer-llama

#自然语言处理#中文法律LLaMA (LLaMA for Chinese legel domain)

alpaca llama legal-ai 自然语言处理 plm pretrained-language-model large-language-models 大语言模型 pretrained-models

Python 925

8 个月前

gaoisbest / NLP-Projects

word2vec, sentence2vec, machine reading comprehension, dialog system, text classification, pretrained language model (i.e., XLNet, BERT, ELMo, GPT), sequence labeling, information retrieval, informati...

word2vec dialogue-systems text-classification pretrained-language-model sequence-labeling information-retrieval information-extraction knowledge-graph text-generation network-embedding

OpenEdge ABL 545

4 年前

allenai / dont-stop-pretraining

#自然语言处理#Code associated with the Don't Stop Pretraining ACL 2020 paper

自然语言处理 pretrained-language-model

Python 529

3 年前

OpenBMB / CPM-Live

#自然语言处理#Live Training for Open-source Big Models

深度学习 multi-task-learning natural-language-generation natural-language-understanding 自然语言处理 parameter-efficient-learning pretrained-language-model

Python 507

2 年前

RenzeLou / awesome-instruction-learning

#数据仓库#Papers and Datasets on Instruction Tuning and Following. ✨✨✨

pretrained-language-model paper-list Awesome Lists 数据集 in-context-learning large-language-models prompt survey instruction instruction-tuning

Python 488

1 年前

Hzfinfdu / Diffusion-BERT

ACL'2023: DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models

diffusion-models bert pretrained-language-model text-generation

Python 307

1 年前

LYH-YF / MWPToolkit

#计算机科学#MWPToolkit is an open-source framework for math word problem(MWP) solvers.

深度学习 PyTorch pretrained-language-model sequence-to-sequence

Python 163

3 年前

yueyu1030 / AttrPrompt

#自然语言处理#[NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.

data-centric-ai large-language-models 自然语言处理 pretrained-language-model text-classification zero-shot-learning

Python 151

1 年前

ZhengZixiang / ATPapers

Worth-reading papers and related resources on attention mechanism, Transformer and pretrained language model (PLM) such as BERT. 值得一读的注意力机制、Transformer和预训练语言模型论文与相关资源集合

attention-mechanism pretrained-language-model transformer Awesome Lists bert

133

4 年前

hyintell / awesome-refreshing-llms

#自然语言处理#EMNLP'23 survey: a curation of awesome papers and resources on refreshing large language models (LLMs) without expensive retraining.

Awesome Lists continual-learning large-language-models 大语言模型 llms 自然语言处理 Bukkit pretrained-language-model retrieval-augmented-generation 代码审查 survey

132

1 年前

zzz47zzz / awesome-lifelong-learning-methods-for-llm

[ACM Computing Surveys 2025] This repository collects awesome survey, resource, and paper for Lifelong Learning with Large Language Models. (Updated Regularly)

incremental-learning pretrained-language-model continual-learning large-language-models

118

2 个月前

microsoft / COCO-LM

#自然语言处理#[NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining

language-model pretrained-language-model 自然语言处理 natural-language-understanding pretraining representation-learning 深度学习 transformers contrastive-learning

Python 118

2 年前

DC-research / TEMPO

#时序数据库#The official code for "TEMPO: Prompt-based Generative Pre-trained Transformer for Time Series Forecasting (ICLR 2024)". TEMPO is one of the very first open source Time Series Foundation Models for fo...

forecasting forecasting-models foundation-models gpt pretrained-language-model pretrained-models time-series time-series-analysis transformer transformers

Python 107

2 个月前

git-disl / BERT4ETH

#区块链#BERT4ETH: A Pre-trained Transformer for Ethereum Fraud Detection (WWW23)

bert transformer 区块链以太坊 fraud-detection pretrained-language-model

Python 102

10 个月前