long-context · GitHub Topics

InternLM / InternLM

#大语言模型#Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).

聊天机器人 gpt large-language-model long-context rlhf fine-tuning-llm 大语言模型中文 flash-attention pretrained-models

Python 6.86 k

2 个月前

dvlab-research / LongLoRA

#大语言模型#Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

fine-tuning-llm large-language-models long-context 大语言模型 lora

Python 2.66 k

8 个月前

THUDM / LongWriter

#大语言模型#[ICLR 2025] LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

fine-tuning 大语言模型 long-context

Python 1.64 k

5 个月前

THUDM / LongBench

#大语言模型#LongBench v2 and LongBench (ACL 2024)

benchmark 大语言模型 long-context

Python 836

3 个月前

haoliuhl / ringattention

Large Context Attention

large-language-models long-context memory-efficient transformers

Python 700

3 个月前

lucidrains / MEGABYTE-pytorch

#计算机科学#Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch

人工智能深度学习 learned-tokenization attention-mechanisms long-context transformers

Python 640

4 个月前

lucidrains / ring-attention-pytorch

Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch

attention-mechanism long-context

Python 510

6 个月前

THUDM / LongCite

#大语言模型#LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA

benchmark fine-tuning 大语言模型 long-context

Python 486

3 个月前

NVIDIA / kvpress

#大语言模型#LLM KV cache compression made easy

大语言模型 inference kv-cache long-context Python PyTorch transformers large-language-models

Python 453

25 天前

lucidrains / recurrent-memory-transformer-pytorch

#计算机科学#Implementation of Recurrent Memory Transformer, Neurips 2022 paper, in Pytorch

人工智能 attention-mechanisms 深度学习 transformers long-context memory recurrence

Python 407

3 个月前

thunlp / InfLLM

#大语言模型#The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Memory"

large-language-models 大语言模型 long-context

Python 349

1 年前

OpenBMB / InfiniteBench

Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718

benchmark large-language-models long-context

Python 317

7 个月前

dingo-actual / infini-transformer

#计算机科学#PyTorch implementation of Infini-Transformer from "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" (https://arxiv.org/abs/2404.07143)

attention-mechanism 深度学习 long-context PyTorch transformers

Python 288

1 年前

VITA-MLLM / Long-VITA

✨✨Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuracy

long-context mllm vision-language-model

Python 269

25 天前

THUDM / LongAlign

#大语言模型#[EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs

alignment 大语言模型 long-context

Python 249

4 个月前

Infini-AI-Lab / TriForce

#大语言模型#[COLM 2024] TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding

acceleration 大语言模型 long-context speculative-decoding llm-inference efficiency inference

Python 246

7 个月前

metame-ai / awesome-llm-plaza

#大语言模型#awesome llm plaza: daily tracking all sorts of awesome topics of llm, e.g. llm for coding, robotics, reasoning, multimod etc.

Awesome Lists 大语言模型 long-context llm-application

191

1 天前

nightdessert / Retrieval_Head

open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality

large-language-models long-context

Python 182

8 个月前

bigai-nlco / LooGLE

#大语言模型#ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models

large-language-models long-context 大语言模型

Python 181

6 个月前

yangjianxin1 / LongQLoRA

#大语言模型#LongQLoRA: Extent Context Length of LLMs Efficiently

大语言模型 long-context lora qlora

Python 164

1 年前