”vllm“ 的搜索结果 | GitHub 中文社区

#大语言模型#A high-throughput and memory-efficient inference and serving engine for LLMs

Python31.11 k

31 分钟前

machinelearning language-model pytorch llm-serving huggingface-transformers model-serving gpt visual-language-learning llama llm

qwen-vllm

owenliang@owenliang

通义千问VLLM推理部署DEMO

Python450

8 个月前

worker-vllm

@runpod-workers

The RunPod worker template for serving our large language model endpoints. Powered by vLLM.

Python246

2 天前

candle-vllm

@EricLBuehler

Efficent platform for inference and serving local LLMs including an OpenAI compatible API server.

Rust269

11 天前

vllm_backend

Triton Inference Server@triton-inference-server

Python196

5 天前

custom_websearch_agent

@john-adeojo

Custom Websearch Agent Built with Local Models, vLLM, and OpenAI

Python127

6 个月前

sparrow

@katanaml

#大语言模型#Data processing with ML, LLM and Vision LLM

机器学习 huggingface-transformers 自然语言处理机器视觉 gpt

Python3.74 k

7 小时前

vllm-mixed-precision

@Qcompiler

Support mixed-precsion inference with vllm

Python98

25 天前

llm-compressor

@vllm-project

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python751

3 小时前

vllm

International Business Machines@IBM

A fork of github.com/vllm-project/vllm

Python9

1 个月前

vllm-env

@kemingy

setup the env for vllm users

Dockerfile13

1 年前

vllm-project.github.io

@vllm-project

HTML4

1 个月前

LLaMA2-Accessory

@Alpha-VLLM

An Open-source Toolkit for LLM Development

Python2.73 k

6 个月前

InternLM-XComposer

@InternLM

#大语言模型#InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

ChatGPT visual-language-learning multi-modality foundation gpt-4

Python2.54 k

2 个月前

vllm-cn

@gameofdimension

演示 vllm 对中文大语言模型的神奇效果

Jupyter Notebook32

1 年前

Awesome-LLM-Inference

@DefTruth

📖A curated list of Awesome LLM/VLM Inference Papers with codes, such as FlashAttention, PagedAttention, Parallelism, etc. 🎉🎉

2.91 k

17 小时前

llama-recipes

@meta-llama

#大语言模型#Llama 2 微调/推理方法和示例

人工智能 finetuning langchain llama llama2

Jupyter Notebook15.43 k

6 小时前

ipex-llm

intel-analytics@intel-analytics

#大语言模型#Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such ...

PyTorch llm transformers gpu

Python6.76 k

3 天前

EduChat

@ECNU-ICALK

An open-source educational chat model from ICALK, East China Normal University. 开源中英教育对话大模型。(通用基座模型，GPU部署，数据清理) 致敬: LLaMA, MOSS, BELLE, Ziya, vLLM

Jupyter Notebook719

2 个月前

编程语音

Python
Jupyter Notebook
Dockerfile
HTML