”smoothquant“ 的搜索结果

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Python1.27 k

5 个月前

quantization quantization-aware-training post-training-quantization sparsity large-language-models pruning auto-tuning smoothquant low-precision knowledge-distillation

AutoSmoothQuant

@AniZpZ

An easy-to-use package for implementing SmoothQuant for LLMs

Python85

6 个月前

neural-compressor

Intel Corporation@intel

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

low-precision pruning sparsity auto-tuning knowledge-distillation

Python2.23 k

7 小时前

llmc

@ModelTC

[EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".

Python329

2 小时前

smoothquant-bloom

@jason104

Python2

1 年前

smoothquant_mean

@kalpanakc2909

Jupyter Notebook0

9 个月前

smoothquant-experiments

@laerdon

Working with SmoothQuant and LLM-AWQ.

Jupyter Notebook0

4 个月前

ov.smoothquant

@usstq

Python1

5 个月前

SmoothQuant-llama

@jli943

Jupyter Notebook0

7 个月前

error_injection_on_smoothquant

@2000012835xt

Inject errors to LLMs

9 个月前

Awesome-LLM-Inference

@DefTruth

📖A curated list of Awesome LLM Inference Papers with codes. 🎉🎉

2.87 k

5 天前

编程语音

Python