”quantization“ 的搜索结果

4 bits quantization of LLaMA using GPTQ

Python3 k

5 个月前

quantization batch-normalization-folding pytorch large-language-models pruning dorefa natural-language-processing model-compression group-convolution twn

ppq

@OpenPPL

PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.

Python1.58 k

8 个月前

bitsandbytes

@bitsandbytes-foundation

Accessible large language models via k-bit quantization for PyTorch.

Python6.34 k

1 小时前

vector-quantize-pytorch

Phil Wang@lucidrains

Vector (and Scalar) Quantization, in Pytorch

Python2.67 k

1 天前

MQBench

@ModelTC

Model Quantization Benchmark

Shell768

6 个月前

brevitas

@Xilinx

Brevitas: neural network quantization in PyTorch

Python1.21 k

1 天前

binance-quantization

@hengxuZ

虚拟货币(BTC、ETH)炒币量化系统项目。币安交易所-量化交易-网格策略实践。火币、OKEX热门交易所未来都支持。最简单收益最靠谱的项目，包教包会。

Python974

2 年前

qlora

@artidoro

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook10.08 k

6 个月前

micronet

@666DZY666

micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-Ari...

翻译 - 基于pytorch的模型压缩（1，量化：8/4 / 2bits（dorefa），三进制/二进制值（twn / bnn / xnornet）； 2，修剪：常规，常规和组卷积通道修剪； 3，组卷积结构； 4，特征（A）的二进制值的分批归一化折叠）

quantization pruning dorefa twn bnn

Python2.22 k

3 年前

Awesome-Model-Quantization

@Efficient-ML

A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (pape...

1.9 k

1 个月前