#

low-precision

https://static.github-zh.com/github_avatars/intel?size=40

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

Python 2.38 k
12 小时前
https://static.github-zh.com/github_avatars/Tiiiger?size=40

#学习与技能提升#Low Precision Arithmetic Simulation in PyTorch

Python 275
1 年前
https://static.github-zh.com/github_avatars/gudovskiy?size=40

A script to convert floating-point CNN models into generalized low-precision ShiftCNN representation

Python 56
8 年前
https://static.github-zh.com/github_avatars/sefaburakokcu?size=40
Python 37
1 个月前
https://static.github-zh.com/github_avatars/graphcore-research?size=40
Python 16
6 个月前
https://static.github-zh.com/github_avatars/gudovskiy?size=40

Code for DNN feature map compression paper

C++ 11
6 年前
https://static.github-zh.com/github_avatars/KernelTuner?size=40

CUDA/HIP header-only library for writing vectorized and low-precision (16 bit, 8 bit) GPU kernels

C++ 7
14 天前
Website
Wikipedia