#计算机科学#《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases
#自然语言处理#Sparsity-aware deep learning inference runtime for CPUs
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-Ari...
翻译 - 基于pytorch的模型压缩(1,量化:8/4 / 2bits(dorefa),三进制/二进制值(twn / bnn / xnornet); 2,修剪:常规,常规和组卷积通道修剪; 3,组卷积结构; 4,特征(A)的二进制值的分批归一化折叠)
PaddleSlim is an open-source library for deep model compression and architecture search.
翻译 - PaddleSlim是一个用于深度模型压缩和体系结构搜索的开源库。
OpenMMLab Model Compression Toolbox and Benchmark.
翻译 - OpenMMLab 模型压缩工具箱和基准。
Efficient computing methods developed by Huawei Noah's Ark Lab
A curated list of awesome edge machine learning resources, including research papers, inference engines, challenges, books, meetups and others.
翻译 - 精选的很棒的边缘机器学习资源列表,包括研究论文,推理引擎,挑战,书籍,聚会等。
PyTorch Model Compression
翻译 - PyTorch模型压缩