efficient-model · GitHub Topics

mit-han-lab / temporal-shift-module

[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding

翻译 - [ICCV 2019] TSM：高效视频理解的时移模块。

acceleration low-latency temporal-modeling video-understanding efficient-model nvidia-jetson-nano tsm

Python 2.11 k

9 个月前

mit-han-lab / once-for-all

[ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment

tinyml edge-ai efficient-model acceleration nas automl

Python 1.9 k

1 年前

mit-han-lab / proxylessnas

[ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware

翻译 - [ICLR 2019] ProxylessNAS：直接在目标任务和硬件上进行神经体系结构搜索。

automl specialization hardware-aware acceleration on-device-ai efficient-model

C++ 1.44 k

7 个月前

mit-han-lab / amc

[ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices

automl model-compression channel-pruning efficient-model on-device-ai

Python 440

1 年前

mit-han-lab / haq

[CVPR 2019, Oral] HAQ: Hardware-Aware Automated Quantization with Mixed Precision

quantization automl efficient-model

Python 381

4 年前

microsoft / nn-Meter

#计算机科学#A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.

Python 深度神经网络 latency inference edge-computing edge-ai Tensorflow onnx-models PyTorch 机器学习深度学习 neural-architecture-search efficient-model

Python 348

8 个月前

SqueezeAILab / KVQuant

#自然语言处理#[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

compression efficient-inference efficient-model large-language-models llama 大语言模型 localllama localllm mistral model-compression 自然语言处理 quantization text-generation transformer

Python 340

8 个月前

mit-han-lab / hardware-aware-transformers

#自然语言处理#[ACL'20] HAT: Hardware-Aware Transformers for Efficient Natural Language Processing

hardware-aware transformer specialization efficient-model 自然语言处理 machine-translation

Python 331

9 个月前

amirgholami / ZeroQ

[CVPR'20] ZeroQ: A Novel Zero Shot Quantization Framework

quantization compression efficient-neural-networks efficient-model

Python 276

1 年前

kssteven418 / I-BERT

#自然语言处理#[ICML'21 Oral] I-BERT: Integer-only BERT Quantization

自然语言处理 quantization efficient-model efficient-neural-networks transformer bert model-compression

Python 241

2 年前

mit-han-lab / amc-models

[ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices

efficient-model on-device-ai automl model-compression

Python 166

4 年前

youngwanLEE / VoV3D

Efficient 3D Backbone Network for Temporal Modeling

temporal-modeling efficient-model vovnet video-understanding

Python 108

4 年前

d-li14 / HBONet

[ICCV 2019] Harmonious Bottleneck on Two Orthogonal Dimensions, surpassing MobileNetV2

PyTorch imagenet pretrained-models mobilenetv2 efficient-model iccv2019

Python 102

5 年前

kssteven418 / LTP

#自然语言处理#[KDD'22] Learned Token Pruning for Transformers

自然语言处理 transformer bert pruning model-compression efficient-model efficient-neural-networks

Python 95

2 年前

szq0214 / S2-BNN

S2-BNN: Bridging the Gap Between Self-Supervised Real and 1-bit Neural Networks via Guided Distribution Calibration (CVPR 2021)

self-supervised-learning binary-neural-networks contrastive-learning contrastive-loss efficient-model

Python 64

4 年前

SHI-Labs / Any-Precision-DNNs

Any-Precision Deep Neural Networks (AAAI 2021)

efficient-model

Python 59

5 年前

xvyaward / owq

#大语言模型#Code for the AAAI 2024 Oral paper "OWQ: Outlier-Aware Weight Quantization for Efficient Fine-Tuning and Inference of Large Language Models".

efficient-model large-language-models 大语言模型 quantization

Python 58

1 年前

mit-han-lab / neurips-micronet

#自然语言处理#[JMLR'20] NeurIPS 2019 MicroNet Challenge Efficient Language Modeling, Champion

quantization pruning knowledge-distillation 自然语言处理 language-modeling efficient-model

Jupyter Notebook 40

4 年前

tiangexiang / BiX-NAS

[MICCAI 2021] BiX-NAS: Searching Efficient Bi-directional Architecture for Medical Image Segmentation

neural-architecture-search segmentation semantic-segmentation efficient-model

Python 38

3 年前

lironui / ABCNet

The semantic segmentation of remote sensing images

segmentation semantic-segmentation real-time efficient-model remote-sensing uav

Python 36

2 年前