offloading · GitHub Topics

FMInference / FlexLLMGen

#计算机科学#Running large language models on a single GPU for throughput-oriented scenarios.

深度学习 gpt-3 high-throughput large-language-models 机器学习 offloading opt

Python 9.3 k

6 个月前

dvmazur / mixtral-offloading

#大语言模型#Run Mixtral-8x7B models in Colab or consumer desktops

colab-notebook 深度学习 google-colab language-model 大语言模型 mixture-of-experts offloading PyTorch quantization

Python 2.3 k

1 年前

pytorch / ao

PyTorch native quantization and sparsity for training and inference

brrr dtypes inference mx PyTorch quantization sparsity training float8 transformer offloading optimizer CUDA llama

Python 1.95 k

1 天前

ImanRHT / QECO

A QoE-Oriented Computation Offloading Algorithm based on Deep Reinforcement Learning (DRL) for Mobile Edge Computing (MEC) | This algorithm captures the dynamics of the MEC environment by integrating ...

deep-reinforcement-learning edge-computing deep-q-network dqn resource-management markov-decision-processes offloading

Python 207

8 天前

Infini-AI-Lab / UMbreLLa

LLM Inference on consumer devices

llm-inference offloading speculative-decoding

Python 105

1 个月前

vipinpv85 / DPDK_SURICATA-4_1_1

dpdk infrastructure for software acceleration. Currently working on RX and ACL pre-filter

dpdk suricata acl offloading

C 91

4 年前

liangyuwang / zo2

ZO2 (Zeroth-Order Offloading): Full Parameter Fine-Tuning 175B LLMs with 18GB GPU Memory

llms offloading sft

Python 84

8 天前

IBM / DPFS

DPU-Powered File System Virtualization over virtio-fs

filesystem 框架 offloading virtualization cloud datacenter Linux storage

Jupyter Notebook 69

1 年前

Mellanox / ovs-tests

A collection of tests for the Open vSwitch HW offload.

ovs offloading

Shell 40

5 个月前

Fatemeh-MA / A-Dynamic-Programming-Offloading-Algorithm-for-Mobile-Cloud-Computing

A Dynamic Programming Offloading Algorithm for Mobile Cloud Computing

dynamic-programming offloading cloud-computing

MATLAB 36

6 年前

MoatLab / LeapIO

LeapIO: Efficient and Portable Virtual NVMe Storage on ARM SoCs (ASPLOS'20)

offloading nvme-over-fabrics

C 28

4 年前

nareddyt / cs4365-task-offload-framework

A framework for IoT devices to offload tasks to the cloud, resulting in efficient computation and decreased cloud costs.

Internet of things cloud offloading 框架

Python 28

3 年前

ubc-cirrus-lab / unfaasener

A lightweight framework that enables serverless users to reduce their bills by harvesting non-serverless compute resources such as their VMs, on-premise servers, or personal computers.

Serverless offloading cloud faas Publish-subscribe pattern

Python 28

8 个月前