”deepspeed“ 的搜索结果 | GitHub 中文社区

Example models using DeepSpeed

Python6.12 k

4 天前

inference compression language-model pytorch mixture-of-experts transformers deepspeed-library deep-learning billion-parameters model-parallelism

DeepSpeed

Microsoft@microsoft

#计算机科学#DeepSpeed Chat: 一键式RLHF训练，让你的类ChatGPT千亿大模型提速省钱15倍

深度学习 PyTorch gpu 机器学习 billion-parameters

Python35.7 k

5 天前

gpt-neox

@EleutherAI

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

翻译 - 基于DeepSpeed库的GPU上类似于GPT-3的模型并行模型的实现。设计为能够训练成千上亿个参数或更大参数的模型。

deepspeed-library gpt-3 transformers language-model

Python6.97 k

2 天前

DeepSpeed-MII

Microsoft@microsoft

#计算机科学#MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

深度学习 inference PyTorch

Python1.91 k

12 天前

Megatron-DeepSpeed

@bigscience-workshop

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python1.34 k

8 个月前

Chatglm_lora_multi-gpu

@liangwq

chatglm多gpu用deepspeed和

Python404

5 个月前

accelerate

Hugging Face@huggingface

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

翻译 - 通过多GPU，TPU，混合精度训练和使用PyTorch模型的简单方法

Python8.01 k

6 天前

cogvideox-factory

@a-r-r-o-w

Memory optimized finetuning scripts for CogVideoX using TorchAO and DeepSpeed

Python457

10 小时前

DeepSpeedTutorial

@OvJat

DeepSpeed Tutorial

Python90

4 个月前

llama2-lora-fine-tuning

@git-cloner

llama2 finetuning with deepspeed and lora

Python169

1 年前

TDS

@TsinghuaAI

A plug-in of Microsoft DeepSpeed to fix the bug of DeepSpeed pipeline

Python26

4 年前

distill-bloom-deepspeed

Hugging Face@huggingface

Teacher - student distillation using DeepSpeed

Python19

2 年前

intel-extension-for-deepspeed

Intel Corporation@intel

Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note XPU is already supported by stock DeepSpeed.

C++52

8 个月前