”deepspeed-library“ 的搜索结果

#计算机科学#DeepSpeed Chat: 一键式RLHF训练，让你的类ChatGPT千亿大模型提速省钱15倍

深度学习 PyTorch gpu 机器学习 billion-parameters

Python35.7 k

5 天前

inference compression language-model pytorch mixture-of-experts transformers deepspeed-library deep-learning billion-parameters model-parallelism

gpt-neox

@EleutherAI

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

翻译 - 基于DeepSpeed库的GPU上类似于GPT-3的模型并行模型的实现。设计为能够训练成千上亿个参数或更大参数的模型。

deepspeed-library gpt-3 transformers language-model

Python6.97 k

2 天前

DinkyTrain

@princeton-nlp

Princeton NLP's pre-training library based on fairseq with DeepSpeed kernel integration 🚃

Python112

2 年前

revlib

@HomebrewNLP

Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offload

Python124

2 年前

finetuning-and-deploying-llama-on-Sagemaker

@yuhuiaws

Use the two different methods (deepspeed and SageMaker model parallelism library) to fine tune llama model on Sagemaker. Then deploy the fine tuned llama on Sagemaker with server side batch.

Jupyter Notebook23

1 年前

Tiny-DeepSpeed

@liangyuwang

Tiny-DeepSpeed, a minimalistic re-implementation of the DeepSpeed library

Python5

4 个月前

ds-mii-deepdive

@heiko-hotz

Experiments with DeepSpeed MII library

Jupyter Notebook1

2 年前

WarpSpeed

@Qonfused

TensorFlow GSPMD implementation of Microsoft's DeepSpeed library.

Starlark0

2 年前

ds-inference-experiments

@heiko-hotz

A library that experiments with different DeepSpeed options for Inference

Jupyter Notebook0

2 年前

Deepspeed-finetuner

@devanshj7

Using the deepspeed library to prepare a new fine-tuner script

Python0

5 个月前

DeepSpeedExamples

Microsoft@microsoft

Example models using DeepSpeed

Python6.12 k

4 天前

Chatglm_lora_multi-gpu

@liangwq

chatglm多gpu用deepspeed和

Python404

5 个月前

TDS

@TsinghuaAI

A plug-in of Microsoft DeepSpeed to fix the bug of DeepSpeed pipeline

Python26

4 年前

llama2-lora-fine-tuning

@git-cloner

llama2 finetuning with deepspeed and lora

Python169

1 年前

distill-bloom-deepspeed

Hugging Face@huggingface

Teacher - student distillation using DeepSpeed

Python19

2 年前

intel-extension-for-deepspeed

Intel Corporation@intel

Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note XPU is already supported by stock DeepSpeed.

C++52

8 个月前

DeepSpeed-MII

Microsoft@microsoft

#计算机科学#MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

深度学习 inference PyTorch

Python1.91 k

12 天前