Distribute and run LLMs with a single file.
distributed trainer for LLMs
Distributed LLM and StableDiffusion inference for mobile, desktop and server.
Rill Flow is a high-performance, scalable workflow orchestration engine for distributed workloads and LLMs
Distributed Inference for mlx LLm
Multipack distributed sampler for fast padding-free training of LLMs
Tune efficiently any LLM model from HuggingFace using distributed training (multiple GPU) and DeepSpeed. Uses Ray AIR to orchestrate the training on multiple AWS GPU instances
#大语言模型#Tensor parallelism is all you need. Run LLMs on an AI cluster at home using any device. Distribute the workload, divide RAM usage, and increase inference speed.
Repository to deploy LLMs with Multi-GPUs in distributed Kubernetes nodes
Distribute and run llamafile/LLMs with a single docker image.
Universal and Transferable Attacks on Aligned Language Models
the LLM vulnerability scanner
#大语言模型#LLM Finetuning with peft
#大语言模型#[ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning
LLM as a Chatbot Service
#大语言模型#LlamaIndex is a data framework for your LLM applications
A distributed task scheduler for Dask
open source training courses about distributed database and distributed systems
翻译 - PingCAP培训课程
Numbers every LLM developer should know
📖A curated list of Awesome LLM/VLM Inference Papers with codes, such as FlashAttention, PagedAttention, Parallelism, etc. 🎉🎉
Simple UI for LLM Model Finetuning