Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillions of parameters
MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.
Guide: Finetune GPT2-XL (1.5 Billion Parameters) and finetune GPT-NEO (2.7 B) on a single GPU with Huggingface Transformers using DeepSpeed
This repo contains the data preparation, tokenization, training and inference code for BLOOMChat. BLOOMChat is a 176 billion parameter multilingual chat model based on BLOOM.
ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward experts. We released a collection of ModuleFormer-based Language M...
BTGenBot: a system to generate behavior trees for robots using lightweight (~7 billion parameters) large language models (LLMs)
#计算机科学#DeepSpeed Chat: 一键式RLHF训练,让你的类ChatGPT千亿大模型提速省钱15倍
The Mistral-7B-v0.1 Large Language Model (LLM) is a pretrained generative text model with 7 billion parameters. Mistral-7B-v0.1 outperforms Llama 2 13B on all benchmarks we tested.
📊 Computation and processing of models' parameters
Xaddress - Give 7 billion people an instant physical address
翻译 - Xaddress-为70亿人提供即时的物理地址
Solutions for the game 7 Billion Humans
Solutions to the "7 Billion Humans" challenges
Framework for evaluating ANNS algorithms on billion scale datasets.
Neat URL cleans URLs, removing parameters such as Google Analytics' utm parameters.
翻译 - 整洁的URL会清理URL,并删除诸如Google Analytics(分析)的utm参数之类的参数。
Pretrained language model with 100B parameters
1️⃣🐝🏎️ The One Billion Row Challenge - .NET Edition
Formerly known as code.google.com/p/1-billion-word-language-modeling-benchmark
Zoomable, animated scatterplots in the browser that scales over a billion points
PyTorch Language Model for 1-Billion Word (LM1B / GBW) Dataset
Parameters-based transformation DSL
翻译 - 基于参数的转换DSL
A collection of all the data i could extract from 1 billion leaked credentials from internet.
翻译 - 我可以从Internet泄漏的10亿份凭据中提取的所有数据的集合。
An application which manages kernel parameters