GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
#大语言模型#Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
翻译 - 一个快速简单的框架,用于构建和运行分布式应用程序。 Ray与RLlib(可扩展的强化学习库)和Tune(可扩展的超参数调整库)打包在一起。
#大语言模型#Gitleaks 是一个开源SAST(静态应用安全测试)命令行工具,用于检测Git 仓库以防止把密码、API 密钥和访问令牌等机密信息硬编码到代码中
#大语言模型#本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
#大语言模型#20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
#大语言模型#Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.
#大语言模型#Official inference library for Mistral models
#大语言模型#PowerInfer 是一个快速的、可运行在消费级GPU、个人电脑上的大模型服务
#自然语言处理#OpenVINO™ is an open source toolkit for optimizing and deploying AI inference
翻译 - OpenVINO™工具包存储库
#大语言模型#The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!
翻译 - 轻松进行模型服务
#大语言模型#LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
#向量搜索引擎#Superduper: End-to-end framework for building custom AI applications and agents.
#计算机科学#Standardized Serverless ML Inference Platform on Kubernetes
📚A curated list of Awesome LLM/VLM🔥 Inference Papers with codes: WINT8/4, FlashAttention, PagedAttention, MLA, Parallelism etc.
#自然语言处理#Sparsity-aware deep learning inference runtime for CPUs
#大语言模型#Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.
#大语言模型#Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
Eko (Eko Keeps Operating) - Build Production-ready Agentic Workflow with Natural Language - eko.fellou.ai
FlashInfer: Kernel Library for LLM Serving
#大语言模型#Code examples and resources for DBRX, a large language model developed by Databricks