GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
#大语言模型# 本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
#大语言模型# 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
#大语言模型# Official inference library for Mistral models
#大语言模型# PowerInfer 是一个快速的、可运行在消费级GPU、个人电脑上的大模型服务
#自然语言处理# OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference
翻译 - OpenVINO™工具包存储库
#大语言模型# LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
#大语言模型# Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.
#大语言模型# Code examples and resources for DBRX, a large language model developed by Databricks
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡
#大语言模型# Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere (Linux/Windows/Mac). Use `llama2-wrapper` as your local llama2 backend for Generative Agents/Apps.
#大语言模型# AICI: Prompts as (Wasm) Programs
#大语言模型# Tensor parallelism is all you need. Run LLMs on an AI cluster at home using any device. Distribute the workload, divide RAM usage, and increase inference speed.
#计算机科学# LLMs as Copilots for Theorem Proving in Lean
Arch is an intelligent prompt gateway. Engineered with (fast) LLMs for the secure handling, robust observability, and seamless integration of prompts with your APIs - outside business logic. Built by ...
#大语言模型# Practical Llama 3 inference in Java
#计算机科学# Efficient AI Inference & Serving
#大语言模型# The official repo of Aquila2 series proposed by BAAI, including pretrained & chat large language models.
#大语言模型# Create characters in Unity with LLMs!