serving · GitHub Topics

#大语言模型#Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 37.79 k

3 小时前

tensorflow / serving

#计算机科学#A flexible, high-performance serving system for machine learning models

机器学习深度学习深度神经网络 Python C++神经网络 serving Tensorflow

C++ 6.29 k

4 天前

vespa-engine / vespa

#搜索#AI + Data, online. https://vespa.ai

vespa 搜索引擎 big-data 人工智能 serving serving-recommendation 机器学习 Server Tensorflow Java C++vector-search

Java 6.23 k

2 天前

volcano-sh / volcano

#计算机科学#A Cloud Native Batch System (Project under CNCF)

batch-systems Kubernetes Go hpc bigdata 机器学习 gene 人工智能 serving training

Go 4.79 k

2 天前

SeldonIO / seldon-core

#计算机科学#An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models

Kubernetes 机器学习部署 serving mlops aiops machine-learning-operations production-machine-learning

Go 4.56 k

1 天前

ahkarami / Deep-Learning-in-Production

#计算机科学#In this repository, I will share some useful notes and references about deploying deep learning-based models in production.

深度学习深度神经网络 Python PyTorch tesnorflow Keras mxnet caffe2 production serving C++model-serving 教程 Flask REST API React Angular Tensorflow

4.35 k

8 个月前

pytorch / serve

#计算机科学#TorchServe是一个高性能、灵活且易于使用的工具，用于在生产级环境中提供PyTorch模型的服务。

PyTorch 机器学习 mlops serving Docker Kubernetes optimization cpu gpu 监控深度学习

Java 4.34 k

14 天前

Lightning-AI / LitServe

#计算机科学#The easiest way to deploy agents, MCP servers, models, RAG, pipelines and more. No MLOps. No YAML.

人工智能 API serving 深度学习 developer-tools FastAPI REST API Web

Python 3.34 k

6 天前

PaddlePaddle / FastDeploy

#大语言模型#High-performance Inference and Deployment Toolkit for LLMs and VLMs based on PaddlePaddle

serving ernie 大语言模型 inference llm-serving openai vllm ernie-45 ernie-45-vl

C++ 3.34 k

14 小时前

georgia-tech-db / evadb

#大语言模型#Database system for AI-powered apps

eva video-analytics serving 数据库 labeling object-detection 数据分析人工智能 ChatGPT langchain auto-gpt gpt4all huggingface 大语言模型 gpt-4 agent Hacktoberfest

Python 2.67 k

1 年前

skyzh / tiny-llm

#大语言模型#A course of learning LLM inference serving on Apple Silicon for systems engineers.

course 大语言模型 Python qwen qwen2 serving

Python 2.66 k

18 天前

tobegit3hub / tensorflow_template_application

#计算机科学#TensorFlow template application for deep learning

Tensorflow tfrecords libsvm CSV 深度学习机器学习 mlp cnn lstm inference tensorboard serving

Python 1.88 k

2 年前

ray-project / llm-applications

#计算机科学#A comprehensive guide to building RAG-based LLM applications for production.

大语言模型机器学习 ray anyscale fine-tuning llama2 openai serving

Jupyter Notebook 1.8 k

1 年前

Delta-ML / delta

#前端开发#DELTA is a deep learning based natural language and speech processing platform. LF AI & DATA Projects: https://lfaidata.foundation/projects/delta/

自然语言处理深度学习 Tensorflow speech sequence-to-sequence seq2seq speech-recognition text-classification speaker-verification nlu text-generation emotion-recognition tensorflow-lite inference asr serving front-end ops

Python 1.59 k

3 个月前

dingodb / dingo

A multi-modal vector database that supports upserts and vector queries using unified SQL (MySQL-Compatible) on structured and unstructured data, while meeting the requirements of high concurrency and ...

serving embedding-store vector-database mysql-compatibility embedding-search key-value-distributed-store vector-ocean unified-sql structured-data unstructured-data

Java 1.56 k

2 天前

PaddlePaddle / Serving

#计算机科学#A flexible, high-performance carrier for machine learning models（『飞桨』服务化部署框架）

rpc-service gpu Python Docker serving pipeline paddle 深度学习 prediction predictor dag micro-service

C++ 911

2 个月前

tobegit3hub / simple_tensorflow_serving

#计算机科学#Generic and easy-to-use serving service for machine learning models

Tensorflow serving client HTTP 机器学习深度学习

JavaScript 757