”inference-engine“ 的搜索结果

web-llm

@mlc-ai

#大语言模型#浏览器中运行大模型，无需 server 支持

深度学习 llm tvm webgpu webml

TypeScript13.83 k

2 天前

Google Bing GitHub

inference pytorch cuda ai inference-engine llama2 onnx llama llamacpp llm

vllm

@vllm-project

#大语言模型#A high-throughput and memory-efficient inference and serving engine for LLMs

gpt llm PyTorch llmops mlops

Python31.03 k

8 小时前

CTranslate2

OpenNMT@OpenNMT

Fast inference engine for Transformer models

neural-machine-translation C++mkl quantization CUDA

C++3.43 k

4 天前

gemma.cpp

谷歌公司@google

用于谷歌Gemma模型的轻量级独立C++推理引擎。

C++6 k

3 天前

aphrodite-engine

@PygmalionAI

#计算机科学#Large-scale LLM inference engine

API inference-engine 机器学习

Python1.15 k

6 小时前

Tengine

@OAID

#计算机科学#Tengine is a lite, high performance, modular inference engine for embedded device

翻译 - Tengine是适用于嵌入式设备的精简，高性能，模块化推理引擎

arm 机器学习人工智能 cnn Tensorflow

C++4.66 k

2 个月前

FeatherCNN

腾讯@Tencent

FeatherCNN is a high performance inference engine for convolutional neural networks.

C++1.21 k

5 年前

elfi

@elfi-dev

ELFI - Engine for Likelihood-Free Inference

Python266

5 个月前

openrouter-runner

@OpenRouterTeam

Inference engine powering open source models on OpenRouter

Python597

5 个月前

Jlama

@tjake

#大语言模型#Jlama is a modern LLM inference engine for Java

人工智能 Java llm simd transformers

Java678

20 小时前

mlsub

Stephen Dolan@stedolan

Prototype type inference engine

OCaml196

4 个月前

compute-engine

@larq

Highly optimized inference engine for Binarized Neural Networks

C++243

1 个月前

emlearn

@emlearn

Machine Learning inference engine for Microcontrollers and Embedded devices

Python514

3 个月前

cortex.cpp

@janhq

Local AI API Platform

gguf llama2 llamacpp tensorrt-llm accelerated

C++2.15 k

7 小时前

StockInference-Spark

@Pivotal-Open-Source-Hub

Stock inference engine using Spring XD, Apache Geode / GemFire and Spark ML Lib.

Java381

8 年前

yolov4-triton-tensorrt

@isarsoft

This repository deploys YOLOv4 as an optimized TensorRT engine to Triton Inference Server

C++279

2 年前

TensorFlowNative-Unreal

@getnamo

Tensorflow Plugin for Unreal Engine using C API for inference focus.

C#43

3 年前

libonnx

@xboot

#计算机科学#A lightweight, portable pure C99 onnx inference engine for embedded devices with hardware acceleration support.

翻译 - 轻巧的便携式纯C99 onnx推理引擎，适用于具有硬件加速支持的嵌入式设备。

onnx inference C embedded baremetal

C586

10 天前

llama.cpp

Georgi Gerganov@ggerganov

Facebook 的 LLaMA 模型在 C/C++ 中的移植

llama ggml

C++68.5 k

2 小时前

inferflow

@inferflow

Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).

llama2 llamacpp llm-inference model-quantization multi-gpu-inference

C++231

9 个月前

llama

@meta-llama

LLaMA模型的推理代码

Python56.6 k

3 个月前

Triton-TensorRT-Inference-CRAFT-pytorch

@k9ele7en

Advanced inference pipeline using NVIDIA Triton Inference Server for CRAFT Text detection (Pytorch), included converter from Pytorch -> ONNX -> TensorRT, Inference pipelines (TensorRT, Triton server -...

Python31

3 年前

whisper.el

@natrys

Speech-to-Text interface for Emacs using OpenAI's whisper model and whisper.cpp as inference engine.

Emacs Lisp62

1 年前

”inference-engine“ 的搜索结果

编程语音