GitHub 中文社区

回车: Github搜索 Shift+回车: Google搜索

©2023 GitHub中文社区论坛 GitHub官网网站地图 GitHub官方翻译

GitHub on X
GitHub on Facebook
GitHub on LinkedIn
GitHub on YouTube
GitHub on Twitch
GitHub on TikTok
GitHub’s organization on GitHub

”tensorrt-llm“ 的搜索结果

NVIDIA Corporation@NVIDIA

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficientl...

C++8.78 k

7 小时前

Google Bing GitHub

jetson vector-database ai chatgpt nvidia deep-learning tensorrt llama2 onnx llamacpp

tensorrtllm_backend

Triton Inference Server@triton-inference-server

The Triton TensorRT-LLM Backend

Python714

3 天前

Qwen-TensorRT-LLM

Python587

4 个月前

Jan is an open source alternative to ChatGPT that runs 100% offline on your computer. Multiple engine support (llama.cpp, TensorRT-LLM)

llama2 llamacpp localai Self-hosted Electron

TypeScript23.82 k

5 小时前

NVIDIA Corporation@NVIDIA

A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM

TypeScript2.77 k

3 个月前

MixQ_Tensorrt_LLM

Mixed precision inference by Tensorrt-LLM

C++94

1 个月前

OpenAI compatible API for TensorRT LLM triton backend

Rust177

4 个月前

TensorRT-Model-Optimizer

NVIDIA Corporation@NVIDIA

TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization, pruning, distillation, etc. It compresses deep learning models for downstream depl...

Python587

4 天前

Local AI API Platform

gguf llama2 llamacpp tensorrt-llm accelerated

C++2.15 k

32 分钟前

TensorRT LLM Benchmark Configuration

Python11

4 个月前

Awesome-LLM-Inference

📖A curated list of Awesome LLM Inference Papers with codes. 🎉🎉

2.87 k

5 天前

Open Neural Network Exchange@onnx

#计算机科学#ONNX-TensorRT: TensorRT backend for ONNX

翻译 - ONNX-TensorRT：用于ONNX的TensorRT后端

onnx 深度学习 Nvidia

C++2.96 k

24 天前

Georgi Gerganov@ggerganov

Facebook 的 LLaMA 模型在 C/C++ 中的移植

C++68.49 k

28 分钟前

#大语言模型#The Memory layer for your AI apps

人工智能 ChatGPT llm Python chatbots

Python23.02 k

3 天前

@tensorflow • 谷歌公司

TensorFlow/TensorRT integration

Jupyter Notebook736

1 年前

TensorRT-Yolov3

TensorRT for Yolov3

翻译 - TensorRT for Yolov3

C++490

2 年前

YOLOv8-TensorRT

YOLOv8 using TensorRT accelerate !

tensorrt yolov8 onnx deepstream segment

C++1.4 k

5 天前

NVIDIA Corporation@NVIDIA

#计算机科学#NVIDIA®TensorRT™是一款用于在NVIDIA GPU上进行高性能深度学习推理的SDK。此存储库包含TensorRT的开源组件。

tensorrt Nvidia 深度学习 inference gpu-acceleration

C++10.87 k

24 天前

trt-samples-for-hackathon-cn

NVIDIA Corporation@NVIDIA

Simple samples for TensorRT programming

Python1.5 k

1 个月前

yolov5-tensorrt

YOLOv5 in TensorRT

Python138

3 年前

YOLOv8-TensorRT-CPP

YOLOv8 TensorRT C++ Implementation

C++566

14 天前

C++ library based on tensorrt integration

C++2.62 k

2 年前

tensorrt-cpp-api

TensorRT C++ API Tutorial

C++363

10 个月前

TensorRT Plugin Autogen Tool

Python365

2 年前

PyTorch ,ONNX and TensorRT implementation of YOLOv4

翻译 - YOLOv4的PyTorch，ONNX和TensorRT实现

yolov4 PyTorch darknet2pytorch darknet2onnx tensorrt

Python4.49 k

5 个月前

Universal and Transferable Attacks on Aligned Language Models

Python3.48 k

4 个月前

loading...

编程语音

Python
C++
Jupyter Notebook
TypeScript
JavaScript
Rust
HTML
Go
Java
Cuda