openai-triton · GitHub Topics

#自然语言处理#LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

深度学习 gpt llama 大语言模型 model-serving 自然语言处理 openai-triton

Python 3.12 k

2 天前

chengzeyi / stable-fast

https://wavespeed.ai/ Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.

CUDA diffusers PyTorch stable-diffusion openai-triton torch

Python 1.25 k

18 天前

BobMcDear / attorch

#计算机科学#A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.

CUDA 深度学习机器学习 PyTorch triton openai openai-triton

Python 529

2 个月前

DeepAuto-AI / hip-attention

Training-free Post-training Efficient Sub-quadratic Complexity Attention. Implemented with OpenAI Triton.

attention attention-mechanism openai-triton triton

Python 128

2 天前

neural-bits / ai-programming-hub

Learn and experiment with new techniques and programming languages with a focus on ML

C++CUDA cython openai-triton Python Rust

Jupyter Notebook 5

7 个月前