GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub

编程语言

”cuda-kernels“ 的搜索结果

xlite-dev/LeetCUDA
LeetCUDA
@xlite-dev

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

CUDAcuda-kernelsflash-attention
Cuda5.53 k
2 天前

相关主题

CUDAgpucuda-kernelsopenclbenchmarksyclNvidiaopenmp

Google   Bing   GitHub

NVIDIA Corporation
nvbench
NVIDIA Corporation@NVIDIA

CUDA Kernel Benchmarking Library

benchmarkcuda-kernelsCUDAperformance
Cuda682
5 天前
mirage
@mirage-project

Mirage: Automatically Generating Fast GPU Kernels without Programming in Triton/CUDA

C++1.57 k
1 天前
CUDA-Winograd
@xuqiantong

Fast CUDA Kernels for ResNet Inference.

Cuda177
6 年前
KernelBench
@ScalingIntelligence

KernelBench: Can LLMs Write GPU Kernels? - Benchmark with Torch -> CUDA problems

benchmarkcodegenevaluationgpu
Python475
20 小时前
NVIDIA Corporation
cuda-samples
NVIDIA Corporation@NVIDIA

CUDA 开发人员使用的示例,演示了 CUDA 工具包中的功能

CUDAcuda-kernelscuda-openglcuda-driver-api
C7.74 k
2 个月前
CLTune
@CNugteren

CLTune: An automatic OpenCL & CUDA kernel tuner

openclCUDAtunerauto-tuning
C++180
3 年前
eigen-cuda
@GPMueller

MWE for using the Eigen library in CUDA kernels

CUDA
CMake119
3 年前
torchsearchsorted
@aliutkus

Pytorch Custom CUDA kernel for searchsorted

Python137
2 年前
Soumith Chintala
cuda-convnet2.torch
Soumith Chintala@soumith

Torch7 bindings for cuda-convnet2 kernels!

Cuda40
9 年前
cuda-opencv-examples
@evlasblom

Using custom CUDA kernels with Open CV Mat objects.

CUDAcuda-kernelsOpenCV
Cuda37
7 年前
mixbench
@ekondis

A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)

hipgpuopenclCUDAbenchmark
C++405
6 个月前
FlashAttention20
@kyegomez

Get down and dirty with FlashAttention2.0 in pytorch, plug in and play no complex CUDA kernels

Python105
2 年前
CudaPAD
@SunsetQuest

CudaPAD is a PTX/SASS viewer for NVIDIA Cuda kernels and provides an on-the-fly view of the assembly.

CUDAcuda-programminggpuNvidiaptx
C#119
2 年前
FlashMLA
@deepseek-ai

FlashMLA: Efficient MLA decoding kernels

Cuda11.65 k
3 个月前
Kernels
@ParRes

This is a set of simple programs that can be used to explore the features of a parallel platform.

parallel-programmingCC++mpi
C434
2 个月前
embree
@RenderKit

Embree ray tracing kernels repository.

C++2.53 k
12 天前
LinkedIn
Liger-Kernel
LinkedIn@linkedin

Efficient Triton Kernels for LLM Training

llm-trainingtritonfinetuninggemma2llama
Python5.37 k
2 天前
ThunderKittens
@HazyResearch

Tile primitives for speedy kernels

Cuda2.51 k
8 天前
GPU-Puzzles
@srush

#计算机科学#Solve puzzles. Learn CUDA.

CUDA机器学习puzzles
Jupyter Notebook11.27 k
10 个月前
Apollo Auto
apollo-kernel
Apollo Auto@ApolloAuto

Collections of Apollo Kernels

Shell407
3 年前
NVIDIA Corporation
CUDALibrarySamples
NVIDIA Corporation@NVIDIA

CUDA Library Samples

curandcusolvercusparse
Cuda2.02 k
7 天前
rtl88x2bu
@cilynx
内容违规,已屏蔽
C1.72 k
1 个月前
ZLUDA
@vosen

ZLUDA 使得原本为英伟达GPU设计的 CUDA 应用程序能够在AMD和Intel GPU上运行而无需修改

CUDARust
Rust12.66 k
1 天前
loading...