cuda-programming · GitHub Topics

Taskflow 助您用现代 C++ 快速编写并行和异构任务程序

parallel-programming threadpool concurrent-programming high-performance-computing multicore-programming multi-threading taskparallelism multithreading parallel-computing work-stealing gpu-programming heterogeneous-parallel-programming cuda-programming parallel taskflow

C++ 11.04 k

15 天前

Rust-GPU / Rust-CUDA

Ecosystem of libraries and tools for writing and executing fast GPU code fully in Rust.

CUDA cuda-kernels cuda-programming gpgpu gpu gpu-programming Rust

Rust 4.53 k

3 天前

NVIDIA / cccl

CUDA Core Compute Libraries

accelerated-computing C++cpp-programming CUDA cuda-cpp cuda-kernels cuda-library cuda-programming gpu gpu-acceleration gpu-computing gpu-programming hpc Nvidia parallel-algorithm parallel-computing parallel-programming modern-cpp

C++ 1.76 k

14 小时前

brucefan1983 / CUDA-Programming

Sample codes for my CUDA programming book

cuda-programming gpu-programming Molecular Dynamics

Cuda 1.76 k

5 个月前

mit-han-lab / TinyChatEngine

#计算机科学#TinyChatEngine: On-Device LLM Inference Library

arm C C++cuda-programming 深度学习 edge-computing large-language-models on-device-ai quantization x86-64

C++ 871

1 年前

coreylowman / cudarc

Safe rust wrapper around CUDA toolkit

CUDA cuda-programming gpu gpu-acceleration Rust cublas curand cuda-kernels cudnn nccl

Rust 869

18 天前

eyalroz / cuda-api-wrappers

Thin, unified, C++-flavored wrappers for the CUDA APIs

API CUDA modern-cpp gpu gpu-computing gpgpu gpgpu-computing cuda-driver-api cuda-programming

C++ 848

1 个月前

harleyszhang / llm_note

#大语言模型#LLM notes, including model inference, transformer model structure, and llm framework code analysis notes.

大语言模型 llm-inference vllm cuda-programming kv-cache transformer-models

Python 796

2 天前

sail-sg / Adan

#计算机科学#Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models

bert-model convnext 深度学习 fairseq optimizer resnet timm vit transformer-xl 人工智能 diffusion dreamfusion gpt2 PyTorch cuda-programming llm-training 大语言模型 moe

Python 795

1 个月前

PaddleJitLab / CUDATutorial

#计算机科学#A self-learning tutorail for CUDA High Performance Programing.

cuda-programming 深度学习

JavaScript 677

15 天前

nosferalatu / SimpleGPUHashTable

A simple GPU hash table implemented in CUDA using lock free techniques

CUDA 数据结构 gpu cuda-programming

Cuda 395

1 年前

jaredhoberock / stanford-cs193g-sp2010

This is an archive of materials produced for an introductory class on CUDA programming at Stanford University in 2010

CUDA cuda-kernels cuda-programming gpu-programming

C++ 219

3 年前

HenryNdubuaku / cuda-tutorials

#计算机科学#CUDA tutorials for Maths & ML tutorials with examples, covers multi-gpus, fused attention, winograd convolution, reinforcement learning.

CUDA cuda-kernels cuda-programming 机器学习 maths

Cuda 187

1 个月前

MuGdxy / muda

μ-Cuda, COVER THE LAST MILE OF CUDA. With features: intellisense-friendly, structured launch, automatic cuda graph generation and updating.

CUDA cuda-programming cuda-cpp

C++ 181

10 天前

ROCm / HIP-CPU

An implementation of HIP that works on CPUs, across OSes.

hip hip-runtime hip-portability hip-kernel-language CUDA cuda-programming C++stl-algorithms spmd

C++ 121

1 年前

tgautam03 / xGeMM

Accelerated General (FP32) Matrix Multiplication from scratch in CUDA

cuda-programming gpu-programming matrix-multiplication

Cuda 120

6 个月前

SunsetQuest / CudaPAD

CudaPAD is a PTX/SASS viewer for NVIDIA Cuda kernels and provides an on-the-fly view of the assembly.

CUDA cuda-programming gpu Nvidia ptx Windows

C# 119

2 年前

eyalroz / cuda-kat

#算法刷题#CUDA kernel author's tools

CUDA cuda-kernels utility-library C++constexpr 算法 patterns modern-cpp gpu-programming gpu cuda-library cuda-programming printf

Cuda 111

3 年前

emptysoal / cuda-image-preprocess

#计算机科学#Speed up image preprocess with cuda when handle image or tensorrt inference

cnn cuda-programming 深度学习图像处理 tensorrt CUDA cuda-kernels

Cuda 72

6 天前

mikeroyal / CUDA-Guide

#Awesome#CUDA Guide

CUDA gpu 深度学习机器学习 cuda-programming Awesome Lists awesome-readme cuda-kernels cuda-library cuda-opengl gpgpu-computing graphics-programming gpgpu Hackathon-Kit

Cuda 69

2 年前