Matrix Shadow:Lightweight CPU/GPU Matrix and Tensor Template Library in C++/CUDA for (Deep) Machine Learning
翻译 - 矩阵阴影:C ++ / CUDA中的轻量级CPU / GPU矩阵和Tensor模板库,用于(深度)机器学习
📚Tensor/CUDA Cores, 📖150+ CUDA Kernels, ⚡️⚡️toy-hgemm library with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS 🎉🎉).
CUDA Tensor Transpose (cuTT) library
Matrix Multiply-Accumulate with CUDA and WMMA( Tensor Core)
an example of a CUDA extension for PyTorch using CuPy which computes the Hadamard product of two tensors
A fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
Forth does tensors, in CUDA.
A light-weight tensor operation library for C and CUDA
Computer vision container that includes Jupyter notebooks with built-in code hinting, Anaconda, CUDA 11.8, TensorRT inference accelerator for Tensor cores, CuPy (GPU drop in replacement for Numpy), Py...
C++ tensors with broadcasting and lazy computing
翻译 - 具有广播和惰性计算的C ++张量
Tensors and neural networks in Haskell
CUDA 开发人员使用的示例,演示了 CUDA 工具包中的功能
Simple, safe way to store and distribute tensors
Amazon SageMaker Debugger provides functionality to save tensors during training of machine learning jobs and analyze those tensors
Tensors and differentiable operations (like TensorFlow) in Rust
Library to manipulate tensors on the GPU.
Named tensors with first-class dimensions for PyTorch
A library for doing homomorphic encryption operations on tensors
CUDA Library Samples
pure-Python HistFactory implementation with tensors and autodiff