#大语言模型#A high-throughput and memory-efficient inference and serving engine for LLMs
#计算机科学#Open deep learning compiler stack for cpu, gpu and specialized accelerators
翻译 - 针对cpu,gpu和专用加速器的开放式深度学习编译器堆栈
翻译 - 移至https://github.com/dmlc/tvm/
#计算机科学#A deep learning package for many-body potential energy representation and molecular dynamics
翻译 - 用于多体势能表示和分子动力学的深度学习包
#计算机科学#Large-scale LLM inference engine
stdgpu: Efficient STL-like Data Structures on the GPU
Dockerfiles for the various software layers defined in the ROCm software platform
Abstraction Library for Parallel Kernel Acceleration 🦙
Agenium Scale vectorization library for CPUs and GPUs
Main repository for QMCPACK, an open-source production level many-body ab initio Quantum Monte Carlo code for computing the electronic structure of atoms, molecules, and solids with full performance p...
Kubernetes (k8s) device plugin to enable registration of AMD GPU to a container cluster
HPC solver for nonlinear optimization problems
Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm
#计算机科学#MIVisionX toolkit is a set of comprehensive computer vision and machine intelligence libraries, utilities, and applications bundled into a single toolkit. AMD MIVisionX also delivers a highly optimize...