SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
#计算机科学#Must read research papers and links to tools and datasets that are related to using machine learning for compilers and systems optimisation
#计算机科学#Machine Learning Framework for Operating Systems - Brings ML to Linux kernel
翻译 - 操作系统的机器学习框架 - 将机器学习引入 Linux 内核
#计算机科学#Stretching GPU performance for GEMMs and tensor contractions.
#计算机科学#Alchemy Cat —— 🔥Config System for SOTA
ebpf profiler for jvm
#计算机科学#Collective Knowledge crowd-tuning extension to let users crowdsource their experiments (using portable Collective Knowledge workflows) such as performance benchmarking, auto tuning and machine learnin...
#计算机科学#K2vTune (A Workload-aware Configuration Tuning for RocksDB)
A Generic Distributed Auto-Tuning Infrastructure
:bowtie: Backoff uses an exponential backoff algorithm to backoff between retries with optional auto-tuning functionality.
Autotuner for Spark applications
This software package accompanies the paper "A Methodology for Comparing Auto-Tuning Optimization Algorithms" (https://doi.org/10.1016/j.future.2024.05.021), making the guidelines in the methodology e...