我的自学笔记,终身更新,当前专注System基础、MLSys。
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
[MICRO'23, MLSys'22] TorchSparse: Efficient Training and Inference Framework for Sparse Convolution on GPUs.
Learning Machine Learning, The Chinese Taoist Way
A tree-based federated learning system (MLSys 2023)
[MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration
(MLSys' 21) An Acceleration System for Large-scare Unsupervised Heterogeneous Outlier Detection (Anomaly Detection)
Slides, scripts and materials for the Machine Learning in Finance Course at NYU Tandon, 2022
🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSys...
PyTorch implementation of FedProx (Federated Optimization for Heterogeneous Networks, MLSys 2020).
[MLSys 2022] "BNS-GCN: Efficient Full-Graph Training of Graph Convolutional Networks with Partition-Parallelism and Random Boundary Node Sampling" by Cheng Wan, Youjie Li, Ang Li, Nam Sung Kim, Yingya...