#自然语言处理#🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton
[TMLR 2024] Efficient Large Language Models: A Survey
#Awesome#Infrastructures™ for Machine Learning Training/Inference in Production.
#Awesome#Curated collection of papers in machine learning systems
Dive into machine learning system, start from reinventing the wheel.
#面试#Learn how to design and implement effective Machine Learning systems from start to finish.
#大语言模型#The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)
#大语言模型#Distributed RL System for LLM Reasoning
Oort: Efficient Federated Learning via Guided Participant Selection
#计算机科学#a curated list of high-quality papers on resource-efficient LLMs 🌱
#计算机科学#Here are my personal paper reading notes (including cloud computing, resource management, systems, machine learning, deep learning, and other interesting stuffs).
#大语言模型#Course Material for the UG Course COMP4901Y
#计算机科学#Machine Learning Compiler Road Map
Triton implement of bi-directional (non-causal) linear attention
#计算机科学#CSCE 585 - Machine Learning Systems
#计算机科学#A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup
#计算机科学#A C++ implementation of the scalar-valued autograd engine micrograd
#Awesome#A curated list of resources to deep dive into the intersection of applied machine learning and threat detection.
[Long Term Support] [SIGCOMM 2023] Lightning: A Reconfigurable Photonic-Electronic SmartNIC for Fast and Energy-Efficient Inference