#大语言模型#Tools for merging pretrained large language models.
Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. arXiv:2408.07666.
FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion
Code release for Dataless Knowledge Fusion by Merging Weights of Language Models (https://openreview.net/forum?id=FCnohuR6AnM)
AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.
ZhiJian: A Unifying and Rapidly Deployable Toolbox for Pre-trained Model Reuse
Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]
Representation Surgery for Multi-Task Model Merging. ICML, 2024.
#大语言模型#DELLA-Merging: Reducing Interference in Model Merging through Magnitude-Based Sampling
#大语言模型#All-in-one UI for merged LLMs in Hugging Face
#自然语言处理#Exploring Model Kinship for Merging Large Language Models
#大语言模型#Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic
[ECCV'24] MaxFusion: Plug & Play multimodal generation in text to image diffusion models
[ECCV 2024] MagMax: Leveraging Model Merging for Seamless Continual Learning (official repository)
#计算机科学#[ICLR 2025] CAMEx: Curvature-Aware Merging of Experts
flow-merge is a powerful Python library that enables seamless merging of multiple transformer-based language models using the most popular merge methods such as model soups, SLERP, ties-MERGING or DAR...
A model merging project for generalizing Featured Finite State Machines (FFSMs) to unify behaviors across Software Product Lines (SPLs)
#大语言模型#Merge transformers without using like a bajillion GB of RAM
#计算机科学#An easy-to-use Python library for merging PyTorch models.
SurgeryV2: Bridging the Gap Between Model Merging and Multi-Task Learning with Deep Representation Surgery. Arxiv, 2024.