Tools for merging pretrained large language models.
-
Updated
May 4, 2025 - Python
Tools for merging pretrained large language models.
Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. arXiv:2408.07666.
FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion
Code release for Dataless Knowledge Fusion by Merging Weights of Language Models (https://openreview.net/forum?id=FCnohuR6AnM)
AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.
ZhiJian: A Unifying and Rapidly Deployable Toolbox for Pre-trained Model Reuse
Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]
Representation Surgery for Multi-Task Model Merging. ICML, 2024.
DELLA-Merging: Reducing Interference in Model Merging through Magnitude-Based Sampling
[ECCV'24] MaxFusion: Plug & Play multimodal generation in text to image diffusion models
Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic
Exploring Model Kinship for Merging Large Language Models
Official PyTorch implementation of LoRI: Reducing Cross-Task Interference in Multi-Task Low-Rank Adaptation.
All-in-one UI for merged LLMs in Hugging Face
[ECCV 2024] MagMax: Leveraging Model Merging for Seamless Continual Learning (official repository)
[ICLR 2025] CAMEx: Curvature-Aware Merging of Experts
Flexible library for merging large language models (LLMs) via evolutionary optimization.
flow-merge is a powerful Python library that enables seamless merging of multiple transformer-based language models using the most popular merge methods such as model soups, SLERP, ties-MERGING or DARE.
Merge transformers without using like a bajillion GB of RAM
An easy-to-use Python library for merging PyTorch models.
Add a description, image, and links to the model-merging topic page so that developers can more easily learn about it.
To associate your repository with the model-merging topic, visit your repo's landing page and select "manage topics."