Paper accepted at ACL 2026
π Happy to share that our paper βK-Merge: Online Continual Merging of Adapters for On-device Large Language Modelsβ has been accepted at πππ ππππ (main, oral), which will be held in San Diego, California, this July! βοΈπ
On-device deployment of Large Language Models frequently relies on Low-Rank Adapters (LoRAs) to support diverse downstream tasks under tight resource constraints. As users continuously request support for new tasks, new languages, new problem types, devices face a critical challenge: how to incorporate new adapters without forgetting previously learned capabilities, all within a limited storage budget.
In this work, we:
π― Introduce a new and practical setting for online continual merging of adapters in resource-constrained on-device LLMs under a storage budget.
π§ Propose π-πππ«π π, a lightweight and data-free merging strategy that selects which adapters to merge by leveraging the history of past merges.
βοΈ Employ dynamic weighting to balance old and new capabilities during merging, avoiding catastrophic forgetting while staying within the compute and memory limits of real mobile devices.
π Conduct extensive experiments on real-world tasks representative of mobile device usage, showing consistent superiority over alternative merging strategies under realistic constraints.
Thanks to the amazing co-authors: Ondrej Bohdal, Taha Ceritli, Mete Ozay, Pietro Zanuttigh, and Umberto Michieli
π Preprint: https://arxiv.org/abs/2510.13537
π Project Page: https://donaldssh.github.io/K-Merge/
π» Code: https://github.com/donaldssh/K-Merge