Bridging Domains through Subspace-Aware Model Merging

ByAdmin

Mar 10, 2026

THE AI TODAY

arXiv:2603.05768v2 Announce Type: replace-cross
Abstract: Model merging integrates multiple task-specific models into a single consolidated one. Recent research has made progress in improving merging performance for in-distribution or multi-task scenarios, but domain generalization in model merging remains underexplored. We investigate how merging models fine-tuned on distinct domains affects generalization to unseen domains. Through an analysis of parameter competition in the task matrix using singular value decomposition, we show that merging models trained under different distribution shifts induces stronger conflicts between their subspaces compared to traditional multi-task settings. To mitigate this issue, we propose SCORE (Subspace COnflict-Resolving mErging), a method designed to alleviate such singular subspace conflicts. SCORE finds a shared orthogonal basis by computing the principal components of the concatenated leading singular vectors of all models. It then projects each task matrix into the shared basis, pruning off-diagonal components to remove conflicting singular directions. SCORE consistently outperforms, on average, existing model merging approaches in domain generalization settings across a variety of architectures and model scales, demonstrating its effectiveness and scalability.

By Admin

AI RESEARCH

Bridging Domains through Subspace-Aware Model Merging

ByAdmin

By Admin

Related Post

Bridging Natural Language and Microgrid Dynamics: A Context-Aware Simulator and Dataset

Towards Privacy-Preserving Large Language Model: Text-free Inference Through Alignment and Adaptation

On the Step Length Confounding in LLM Reasoning Data Selection

You missed

High-Precision Estimation of the State-Space Complexity of Shogi via the Monte Carlo Method

Governance and Regulation of Artificial Intelligence in Developing Countries: A Case Study of Nigeria

On the Step Length Confounding in LLM Reasoning Data Selection

Towards Privacy-Preserving Large Language Model: Text-free Inference Through Alignment and Adaptation