Baobao Algorithm Notes
Sep 9, 2024 · Artificial Intelligence
How MoSLoRA Reinvents Low‑Rank Adaptation with Mixer Matrices
This article analyzes the Mixture‑of‑Subspaces in Low‑Rank Adaptation (MoSLoRA) paper, explaining its motivation, design choices that replace LoRA's gate with a mixer matrix, connections to multi‑head attention, experimental findings on LLaMA‑3 fine‑tuning, and theoretical proofs of its re‑parameterization properties.
AILoRAMixture of Experts
0 likes · 12 min read
