Xiaomi MiMo-V2-Flash Open‑Source: Ultra‑Efficient Inference and Agent‑Ready Model
Xiaomi's MiMo-V2-Flash, a 309B MoE model with hybrid attention and Multi‑Token Prediction acceleration, delivers top‑2 global agent benchmark scores, up to 2× faster inference, and only 2.5% of the cost of comparable closed‑source models, while being fully open‑source.
