Can MaIR’s Locality‑Preserving Mamba Boost Image Restoration?

The article presents MaIR, a locality‑ and continuity‑preserving Mamba‑based model for image restoration, detailing its three‑stage architecture, novel scanning strategy, loss functions, experimental results on super‑resolution and denoising, and ablation studies, with links to the arXiv paper and source code.

AI Frontier Lectures
AI Frontier Lectures
AI Frontier Lectures
Can MaIR’s Locality‑Preserving Mamba Boost Image Restoration?

Paper title: MaIR: A Locality- and Continuity-Preserving Mamba for Image Restoration . arXiv PDF: https://arxiv.org/pdf/2412.20066. Source code: https://github.com/XLearning-SCU/2025-CVPR-MaIR

Method

Comparison of visual Mamba scanning strategies.

Scanning methods comparison
Scanning methods comparison

Overall architecture: three linear stages – shallow feature extraction (3×3 Conv), deep feature extraction (stacked RMG blocks), and reconstruction layer (super‑resolution: 3×3 Conv + pixel‑shuffle; denoising: 3×3 Conv + residual connection). Each RMG contains several RMB+Conv blocks; RMB is a transformer‑style block whose core is VMM, a dual‑branch Mamba block whose core is MaIRM.

MaIR overall structure
MaIR overall structure

MaIRM module: the NSS (Nested S‑shaped Scanning) flattens 2D features into four 1‑D sequences using a four‑direction nested S‑shaped scanning pattern. The SSO (Self‑Similarity Operator) captures long‑term dependencies within each sequence. The SSA (Sequence‑wise Spatial Aggregation) aggregates the four sequences by pooling, channel shuffle, group convolution, unshuffle, chunked attention, and weighted sum to produce the final output.

MaIRM diagram 1
MaIRM diagram 1
MaIRM diagram 2
MaIRM diagram 2

Loss functions: L1 loss for super‑resolution and Charbonnier loss for denoising.

Loss function illustration
Loss function illustration

Experiments

Quantitative evaluation

Super‑resolution: benchmark PSNR/SSIM improvements (exact numbers omitted).

Super‑resolution quantitative
Super‑resolution quantitative

Denoising: quantitative metrics on standard denoising datasets.

Denoising quantitative
Denoising quantitative

Visual quality

Super‑resolution visual comparison shows sharper edges and more faithful texture reconstruction.

Super‑resolution visual quality
Super‑resolution visual quality

Ablation study

Components examined: NSS, SSA, and bandwidth (feature channel width). Removing any component degrades performance, confirming their contribution.

Ablation results 1
Ablation results 1
Ablation results 2
Ablation results 2
Ablation results 3
Ablation results 3
Ablation results 4
Ablation results 4

Insights

Improved scanning strategies (NSS) and dedicated aggregation modules (SSA) are effective ways to enhance Mamba‑based image restoration models.

Compared with the predecessor MambaIR, MaIR achieves higher PSNR/SSIM on both super‑resolution and denoising tasks.

MaIR vs MambaIR comparison
MaIR vs MambaIR comparison

Reference

@inproceedings{MaIR,
  title={MaIR: A Locality- and Continuity-Preserving Mamba for Image Restoration},
  author={Li, Boyun and Zhao, Haiyu and Wang, Wenxin and Hu, Peng and Gou, Yuanbiao and Peng, Xi},
  booktitle={IEEE Conference on Computer Vision and Pattern Recognition},
  year={2025},
  address={Nashville, TN},
  month={jun}
}
computer visionSuper-ResolutionDenoisingImage RestorationMamba
AI Frontier Lectures
Written by

AI Frontier Lectures

Leading AI knowledge platform

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.