ICLR 2026: How LiveMoments Restores Live Photo Cover Frames Without Blur

The paper "LiveMoments: Reselected Key Photo Restoration in Live Photos via Reference‑guided Diffusion" introduces a new task and a diffusion‑based method that uses the original high‑resolution cover frame to dramatically improve the visual quality of reselection cover frames in Live Photos, outperforming existing reference‑super‑resolution and single‑frame approaches.

Machine Heart
Machine Heart
Machine Heart
ICLR 2026: How LiveMoments Restores Live Photo Cover Frames Without Blur

Task Definition

Reselected Key Photo Restoration uses the high‑resolution original cover frame of a Live Photo as a reference to restore a newly selected cover frame, aiming for visual quality comparable to the original cover.

Method Overview (LiveMoments)

LiveMoments adopts a reference‑guided diffusion framework with two branches:

Generative branch : a diffusion model provides a strong generative prior for high‑fidelity texture synthesis.

Motion‑alignment module : aligns the original and reselection frames both in latent space (motion‑guided attention) and in pixel space (block‑matching) to handle temporal offset and motion misalignment.

An attention‑based feature‑fusion mechanism injects fine details from the original cover into the diffusion process, preserving structural consistency while enhancing sharpness.

LiveMoments architecture diagram
LiveMoments architecture diagram

Baseline Design for Detail Transfer

The diffusion model extracts fine‑grained features; a reference‑guided attention fusion incorporates texture from the original cover, enabling stable restoration of the reselection frame.

Motion Alignment Design

Because the original and reselection frames often exhibit noticeable time shift and motion displacement, the motion‑alignment module operates in two stages:

In latent space, motion‑guided attention enhances consistency of fused features.

In pixel space, block‑matching retrieves the correct reference region, reducing artifacts caused by motion.

Dataset and Evaluation

Three datasets were built:

Two real‑world Live Photo collections captured with vivo X200 Pro and iPhone 15 Pro.

One synthetic dataset.

Evaluation metrics were adapted from no‑reference image‑quality measures to suit the reselection task.

Quantitative results table
Quantitative results table

Experimental Results

LiveMoments achieved the highest scores on both real datasets, surpassing state‑of‑the‑art reference‑super‑resolution (RefSR) and single‑frame super‑resolution methods. Visual comparisons show clearer edges and fewer artifacts.

Qualitative comparison
Qualitative comparison

Conclusion

By exploiting the naturally available high‑resolution original cover as a reference, the dual‑branch diffusion model with unified motion alignment reliably transfers structure and texture to reselection frames, even in complex dynamic scenes. The work introduces a new reference‑guided image‑restoration paradigm for computational photography on mobile devices.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Sign in to view source
Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactadmin@besthub.devand we will review it promptly.

diffusion modelImage RestorationLive PhotoMobile ImagingICLR 2026Reference-guided
Machine Heart
Written by

Machine Heart

Professional AI media and industry service platform

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.