LiveMoments: Reference‑Guided Diffusion Boosts Live Photo Cover Frame Quality (ICLR 2026)

LiveMoments, the first method dedicated to restoring the quality of reselected cover frames in Live Photos, leverages the original high‑resolution cover as a reference and a motion‑alignment module within a diffusion model to correct motion misalignment and achieve high‑definition results, as demonstrated on two real datasets and a synthetic benchmark.

vivo Internet Technology
vivo Internet Technology
vivo Internet Technology
LiveMoments: Reference‑Guided Diffusion Boosts Live Photo Cover Frame Quality (ICLR 2026)

Background

Live Photo combines a high‑resolution still cover and a short video. Users often reselect a frame from the video as a new cover, but the selected frames suffer from low resolution and heavy compression, resulting in poor visual quality.

Live Photo cover selection example
Live Photo cover selection example

New Task Definition

The authors define “Reselected Key Photo Restoration” – using the original high‑quality cover as a reference to guide the enhancement of the low‑quality reselected frame. This differs from traditional reference‑based super‑resolution, which relies on external databases, and from video‑wide super‑resolution, which processes entire sequences.

Task comparison illustration
Task comparison illustration

Technical Approach

LiveMoments builds on a diffusion model and introduces a reference‑guided restoration framework. An attention‑based feature fusion injects details from the original cover into the diffusion process, preserving structure while improving clarity.

To address motion misalignment between the original and reselected frames, a motion‑alignment module operates both in latent space (motion‑guided attention) and pixel space (block‑matching retrieval), aligning the reference correctly before fusion.

Experiments

The team constructed two real‑world Live Photo datasets captured with vivo X200 Pro and iPhone 15 Pro, plus a synthetic dataset. Evaluation metrics were adapted for the task. Table 1 compares LiveMoments with existing reference‑SR and single‑frame SR methods, showing superior scores on both datasets.

Quantitative comparison table
Quantitative comparison table

Qualitative results (Figure 4) demonstrate that LiveMoments restores fine details and achieves “cover‑level” sharpness, outperforming competing methods.

Visual comparison of restored frames
Visual comparison of restored frames

Conclusion and Outlook

By exploiting the native high‑resolution cover as a reference and integrating a dual‑branch diffusion architecture with motion alignment, LiveMoments reliably restores reselected frames even in complex dynamic scenes. The work opens avenues for higher‑quality cover selection in everyday photography, content creation, and further research on reference‑guided diffusion.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Sign in to view source
Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactadmin@besthub.devand we will review it promptly.

image restorationLive PhotoICLR 2026motion alignmentreference-guided diffusion
vivo Internet Technology
Written by

vivo Internet Technology

Sharing practical vivo Internet technology insights and salon events, plus the latest industry news and hot conferences.

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.