Tagged articles
36 articles
Page 1 of 1
AIWalker
AIWalker
Apr 6, 2026 · Artificial Intelligence

BIPNet: Adaptive Progressive Upsampling Drives a Leap in Burst Image Restoration (TPAMI 2025)

The TPAMI 2025 paper introduces BIPNet, a unified burst‑image framework that tackles alignment, fusion, and upsampling challenges with edge‑enhanced alignment, pseudo‑burst feature fusion, and adaptive group upsampling, achieving state‑of‑the‑art results across super‑resolution, low‑light enhancement, and denoising while offering lightweight mobile variants.

BIPNetBurst Image ProcessingComputer Vision
0 likes · 13 min read
BIPNet: Adaptive Progressive Upsampling Drives a Leap in Burst Image Restoration (TPAMI 2025)
vivo Internet Technology
vivo Internet Technology
Mar 18, 2026 · Artificial Intelligence

How Ada-RefSR Eliminates Hallucinations in Single‑Step Diffusion Super‑Resolution

This article presents Ada-RefSR, a novel single‑step diffusion‑based reference super‑resolution framework that introduces a "Trust but Verify" paradigm, adaptive implicit correlation gating, and lightweight architecture to robustly suppress hallucinations and achieve state‑of‑the‑art performance on multiple benchmarks, while being suitable for mobile deployment.

Ada-RefSRICLR2026Image Restoration
0 likes · 10 min read
How Ada-RefSR Eliminates Hallucinations in Single‑Step Diffusion Super‑Resolution
Kuaishou Tech
Kuaishou Tech
Jul 7, 2025 · Artificial Intelligence

8 Kuaishou Papers Spotlighted at ICML 2025: Multimodal AI, Causal Inference and More

Kuaishou has had eight cutting‑edge papers accepted at the International Conference on Machine Learning 2025, covering breakthroughs in multimodal emotion modeling, monotonic probability learning, causal effect generalization, cascade ranking, multimodal LLM alignment, ultra‑low‑rate image compression, and visual autoregressive super‑resolution, with links to each work and accompanying code repositories.

Multimodalaicausal inference
0 likes · 13 min read
8 Kuaishou Papers Spotlighted at ICML 2025: Multimodal AI, Causal Inference and More
AI Frontier Lectures
AI Frontier Lectures
Jun 7, 2025 · Artificial Intelligence

Can MaIR’s Locality‑Preserving Mamba Boost Image Restoration?

The article presents MaIR, a locality‑ and continuity‑preserving Mamba‑based model for image restoration, detailing its three‑stage architecture, novel scanning strategy, loss functions, experimental results on super‑resolution and denoising, and ablation studies, with links to the arXiv paper and source code.

Computer VisionDenoisingImage Restoration
0 likes · 5 min read
Can MaIR’s Locality‑Preserving Mamba Boost Image Restoration?
AI Frontier Lectures
AI Frontier Lectures
Jun 3, 2025 · Artificial Intelligence

How MaIR Advances Image Restoration with a Locality‑Preserving Mamba Architecture

The article presents MaIR, a Mamba‑based image restoration model that preserves locality and continuity, detailing its architecture, scanning strategies, loss functions, experimental results on super‑resolution and denoising, and an ablation study, while providing links to the arXiv paper and GitHub source code.

Computer VisionDenoisingImage Restoration
0 likes · 5 min read
How MaIR Advances Image Restoration with a Locality‑Preserving Mamba Architecture
AI Frontier Lectures
AI Frontier Lectures
Mar 24, 2025 · Artificial Intelligence

How MambaIRv2 Boosts Image Restoration with Attentive State‑Space Design

Introducing MambaIRv2, an image restoration backbone that replaces Mamba’s causal scanning with an attentive state‑space module, achieving single‑direction scanning, reduced parameters and computation, and superior performance on lightweight and classic super‑resolution, JPEG artifact removal, and denoising tasks, as validated by CVPR‑2025 results.

Computer VisionImage RestorationMambaIRv2
0 likes · 8 min read
How MambaIRv2 Boosts Image Restoration with Attentive State‑Space Design
AIWalker
AIWalker
Feb 6, 2025 · Artificial Intelligence

FluxSR: The First 12B‑Parameter Single‑Step Diffusion Model for Real‑World Super‑Resolution

FluxSR introduces a novel single‑step diffusion approach for real‑world image super‑resolution built on the 12‑billion‑parameter FLUX.1‑dev model, employing Flow‑Trajectory Distillation, TV‑LPIPS and attention‑diversity losses to achieve high fidelity, reduced artifacts, and lower memory and compute costs.

Flow DistillationImage Restorationdiffusion
0 likes · 16 min read
FluxSR: The First 12B‑Parameter Single‑Step Diffusion Model for Real‑World Super‑Resolution
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Jun 17, 2024 · Artificial Intelligence

Xiaohongshu Audio-Video Architecture Team Wins Top Awards in CVPR NTIRE 2024 Challenges

Xiaohongshu’s audio‑video architecture team secured second place in the RAIM challenge and first in the S‑UGC VQA challenge at CVPR NTIRE 2024 by improving generative image restoration with SUPIR, DeSRA and a Fusion model, and enhancing video quality assessment using LIQE, Q‑Align and FAST‑VQA, then deploying these methods for live‑stream denoising, intelligent transcoding and cloud‑based super‑resolution, achieving high PLCC/SROCC scores and up to 33 % bandwidth savings.

CVPR NTIRE 2024Deep LearningDenoising
0 likes · 25 min read
Xiaohongshu Audio-Video Architecture Team Wins Top Awards in CVPR NTIRE 2024 Challenges
Bilibili Tech
Bilibili Tech
Mar 1, 2024 · Artificial Intelligence

Bilibili's Self-Developed Video Super-Resolution Algorithm: Background, Optimization Directions, and Implementation Details

Bilibili’s self‑supervised video super‑resolution system upgrades low‑resolution streams to 4K by using three parallel degradation‑branch networks—texture‑enhancing, line‑recovering, and noise‑removing—tailored to anime, game, and real‑world content, delivering sharper edges, finer textures, and measurable quality gains across its online playback pipeline.

BilibiliDeep LearningModel architecture
0 likes · 16 min read
Bilibili's Self-Developed Video Super-Resolution Algorithm: Background, Optimization Directions, and Implementation Details
DataFunSummit
DataFunSummit
Feb 8, 2024 · Artificial Intelligence

Tencent Music Tianqin Lab's Applications and Practices of Audio Quality AIGC

The article details Tencent Music's Tianqin Lab research on audio quality AIGC, covering background upgrades, music separation techniques, real‑time super‑resolution, the industry‑first premium master‑track technology, and a Q&A on vocal separation, illustrating practical AI-driven audio enhancements.

AIGCAudio AIMastering
0 likes · 7 min read
Tencent Music Tianqin Lab's Applications and Practices of Audio Quality AIGC
DataFunSummit
DataFunSummit
Jan 1, 2024 · Artificial Intelligence

Advances in Image and Video Enhancement, Quality Assessment, and Multimodal AI Techniques

This article reviews the latest research from Alibaba DAMO Academy on real-world image quality problems, covering spatial, temporal, and color enhancement methods, advanced quality assessment metrics, multimodal diffusion models, and future directions toward large‑model integration and lightweight deployment.

Deep LearningMOS regressionMultimodal AI
0 likes · 24 min read
Advances in Image and Video Enhancement, Quality Assessment, and Multimodal AI Techniques
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Aug 17, 2023 · Artificial Intelligence

Human Visual Perception Based Edge‑Cloud Super‑Resolution Framework and RedVQA Quality Assessment at XiaoHongShu

At LiveVideoStackCon 2023, XiaoHongShu unveiled an edge‑cloud super‑resolution framework guided by the human‑perception‑aligned RedVQA model, which jointly optimizes video quality and bandwidth by generating a dedicated SR bitrate tier in the cloud and applying a lightweight SR algorithm on the client, achieving notable QoE gains and narrow‑band HD delivery.

aibandwidth optimizationedge-cloud
0 likes · 25 min read
Human Visual Perception Based Edge‑Cloud Super‑Resolution Framework and RedVQA Quality Assessment at XiaoHongShu
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Jul 26, 2023 · Industry Insights

Human‑Perception‑Based End‑Cloud Super‑Resolution: Cutting Bandwidth, Boosting Quality

The LiveVideoStackCon 2023 session revealed how a human‑perception‑driven end‑cloud super‑resolution framework, AI‑based no‑reference video quality assessment, and rigorous AB‑testing methods can dramatically reduce video bandwidth while enhancing visual quality, illustrating the broader challenges and opportunities in modern audio‑video systems.

AB testingAI assessmentaudio-video industry
0 likes · 13 min read
Human‑Perception‑Based End‑Cloud Super‑Resolution: Cutting Bandwidth, Boosting Quality
Bilibili Tech
Bilibili Tech
Jun 2, 2023 · Artificial Intelligence

AI‑Driven Video Quality Enhancement and Low‑Bitrate High‑Resolution Techniques at Bilibili

Bilibili’s Cloud Multimedia team uses AI‑driven pipelines to cut bandwidth costs while delivering low‑bitrate, high‑quality video, employing a QoE‑based decision engine, real‑time 4K super‑resolution for game streams, low‑rank reconstruction for narrow‑band HD, data‑driven HDR LUTs, and explores diffusion‑based restoration for legacy content.

BilibiliReal-time Streamingai
0 likes · 27 min read
AI‑Driven Video Quality Enhancement and Low‑Bitrate High‑Resolution Techniques at Bilibili
Youku Technology
Youku Technology
Jun 1, 2022 · Artificial Intelligence

AI-Powered Restoration of Classic Animation Videos Using Deep Learning

Youku’s Digital Media Lab built an AI‑powered restoration pipeline that uses multi‑frame super‑resolution, denoising, deblocking, deblurring and GAN‑based detail generation to automatically revive classic animated films, removing noise and artifacts while preserving fine lines, enabling high‑definition viewing for modern audiences.

Denoisingaianimation
0 likes · 8 min read
AI-Powered Restoration of Classic Animation Videos Using Deep Learning
DaTaobao Tech
DaTaobao Tech
May 5, 2022 · Artificial Intelligence

Two-Stage Video Restoration Framework for NTIRE 2022 Video Enhancement Challenge

The TaoMC2 framework, a two‑stage pipeline that augments BasicVSR++ with peak‑quality frames and deep residual blocks in Stage I and refines results with a SwinIR transformer in Stage II, leverages progressive training and transfer learning to boost PSNR to 33.16 dB and secured two championship titles and a runner‑up in the NTIRE 2022 video enhancement challenge.

NTIRE2022compression artifact removalsuper-resolution
0 likes · 26 min read
Two-Stage Video Restoration Framework for NTIRE 2022 Video Enhancement Challenge
High Availability Architecture
High Availability Architecture
Apr 22, 2022 · Artificial Intelligence

BIGO RTC: High‑Quality, Low‑Cost Real‑Time Communication through Core Operators and Scene Adaptation

The article explains how BIGO RTC achieves high‑quality, low‑cost real‑time audio‑video communication by optimizing core video operators such as HEVC encoding, AI‑driven super‑resolution and HDR, and by employing scene‑adaptive techniques like device performance tuning, content‑adaptive encoding and AI‑based pre‑processing to meet diverse latency constraints.

AI AdaptationHDRHigh Quality
0 likes · 9 min read
BIGO RTC: High‑Quality, Low‑Cost Real‑Time Communication through Core Operators and Scene Adaptation
Huawei Cloud Developer Alliance
Huawei Cloud Developer Alliance
Apr 11, 2022 · Artificial Intelligence

How AI Is Revolutionizing Ultra‑HD Cloud Video Transcoding

This article explores the rapid growth of ultra‑HD video, the challenges it creates for cloud transcoding, and how Huawei Cloud leverages AI techniques such as super‑resolution, frame interpolation, denoising, restoration, and SDR‑to‑HDR conversion to deliver immersive, high‑quality viewing experiences across diverse devices.

AI video processingHDR conversioncloud transcoding
0 likes · 24 min read
How AI Is Revolutionizing Ultra‑HD Cloud Video Transcoding
Kuaishou Tech
Kuaishou Tech
Mar 1, 2022 · Artificial Intelligence

Quality Sound and Vision: Enhancing Video and Audio Experience with AI

The article details the development of '质臻影音' technology by Kuaishou's audio-visual team, which uses AI algorithms to restore classic films and enhance video quality through techniques like super-resolution, noise reduction, and adaptive processing strategies.

AI in video enhancementKuaishou technologyadaptive processing
0 likes · 9 min read
Quality Sound and Vision: Enhancing Video and Audio Experience with AI
Code DAO
Code DAO
Dec 23, 2021 · Artificial Intelligence

Permutation‑Invariant PIUnet Boosts Multi‑Temporal Satellite Image Super‑Resolution

The article explains how satellite images suffer from limited spatial resolution, why the ordering of multi‑temporal frames is irrelevant, and how the PIUnet model introduces permutation‑invariant equivariant layers to achieve state‑of‑the‑art super‑resolution efficiently, winning the AI4EO challenge.

Deep LearningPIUnetSatellite Imagery
0 likes · 6 min read
Permutation‑Invariant PIUnet Boosts Multi‑Temporal Satellite Image Super‑Resolution
iQIYI Technical Product Team
iQIYI Technical Product Team
Oct 15, 2021 · Artificial Intelligence

iQiyi's ZoomAI Wins 2021 CCF Science and Technology Award for Video Restoration and Enhancement

iQiyi's ZoomAI AI-based video restoration and enhancement system won the 2021 CCF Science and Technology Outstanding Progress Award, marking its fifth consecutive CCF honor; the system uses proprietary AI algorithms for super‑resolution, frame interpolation, scratch repair, colorization, etc., restoring classic films and improving viewer experience.

AI video restorationArtificial IntelligenceZoomAI
0 likes · 5 min read
iQiyi's ZoomAI Wins 2021 CCF Science and Technology Award for Video Restoration and Enhancement
DataFunTalk
DataFunTalk
May 22, 2021 · Artificial Intelligence

Baidu's Video Foundation Technology Architecture and Key AI Techniques

This article presents an overview of Baidu's video foundation technology architecture, covering the video R&D platform, core AI techniques for video understanding, editing, surveillance, and general vision, and detailing innovations such as Attention‑Cluster networks, cross‑modality attention with graph convolution, GANs, super‑resolution, and adaptive encoding.

Adaptive EncodingAttention MechanismGAN
0 likes · 14 min read
Baidu's Video Foundation Technology Architecture and Key AI Techniques
DataFunTalk
DataFunTalk
Feb 12, 2021 · Artificial Intelligence

PlugNet: A Plug‑in Super‑Resolution Unit for Low‑Quality Text Recognition in Natural Scene OCR

This article introduces ImageDT's PlugNet, which combines deep‑learning OCR and super‑resolution techniques to improve low‑quality text recognition in natural scenes, detailing the company's background, OCR challenges, deep‑learning approaches, super‑resolution methods, the PlugNet architecture, experimental results, and future research directions.

Low-Quality TextOCRPlugNet
0 likes · 16 min read
PlugNet: A Plug‑in Super‑Resolution Unit for Low‑Quality Text Recognition in Natural Scene OCR
NetEase Smart Enterprise Tech+
NetEase Smart Enterprise Tech+
Jan 5, 2021 · Artificial Intelligence

How AI-Powered Super-Resolution is Transforming Real-Time Video Communication

AI-driven super-resolution, once limited to academic research, is now tackling real-time video communication challenges by evolving from early interpolation methods to deep learning models, addressing issues of model size, generalization, and real-world degradation, while lightweight networks and encoding-aware techniques promise practical deployment.

Real-time Videoaiimage enhancement
0 likes · 12 min read
How AI-Powered Super-Resolution is Transforming Real-Time Video Communication
Baidu App Technology
Baidu App Technology
Sep 7, 2020 · Artificial Intelligence

Real-Time Mobile Super-Resolution Reconstruction in Baidu App

The article describes Baidu App's real-time mobile super-resolution using a VDSR-based model with pruning and depthwise separable convolutions, optimized via application-layer and inference engine techniques to halve latency and memory, enabling on-device high‑def image/video enhancement, reducing server load, and supporting iOS/Android integration.

Mobile AIReal-time Processingimage enhancement
0 likes · 8 min read
Real-Time Mobile Super-Resolution Reconstruction in Baidu App
Programmer DD
Programmer DD
Jun 6, 2020 · Artificial Intelligence

How to Revive Century-Old Footage with AI: DAIN, ESRGAN, and DeOldify

This guide shows how to restore and enhance century‑old black‑and‑white Beijing footage using three open‑source AI tools—DAIN for frame interpolation, ESRGAN for super‑resolution, and DeOldify for colorization—complete with setup steps, code snippets, and usage instructions.

AI video restorationDAINDeOldify
0 likes · 10 min read
How to Revive Century-Old Footage with AI: DAIN, ESRGAN, and DeOldify
iQIYI Technical Product Team
iQIYI Technical Product Team
Apr 19, 2019 · Artificial Intelligence

iQIYI ZoomAI Video Enhancement Technology: Applications and Technical Details

Jiang Zidong explains iQIYI's ZoomAI video enhancement tech, covering super‑resolution, denoising/sharpening, color correction, scratch removal, and frame interpolation, and its modular deployment across business lines for restoring classic content and boosting low‑resolution media, achieving massive efficiency gains.

DenoisingZoomAIai
0 likes · 29 min read
iQIYI ZoomAI Video Enhancement Technology: Applications and Technical Details
Youku Technology
Youku Technology
Apr 11, 2019 · Artificial Intelligence

YOUKU-VSRE 2019 Video Enhancement and Super-Resolution Challenge Announcement

The YOUKU‑VSRE 2019 challenge invites researchers to develop state‑of‑the‑art video enhancement and super‑resolution models using the largest, most diverse simulated‑noise dataset, with three competition stages (preliminary, semi‑final, final), cash prizes up to ¥100,000, certificates, and fast‑track recruitment opportunities at Alibaba (Youku).

AI challengeComputer VisionDataset
0 likes · 3 min read
YOUKU-VSRE 2019 Video Enhancement and Super-Resolution Challenge Announcement
360 Quality & Efficiency
360 Quality & Efficiency
Dec 28, 2018 · Artificial Intelligence

SRGAN-Based Image Super-Resolution and MNIST Training Tutorial

This tutorial outlines a curriculum covering open‑source examples for enhancing image resolution using SRGAN, explains GAN‑based super‑resolution concepts, details network architectures and perceptual loss, and provides a simple MNIST training walkthrough with code links and resources.

GANMNISTSRGAN
0 likes · 7 min read
SRGAN-Based Image Super-Resolution and MNIST Training Tutorial
Youku Technology
Youku Technology
Oct 31, 2018 · Artificial Intelligence

Technical Overview of Youku's Video Face Swapping System

Youku’s new video face‑swapping service lets users replace a celebrity’s face with a single uploaded photo by employing a 3D generative model, deep‑learning segmentation, multi‑scale super‑resolution, and trajectory smoothing to achieve fast, near‑photorealistic results across varied angles, expressions, and lighting, though it still lacks personalized models and struggles with extreme side views or heavy occlusions.

3D modelingVideo processingai
0 likes · 10 min read
Technical Overview of Youku's Video Face Swapping System
iQIYI Technical Product Team
iQIYI Technical Product Team
Sep 7, 2018 · Artificial Intelligence

How ZoomAI Uses AI to Super‑Resolve and Enhance Low‑Quality Videos

ZoomAI is an AI‑driven video enhancement platform that combines modular super‑resolution, denoising, sharpening, color correction, and scratch removal techniques, offering both cloud and mobile SDK solutions for restoring old footage, improving streaming content, and boosting visual quality across devices.

AI video enhancementDenoisingOpenGL
0 likes · 9 min read
How ZoomAI Uses AI to Super‑Resolve and Enhance Low‑Quality Videos