Tagged articles
75 articles
Page 1 of 1
Open Source Tech Hub
Open Source Tech Hub
Apr 9, 2026 · Backend Development

Build a PHP‑Powered AI Video Assistant with Webman, Neuron AI & FFmpeg

This guide shows PHP developers how to create a smart video‑processing agent by combining the high‑performance Webman framework, the Neuron AI agent library supporting multiple LLMs, and FFmpeg tools, covering stack selection, core implementation steps, sample code for tools, controller integration, and visual demos of video info extraction, screenshot and transcoding.

LLMVideo processingWebman
0 likes · 9 min read
Build a PHP‑Powered AI Video Assistant with Webman, Neuron AI & FFmpeg
AI Explorer
AI Explorer
Mar 8, 2026 · Artificial Intelligence

AutoClip: One‑Click AI Video Highlight Extraction and Editing

AutoClip is an open‑source, locally‑run tool that uses Alibaba's Qwen large language model and OpenAI Whisper to automatically download, transcribe, analyze, and cut high‑light segments from YouTube or Bilibili videos, offering real‑time task monitoring, smart collections, preview, Docker deployment, and a roadmap of future AI‑driven features.

AI video editingDockerFastAPI
0 likes · 7 min read
AutoClip: One‑Click AI Video Highlight Extraction and Editing
ByteDance Data Platform
ByteDance Data Platform
Dec 23, 2025 · Artificial Intelligence

How Daft and Ray Supercharge Million‑Hour Video Processing for AI‑Powered Robotics

This article details a scalable, distributed pipeline that uses LAS AI Data Lake, Daft on Ray, and advanced video‑processing techniques—scene detection, splitting, frame sampling, filtering, and caption generation—to transform tens of millions of hours of robot‑captured video into high‑quality, searchable semantic data while dramatically boosting CPU and GPU utilization.

AI PipelineDaftRay
0 likes · 21 min read
How Daft and Ray Supercharge Million‑Hour Video Processing for AI‑Powered Robotics
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Nov 4, 2025 · Artificial Intelligence

How Baidu’s Baige Accelerates Multimodal Video Training with Context Parallelism

Baidu Baige’s enhanced veRL framework dramatically boosts video frame rates and resolution limits, cuts training time, reduces memory usage, and improves model accuracy by leveraging context parallelism and optimized attention on Ampere GPUs for multimodal mixed‑training scenarios.

AI accelerationContext ParallelismMultimodal Training
0 likes · 6 min read
How Baidu’s Baige Accelerates Multimodal Video Training with Context Parallelism
Baidu Geek Talk
Baidu Geek Talk
Aug 20, 2025 · Mobile Development

How Mobile Video Players Boost Visual Quality with Real‑Time Brightness and Color Enhancement

This article explains the engineering of mobile video post‑processing techniques—brightness and color enhancement using GPU shaders, linear gain, YUV scaling, gamma correction, adaptive saturation, HSV adjustments, and skin‑tone protection—to improve clarity, contrast, and naturalness while maintaining real‑time performance.

GPU shaderMobileVideo processing
0 likes · 14 min read
How Mobile Video Players Boost Visual Quality with Real‑Time Brightness and Color Enhancement
Python Programming Learning Circle
Python Programming Learning Circle
Jul 8, 2025 · Artificial Intelligence

Create a Dancing Word Cloud from Bilibili Videos with Python – Full Step‑by‑Step Guide

This tutorial walks you through building a Python project that downloads a Bilibili video, extracts its frames, applies Baidu AI human segmentation, scrapes danmu comments, generates a stylized word‑cloud animation, and finally composes a video with background music, showcasing video processing, AI, and data visualization techniques.

AI segmentationBilibiliOpenCV
0 likes · 11 min read
Create a Dancing Word Cloud from Bilibili Videos with Python – Full Step‑by‑Step Guide
DaTaobao Tech
DaTaobao Tech
Mar 5, 2025 · Artificial Intelligence

Multimodal Large‑Model Cover Generation AI Agent for Taobao Video and Live Streams

Taobao’s new multimodal AI Agent automatically creates high‑quality static and dynamic video covers by planning tasks, consulting a memory of quality criteria, executing frame selection with ReKV streaming and dual‑stage evaluation, generating marketing copy via fine‑tuned Qwen2.5‑7B, and refining layout, resulting in significantly higher click‑through rates, lower latency, and reduced manual effort.

AIVideo processingcover generation
0 likes · 17 min read
Multimodal Large‑Model Cover Generation AI Agent for Taobao Video and Live Streams
DeWu Technology
DeWu Technology
Jan 22, 2025 · Operations

How We Cut Video Detection Memory Usage by 78% with WebAssembly and WorkerFS

This article details the challenges of video corruption detection on a creator platform, analyzes existing server‑side and client‑side approaches, and presents a WebAssembly‑based solution using ffmpeg, WorkerFS, and memory‑growth tuning that reduces memory consumption by up to 78% while speeding up large‑file processing.

Memory OptimizationVideo processingWeb Worker
0 likes · 13 min read
How We Cut Video Detection Memory Usage by 78% with WebAssembly and WorkerFS
Kuaishou Audio & Video Technology
Kuaishou Audio & Video Technology
Jan 21, 2025 · Fundamentals

How Kuaishou Enables Full‑Chain Dolby Vision Support for UGC

Kuaishou partners with Dolby Labs to bring full‑chain Dolby Vision to its short‑video platform, detailing the technology behind HDR, dynamic metadata, and a brightness‑adjustment solution that ensures seamless playback and optimal visual experience for user‑generated content across devices.

Dolby VisionDynamic MetadataExtended Brightness
0 likes · 10 min read
How Kuaishou Enables Full‑Chain Dolby Vision Support for UGC
Kuaishou Tech
Kuaishou Tech
Jan 17, 2025 · Artificial Intelligence

Kuaishou Achieves 7 Papers Accepted at AAAI 2025

Kuaishou has achieved a significant milestone with 7 papers accepted at AAAI 2025, covering diverse AI research areas including video processing, recommendation systems, and image restoration, demonstrating the company's strong research capabilities in artificial intelligence.

AAAI 2025Image RestorationKuaishou
0 likes · 10 min read
Kuaishou Achieves 7 Papers Accepted at AAAI 2025
Python Programming Learning Circle
Python Programming Learning Circle
Sep 11, 2024 · Artificial Intelligence

Python Tutorial: Download Bilibili Video, Extract Frames, Perform Human Segmentation with Baidu AI, Generate Word Cloud, and Compose Final Video

This article demonstrates how to use Python to download a Bilibili video, extract frames with OpenCV, perform human segmentation via Baidu AI, generate a word‑cloud animation using MoviePy, and finally compose the processed clips into a complete video with added audio.

AI segmentationOpenCVVideo processing
0 likes · 13 min read
Python Tutorial: Download Bilibili Video, Extract Frames, Perform Human Segmentation with Baidu AI, Generate Word Cloud, and Compose Final Video
JD Retail Technology
JD Retail Technology
Sep 3, 2024 · Backend Development

Design and Architecture of a New Video Review System with Streamlined Frame Extraction and Parallel Processing

This article presents the design goals, architecture, technology selection, and component details of a unified video review system that leverages FFmpeg for frame extraction, stream‑based parallel processing, and flexible synchronous/asynchronous workflows to achieve low latency and high scalability.

StreamingSystem ArchitectureVideo processing
0 likes · 10 min read
Design and Architecture of a New Video Review System with Streamlined Frame Extraction and Parallel Processing
Bilibili Tech
Bilibili Tech
Apr 26, 2024 · Artificial Intelligence

2024 Bilibili Technology Patent Awards – Highlights of Ten Winning Innovations

On World Intellectual Property Day, Bilibili honored ten breakthrough patents that together enable billion‑scale video duplicate detection, AI‑driven story generation, synchronized live rhythm‑games, automatic OTT casting, knowledge‑graph‑based content moderation, glitch‑free multi‑audio streaming, modular playback integration, neural‑network resolution encoding, AV1 reference‑frame pruning, and fine‑grained GPU isolation.

StreamingVideo processingartificial intelligence
0 likes · 6 min read
2024 Bilibili Technology Patent Awards – Highlights of Ten Winning Innovations
Bilibili Tech
Bilibili Tech
Apr 16, 2024 · Frontend Development

Design and Implementation of a High‑Performance Matroska Demuxer for Web Uploads

The new mkv-demuxer SDK replaces the slow FFmpeg-Wasm solution on Bilibili’s upload page by reading Matroska files in slice-sized ArrayBuffers, parsing EBML headers and SeekHead indexes, and exposing getMeta, getData, and seekFrame APIs, cutting memory use by 98 % and parsing time by 97 % while accelerating cover-generation and recommendation processing.

DemuxerMatroskaVideo processing
0 likes · 17 min read
Design and Implementation of a High‑Performance Matroska Demuxer for Web Uploads
360 Smart Cloud
360 Smart Cloud
Apr 3, 2024 · Backend Development

Understanding FFmpeg Hardware Acceleration Architecture and Implementation

FFmpeg provides a comprehensive, cross‑platform hardware acceleration framework that abstracts diverse GPU and dedicated video codec interfaces, defines HWContext types, device and frame contexts, and various codec configuration methods, enabling efficient video encoding, decoding, and filtering while addressing performance, compatibility, and pipeline complexity challenges.

GPUHardware accelerationMultimedia
0 likes · 10 min read
Understanding FFmpeg Hardware Acceleration Architecture and Implementation
Open Source Tech Hub
Open Source Tech Hub
Jan 18, 2024 · Backend Development

Install and Use FFmpeg with PHP‑FFMpeg on Ubuntu

This guide explains what FFmpeg is, shows how to install it on Ubuntu 18.04, demonstrates integrating the Webman framework and PHP‑FFMpeg library, and provides step‑by‑step code examples for extracting images, adding watermarks, and basic video editing.

ComposerPHPUbuntu
0 likes · 6 min read
Install and Use FFmpeg with PHP‑FFMpeg on Ubuntu
Bilibili Tech
Bilibili Tech
Jan 12, 2024 · Frontend Development

Understanding WebCodecs: Design Goals, Core API, Demos, and Application Scenarios

WebCodecs, introduced in Chrome 94, provides direct, low‑latency access to hardware‑accelerated audio and video codecs, enabling fine‑grained encoding and decoding control, composable streaming pipelines, and high‑performance demos such as controllable decoding, watermarking, chroma‑key, and client‑side video processing, while still lacking container support and broad browser compatibility.

Audio Video APIsBrowser MediaVideo processing
0 likes · 15 min read
Understanding WebCodecs: Design Goals, Core API, Demos, and Application Scenarios
Bilibili Tech
Bilibili Tech
Dec 22, 2023 · Artificial Intelligence

Intelligent Media Technology and Innovative Applications: Information-Theoretic Principles for Transcoding System Optimization

The upcoming Shanghai Jiao‑Tong University seminar on Intelligent Media Technology will feature Bilibili’s Cai Chunlei presenting an information‑theoretic framework for jointly optimizing video transcoding pipelines, linking traditional coding, deep‑learning methods and future large‑model techniques to improve compression and guide practical system design.

AISeminarVideo processing
0 likes · 4 min read
Intelligent Media Technology and Innovative Applications: Information-Theoretic Principles for Transcoding System Optimization
NetEase Cloud Music Tech Team
NetEase Cloud Music Tech Team
Dec 21, 2023 · Artificial Intelligence

Video and Image Technologies in NetEase Cloud Music: Architecture, Algorithms, and Applications

The article examines NetEase Cloud Music’s video and image technology stack—covering a four‑module architecture, algorithms for content understanding, intelligent production, moderation, and interactive effects—and explains how these systems enhance user experience, streamline backend processing, and position the platform for future AIGC‑driven innovations.

AI AlgorithmsMultimodal LearningVideo processing
0 likes · 11 min read
Video and Image Technologies in NetEase Cloud Music: Architecture, Algorithms, and Applications
AntTech
AntTech
Aug 24, 2023 · Artificial Intelligence

CoDeF: A Canonical Content Field Approach for Consistent Video Processing

The CoDeF algorithm introduced by Ant Group's Interactive Intelligence Lab transforms video processing into image processing using a canonical content field and a temporal deformation field, enabling seamless video style transfer, keypoint tracking, and interactive editing while preserving temporal consistency.

Video processingcanonical content fieldtemporal deformation
0 likes · 5 min read
CoDeF: A Canonical Content Field Approach for Consistent Video Processing
IT Services Circle
IT Services Circle
Mar 3, 2023 · Backend Development

FFmpeg 6.0 “Von Neumann” Released with New Encoders, Decoders, Filters, and ABI Versioning

FFmpeg 6.0 “Von Neumann” has been officially released, introducing numerous new encoders, decoders, and filters, adding ABI versioning to major releases, deprecating old APIs, and enhancing CLI performance with threading, statistics options, and file‑based filter options, while outlining upcoming features for version 6.1.

CLIDecodersEncoders
0 likes · 6 min read
FFmpeg 6.0 “Von Neumann” Released with New Encoders, Decoders, Filters, and ABI Versioning
Programmer DD
Programmer DD
Mar 3, 2023 · Backend Development

FFmpeg 6.0 Highlights: New Codecs, Filters, and Performance Boosts

FFmpeg 6.0 "Von Neumann" introduces a host of new codecs, decoders, filters, CLI enhancements, ABI versioning, and a more frequent release cadence, offering developers expanded multimedia processing capabilities and improved performance across platforms.

Backend DevelopmentMultimediaSoftware Release
0 likes · 6 min read
FFmpeg 6.0 Highlights: New Codecs, Filters, and Performance Boosts
Bilibili Tech
Bilibili Tech
Feb 24, 2023 · Artificial Intelligence

Understanding Video Super-Resolution: Principles, Common Defects, and Practical Enhancement Techniques

Video super‑resolution, pioneered by deep‑learning models such as SRCNN, can synthesize plausible high‑frequency detail but often introduces artifacts like loss of stylistic noise, inconsistent line depth, texture smearing, and temporal flicker, which can be mitigated through preprocessing (BM3D denoising, descaling), targeted post‑processing (Gaussian blur, unsharp masking) and selective edge‑based texture merging to preserve original artistic style while enhancing perceived sharpness.

BM3DCUGANFourier Transform
0 likes · 13 min read
Understanding Video Super-Resolution: Principles, Common Defects, and Practical Enhancement Techniques
Baidu Geek Talk
Baidu Geek Talk
Oct 12, 2022 · Backend Development

Understanding Video Color Spaces, Gamma Correction, and Transcoding with FFmpeg

Video processing involves converting linear sensor data through gamma correction and multiple color‑space transformations—such as RGB, YUV, and XYZ—using standards like BT.601/709/2020, with FFmpeg’s colorspace filter and ffprobe to manage transfer functions, primaries, and ranges during transcoding to preserve accurate colors across devices.

Color ManagementVideo processingcolor space
0 likes · 12 min read
Understanding Video Color Spaces, Gamma Correction, and Transcoding with FFmpeg
Shopee Tech Team
Shopee Tech Team
Aug 12, 2022 · Backend Development

Shopee Video Technology: Backend Services, High‑Definition Low‑Bitrate Optimization, and Performance Enhancements

Shopee’s video platform combines live‑stream and on‑demand transcoding, link‑mic, multi‑party mixing, and backend editing services with a proprietary high‑definition low‑bitrate pipeline that leverages GPU and CPU encoders, AI‑enhanced pre‑processing, hierarchical B‑frames, and SIMD‑optimized sharpening to deliver high‑quality video on low‑end devices while cutting compute costs, and the company is actively recruiting engineers for further development.

AI enhancementPerformance OptimizationVideo processing
0 likes · 19 min read
Shopee Video Technology: Backend Services, High‑Definition Low‑Bitrate Optimization, and Performance Enhancements
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Jul 23, 2022 · Mobile Development

Xiaohongshu Deploys On‑Device Super‑Resolution with Huawei HMS Core for High‑Quality Short Videos

Xiaohongshu, partnering with Huawei HMS Core, now runs on‑device super‑resolution for short videos, instantly upscaling 540p to 1080p and enhancing 720p content using GPU/NPU via HiAI, cutting bandwidth and stutter while keeping power use low across hundreds of Huawei devices.

AI accelerationAndroid NDKHuawei HMS Core
0 likes · 9 min read
Xiaohongshu Deploys On‑Device Super‑Resolution with Huawei HMS Core for High‑Quality Short Videos
Youku Technology
Youku Technology
Jun 9, 2022 · Mobile Development

Design and Architecture of the Cross-Platform Multimedia Rendering Engine OPR

The OPR engine provides a cross‑platform, GPU‑accelerated rendering framework that unifies audio‑video pre‑ and post‑processing, native UI‑driven danmaku rendering, and real‑time visual effects such as human‑body recognition, using a modular command‑stream architecture, C++ core, monitoring tools, and extensibility for future Vulkan, VR, and plugin integration.

GPUNative UIReal-Time
0 likes · 15 min read
Design and Architecture of the Cross-Platform Multimedia Rendering Engine OPR
Bilibili Tech
Bilibili Tech
Apr 26, 2022 · Artificial Intelligence

2022 Bilibili Technology Patent Selection Awards

The 2022 Bilibili Technology Patent Selection Awards honored ten innovative projects across Best Popularity and Most Popular categories, showcasing advances such as advanced bullet comments, optimized gift animation, video rendering, virtual avatar production, virtual material editing, mini‑program integration, AI‑driven live‑stream switching, blur‑face enhancement, and ghost video tools.

AI enhancementBilibiliMini Program
0 likes · 8 min read
2022 Bilibili Technology Patent Selection Awards
Python Crawling & Data Mining
Python Crawling & Data Mining
Feb 22, 2022 · Artificial Intelligence

Create a Dancing Word‑Cloud Video with Python and AI

This tutorial walks through downloading a dance video, extracting frames, using Baidu AI for person segmentation, generating word‑cloud masks, and stitching the results into a dancing word‑cloud video with Python, OpenCV and the WordCloud library.

Baidu AIComputer VisionOpenCV
0 likes · 8 min read
Create a Dancing Word‑Cloud Video with Python and AI
Python Programming Learning Circle
Python Programming Learning Circle
Feb 14, 2022 · Fundamentals

How to Convert Video to GIF Using Python and MoviePy

This tutorial explains how to install the MoviePy library, write Python code to load a video file, and generate a GIF while controlling size through resolution scaling, frame rate reduction, sub‑clipping, and output dimensions, all with clear code examples and visual results.

GIFTutorialVideo processing
0 likes · 4 min read
How to Convert Video to GIF Using Python and MoviePy
Bilibili Tech
Bilibili Tech
Jan 28, 2022 · Artificial Intelligence

Real-CUGAN: An Open‑Source AI Super‑Resolution Model for Anime Video Upscaling

Real‑CUGAN is an open‑source AI super‑resolution model that upscales anime video up to 4× using a million‑patch, frequency‑domain‑supervised dataset, delivering faster inference than Real‑ESRGAN, seamless Waifu2x compatibility, and superior texture, line and artifact handling, with code released on GitHub.

AI super-resolutionDeep LearningImage Restoration
0 likes · 8 min read
Real-CUGAN: An Open‑Source AI Super‑Resolution Model for Anime Video Upscaling
Kuaishou Tech
Kuaishou Tech
Jan 20, 2022 · Artificial Intelligence

Understanding Kuaishou's KFRUC Algorithm: A Technical Deep Dive into Video Frame Interpolation

This article provides a comprehensive technical analysis of Kuaishou's self-developed KFRUC video frame interpolation algorithm, detailing its motion estimation, occlusion localization, and motion compensation mechanisms to enhance playback smoothness and visual quality in slow-motion and high-frame-rate video applications.

KFRUC AlgorithmMEMCSlow Motion Technology
0 likes · 8 min read
Understanding Kuaishou's KFRUC Algorithm: A Technical Deep Dive into Video Frame Interpolation
Bitu Technology
Bitu Technology
Jan 7, 2022 · Backend Development

Design and Implementation of Tubi Multimedia Processing Platform (TMPP)

The article details Tubi's Multimedia Processing Platform (TMPP), describing its architecture, processing stages, resource management, and distributed task scheduling for large‑scale video transcoding and delivery across multiple devices.

Distributed SystemsResource ManagementVideo processing
0 likes · 8 min read
Design and Implementation of Tubi Multimedia Processing Platform (TMPP)
Python Programming Learning Circle
Python Programming Learning Circle
Jan 4, 2022 · Artificial Intelligence

Python Project: Download Bilibili Video, Extract Frames, Perform Human Segmentation, Generate Word Cloud, and Compose Final Video

This tutorial walks through a complete Python workflow that downloads a B‑site video, extracts frames with OpenCV, uses Baidu AI for human segmentation, crawls danmu comments, creates a masked word‑cloud animation, and finally merges the clips with audio into a polished video.

OpenCVVideo processingmoviepy
0 likes · 12 min read
Python Project: Download Bilibili Video, Extract Frames, Perform Human Segmentation, Generate Word Cloud, and Compose Final Video
Douyu Streaming
Douyu Streaming
Dec 1, 2021 · Mobile Development

How to Get, Build, and Extend WebRTC m79 Source for Windows, Android, and iOS

This guide explains how to obtain the WebRTC m79 source, compile it for Windows, Android, and iOS, walk through the basic signaling and peer‑connection workflow, and implement advanced video‑capture and audio‑volume features with custom C++ extensions, while unifying the codebase across platforms.

Audio ProcessingCCompilation
0 likes · 19 min read
How to Get, Build, and Extend WebRTC m79 Source for Windows, Android, and iOS
NetEase Smart Enterprise Tech+
NetEase Smart Enterprise Tech+
Nov 16, 2021 · Mobile Development

Integrating Faceunity Beauty SDK with NERtc on Android and iOS

This guide explains the core concepts, integration steps, and troubleshooting tips for using the Faceunity (相芯) Beauty SDK with NetEase NERtc on Android and iOS, covering OpenGL ES basics, EGL/EAGL interfaces, three rendering schemes, resource management, and platform‑specific setup.

AndroidMobile DevelopmentNERtc
0 likes · 13 min read
Integrating Faceunity Beauty SDK with NERtc on Android and iOS
High Availability Architecture
High Availability Architecture
Oct 21, 2021 · Cloud Computing

Optimizing NetEase Cloud Music Audio/Video Processing Platform with Serverless

This article describes how NetEase Cloud Music leveraged Serverless function computing to redesign its audio/video algorithm processing platform, covering the existing challenges, the selection criteria for Serverless solutions, the implementation details, performance gains, cost savings, and future directions.

Audio ProcessingCloud FunctionsNetEase
0 likes · 11 min read
Optimizing NetEase Cloud Music Audio/Video Processing Platform with Serverless
Taobao Frontend Technology
Taobao Frontend Technology
Aug 10, 2021 · Frontend Development

Optimizing Video Thumbnail Selection: Canvas vs FFmpeg WebAssembly

This article examines how Taobao's front‑end team built a custom video frame‑capture tool, compares video+canvas with FFmpeg‑WebAssembly approaches, presents testing results, implementation details, and future optimizations to improve thumbnail selection efficiency and user experience.

CanvasVideo processingWebAssembly
0 likes · 5 min read
Optimizing Video Thumbnail Selection: Canvas vs FFmpeg WebAssembly
MaGe Linux Operations
MaGe Linux Operations
Jul 18, 2021 · Fundamentals

Turn a Bilibili Dance Clip into an ASCII‑Art Video with Python

Learn how to download a Bilibili dance video, extract GIF frames, convert them to ASCII art, rename and order the frames, transform them into images, and finally stitch them into a music‑backed video using Python tools such as you‑get, OpenCV, and moviepy.

ASCII artVideo processingmoviepy
0 likes · 9 min read
Turn a Bilibili Dance Clip into an ASCII‑Art Video with Python
Tencent Cloud Developer
Tencent Cloud Developer
Jun 22, 2021 · Cloud Computing

Let's Dive Into Serverless World: Tencent Cloud's Serverless Development and Latest Trends

Tencent Cloud’s serverless platform, now serving over a million developers and billions of daily invocations, accelerates business and education workloads, enables massive elastic scaling, integrates video, GPU, and event‑bus services, and simplifies migration, debugging, and SaaS integration, heralding serverless as the next mainstream cloud paradigm.

Cloud NativeDeveloper ExperienceEvent-Driven Architecture
0 likes · 17 min read
Let's Dive Into Serverless World: Tencent Cloud's Serverless Development and Latest Trends
Volcano Engine Developer Services
Volcano Engine Developer Services
Jun 16, 2021 · Backend Development

How ByteDance’s Video Processing Platform Achieves Billion‑Scale High Availability

This article explains how ByteDance’s Volcano Engine video platform handles the entire video lifecycle—from client‑side capture to cloud processing, delivery, and playback—by employing a multi‑plane architecture, scalable workflow system, function compute platform, and the dynamic BMF framework to meet massive scale, ensure high availability, improve user experience, and reduce costs.

Function ComputeVideo processinghigh availability
0 likes · 19 min read
How ByteDance’s Video Processing Platform Achieves Billion‑Scale High Availability
Kuaishou Tech
Kuaishou Tech
May 7, 2021 · Artificial Intelligence

Kuaishou–Tsinghua Joint Research Institute Showcases AI and Video Technology Collaboration at the Software Discipline Development Forum

The Kuaishou–Tsinghua Future Media Data Joint Research Institute co‑hosted the 2021 Software Discipline Development Forum, highlighting extensive AI‑driven video analysis, computer‑vision, multimodal learning, and recommendation‑system research, as well as talent cultivation and innovative VR livestream experiences for the university’s 110th anniversary celebrations.

AIIndustry-Academia CollaborationVR Live Streaming
0 likes · 7 min read
Kuaishou–Tsinghua Joint Research Institute Showcases AI and Video Technology Collaboration at the Software Discipline Development Forum
HomeTech
HomeTech
Apr 21, 2021 · Artificial Intelligence

AI-Powered Masked Danmaku: Design and Implementation

This article details the design and practical implementation of an AI-driven masked danmaku system that prevents comment overlay on video content, covering background, technology selection, instance segmentation methods, distributed task scheduling, mask generation, client rendering, performance optimizations, and future directions.

AIDistributed SystemsMask Danmaku
0 likes · 18 min read
AI-Powered Masked Danmaku: Design and Implementation
Baidu Geek Talk
Baidu Geek Talk
Mar 17, 2021 · Artificial Intelligence

Overview of Baidu's Wànxiàng System for Large‑Scale Rich Media Processing

Baidu’s Wànxiàng system processes billions of images and videos daily by extracting low‑ and high‑level features, linking related media, and aggregating semantic attributes in a scalable, timely architecture that leverages thousands of CPU, GPU, and FPGA cores to power accurate, low‑latency rich‑media search and recommendation.

BaiduImage AnalysisRich Media
0 likes · 14 min read
Overview of Baidu's Wànxiàng System for Large‑Scale Rich Media Processing
360 Tech Engineering
360 Tech Engineering
Feb 23, 2021 · Artificial Intelligence

Video Stutter Detection via Frame Difference Analysis Using FFmpeg

This article explains a method for detecting video stutter by converting uploaded videos into frame sequences with ffmpeg, calculating pixel differences between consecutive frames, aggregating motion metrics, removing scene‑change effects, computing a dynamic factor, and outputting a binary result indicating the presence or absence of stutter.

Computer VisionVideo processingalgorithm
0 likes · 5 min read
Video Stutter Detection via Frame Difference Analysis Using FFmpeg
iQIYI Technical Product Team
iQIYI Technical Product Team
Feb 5, 2021 · Artificial Intelligence

Efficient General‑Purpose Frame Extraction for AI Video Inference Services

The paper presents a unified, high‑performance frame‑extraction framework that dynamically selects CPU or GPU decoding, leverages multithreaded and CUDA‑accelerated pipelines, keeps frames in memory, and achieves up to ten‑fold latency reductions for diverse AI video‑inference tasks.

AI video inferenceCPU optimizationGPU Acceleration
0 likes · 14 min read
Efficient General‑Purpose Frame Extraction for AI Video Inference Services
Python Crawling & Data Mining
Python Crawling & Data Mining
Dec 22, 2020 · Artificial Intelligence

Create Stunning Video Ghosting Effects with PaddlePaddle’s DeepLabV3p Model

Learn how to generate cinematic ghosting effects in videos by leveraging PaddlePaddle’s PaddleHub deep learning library and the pretrained deeplabv3p_xception65 model for semantic segmentation, with step‑by‑step code, environment setup, and practical testing on classic martial‑arts footage.

Deep LearningGhost EffectPaddlePaddle
0 likes · 7 min read
Create Stunning Video Ghosting Effects with PaddlePaddle’s DeepLabV3p Model
Youku Technology
Youku Technology
May 7, 2020 · Industry Insights

How Alibaba’s FrameShare Pushes Ultra‑HD Video to the Next Level

This article explains the FrameShare ultra‑HD solution, detailing its four core capabilities—high frame‑rate, ultra‑high resolution, HDR rendering, and surround sound—along with the end‑to‑end video pipeline, key technologies such as frame interpolation, HDR tone‑mapping, cloud‑edge collaboration, and the future vision for nationwide ultra‑HD adoption.

HDRHigh Frame RateVideo processing
0 likes · 14 min read
How Alibaba’s FrameShare Pushes Ultra‑HD Video to the Next Level
Meituan Technology Team
Meituan Technology Team
Sep 12, 2019 · Mobile Development

How Meituan Engineered a Scalable Mobile Video Platform: Architecture and Lessons

This article details Meituan's end‑to‑end development of a merchant‑side mobile video feature, covering background needs, architecture design, technology selection, implementation of playback, recording, composition, cutting, processing pipelines, encountered pitfalls, monitoring strategies, and future optimization directions.

AndroidMediaCodecVideo processing
0 likes · 24 min read
How Meituan Engineered a Scalable Mobile Video Platform: Architecture and Lessons
Meitu Technology
Meitu Technology
Jun 12, 2019 · Cloud Computing

Meitu's Cloud-Based Image Beautification and Large-Scale Video Processing Architecture

Meitu replaced on-device beautification and video processing with a cloud-native architecture that routes requests by region, uses a dedicated upload SDK for detailed monitoring, employs edge-computing, a configuration-driven plug-in framework and Kubernetes-based elastic scaling, enabling fast, reliable, globally-distributed image and video services.

Edge ComputingMeituVideo processing
0 likes · 12 min read
Meitu's Cloud-Based Image Beautification and Large-Scale Video Processing Architecture
58 Tech
58 Tech
Apr 16, 2019 · Mobile Development

Design and Architecture of the 58 Short Video SDK for Mobile Applications

The article outlines the technical challenges of short‑video apps and presents the modular, extensible architecture of the 58 Short Video SDK, detailing its layered design, design principles, advantages, and future evolution to support advanced features such as AR, hardware decoding, and h265 encoding.

MultimediaVideo processingshort video
0 likes · 12 min read
Design and Architecture of the 58 Short Video SDK for Mobile Applications
iQIYI Technical Product Team
iQIYI Technical Product Team
Nov 16, 2018 · Artificial Intelligence

iQIYI AI Bullet‑Screen Masking: Semantic Segmentation System and Engineering Insights

iQIYI’s bullet‑screen masking employs a DeepLabv3+‑based two‑class semantic segmentation pipeline, preceded by a close‑up detector and followed by morphological refinement, trained on a custom annotated dataset that raises IoU to 93.6 %, processes hour‑long videos in under an hour, and is slated for future upgrades to instance and panoptic segmentation for finer‑grained masking.

AIVideo processingbullet screen masking
0 likes · 10 min read
iQIYI AI Bullet‑Screen Masking: Semantic Segmentation System and Engineering Insights
Youku Technology
Youku Technology
Oct 31, 2018 · Artificial Intelligence

Technical Overview of Youku's Video Face Swapping System

Youku’s new video face‑swapping service lets users replace a celebrity’s face with a single uploaded photo by employing a 3D generative model, deep‑learning segmentation, multi‑scale super‑resolution, and trajectory smoothing to achieve fast, near‑photorealistic results across varied angles, expressions, and lighting, though it still lacks personalized models and struggles with extreme side views or heavy occlusions.

3D ModelingAIVideo processing
0 likes · 10 min read
Technical Overview of Youku's Video Face Swapping System
Youku Technology
Youku Technology
Oct 29, 2018 · Artificial Intelligence

Improving Online Video Experience: Youku’s End‑to‑End Video Quality Enhancement Techniques

Youku enhances online video by applying intelligent post‑production contrast mapping, device‑specific HDR tone‑mapping, high‑frame‑rate restoration through frame‑rate conversion, and ROI‑aware encoding that allocates bitrate to key visual areas, complemented by audio processing, to deliver cinema‑grade quality across diverse screens.

HDRROI encodingStreaming
0 likes · 9 min read
Improving Online Video Experience: Youku’s End‑to‑End Video Quality Enhancement Techniques
360 Quality & Efficiency
360 Quality & Efficiency
Apr 25, 2018 · Fundamentals

Introduction to FFmpeg: Libraries, Tools, and Basic Command Usage

This article introduces FFmpeg, outlines its eight core libraries, describes the main command‑line tools (ffmpeg, ffplay, ffprobe), and provides a step‑by‑step example of converting an MP4 video to HEVC with MP3 audio on Windows, including useful help commands and additional features.

MultimediaVideo processingffmpeg
0 likes · 5 min read
Introduction to FFmpeg: Libraries, Tools, and Basic Command Usage
Qizhuo Club
Qizhuo Club
Mar 13, 2018 · Mobile Development

Mastering Android MediaCodec: From Basics to Advanced Video Processing

This article explores Android’s MediaCodec API, detailing its role in hardware video encoding/decoding, buffer management, data types, lifecycle states, and practical code examples, providing developers with a comprehensive guide to implementing advanced video processing features such as watermarking and transcoding on mobile devices.

AndroidHardware DecodingMediaCodec
0 likes · 10 min read
Mastering Android MediaCodec: From Basics to Advanced Video Processing