Tagged articles
43 articles
Page 1 of 1
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Apr 1, 2026 · Artificial Intelligence

World Models Ending Pixel Reconstruction: 14‑Paper JEPA Roadmap

The article reviews Yann LeCun's world‑model research program, detailing how the JEPA family of models abandons pixel‑level reconstruction in favor of abstract feature prediction across images, video, audio, 3D data, and action planning, and summarises the empirical gains reported in fourteen key papers.

3DJEPAVision
0 likes · 18 min read
World Models Ending Pixel Reconstruction: 14‑Paper JEPA Roadmap
IT Services Circle
IT Services Circle
Sep 29, 2025 · Artificial Intelligence

How Memvid Stores AI Knowledge in MP4 Videos with 10× Less Space

Memvid replaces traditional vector databases by encoding text chunks as QR codes inside MP4 video frames, achieving up to ten‑fold storage reduction, millisecond‑level semantic search, zero‑infrastructure deployment, and a built‑in conversational interface, while providing a fast‑install Python SDK and CLI.

AIMemoryMemvid
0 likes · 9 min read
How Memvid Stores AI Knowledge in MP4 Videos with 10× Less Space
FunTester
FunTester
Mar 27, 2025 · Backend Development

Curated List of Development Tutorials and Video Resources

This page compiles a comprehensive collection of tutorial links and video resources covering Chrome extension development, Java performance testing, interface testing, Groovy scripting, various utility videos, and the Arthas diagnostic tool, providing developers with organized references for learning and practice.

ArthasGroovyJava performance
0 likes · 5 min read
Curated List of Development Tutorials and Video Resources
58UXD
58UXD
Dec 26, 2024 · Artificial Intelligence

How to Turn B2B Project Summaries into Engaging AI‑Powered Videos

Learn a step‑by‑step workflow for converting complex B‑side design project summaries into concise, engaging videos using AI tools for outlining, script writing, voice synthesis, and editing, while avoiding common PPT pitfalls and maximizing audience interest.

AIDesignNotionAI
0 likes · 12 min read
How to Turn B2B Project Summaries into Engaging AI‑Powered Videos
Goodme Frontend Team
Goodme Frontend Team
Nov 18, 2024 · Frontend Development

Add Rotation and Scaling to Video Previews with React and Vime

This article explains how to implement video rotation, fullscreen handling, and proportional scaling in a React application using the Vime library and CSS transforms, covering container setup, control customization, and code examples for a seamless user experience.

CSS transformfrontendrotation
0 likes · 10 min read
Add Rotation and Scaling to Video Previews with React and Vime
Alibaba Cloud Developer
Alibaba Cloud Developer
Jun 13, 2024 · Artificial Intelligence

Creating a Full AI‑Generated Music Video with Large‑Model Agents

This article documents the end‑to‑end workflow of using large multimodal models and specialized agents to automatically generate a storyboard, compose original music and lyrics, produce keyframes, and assemble a complete music video, while highlighting the remaining manual steps and future automation possibilities.

AIMusicStoryboard
0 likes · 10 min read
Creating a Full AI‑Generated Music Video with Large‑Model Agents
HelloTech
HelloTech
Jan 25, 2024 · Backend Development

Design and Implementation of a Custom Multimedia Framework Using FFmpeg

The Haro Street Cat mobile team created a custom multimedia framework that wraps FFmpeg 4.2.2 in a C++ core library with Android/iOS compatibility layers and Java wrappers for transcoding, live streaming, and composition, delivering hardware‑accelerated decoding, flexible filter pipelines, and reliable transcoding that boosted coverage to over 99 %, cut storage by more than 30 %, accelerated video start‑up, and improved streaming and watermarking performance.

C++Filter GraphMultimedia
0 likes · 27 min read
Design and Implementation of a Custom Multimedia Framework Using FFmpeg
政采云技术
政采云技术
Jun 1, 2023 · Fundamentals

Fundamentals of Audio and Video Capture for Real‑Time Applications

This article introduces the basic concepts of audio and video capture—including sampling, quantization, PCM storage, YUV formats, camera operation, and pixel resolution—explaining how these technologies enable non‑contact, fully digital government procurement services during the COVID‑19 pandemic.

PCMReal-Timeaudio
0 likes · 17 min read
Fundamentals of Audio and Video Capture for Real‑Time Applications
Python Programming Learning Circle
Python Programming Learning Circle
Apr 28, 2023 · Backend Development

10 Python Automation Scripts to Simplify Repetitive Tasks

This article presents ten practical Python automation scripts—including HTML parsing, QR code scanning, screenshot capture, audiobook creation, PDF editing, StackOverflow querying, mobile device control, CPU/GPU temperature monitoring, Instagram uploading, and video watermarking—to help readers eliminate repetitive tasks and streamline their workflows.

MobileScriptingWeb Scraping
0 likes · 13 min read
10 Python Automation Scripts to Simplify Repetitive Tasks
DaTaobao Tech
DaTaobao Tech
Aug 5, 2022 · Frontend Development

Front-End Development Guide for Short Video Infinite Scroll

The guide details building a short‑video infinite‑scroll interface using a vertical Swiper carousel with virtual slides, custom HTML5 video players, status buttons, and a fixed loading bar, ensuring only the visible card renders video and minimizing memory for endless content streams.

Code Samplesinfinite scrollswiper
0 likes · 9 min read
Front-End Development Guide for Short Video Infinite Scroll
Taobao Frontend Technology
Taobao Frontend Technology
Jun 27, 2022 · Frontend Development

How VideoX Tackles Complex Video Playback Across Massive E‑commerce Platforms

VideoX, a front‑end player built for Alibaba’s massive e‑commerce ecosystem, addresses diverse playback scenarios—from product detail videos to live streams—by offering multi‑format support, customizable controls, multi‑video management, and a layered architecture that separates playback core, business integration, and experience assurance.

architecturemultiplatformplayback
0 likes · 33 min read
How VideoX Tackles Complex Video Playback Across Massive E‑commerce Platforms
NetEase Smart Enterprise Tech+
NetEase Smart Enterprise Tech+
Dec 8, 2021 · Mobile Development

How to Accurately Measure Real‑Time Audio/Video Performance on Mobile Devices

This article explains why traditional CPU usage metrics are unreliable for real‑time audio/video apps, introduces essential performance indicators, reviews native and third‑party analysis tools for iOS and Android, and proposes power‑consumption‑based evaluation methods with practical testing guidelines.

AndroidMobilePerformance Testing
0 likes · 15 min read
How to Accurately Measure Real‑Time Audio/Video Performance on Mobile Devices
ByteFE
ByteFE
Nov 29, 2021 · Frontend Development

History and Technical Overview of Web Audio/Video: From Early HTML to HTML5, Flash, Codecs, Canvas Playback and FFmpeg

This article traces the evolution of web audio and video from the static early HTML era through Flash's rise and fall, explains HTML5 video/audio support, discusses video and audio encoding, container formats, bitrate, playback pipelines, canvas‑based rendering, and provides practical FFmpeg command examples for developers.

HTML5Web Developmentaudio
0 likes · 19 min read
History and Technical Overview of Web Audio/Video: From Early HTML to HTML5, Flash, Codecs, Canvas Playback and FFmpeg
Tencent Advertising Technology
Tencent Advertising Technology
Aug 18, 2021 · Artificial Intelligence

2021 Tencent Advertising Algorithm Competition: Winners, Accepted Papers, and Reviewer Feedback

The 2021 Tencent Advertising Algorithm Competition, held as the ACM MM 2021 Grand Challenge, announced the top three teams for two tracks, presented the accepted multimodal video advertising papers with detailed reviewer comments, and highlighted the significance of algorithmic innovation over ranking alone.

ACM MMAIAdvertising
0 likes · 8 min read
2021 Tencent Advertising Algorithm Competition: Winners, Accepted Papers, and Reviewer Feedback
NetEase Smart Enterprise Tech+
NetEase Smart Enterprise Tech+
Aug 9, 2021 · Artificial Intelligence

How NetEase’s 2021 Audio‑Video Tech Conference Shaped the Future of AI‑Driven Media

The 2021 NetEase "Eye‑Opening, Sound‑Immersive" audio‑video technology conference gathered experts from NetEase’s divisions to showcase AI‑powered video processing, deep‑fake detection, immersive audio, and innovative media solutions, drawing over 54,000 viewers and promising ongoing video releases for the community.

AIaudioconference
0 likes · 8 min read
How NetEase’s 2021 Audio‑Video Tech Conference Shaped the Future of AI‑Driven Media
DataFunTalk
DataFunTalk
May 15, 2021 · Artificial Intelligence

Multi‑Interest Recall Techniques in iQIYI Short‑Video Recommendation

The article reviews the evolution of iQIYI's short‑video recommendation recall pipeline, detailing multi‑interest recall methods such as clustering‑based recall, MOE‑based recall, single‑activation multi‑interest networks, regularization strategies, dynamic capacity handling, and multimodal extensions, and discusses their impact on recommendation performance.

TransformeriQIYImachine learning
0 likes · 15 min read
Multi‑Interest Recall Techniques in iQIYI Short‑Video Recommendation
Python Crawling & Data Mining
Python Crawling & Data Mining
May 5, 2021 · Game Development

Master Pyglet: Build Games, Audio, and Video with Python

This tutorial walks you through installing pyglet, creating windows, adding text and images, handling keyboard and mouse events, processing input, and playing audio and video, providing a comprehensive guide to building lightweight Python games and multimedia applications.

Game DevelopmentPythonaudio
0 likes · 15 min read
Master Pyglet: Build Games, Audio, and Video with Python
DeWu Technology
DeWu Technology
Jan 24, 2021 · Fundamentals

Overview of Video Container Formats and H.264 Encoding

The article outlines how video container formats such as AVI, MOV, MP4, WMV, RM, FLV and MKV package encoded streams, then explains H.264 encoding fundamentals—including I‑, P‑, B‑frames, macroblocks, GOP structure, and NAL units like SPS and PPS that define parameters for efficient compression and transport.

ContainerH.264NAL
0 likes · 10 min read
Overview of Video Container Formats and H.264 Encoding
Aotu Lab
Aotu Lab
Jan 8, 2021 · Frontend Development

Front‑End Tech Highlights: Video Players, Performance Tips, AI Recommendations

From the rise and fall of Flash to modern front‑end video playback techniques, performance optimization strategies, AI recommendation fundamentals, CLI design best practices, and a glimpse into game development and algorithm analysis, this article surveys diverse cutting‑edge technologies shaping today’s software landscape.

AICLIalgorithm
0 likes · 10 min read
Front‑End Tech Highlights: Video Players, Performance Tips, AI Recommendations
Tencent Music Tech Team
Tencent Music Tech Team
Aug 14, 2020 · Frontend Development

Web Implementation of Transparent Video Gift Animations Using Canvas and WebGL

The article describes how a live‑room video‑gift feature originally built for mobile was ported to a web client by extracting separate color and alpha video streams, compositing them on canvas, then migrating the per‑pixel blending to WebGL shaders, which cut CPU usage dramatically, raise frame rates to about 60 FPS, and outline further optimisations such as pre‑loading, mobile support, and possible MSE or WebAssembly approaches.

CanvasPerformance OptimizationWebGL
0 likes · 9 min read
Web Implementation of Transparent Video Gift Animations Using Canvas and WebGL
JD Tech Talk
JD Tech Talk
Jun 11, 2020 · Frontend Development

Development and Implementation of the NutUI Video Component for Mobile Web

This article explains the motivation, design, implementation details, code examples, and troubleshooting tips for building a NutUI video component in Vue that supports basic playback, custom controls, mobile compatibility, and various configuration options for enterprise‑level front‑end projects.

ComponentHTML5Mobile
0 likes · 18 min read
Development and Implementation of the NutUI Video Component for Mobile Web
Taobao Frontend Technology
Taobao Frontend Technology
Jun 9, 2020 · Frontend Development

Unlocking Taobao Live: Front‑End Multimedia Tech Behind the Hype

This article explores the front‑end multimedia technologies that power Taobao Live, covering video and audio fundamentals, container and codec formats, streaming protocols, player architecture, web media APIs, and popular open‑source frameworks for building robust live‑streaming experiences.

Multimediaaudiolive streaming
0 likes · 16 min read
Unlocking Taobao Live: Front‑End Multimedia Tech Behind the Hype
Tencent IMWeb Frontend Team
Tencent IMWeb Frontend Team
Jan 18, 2020 · Frontend Development

How We Overcame Video and Audio Pitfalls in a Holiday Web Event

This article details the challenges and solutions encountered while implementing inline video playback, audio handling, canvas snow effects, progressive animations, and performance optimizations for a Christmas-themed year‑end activity page, providing practical code snippets and best‑practice recommendations for front‑end developers.

animationaudiofrontend
0 likes · 14 min read
How We Overcame Video and Audio Pitfalls in a Holiday Web Event
Huajiao Technology
Huajiao Technology
Jan 14, 2020 · Frontend Development

HJPlayer: A JavaScript Player for FLV and HLS Streams in the Browser

HJPlayer is a lightweight JavaScript library that enables browsers to play H264/AAC encoded FLV live and VOD streams as well as HLS streams by demuxing them into fragmented MP4 and feeding them through the Media Source Extensions API, offering both ES6 module and script tag integration.

FLVMedia Source Extensionshls
0 likes · 4 min read
HJPlayer: A JavaScript Player for FLV and HLS Streams in the Browser
Tencent IMWeb Frontend Team
Tencent IMWeb Frontend Team
Jun 28, 2019 · Frontend Development

How Tencent Scaled Online Education with Mini‑Program Architecture and Engineering

This article details Tencent's online‑education mini‑program ecosystem, covering business matrix, native framework selection, engineering practices, audio/video integration, automated release pipelines, performance optimization through sub‑packages, and a comparison of WeChat and QQ mini‑program platforms.

Automationfrontendmini-program
0 likes · 19 min read
How Tencent Scaled Online Education with Mini‑Program Architecture and Engineering
Xianyu Technology
Xianyu Technology
Dec 20, 2018 · Operations

Optimizing Short Video Playback with Preloading and Proxy Caching

By preloading the MP4 header and initial frames and routing playback through a local proxy that caches range‑requested segments in an LRU disk store, the system moves the moov box to the file start (or fetches it separately), cutting short‑video start‑up latency to roughly 800 ms and delivering near‑instant playback.

ProxyStreamingcaching
0 likes · 13 min read
Optimizing Short Video Playback with Preloading and Proxy Caching
UC Tech Team
UC Tech Team
Oct 31, 2018 · Frontend Development

Implementing Picture-in-Picture (PiP) with the Web API

This article explains how to use the new Picture-in-Picture Web API to enable floating video playback on web pages, covering setup of video and button elements, request handling, error management, event listeners, window‑size tracking, feature detection, and best‑practice UI considerations.

HTML5JavaScriptPicture-in-Picture
0 likes · 8 min read
Implementing Picture-in-Picture (PiP) with the Web API
JD Tech
JD Tech
Apr 26, 2018 · Frontend Development

Using HTML5 Video Tag in Mobile Web: Attributes, Inline/Fullscreen Playback, Dynamic Source Replacement, Live Streaming, and Common Issues

This article explains how to use the HTML5 video tag on mobile web pages, covering common attributes, inline versus fullscreen playback on iOS and Android, dynamic source updates in Vue, live‑streaming protocols with video.js, and practical fixes for background‑audio and layering problems.

HTML5MobileVue
0 likes · 13 min read
Using HTML5 Video Tag in Mobile Web: Attributes, Inline/Fullscreen Playback, Dynamic Source Replacement, Live Streaming, and Common Issues
360 Quality & Efficiency
360 Quality & Efficiency
Jan 24, 2018 · Fundamentals

Understanding Video Basics: Frame Rate, Resolution, and Bitrate

This article explains the fundamental video parameters—frame rate, resolution, and bitrate—how they affect playback smoothness and visual clarity, compares differences between film and game rendering, and offers practical guidance for optimizing video quality on mobile devices.

Frame RateMobilebitrate
0 likes · 9 min read
Understanding Video Basics: Frame Rate, Resolution, and Bitrate
Meitu Technology
Meitu Technology
Jul 27, 2017 · Backend Development

Meitu Internet Technology Salon: Live Streaming Technology Architecture and Practices

At Meitu’s fifth Internet Technology Salon in Xiamen, senior engineers from Meitu and Hulu detailed the company’s self‑built cloud live‑streaming stack, multi‑center optimization, DASH‑based high‑definition delivery, and the evolution of Meipai’s bullet‑screen architecture that now supports near‑million concurrent users, highlighting performance gains, cost control, and future intelligent dispatch strategies.

BackendCDNDASH
0 likes · 12 min read
Meitu Internet Technology Salon: Live Streaming Technology Architecture and Practices
Tencent TDS Service
Tencent TDS Service
Aug 11, 2016 · Frontend Development

Unlocking H5 Video Live Streaming: From Capture to Playback

This article walks through mobile video live streaming fundamentals, covering H5 playback, WebRTC recording, HLS streaming, RTMP server setup, iOS capture, user interaction features, and practical code examples to help front‑end engineers build robust live video solutions.

H5RTMPWebRTC
0 likes · 16 min read
Unlocking H5 Video Live Streaming: From Capture to Playback
Huawei Cloud Developer Alliance
Huawei Cloud Developer Alliance
Aug 2, 2016 · Cloud Computing

Inside Huawei CloudOpera IES: Architecture, Open Capabilities & Hands‑On Labs

The HDG Hangzhou event showcased Huawei’s CloudOpera IES architecture, north‑ and south‑bound open capabilities, hands‑on remote labs, plus deep dives into Huawei CCE hybrid‑cloud containers, Kubernetes design, and video UI integration, all wrapped in live streaming, audience interaction, and community networking.

CloudOperaHuaweiKubernetes
0 likes · 8 min read
Inside Huawei CloudOpera IES: Architecture, Open Capabilities & Hands‑On Labs
Suning Design
Suning Design
Jul 18, 2014 · Frontend Development

How Dynamic Video Transforms Web Pages: Strategies and Real‑World Examples

Dynamic video is reshaping web communication by reducing text reading costs and enhancing user immersion, as illustrated through case studies like Kickstarter’s annual highlights, Google Glass concept sites, Google Doodle, NISSIN, Apple’s Mac 30‑year celebration, and Eastpak’s interactive campaigns, offering practical design insights.

User experiencedynamic mediafrontend
0 likes · 9 min read
How Dynamic Video Transforms Web Pages: Strategies and Real‑World Examples