Tagged articles

video summarization

5 articles · Page 1 of 1
Old Zhang's AI Learning
Old Zhang's AI Learning
Jun 24, 2026 · Artificial Intelligence

Universal Video Download Skill Evolves into Full‑Video Summarization (z‑video‑study‑webpage‑qwen)

The author open‑sources a universal video‑download Skill and then introduces a companion Skill that automatically extracts audio, frames, and visual insights from a local MP4, runs Whisper and qwen3.7‑plus to generate a structured summary webpage with player, key points, timeline and actionable items.

Multimodal AIOpen-sourceWhisper
0 likes · 3 min read
Universal Video Download Skill Evolves into Full‑Video Summarization (z‑video‑study‑webpage‑qwen)
Bilibili Tech
Bilibili Tech
Oct 13, 2023 · Artificial Intelligence

Multimodal Video High‑Energy Segment Extraction for Dynamic Video Covers

The authors present a multimodal system that automatically extracts high‑energy video segments for dynamic covers by analyzing subtitles, audio, visual frames, and danmu, employing LLM prompt‑tuning, scene‑cut detection, and aesthetic scoring to reduce manual effort and boost click‑through rates.

ASRMultimodal AIOCR
0 likes · 14 min read
Multimodal Video High‑Energy Segment Extraction for Dynamic Video Covers
DataFunTalk
DataFunTalk
May 13, 2023 · Artificial Intelligence

Multimedia Content Understanding at Weibo: Video Summarization, Quality Assessment, OCR, Embedding, and CV‑CUDA Optimization

This article presents Weibo's comprehensive multimedia content understanding pipeline, covering video summarization techniques, quality assessment models, OCR advancements, video embedding strategies, and the performance benefits of CV‑CUDA acceleration, while highlighting real‑world applications and engineering trade‑offs.

CV-CUDADeep LearningEmbedding
0 likes · 32 min read
Multimedia Content Understanding at Weibo: Video Summarization, Quality Assessment, OCR, Embedding, and CV‑CUDA Optimization
Baidu Geek Talk
Baidu Geek Talk
Jun 15, 2022 · Artificial Intelligence

CCL2022 Video Highlight Extraction Challenge Overview

The article describes the CCL2022 Video Highlight Extraction Challenge, a competition at the 21st China Conference on Computational Linguistics organized by Baidu, inviting participants worldwide to generate timestamped concise summaries of video segments, with registration details, eligibility, task description, example inputs/outputs, and evaluation metrics based on timing accuracy and ROUGE-L.

CCL2022Evaluation MetricsNLP
0 likes · 6 min read
CCL2022 Video Highlight Extraction Challenge Overview
NetEase Media Technology Team
NetEase Media Technology Team
Jul 12, 2019 · Artificial Intelligence

Video Highlight Detection and GIF Cover Generation Using 3D Convolutional Scoring

The paper proposes a 3D‑CNN scoring system that ranks short, information‑dense video segments, selects the most exciting one, and converts it into a looping GIF cover, replacing static thumbnails; trained on large video‑GIF datasets with pairwise ranking loss, it improves click‑through rates while reducing bad‑case generation.

3D CNNGIF generationcontent recommendation
0 likes · 13 min read
Video Highlight Detection and GIF Cover Generation Using 3D Convolutional Scoring