Tagged articles
32 articles
Page 1 of 1
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Mar 17, 2026 · Artificial Intelligence

Mastering Chunk Splitting for RAG: From Fixed Length to Semantic Segmentation

Chunk splitting, a critical yet often overlooked step in RAG pipelines, dramatically impacts retrieval recall and LLM output quality; this guide walks through three evolution stages—from naive fixed‑length splits to sentence‑aware overlaps and finally semantic, structure‑driven segmentation—complete with code, experiments, and practical pitfalls.

LLMRAGchunking
0 likes · 15 min read
Mastering Chunk Splitting for RAG: From Fixed Length to Semantic Segmentation
AI Frontier Lectures
AI Frontier Lectures
Jan 7, 2026 · Artificial Intelligence

RankSEG: Boost Semantic Segmentation Accuracy with Just Three Lines of Code

This article reveals that the conventional threshold/argmax post‑processing for semantic segmentation is sub‑optimal for Dice/IoU metrics, introduces the RankSEG framework that optimizes predictions without retraining, and presents an efficient RankSEG‑RMA approximation with extensive experiments showing consistent performance gains.

Deep LearningDice optimizationRankSEG
0 likes · 12 min read
RankSEG: Boost Semantic Segmentation Accuracy with Just Three Lines of Code
JD Cloud Developers
JD Cloud Developers
Nov 21, 2025 · Artificial Intelligence

Why Chunking Strategy Makes or Breaks RAG Performance

This article explains how different chunking methods—fixed size, semantic, recursive, document‑based, agent‑driven, sentence‑level, and paragraph‑level—affect Retrieval‑Augmented Generation, offering practical guidelines, metrics, and optimization tips for real‑world deployments.

AIRAGchunking
0 likes · 9 min read
Why Chunking Strategy Makes or Breaks RAG Performance
Data Party THU
Data Party THU
Sep 27, 2025 · Artificial Intelligence

How Depth-Guided Texture Diffusion Boosts Image Semantic Segmentation

This article reviews the depth‑guided texture diffusion method, detailing its texture extraction, diffusion, structural consistency optimization, and integration into segmentation networks, and shows how it narrows the depth‑RGB gap to achieve state‑of‑the‑art performance on various semantic segmentation tasks.

Computer Visiondepth-guided diffusionsemantic segmentation
0 likes · 13 min read
How Depth-Guided Texture Diffusion Boosts Image Semantic Segmentation
AIWalker
AIWalker
Sep 2, 2025 · Artificial Intelligence

BEVANet’s Triple Boost for Real-Time Segmentation: Field, Edge, Speed

BEVANet tackles the efficiency‑accuracy trade‑off in real‑time semantic segmentation by integrating large‑kernel attention, an efficient visual attention (EVA) module, a bilateral architecture, and boundary‑guided adaptive fusion, delivering up to 81 % mIoU on Cityscapes at 33 FPS and surpassing prior state‑of‑the‑art models on both accuracy and speed.

Computer VisionReal-Timeefficiency
0 likes · 19 min read
BEVANet’s Triple Boost for Real-Time Segmentation: Field, Edge, Speed
AIWalker
AIWalker
Jun 18, 2025 · Artificial Intelligence

SeNaTra: Nvidia’s Spatial Grouping Layer Pushes Semantic Segmentation Past Swin Transformer

Nvidia introduces SeNaTra, a native‑segmentation vision transformer that replaces uniform down‑sampling with a content‑aware spatial grouping layer, delivering superior zero‑shot and supervised segmentation performance while cutting parameters and FLOPs compared with Swin Transformer and other backbones.

Nvidiasemantic segmentationspatial grouping
0 likes · 29 min read
SeNaTra: Nvidia’s Spatial Grouping Layer Pushes Semantic Segmentation Past Swin Transformer
Baidu MEUX
Baidu MEUX
Jan 3, 2024 · Artificial Intelligence

Mastering AI‑Generated Brand Symbol Posters with Stable Diffusion

This article walks through a complete methodology for creating brand symbol posters using AI, covering basic and advanced Stable Diffusion techniques such as ControlNet, depth‑map generation, semantic segmentation, LoRA integration, and post‑processing to achieve high‑quality, efficient visual assets.

AI image generationControlNetDepth map
0 likes · 10 min read
Mastering AI‑Generated Brand Symbol Posters with Stable Diffusion
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Oct 8, 2023 · Artificial Intelligence

Why the Scale‑Aware Modulation Transformer Outperforms CNNs and Vision Transformers with Fewer Parameters

The Scale‑Aware Modulation Transformer (SMT) introduces a lightweight SAM module and an Evolutionary Hybrid Network that together achieve higher accuracy on ImageNet, COCO, and ADE20K while using significantly fewer parameters and FLOPs than existing CNN and Transformer baselines.

Image ClassificationSMTScale‑Aware Modulation
0 likes · 12 min read
Why the Scale‑Aware Modulation Transformer Outperforms CNNs and Vision Transformers with Fewer Parameters
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Aug 9, 2023 · Artificial Intelligence

DeepLabV2: Architecture, Improvements, and Experimental Results

This article introduces DeepLabV2, explains its challenges, architectural enhancements such as the ASPP module, backbone modifications, poly learning‑rate policy, and presents experimental comparisons on several benchmark datasets, providing a concise yet comprehensive overview for computer‑vision practitioners.

ASPPDeepLabV2semantic segmentation
0 likes · 9 min read
DeepLabV2: Architecture, Improvements, and Experimental Results
Huolala Tech
Huolala Tech
Jul 21, 2023 · Artificial Intelligence

Visual Language Models Power Open-Set Detection and Surgical Tool Segmentation

Recent advances in visual language models enable zero-shot multimodal tasks, and this article explores their application to open-set object detection, prompt learning, and promptable surgical instrument segmentation, highlighting methods like CLIP, CoOp, and the DetPro framework with experimental results across multiple benchmarks.

Computer VisionMultimodalVisual-Language Models
0 likes · 12 min read
Visual Language Models Power Open-Set Detection and Surgical Tool Segmentation
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Jul 21, 2023 · Artificial Intelligence

Problems and Solutions in Semantic Segmentation: An Overview of DeepLabV1

This article explains the two main challenges of applying deep convolutional neural networks to semantic segmentation—signal down‑sampling and loss of spatial precision—and describes how the DeepLabV1 architecture, using dilated convolutions, large‑field‑of‑view modules, fully‑connected CRF and multi‑scale fusion, addresses these issues while achieving faster, more accurate segmentation results.

CRFDeepLabV1dilated convolution
0 likes · 12 min read
Problems and Solutions in Semantic Segmentation: An Overview of DeepLabV1
Meituan Technology Team
Meituan Technology Team
Nov 17, 2022 · Artificial Intelligence

Overview of Recent Meituan Visual Intelligence Research Papers on Content Production, Distribution, and Model Quantization

Meituan’s Visual Intelligence team recently published eight top‑conference papers that advance weakly supervised segmentation, future‑aware captioning, panoptic narrative grounding, video‑text retrieval, open‑vocabulary detection, counterfactual image‑text matching, zero‑shot video classification, and efficient Vision‑Transformer quantization, all directly boosting real‑world content creation, distribution, and model efficiency.

AI researchImage CaptioningModel Quantization
0 likes · 19 min read
Overview of Recent Meituan Visual Intelligence Research Papers on Content Production, Distribution, and Model Quantization
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Nov 16, 2022 · Artificial Intelligence

Deep Learning Semantic Segmentation: FCN Source Code Analysis

This tutorial walks through the complete FCN pipeline for semantic segmentation, covering VOC dataset loading, data augmentation, collate functions, model construction, training loops, loss computation with cross‑entropy (including ignore‑index handling), and inference, while providing full PyTorch code snippets for each step.

FCNPyTorchVOC dataset
0 likes · 19 min read
Deep Learning Semantic Segmentation: FCN Source Code Analysis
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Nov 9, 2022 · Artificial Intelligence

Detailed Explanation of Fully Convolutional Networks (FCN) for Semantic Segmentation

This article provides a comprehensive, beginner‑friendly overview of semantic segmentation, focusing on the pioneering Fully Convolutional Network (FCN) architecture, its variants (FCN‑32s, FCN‑16s, FCN‑8s), underlying concepts, loss computation, and practical tips for working with the VOC dataset.

AlexNetComputer VisionFCN
0 likes · 14 min read
Detailed Explanation of Fully Convolutional Networks (FCN) for Semantic Segmentation
Youzan Coder
Youzan Coder
Sep 5, 2022 · Artificial Intelligence

Inside Youzan’s Query Parser: Architecture, Plugins, and Real‑World Impact

This article explains the role of Youzan’s Query Parser (QP) in search, walks through its overall and layered architecture, details each algorithmic plugin—from preprocessing to synonym handling—and shows concrete code examples and results that improve search relevance across multiple retail scenarios.

NLPSystem ArchitectureYouzan
0 likes · 12 min read
Inside Youzan’s Query Parser: Architecture, Plugins, and Real‑World Impact
Youku Technology
Youku Technology
Jun 7, 2022 · Artificial Intelligence

Mobile Real-Time Portrait Segmentation for Youku Bullet Comment Passthrough

To enable real‑time bullet‑comment passthrough on Youku’s mobile app, the team built a million‑scale portrait dataset and designed the AirSegNet series—CPU, GPU, and server variants—using VGG‑style nets, edge‑aware losses, and hybrid CPU‑GPU inference, achieving 0.98 IoU and sub‑15 ms latency on most devices.

Computer VisionEdge ComputingMNN Framework
0 likes · 13 min read
Mobile Real-Time Portrait Segmentation for Youku Bullet Comment Passthrough
Meituan Technology Team
Meituan Technology Team
Apr 14, 2022 · Artificial Intelligence

Short Video Content Understanding and Generation Practices at Meituan

Meituan leverages computer‑vision techniques to tag, analyze, and automatically generate short videos across consumer and merchant scenarios, detailing hierarchical tag design, self‑supervised representation learning, fine‑grained food recognition, intelligent cover creation, and pixel‑level editing to enhance content discovery and presentation.

AI content generationComputer Visionfine-grained recognition
0 likes · 20 min read
Short Video Content Understanding and Generation Practices at Meituan
Kuaishou Tech
Kuaishou Tech
Aug 9, 2021 · Artificial Intelligence

AI‑Powered Danmu Occlusion Prevention Using Human Portrait Segmentation

The article presents a comprehensive AI solution for video danmu (bullet‑screen) occlusion prevention that leverages human portrait semantic segmentation, describes dataset construction with full‑ and semi‑supervised labeling, details the encoder‑decoder model with context attention, outlines post‑processing for spatial and temporal stability, and explains deployment on cloud and edge using YKit, KwaiNN and TensorRT.

AICloud InferenceDanmu
0 likes · 15 min read
AI‑Powered Danmu Occlusion Prevention Using Human Portrait Segmentation
Kuaishou Tech
Kuaishou Tech
May 10, 2021 · Artificial Intelligence

Semantic Image Matting: Integrating Alpha Pattern Semantics into the Matting Framework

The article presents Semantic Image Matting, a novel approach that incorporates 20 semantic Alpha pattern categories into the matting pipeline via semantic Trimap, region‑based classifiers, multi‑class discriminators, and learnable gradient loss, achieving state‑of‑the‑art results on multiple benchmarks.

Computer VisionDeep Learningalpha patterns
0 likes · 11 min read
Semantic Image Matting: Integrating Alpha Pattern Semantics into the Matting Framework
Meituan Technology Team
Meituan Technology Team
Apr 15, 2021 · Artificial Intelligence

Meituan Technical Team Shares CVPR 2021 Pre-lecture: Five Papers on Video Instance Segmentation, Facial Expression Recognition, Real-time Semantic Segmentation, Weakly Supervised Semantic Segmentation, and Multi-source Domain Adaptation

At a CVPR 2021 pre‑lecture, Meituan’s Visual Intelligence Center showcased five cutting‑edge papers—VisTR transformer‑based video instance segmentation, a feature‑decomposition facial expression recognizer, an accelerated BiSeNet for real‑time semantic segmentation, an embedded discriminative attention mechanism for weakly supervised segmentation, and a partial‑feature selection framework for multi‑source domain adaptation—highlighting the company’s large AI R&D team, university collaborations, real‑world deployment across its services, and ongoing recruitment.

AICVPR2021Facial Expression Recognition
0 likes · 10 min read
Meituan Technical Team Shares CVPR 2021 Pre-lecture: Five Papers on Video Instance Segmentation, Facial Expression Recognition, Real-time Semantic Segmentation, Weakly Supervised Semantic Segmentation, and Multi-source Domain Adaptation
Python Crawling & Data Mining
Python Crawling & Data Mining
Dec 22, 2020 · Artificial Intelligence

Create Stunning Video Ghosting Effects with PaddlePaddle’s DeepLabV3p Model

Learn how to generate cinematic ghosting effects in videos by leveraging PaddlePaddle’s PaddleHub deep learning library and the pretrained deeplabv3p_xception65 model for semantic segmentation, with step‑by‑step code, environment setup, and practical testing on classic martial‑arts footage.

Deep LearningGhost EffectPaddlePaddle
0 likes · 7 min read
Create Stunning Video Ghosting Effects with PaddlePaddle’s DeepLabV3p Model
Amap Tech
Amap Tech
Mar 23, 2020 · Artificial Intelligence

Satellite Imagery for Map Data Updating: Key Elements, Semantic Segmentation Techniques, and Future Challenges

Gaode leverages high‑resolution satellite imagery as an active discovery tool for map updates, extracting road, region and building elements through advanced semantic segmentation networks (U‑Net, ASPP, attention, non‑local) and instance‑segmentation pipelines, to accelerate accurate road‑network and building‑block data refreshes while addressing future scalability challenges.

Computer VisionSatellite ImageryU-Net
0 likes · 11 min read
Satellite Imagery for Map Data Updating: Key Elements, Semantic Segmentation Techniques, and Future Challenges
Amap Tech
Amap Tech
Dec 13, 2019 · Artificial Intelligence

Image Segmentation for High-Definition Mapping: Evolution and Practices at Gaode Maps

Gaode Maps has progressed image segmentation from early heuristic region splitting to modern deep‑learning pipelines—leveraging FCNs, multi‑task networks, Mask R‑CNN, and specialized losses—to achieve centimeter‑level, instance‑aware mapping of roads, signs, and small objects while pursuing lighter, real‑time models.

AIComputer VisionDeep Learning
0 likes · 14 min read
Image Segmentation for High-Definition Mapping: Evolution and Practices at Gaode Maps
Didi Tech
Didi Tech
May 1, 2019 · Artificial Intelligence

Didi AI Labs' DFS Face Detection Algorithm Achieves Top Rankings on the WIDER FACE Benchmark

The DFS face-detection algorithm jointly created by Didi AI Labs and Beijing University's PRIS team secured five first-place and one second-place results on the WIDER FACE benchmark, achieving 96.3% (Easy), 95.4% (Medium) and 90.7% (Hard) AP by leveraging a Feature Fusion Pyramid and semantic-segmentation supervision, and is already deployed in Didi's driver-identity verification and in-vehicle privacy systems.

WIDER FACEfeature fusionsemantic segmentation
0 likes · 5 min read
Didi AI Labs' DFS Face Detection Algorithm Achieves Top Rankings on the WIDER FACE Benchmark
DataFunTalk
DataFunTalk
Mar 15, 2019 · Artificial Intelligence

A Comprehensive Overview of Deep Learning Applications in Computer Vision

This article provides an extensive review of deep learning techniques applied to computer vision, covering the evolution of CNN architectures, image and video processing tasks, 2.5‑D and 3‑D reconstruction, object detection, segmentation, tracking, SLAM, and various practical applications such as AR, content retrieval, and autonomous driving.

CNNComputer VisionImage Processing
0 likes · 22 min read
A Comprehensive Overview of Deep Learning Applications in Computer Vision
Alibaba Cloud Developer
Alibaba Cloud Developer
Mar 5, 2019 · Artificial Intelligence

How Alibaba’s Semantic Human Matting Achieves Fully Automatic High‑Precision Image Cutouts

This article introduces Alibaba’s intelligent matting editor and its Semantic Human Matting (SHM) algorithm, detailing the integration of semantic segmentation and deep matting networks, the fusion module, training strategy, experimental results, and the deployment of an online fully‑automatic cutout tool for designers.

AlibabaDeep Learningautomatic cutout
0 likes · 16 min read
How Alibaba’s Semantic Human Matting Achieves Fully Automatic High‑Precision Image Cutouts
Alibaba Cloud Developer
Alibaba Cloud Developer
Feb 27, 2019 · Artificial Intelligence

Inside Alibaba’s AliPlayStudio: Real-Time AI Video Interaction Techniques

This article details how Alibaba’s AliPlayStudio combines advanced computer‑vision algorithms—such as human semantic segmentation, gesture and pose detection, controllable style transfer, and face‑fusion—optimised for low‑power mobile and embedded devices, to deliver engaging real‑time video interactions across online and offline marketing scenarios.

Mobile AIStyle Transferface fusion
0 likes · 17 min read
Inside Alibaba’s AliPlayStudio: Real-Time AI Video Interaction Techniques
Alibaba Cloud Developer
Alibaba Cloud Developer
Jan 16, 2019 · Artificial Intelligence

How Alibaba’s AliPlayStudio Powers Real‑Time AI Video Interactions on Mobile

This article details the research and engineering behind Alibaba's AliPlayStudio, a video‑interactive platform that combines computer‑vision algorithms such as human parsing, gesture and pose detection, and controllable style transfer, all optimized for real‑time deployment on low‑power mobile and embedded devices.

Mobile AIgesture recognitionpose estimation
0 likes · 17 min read
How Alibaba’s AliPlayStudio Powers Real‑Time AI Video Interactions on Mobile
iQIYI Technical Product Team
iQIYI Technical Product Team
Nov 16, 2018 · Artificial Intelligence

iQIYI AI Bullet‑Screen Masking: Semantic Segmentation System and Engineering Insights

iQIYI’s bullet‑screen masking employs a DeepLabv3+‑based two‑class semantic segmentation pipeline, preceded by a close‑up detector and followed by morphological refinement, trained on a custom annotated dataset that raises IoU to 93.6 %, processes hour‑long videos in under an hour, and is slated for future upgrades to instance and panoptic segmentation for finer‑grained masking.

AIVideo processingbullet screen masking
0 likes · 10 min read
iQIYI AI Bullet‑Screen Masking: Semantic Segmentation System and Engineering Insights
Architects Research Society
Architects Research Society
Oct 11, 2015 · Artificial Intelligence

Decision Forests for Pixel-Level Classification in Computer Vision

This article traces the evolution of computer vision from its 1960s origins, explains the challenges of image classification and semantic segmentation, and introduces pixel-level decision forest algorithms as an efficient solution for large‑scale pixel classification tasks.

Computer Visiondecision forestpixel classification
0 likes · 9 min read
Decision Forests for Pixel-Level Classification in Computer Vision