Tagged articles

Semantic Segmentation

33 articles · Page 1 of 1

Mar 17, 2026 · Artificial Intelligence

Mastering Chunk Splitting for RAG: From Fixed Length to Semantic Segmentation

Chunk splitting, a critical yet often overlooked step in RAG pipelines, dramatically impacts retrieval recall and LLM output quality; this guide walks through three evolution stages—from naive fixed‑length splits to sentence‑aware overlaps and finally semantic, structure‑driven segmentation—complete with code, experiments, and practical pitfalls.

ChunkingLLMRAG

0 likes · 15 min read

Mastering Chunk Splitting for RAG: From Fixed Length to Semantic Segmentation

xkx's Tech General Store

Feb 8, 2026 · Artificial Intelligence

Mastering U‑Net: The Core Engine of Stable Diffusion – Theory to Practice

This article introduces the U‑Net architecture—originally designed for medical image segmentation—explains why its pixel‑wise processing makes it the core denoising engine in Stable Diffusion, details three key modifications for diffusion models, and walks through a ResNet‑50‑based implementation trained on the VOC2012 dataset, achieving 0.92 pixel accuracy and 0.64 mean IoU.

Deep LearningPyTorchResNet50

0 likes · 11 min read

Mastering U‑Net: The Core Engine of Stable Diffusion – Theory to Practice

AI Frontier Lectures

Jan 7, 2026 · Artificial Intelligence

RankSEG: Boost Semantic Segmentation Accuracy with Just Three Lines of Code

This article reveals that the conventional threshold/argmax post‑processing for semantic segmentation is sub‑optimal for Dice/IoU metrics, introduces the RankSEG framework that optimizes predictions without retraining, and presents an efficient RankSEG‑RMA approximation with extensive experiments showing consistent performance gains.

Deep LearningDice optimizationRankSEG

0 likes · 12 min read

RankSEG: Boost Semantic Segmentation Accuracy with Just Three Lines of Code

JD Cloud Developers

Nov 21, 2025 · Artificial Intelligence

Why Chunking Strategy Makes or Breaks RAG Performance

This article explains how different chunking methods—fixed size, semantic, recursive, document‑based, agent‑driven, sentence‑level, and paragraph‑level—affect Retrieval‑Augmented Generation, offering practical guidelines, metrics, and optimization tips for real‑world deployments.

AIChunkingInformation Retrieval

0 likes · 9 min read

Why Chunking Strategy Makes or Breaks RAG Performance

Data Party THU

Sep 27, 2025 · Artificial Intelligence

How Depth-Guided Texture Diffusion Boosts Image Semantic Segmentation

This article reviews the depth‑guided texture diffusion method, detailing its texture extraction, diffusion, structural consistency optimization, and integration into segmentation networks, and shows how it narrows the depth‑RGB gap to achieve state‑of‑the‑art performance on various semantic segmentation tasks.

Semantic Segmentationcomputer visiondepth-guided diffusion

0 likes · 13 min read

How Depth-Guided Texture Diffusion Boosts Image Semantic Segmentation

AIWalker

Sep 2, 2025 · Artificial Intelligence

BEVANet’s Triple Boost for Real-Time Segmentation: Field, Edge, Speed

BEVANet tackles the efficiency‑accuracy trade‑off in real‑time semantic segmentation by integrating large‑kernel attention, an efficient visual attention (EVA) module, a bilateral architecture, and boundary‑guided adaptive fusion, delivering up to 81 % mIoU on Cityscapes at 33 FPS and surpassing prior state‑of‑the‑art models on both accuracy and speed.

EfficiencyReal-timeSemantic Segmentation

0 likes · 19 min read

BEVANet’s Triple Boost for Real-Time Segmentation: Field, Edge, Speed

AIWalker

Jun 18, 2025 · Artificial Intelligence

SeNaTra: Nvidia’s Spatial Grouping Layer Pushes Semantic Segmentation Past Swin Transformer

Nvidia introduces SeNaTra, a native‑segmentation vision transformer that replaces uniform down‑sampling with a content‑aware spatial grouping layer, delivering superior zero‑shot and supervised segmentation performance while cutting parameters and FLOPs compared with Swin Transformer and other backbones.

NVIDIASemantic SegmentationVision Transformer

0 likes · 29 min read

SeNaTra: Nvidia’s Spatial Grouping Layer Pushes Semantic Segmentation Past Swin Transformer

AIWalker

May 12, 2025 · Artificial Intelligence

DefMamba: A Deformable Multi‑Scale Visual Foundation Model that Boosts Vision Tasks

DefMamba introduces a multi‑scale backbone, deformable Mamba modules, and a dynamic scanning strategy to preserve image spatial structure, achieving state‑of‑the‑art performance on image classification, object detection, and semantic segmentation benchmarks.

DefMambaSemantic Segmentationcomputer vision

0 likes · 23 min read

DefMamba: A Deformable Multi‑Scale Visual Foundation Model that Boosts Vision Tasks

AI Frontier Lectures

Apr 15, 2025 · Artificial Intelligence

How Dual-Optimization Distillation Boosts SAM-Driven Multimodal Image Fusion

This article presents a novel dual‑optimization distillation framework that injects Segment Anything Model (SAM) semantic priors into multimodal image fusion, achieving unified visual quality and task accuracy while using a lightweight sub‑network to keep inference efficient.

Image FusionSAMSemantic Segmentation

0 likes · 11 min read

How Dual-Optimization Distillation Boosts SAM-Driven Multimodal Image Fusion

Baidu MEUX

Jan 3, 2024 · Artificial Intelligence

Mastering AI‑Generated Brand Symbol Posters with Stable Diffusion

This article walks through a complete methodology for creating brand symbol posters using AI, covering basic and advanced Stable Diffusion techniques such as ControlNet, depth‑map generation, semantic segmentation, LoRA integration, and post‑processing to achieve high‑quality, efficient visual assets.

AI image generationControlNetDepth map

0 likes · 10 min read

Alibaba Cloud Big Data AI Platform

Oct 8, 2023 · Artificial Intelligence

Why the Scale‑Aware Modulation Transformer Outperforms CNNs and Vision Transformers with Fewer Parameters

The Scale‑Aware Modulation Transformer (SMT) introduces a lightweight SAM module and an Evolutionary Hybrid Network that together achieve higher accuracy on ImageNet, COCO, and ADE20K while using significantly fewer parameters and FLOPs than existing CNN and Transformer baselines.

SMTScale‑Aware ModulationSemantic Segmentation

0 likes · 12 min read

Why the Scale‑Aware Modulation Transformer Outperforms CNNs and Vision Transformers with Fewer Parameters

Rare Earth Juejin Tech Community

Aug 9, 2023 · Artificial Intelligence

DeepLabV2: Architecture, Improvements, and Experimental Results

This article introduces DeepLabV2, explains its challenges, architectural enhancements such as the ASPP module, backbone modifications, poly learning‑rate policy, and presents experimental comparisons on several benchmark datasets, providing a concise yet comprehensive overview for computer‑vision practitioners.

ASPPDeepLabV2Semantic Segmentation

0 likes · 9 min read

DeepLabV2: Architecture, Improvements, and Experimental Results

Huolala Tech

Jul 21, 2023 · Artificial Intelligence

Visual Language Models Power Open-Set Detection and Surgical Tool Segmentation

Recent advances in visual language models enable zero-shot multimodal tasks, and this article explores their application to open-set object detection, prompt learning, and promptable surgical instrument segmentation, highlighting methods like CLIP, CoOp, and the DetPro framework with experimental results across multiple benchmarks.

MultimodalSemantic Segmentationcomputer vision

0 likes · 12 min read

Visual Language Models Power Open-Set Detection and Surgical Tool Segmentation

Rare Earth Juejin Tech Community

Jul 21, 2023 · Artificial Intelligence

Problems and Solutions in Semantic Segmentation: An Overview of DeepLabV1

This article explains the two main challenges of applying deep convolutional neural networks to semantic segmentation—signal down‑sampling and loss of spatial precision—and describes how the DeepLabV1 architecture, using dilated convolutions, large‑field‑of‑view modules, fully‑connected CRF and multi‑scale fusion, addresses these issues while achieving faster, more accurate segmentation results.

CRFDeepLabV1Semantic Segmentation

0 likes · 12 min read

Problems and Solutions in Semantic Segmentation: An Overview of DeepLabV1

Meituan Technology Team

Nov 17, 2022 · Artificial Intelligence

Overview of Recent Meituan Visual Intelligence Research Papers on Content Production, Distribution, and Model Quantization

Meituan’s Visual Intelligence team recently published eight top‑conference papers that advance weakly supervised segmentation, future‑aware captioning, panoptic narrative grounding, video‑text retrieval, open‑vocabulary detection, counterfactual image‑text matching, zero‑shot video classification, and efficient Vision‑Transformer quantization, all directly boosting real‑world content creation, distribution, and model efficiency.

AI researchImage CaptioningModel Quantization

0 likes · 19 min read

Overview of Recent Meituan Visual Intelligence Research Papers on Content Production, Distribution, and Model Quantization

Rare Earth Juejin Tech Community

Nov 16, 2022 · Artificial Intelligence

Deep Learning Semantic Segmentation: FCN Source Code Analysis

This tutorial walks through the complete FCN pipeline for semantic segmentation, covering VOC dataset loading, data augmentation, collate functions, model construction, training loops, loss computation with cross‑entropy (including ignore‑index handling), and inference, while providing full PyTorch code snippets for each step.

FCNPyTorchSemantic Segmentation

0 likes · 19 min read

Deep Learning Semantic Segmentation: FCN Source Code Analysis

Rare Earth Juejin Tech Community

Nov 9, 2022 · Artificial Intelligence

Detailed Explanation of Fully Convolutional Networks (FCN) for Semantic Segmentation

This article provides a comprehensive, beginner‑friendly overview of semantic segmentation, focusing on the pioneering Fully Convolutional Network (FCN) architecture, its variants (FCN‑32s, FCN‑16s, FCN‑8s), underlying concepts, loss computation, and practical tips for working with the VOC dataset.

AlexNetFCNSemantic Segmentation

0 likes · 14 min read

Detailed Explanation of Fully Convolutional Networks (FCN) for Semantic Segmentation

Youzan Coder

Sep 5, 2022 · Artificial Intelligence

Inside Youzan’s Query Parser: Architecture, Plugins, and Real‑World Impact

This article explains the role of Youzan’s Query Parser (QP) in search, walks through its overall and layered architecture, details each algorithmic plugin—from preprocessing to synonym handling—and shows concrete code examples and results that improve search relevance across multiple retail scenarios.

NLPSearch EngineSemantic Segmentation

0 likes · 12 min read

Inside Youzan’s Query Parser: Architecture, Plugins, and Real‑World Impact

Youku Technology

Jun 7, 2022 · Artificial Intelligence

Mobile Real-Time Portrait Segmentation for Youku Bullet Comment Passthrough

To enable real‑time bullet‑comment passthrough on Youku’s mobile app, the team built a million‑scale portrait dataset and designed the AirSegNet series—CPU, GPU, and server variants—using VGG‑style nets, edge‑aware losses, and hybrid CPU‑GPU inference, achieving 0.98 IoU and sub‑15 ms latency on most devices.

MNN FrameworkPortrait SegmentationReal-time inference

0 likes · 13 min read

Mobile Real-Time Portrait Segmentation for Youku Bullet Comment Passthrough

Meituan Technology Team

Apr 14, 2022 · Artificial Intelligence

Short Video Content Understanding and Generation Practices at Meituan

Meituan leverages computer‑vision techniques to tag, analyze, and automatically generate short videos across consumer and merchant scenarios, detailing hierarchical tag design, self‑supervised representation learning, fine‑grained food recognition, intelligent cover creation, and pixel‑level editing to enhance content discovery and presentation.

AI content generationSemantic Segmentationcomputer vision

0 likes · 20 min read

Short Video Content Understanding and Generation Practices at Meituan

Kuaishou Tech

Aug 9, 2021 · Artificial Intelligence

AI‑Powered Danmu Occlusion Prevention Using Human Portrait Segmentation

The article presents a comprehensive AI solution for video danmu (bullet‑screen) occlusion prevention that leverages human portrait semantic segmentation, describes dataset construction with full‑ and semi‑supervised labeling, details the encoder‑decoder model with context attention, outlines post‑processing for spatial and temporal stability, and explains deployment on cloud and edge using YKit, KwaiNN and TensorRT.

AICloud InferenceDanmu

0 likes · 15 min read

AI‑Powered Danmu Occlusion Prevention Using Human Portrait Segmentation

Kuaishou Tech

May 10, 2021 · Artificial Intelligence

Semantic Image Matting: Integrating Alpha Pattern Semantics into the Matting Framework

The article presents Semantic Image Matting, a novel approach that incorporates 20 semantic Alpha pattern categories into the matting pipeline via semantic Trimap, region‑based classifiers, multi‑class discriminators, and learnable gradient loss, achieving state‑of‑the‑art results on multiple benchmarks.

Deep LearningSemantic Segmentationalpha patterns

0 likes · 11 min read

Semantic Image Matting: Integrating Alpha Pattern Semantics into the Matting Framework

Meituan Technology Team

Apr 15, 2021 · Artificial Intelligence

Meituan Technical Team Shares CVPR 2021 Pre-lecture: Five Papers on Video Instance Segmentation, Facial Expression Recognition, Real-time Semantic Segmentation, Weakly Supervised Semantic Segmentation, and Multi-source Domain Adaptation

At a CVPR 2021 pre‑lecture, Meituan’s Visual Intelligence Center showcased five cutting‑edge papers—VisTR transformer‑based video instance segmentation, a feature‑decomposition facial expression recognizer, an accelerated BiSeNet for real‑time semantic segmentation, an embedded discriminative attention mechanism for weakly supervised segmentation, and a partial‑feature selection framework for multi‑source domain adaptation—highlighting the company’s large AI R&D team, university collaborations, real‑world deployment across its services, and ongoing recruitment.

AICVPR2021Domain Adaptation

0 likes · 10 min read

Meituan Technical Team Shares CVPR 2021 Pre-lecture: Five Papers on Video Instance Segmentation, Facial Expression Recognition, Real-time Semantic Segmentation, Weakly Supervised Semantic Segmentation, and Multi-source Domain Adaptation

Python Crawling & Data Mining

Dec 22, 2020 · Artificial Intelligence

Create Stunning Video Ghosting Effects with PaddlePaddle’s DeepLabV3p Model

Learn how to generate cinematic ghosting effects in videos by leveraging PaddlePaddle’s PaddleHub deep learning library and the pretrained deeplabv3p_xception65 model for semantic segmentation, with step‑by‑step code, environment setup, and practical testing on classic martial‑arts footage.

Deep LearningGhost EffectPaddlePaddle

0 likes · 7 min read

Create Stunning Video Ghosting Effects with PaddlePaddle’s DeepLabV3p Model

Amap Tech

Mar 23, 2020 · Artificial Intelligence

Satellite Imagery for Map Data Updating: Key Elements, Semantic Segmentation Techniques, and Future Challenges

Gaode leverages high‑resolution satellite imagery as an active discovery tool for map updates, extracting road, region and building elements through advanced semantic segmentation networks (U‑Net, ASPP, attention, non‑local) and instance‑segmentation pipelines, to accelerate accurate road‑network and building‑block data refreshes while addressing future scalability challenges.

Artificial IntelligenceSatellite ImagerySemantic Segmentation

0 likes · 11 min read

Satellite Imagery for Map Data Updating: Key Elements, Semantic Segmentation Techniques, and Future Challenges

Amap Tech

Dec 13, 2019 · Artificial Intelligence

Image Segmentation for High-Definition Mapping: Evolution and Practices at Gaode Maps

Gaode Maps has progressed image segmentation from early heuristic region splitting to modern deep‑learning pipelines—leveraging FCNs, multi‑task networks, Mask R‑CNN, and specialized losses—to achieve centimeter‑level, instance‑aware mapping of roads, signs, and small objects while pursuing lighter, real‑time models.

AIDeep LearningGaode Maps

0 likes · 14 min read

Image Segmentation for High-Definition Mapping: Evolution and Practices at Gaode Maps

Didi Tech

May 1, 2019 · Artificial Intelligence

Didi AI Labs' DFS Face Detection Algorithm Achieves Top Rankings on the WIDER FACE Benchmark

The DFS face-detection algorithm jointly created by Didi AI Labs and Beijing University's PRIS team secured five first-place and one second-place results on the WIDER FACE benchmark, achieving 96.3% (Easy), 95.4% (Medium) and 90.7% (Hard) AP by leveraging a Feature Fusion Pyramid and semantic-segmentation supervision, and is already deployed in Didi's driver-identity verification and in-vehicle privacy systems.

Semantic SegmentationWIDER FACEfeature fusion

0 likes · 5 min read

Didi AI Labs' DFS Face Detection Algorithm Achieves Top Rankings on the WIDER FACE Benchmark

DataFunTalk

Mar 15, 2019 · Artificial Intelligence

A Comprehensive Overview of Deep Learning Applications in Computer Vision

This article provides an extensive review of deep learning techniques applied to computer vision, covering the evolution of CNN architectures, image and video processing tasks, 2.5‑D and 3‑D reconstruction, object detection, segmentation, tracking, SLAM, and various practical applications such as AR, content retrieval, and autonomous driving.

CNNImage processingSLAM

0 likes · 22 min read

A Comprehensive Overview of Deep Learning Applications in Computer Vision

Alibaba Cloud Developer

Mar 5, 2019 · Artificial Intelligence

How Alibaba’s Semantic Human Matting Achieves Fully Automatic High‑Precision Image Cutouts

This article introduces Alibaba’s intelligent matting editor and its Semantic Human Matting (SHM) algorithm, detailing the integration of semantic segmentation and deep matting networks, the fusion module, training strategy, experimental results, and the deployment of an online fully‑automatic cutout tool for designers.

AlibabaDeep LearningSemantic Segmentation

0 likes · 16 min read

How Alibaba’s Semantic Human Matting Achieves Fully Automatic High‑Precision Image Cutouts

Alibaba Cloud Developer

Feb 27, 2019 · Artificial Intelligence

Inside Alibaba’s AliPlayStudio: Real-Time AI Video Interaction Techniques

This article details how Alibaba’s AliPlayStudio combines advanced computer‑vision algorithms—such as human semantic segmentation, gesture and pose detection, controllable style transfer, and face‑fusion—optimised for low‑power mobile and embedded devices, to deliver engaging real‑time video interactions across online and offline marketing scenarios.

Semantic SegmentationStyle Transferface fusion

0 likes · 17 min read

Inside Alibaba’s AliPlayStudio: Real-Time AI Video Interaction Techniques

Alibaba Cloud Developer

Jan 16, 2019 · Artificial Intelligence

How Alibaba’s AliPlayStudio Powers Real‑Time AI Video Interactions on Mobile

This article details the research and engineering behind Alibaba's AliPlayStudio, a video‑interactive platform that combines computer‑vision algorithms such as human parsing, gesture and pose detection, and controllable style transfer, all optimized for real‑time deployment on low‑power mobile and embedded devices.

Real-time InteractionSemantic Segmentationgesture recognition

0 likes · 17 min read

How Alibaba’s AliPlayStudio Powers Real‑Time AI Video Interactions on Mobile

iQIYI Technical Product Team

Nov 16, 2018 · Artificial Intelligence

iQIYI AI Bullet‑Screen Masking: Semantic Segmentation System and Engineering Insights

iQIYI’s bullet‑screen masking employs a DeepLabv3+‑based two‑class semantic segmentation pipeline, preceded by a close‑up detector and followed by morphological refinement, trained on a custom annotated dataset that raises IoU to 93.6 %, processes hour‑long videos in under an hour, and is slated for future upgrades to instance and panoptic segmentation for finer‑grained masking.

AISemantic SegmentationVideo Processing

0 likes · 10 min read

iQIYI AI Bullet‑Screen Masking: Semantic Segmentation System and Engineering Insights

Architects Research Society

Oct 11, 2015 · Artificial Intelligence

Decision Forests for Pixel-Level Classification in Computer Vision

This article traces the evolution of computer vision from its 1960s origins, explains the challenges of image classification and semantic segmentation, and introduces pixel-level decision forest algorithms as an efficient solution for large‑scale pixel classification tasks.

Semantic Segmentationcomputer visiondecision forest

0 likes · 9 min read