Tagged articles
30 articles
Page 1 of 1
AIWalker
AIWalker
Sep 23, 2025 · Artificial Intelligence

DIDB‑ViT Achieves SOTA Binary ViT Results, Outperforms Full‑Precision ResNet‑34 on ADE20K

The paper introduces DIDB‑ViT, a high‑fidelity differential‑information‑driven binary Vision Transformer that closes the performance gap with full‑precision models while keeping the original ViT architecture, and demonstrates state‑of‑the‑art results on image classification and ADE20K segmentation, even surpassing full‑precision ResNet‑34.

binary neural networksedge deploymentimage segmentation
0 likes · 28 min read
DIDB‑ViT Achieves SOTA Binary ViT Results, Outperforms Full‑Precision ResNet‑34 on ADE20K
Huolala Tech
Huolala Tech
Jul 2, 2025 · Artificial Intelligence

Can Diffusion Models Revolutionize Salient Object Detection?

This article introduces a diffusion‑based framework for salient object detection, discusses its background, challenges, and motivations, details the model architecture and training, presents extensive experiments and ablation studies, and outlines limitations and future research directions.

Computer VisionDeep Learningdiffusion model
0 likes · 11 min read
Can Diffusion Models Revolutionize Salient Object Detection?
AIWalker
AIWalker
May 22, 2025 · Artificial Intelligence

VisionReasoner: RL‑Unified System Beats YOLO‑World on Detection, Segmentation, Counting

VisionReasoner introduces a reinforcement‑learning‑driven unified framework that simultaneously handles detection, segmentation, and counting tasks within a single model, achieving 29.1% higher COCO detection AP, 22.1% better ReasonSeg segmentation, and 15.3% improvement on CountBench, while requiring only 7,000 training samples and offering efficient multi‑target matching via batch computation and the Hungarian algorithm.

LVLMObject CountingVisionReasoner
0 likes · 19 min read
VisionReasoner: RL‑Unified System Beats YOLO‑World on Detection, Segmentation, Counting
DaTaobao Tech
DaTaobao Tech
May 16, 2025 · Artificial Intelligence

JianYi: AI‑Powered Image Segmentation and Matting System for Taobao Home‑Decoration

The article introduces JianYi, a self‑developed image segmentation and matting system for Taobao's home‑decoration business that supports product, human, and panoramic segmentation with multi‑modal interaction, achieving high‑precision real‑time performance and powering AI tools such as "Jiazuo" and "Fang Wo Jia".

Computer VisionDeep Learningartificial intelligence
0 likes · 11 min read
JianYi: AI‑Powered Image Segmentation and Matting System for Taobao Home‑Decoration
Baidu Tech Salon
Baidu Tech Salon
Aug 27, 2024 · Artificial Intelligence

How PaddleX Enables Early Detection of Malignant Skin Tumors with AI Segmentation

This article examines the urgent need for early skin cancer detection in China, outlines the challenges of dermatological imaging, and details a low‑code PaddleX solution that leverages PP‑LiteSeg‑T for data preparation, model training, optimization, and deployment to improve diagnostic accuracy and efficiency.

AIDeep LearningPaddleX
0 likes · 10 min read
How PaddleX Enables Early Detection of Malignant Skin Tumors with AI Segmentation
Model Perspective
Model Perspective
Aug 2, 2023 · Artificial Intelligence

How Segment Anything (SAM) Is Revolutionizing Image Segmentation

This article explains the fundamentals of image segmentation, introduces the open‑source Segment Anything Model (SAM) and its massive SA‑1B dataset, outlines SAM's unique promptable, real‑time capabilities, and explores its wide‑ranging future applications across AR/VR, content creation, and scientific research.

AIComputer VisionSAM
0 likes · 7 min read
How Segment Anything (SAM) Is Revolutionizing Image Segmentation
php Courses
php Courses
Jul 21, 2023 · Artificial Intelligence

Image Segmentation with PHP and OpenCV

This tutorial explains how to perform image segmentation using the OpenCV library in PHP, covering environment setup, library import, image loading, grayscale conversion, thresholding, result display, and saving the segmented output.

Computer VisionOpenCVPHP
0 likes · 4 min read
Image Segmentation with PHP and OpenCV
DataFunSummit
DataFunSummit
Apr 1, 2023 · Artificial Intelligence

Overview of Understanding and Editing Vision Models in the ModelScope Community

This article introduces ModelScope's community‑released visual models, detailing the categorization of understanding versus editing models, the two‑stage coarse‑to‑fine segmentation pipeline for matting, and four editing applications—style transfer, portrait beautification, skin enhancement, and anime‑style conversion—while also previewing upcoming sky‑replacement and video‑matting models.

AIModelScopeStyle Transfer
0 likes · 9 min read
Overview of Understanding and Editing Vision Models in the ModelScope Community
NetEase Cloud Music Tech Team
NetEase Cloud Music Tech Team
Feb 8, 2023 · Mobile Development

Implementing iOS 16 Depth Effect Using Image Segmentation with CoreML and Vision

This guide shows how to recreate iOS 16’s lock‑screen Depth Effect by integrating Apple’s pre‑trained DeeplabV3 CoreML model with Vision, running a VNCoreMLRequest on a resized image in a background thread, converting the output to a binary mask, and overlaying it on the original photo to simulate foreground‑background depth, with code examples and GitHub source.

CoreMLDepth EffectSwift
0 likes · 7 min read
Implementing iOS 16 Depth Effect Using Image Segmentation with CoreML and Vision
Kuaishou Tech
Kuaishou Tech
Mar 11, 2022 · Artificial Intelligence

High‑Precision Low‑Latency Intelligent Danmu Blocking Solution for Kuaishou Video

The Kuaishou audio‑video team designed a high‑precision, low‑latency intelligent danmu‑blocking system that uses advanced image‑segmentation and temporal‑stability techniques to generate accurate masks, improve scene robustness, eliminate mask delay, and enhance user experience across diverse video content.

AIKuaishoudanmu blocking
0 likes · 10 min read
High‑Precision Low‑Latency Intelligent Danmu Blocking Solution for Kuaishou Video
Kuaishou Audio & Video Technology
Kuaishou Audio & Video Technology
Mar 10, 2022 · Artificial Intelligence

How Kuaishou Achieved High‑Precision, Low‑Latency Danmu Blocking with AI

To prevent dense on‑screen comments from obscuring key video content, Kuaishou’s audio‑video team built a high‑precision, low‑latency intelligent danmu‑blocking system that uses advanced image‑segmentation masks, temporal stability enhancements, SSIM‑based scene detection, and a large‑scale annotated dataset to ensure robust, real‑time protection across diverse video scenarios.

AIdanmu blockingdata annotation
0 likes · 11 min read
How Kuaishou Achieved High‑Precision, Low‑Latency Danmu Blocking with AI
Python Programming Learning Circle
Python Programming Learning Circle
Mar 3, 2022 · Artificial Intelligence

Ten‑Line Python Projects: QR Code, Word Cloud, Image Segmentation, Sentiment Analysis, Mask Detection, Message Spam, OCR, and a Simple Game

This article presents a series of concise Python examples—each under ten lines—demonstrating how to generate QR codes, create word clouds, perform image segmentation, conduct sentiment analysis, detect masks, automate message sending, extract text with OCR, and build a basic number‑guessing game, showcasing the versatility of Python for quick prototyping across AI and utility tasks.

GameOCRQR code
0 likes · 10 min read
Ten‑Line Python Projects: QR Code, Word Cloud, Image Segmentation, Sentiment Analysis, Mask Detection, Message Spam, OCR, and a Simple Game
Kuaishou Audio & Video Technology
Kuaishou Audio & Video Technology
Jun 11, 2021 · Artificial Intelligence

How Collaborative Learning Boosts Co‑Saliency Object Detection

This article introduces a collaborative‑learning based co‑saliency object detection algorithm that incorporates class‑conditioned information during training, dramatically improving the model's ability to distinguish and detect common objects across image groups, and demonstrates its effectiveness through extensive experiments and real‑world applications.

co-saliency detectioncollaborative learningimage segmentation
0 likes · 10 min read
How Collaborative Learning Boosts Co‑Saliency Object Detection
58UXD
58UXD
Apr 12, 2021 · Artificial Intelligence

How 58.com Built an AI Designer: From Smart Cutout to Intelligent Creative Platform

This article chronicles 58.com’s journey from a small brainstorming room to a full‑scale AI design platform, detailing the development of smart cutout, the BASNet segmentation model, custom loss functions, template editing, and the measurable business impact of the AI designer.

AI designBASNetComputer Vision
0 likes · 15 min read
How 58.com Built an AI Designer: From Smart Cutout to Intelligent Creative Platform
DeWu Technology
DeWu Technology
Nov 26, 2020 · Artificial Intelligence

Automated Captcha Recognition Using Machine Learning

The article outlines a machine‑learning pipeline for automated captcha recognition, covering dataset generation, image preprocessing, segmentation via clustering or watershed methods, and classification using classic models and CNNs, achieving roughly 94% accuracy while noting the growing complexity of modern captchas and recommending developer collaboration when feasible.

CaptchaNeural NetworksPython
0 likes · 23 min read
Automated Captcha Recognition Using Machine Learning
21CTO
21CTO
May 20, 2020 · Artificial Intelligence

Batch Remove Image Backgrounds with Python and PaddlePaddle

This tutorial shows how to install PaddlePaddle and PaddleHub, then use a concise Python script to batch‑remove backgrounds from multiple photos, automatically saving the segmented results, and includes sample code and before‑after images to illustrate the process.

Batch ProcessingPaddlePaddlePython
0 likes · 5 min read
Batch Remove Image Backgrounds with Python and PaddlePaddle
ITPUB
ITPUB
May 15, 2020 · Artificial Intelligence

Batch Background Removal with PaddlePaddle: One-Click Python Script

This guide shows how to install PaddlePaddle and PaddleHub, then use a short Python script to automatically segment and remove backgrounds from multiple photos, turning each into a transparent PNG with just a few lines of code.

Batch ProcessingPaddlePaddleimage segmentation
0 likes · 5 min read
Batch Background Removal with PaddlePaddle: One-Click Python Script
Python Programming Learning Circle
Python Programming Learning Circle
May 12, 2020 · Artificial Intelligence

Batch Image Segmentation with Python and PaddlePaddle

This tutorial demonstrates how to use Python and the PaddlePaddle deep‑learning platform to automatically remove backgrounds from multiple photos in one step, covering installation, verification, and a concise five‑line code example for batch human segmentation.

Batch ProcessingComputer VisionDeep Learning
0 likes · 6 min read
Batch Image Segmentation with Python and PaddlePaddle
Youku Technology
Youku Technology
May 11, 2020 · Artificial Intelligence

Interview with Alibaba Senior Algorithm Expert Ren Haibing on Intelligent Video Matting Technology

Ren Haibing, Alibaba’s senior algorithm expert, explains how deep‑learning AI matting—combining salient object detection, semantic and instance segmentation in a two‑stage hard‑then‑soft pipeline—replaces traditional methods, achieves hair‑level detail and temporal consistency, scores 84.3% on Cityscapes, powers large‑scale video‑person extraction, and relies on human review to meet growing industry demand.

AIAlibabaimage segmentation
0 likes · 10 min read
Interview with Alibaba Senior Algorithm Expert Ren Haibing on Intelligent Video Matting Technology
Alibaba Cloud Developer
Alibaba Cloud Developer
Jan 3, 2020 · Artificial Intelligence

How Alibaba’s DAMO Lab Revolutionizes Image Cutout with AI‑Powered Matting

Alibaba's DAMO Academy details its AI‑driven image cutout system, describing why automated matting is needed, the four‑module pipeline (filtering, classification, detection, segmentation), architectural innovations such as dual decoders and fusion networks, and how these advances enable product‑level batch background removal.

AIAlibabaComputer Vision
0 likes · 9 min read
How Alibaba’s DAMO Lab Revolutionizes Image Cutout with AI‑Powered Matting
Amap Tech
Amap Tech
Dec 13, 2019 · Artificial Intelligence

Image Segmentation for High-Definition Mapping: Evolution and Practices at Gaode Maps

Gaode Maps has progressed image segmentation from early heuristic region splitting to modern deep‑learning pipelines—leveraging FCNs, multi‑task networks, Mask R‑CNN, and specialized losses—to achieve centimeter‑level, instance‑aware mapping of roads, signs, and small objects while pursuing lighter, real‑time models.

AIComputer VisionDeep Learning
0 likes · 14 min read
Image Segmentation for High-Definition Mapping: Evolution and Practices at Gaode Maps
Tencent Advertising Technology
Tencent Advertising Technology
Oct 17, 2019 · Artificial Intelligence

Visual Algorithm Applications in Advertising Scenarios

The talk outlines how Tencent Advertising leverages deep‑learning visual algorithms—including GCN‑based edge refinement, template generation, AutoML‑driven smart review, and a dual‑tower click‑through‑rate model—to automate creative production, improve ad quality, and enhance user experience across creation, review, and playback stages.

AIAdvertisingAutoML
0 likes · 7 min read
Visual Algorithm Applications in Advertising Scenarios
Alibaba Cloud Developer
Alibaba Cloud Developer
Mar 6, 2019 · Artificial Intelligence

How Deep Learning Unwarps Curved Document Images for Better OCR

This article explores how deep‑learning‑based image dewarping techniques, from traditional hardware methods to modern U‑Net, Stacked U‑Net and Dilated U‑Net architectures, can correct warped document photos, improve OCR accuracy, and support intelligent verification in high‑throughput business scenarios.

Deep LearningModel EvaluationOCR
0 likes · 19 min read
How Deep Learning Unwarps Curved Document Images for Better OCR
Meituan Technology Team
Meituan Technology Team
Dec 27, 2018 · Artificial Intelligence

AI-Driven Automated Banner Design for Visual Marketing

Meituan’s AI‑driven system automates banner creation by extracting material features, sequencing them with a planner, refining layouts via an optimizer, and rendering images with a generator, while supporting segmentation, template expansion, and multi‑resolution adaptation to reduce designers’ repetitive work and enable mass personalization.

AIGenerative Modelsbanner design
0 likes · 21 min read
AI-Driven Automated Banner Design for Visual Marketing
Xianyu Technology
Xianyu Technology
Nov 20, 2018 · Artificial Intelligence

How to Separate Complex Image Foreground from Background Using AI and Classic CV Techniques

This article presents a step‑by‑step solution that combines computer‑vision preprocessing, OCR, CNN classification, shape matching, and inpainting to isolate meaningful foreground elements from images with intricate backgrounds, discussing practical results, limitations, and code implementations.

Computer VisionDeep LearningOpenCV
0 likes · 15 min read
How to Separate Complex Image Foreground from Background Using AI and Classic CV Techniques
Suning Technology
Suning Technology
Apr 16, 2018 · Artificial Intelligence

How Suning’s AI‑Powered Banner Design Platform Revolutionizes E‑Commerce Advertising

This article explains how Suning’s intelligent design platform automates banner creation for online retail by combining deep‑learning image segmentation, rule‑based layout generation, multi‑task evaluation models, and adaptive coloring, dramatically reducing manual effort while boosting personalization and conversion rates.

AIAutomationDeep Learning
0 likes · 17 min read
How Suning’s AI‑Powered Banner Design Platform Revolutionizes E‑Commerce Advertising