Tag

object detection

0 views collected around this technical thread.

DataFunTalk
DataFunTalk
Apr 18, 2025 · Artificial Intelligence

Applying ByteDance’s Doubao‑1.5 Vision Model for Image Counting and Automated Annotation

The article demonstrates how ByteDance’s new Doubao‑1.5 multimodal model can be used to locate and count objects in images—such as sushi plates, street signs, and cartoon hats—by generating coordinates and overlaying visual annotations through a concise Python script.

AIDoubaoImage Annotation
0 likes · 5 min read
Applying ByteDance’s Doubao‑1.5 Vision Model for Image Counting and Automated Annotation
Python Programming Learning Circle
Python Programming Learning Circle
Dec 19, 2024 · Artificial Intelligence

Overview of Microsoft’s Open‑Source Computer Vision Recipes Library

The article introduces Microsoft’s open‑source Computer Vision Recipes library, describing its purpose, target audience, repository links, supported vision scenarios such as image classification, similarity, detection, key‑point, segmentation, action recognition, multi‑object tracking and crowd counting, and provides guidance on using PyTorch, Azure and GPU resources.

AzureOpen-sourcePyTorch
0 likes · 7 min read
Overview of Microsoft’s Open‑Source Computer Vision Recipes Library
Python Programming Learning Circle
Python Programming Learning Circle
Dec 18, 2024 · Artificial Intelligence

Object Detection in Python Using Template Matching

This article demonstrates how to perform object detection in Python without machine‑learning frameworks by using OpenCV’s template‑matching functions, covering single‑object detection, multi‑object detection with thresholding, and providing complete code examples for loading images, matching, locating matches, drawing bounding boxes, and visualizing results.

PythonTemplate Matchingcomputer vision
0 likes · 6 min read
Object Detection in Python Using Template Matching
php中文网 Courses
php中文网 Courses
Dec 18, 2024 · Artificial Intelligence

Using PHP to Access the Camera and Perform Face Detection with OpenCV

This article explains how to install OpenCV and php-facedetect libraries, write PHP code to capture images from a webcam, perform face detection using the pico library, and display the results, providing a step‑by‑step guide for object detection with PHP.

CameraPHPcomputer vision
0 likes · 5 min read
Using PHP to Access the Camera and Perform Face Detection with OpenCV
php中文网 Courses
php中文网 Courses
Oct 11, 2024 · Artificial Intelligence

Using PHP to Access a Webcam and Perform Object (Face) Detection with OpenCV

This tutorial explains how to install OpenCV and php-facedetect, write PHP code to capture images from a webcam, perform face detection, and display the results, providing step‑by‑step commands and a complete example script.

PHPbackendcomputer vision
0 likes · 6 min read
Using PHP to Access a Webcam and Perform Object (Face) Detection with OpenCV
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Aug 22, 2024 · Artificial Intelligence

Understanding Faster R-CNN: Architecture, Training, and Experimental Results

This article provides an in‑depth overview of the Faster R‑CNN object detection framework, covering its background, key innovations such as the Region Proposal Network, detailed algorithmic principles, training procedures, experimental results on PASCAL VOC and MS COCO, and a reproducible PyTorch implementation.

Faster R-CNNPyTorchRPN
0 likes · 14 min read
Understanding Faster R-CNN: Architecture, Training, and Experimental Results
160 Technical Team
160 Technical Team
Jul 29, 2024 · Artificial Intelligence

How YOLO Transforms Medical Report Screening and Occlusion Detection

Leveraging the YOLO family of deep‑learning models, this study demonstrates efficient filtering of irrelevant medical images, accurate classification of textual reports, and robust detection of occluding objects, achieving high precision and speed on both CPU and GPU, while outlining training details, performance metrics, and future improvements.

YOLOdeep learningmedical imaging
0 likes · 17 min read
How YOLO Transforms Medical Report Screening and Occlusion Detection
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
May 10, 2024 · Artificial Intelligence

Real-Time Dog Detection in Browser Using TensorFlow.js and MobileNet V2

This guide demonstrates how to build a web‑based real‑time dog detector that accesses the phone camera via the browser, processes video frames with TensorFlow.js and a pre‑trained COCO‑SSD MobileNet V2 model, and plays an audio alert when a dog is recognized, all deployed on an Android device using Termux.

AndroidJavaScriptMobileNet
0 likes · 8 min read
Real-Time Dog Detection in Browser Using TensorFlow.js and MobileNet V2
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
May 4, 2024 · Artificial Intelligence

Real-time LEGO Brick Detection on Mobile Using p5.js and Roboflow

This tutorial explains how to build a mobile web app that captures video with the phone camera, loads a custom LEGO detection model via Roboflow, identifies selected brick types in real time, and visualizes the results on a p5.js canvas, including deployment steps and improvement suggestions.

Webcammobileobject detection
0 likes · 11 min read
Real-time LEGO Brick Detection on Mobile Using p5.js and Roboflow
php中文网 Courses
php中文网 Courses
Apr 16, 2024 · Artificial Intelligence

Using PHP and OpenCV for Camera‑Based Object Detection

This tutorial explains how to install required libraries, write PHP code that captures images from a webcam, uses OpenCV and php‑facedetect to detect faces, and displays the results with annotated bounding boxes, providing a foundation for further object detection projects.

CameraPHPcomputer vision
0 likes · 6 min read
Using PHP and OpenCV for Camera‑Based Object Detection
DataFunTalk
DataFunTalk
Nov 24, 2023 · Artificial Intelligence

Open Vocabulary Detection Contest 2023: Summary of Winning Teams' Technical Solutions

The article reviews the Open Vocabulary Detection Contest organized by the Chinese Society of Image and Graphics and 360 AI Institute, describing the competition setup, dataset characteristics, and detailed winning approaches that combine Detic, CLIP, prompt learning, and multi‑stage pipelines to achieve strong few‑shot and zero‑shot object detection performance.

ClipOpen Vocabulary DetectionZero-shot Learning
0 likes · 17 min read
Open Vocabulary Detection Contest 2023: Summary of Winning Teams' Technical Solutions
DataFunTalk
DataFunTalk
Oct 2, 2023 · Artificial Intelligence

DAMO-YOLO: A High‑Efficiency, High‑Accuracy Object Detection Framework

DAMO‑YOLO is an open‑source, high‑speed and high‑precision object detection framework that leverages MAE‑NAS for low‑cost model customization, Efficient RepGFPN and HeavyNeck for enhanced multi‑scale detection, and a universal distillation technique to boost performance across model scales.

Efficient RepGFPNMAE-NASModel Distillation
0 likes · 15 min read
DAMO-YOLO: A High‑Efficiency, High‑Accuracy Object Detection Framework
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Aug 17, 2023 · Artificial Intelligence

Getting Started with YOLOv8 on the Ultralytics Platform: Installation, Command‑Line Usage, and Model Training

This article introduces the YOLOv8 object‑detection framework on the Ultralytics platform, covering environment setup, command‑line and Python APIs for inference, model‑file options, result interpretation, data annotation, training procedures, and exporting models to various deployment formats.

PythonUltralyticsYOLO
0 likes · 14 min read
Getting Started with YOLOv8 on the Ultralytics Platform: Installation, Command‑Line Usage, and Model Training
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Jun 20, 2023 · Artificial Intelligence

Open-Vocabulary Object Attribute Recognition with OvarNet: A Unified Framework for Detection and Attribute Classification

At CVPR 2023 the Xiaohongshu team presented OvarNet, a unified one‑stage Faster‑RCNN model built on CLIP that uses prompt learning and knowledge distillation to jointly detect objects and recognize open‑vocabulary attributes, achieving state‑of‑the‑art results on VAW, MS‑COCO, LSA and OVAD datasets.

attribute recognitioncomputer visionknowledge distillation
0 likes · 12 min read
Open-Vocabulary Object Attribute Recognition with OvarNet: A Unified Framework for Detection and Attribute Classification
360 Tech Engineering
360 Tech Engineering
May 6, 2023 · Artificial Intelligence

Open‑Vocabulary Object Detection: Overview of OVR‑CNN, RegionCLIP, and CORA

This article reviews the evolution of open‑vocabulary object detection, describing the OVR‑CNN paradigm, the RegionCLIP enhancements, and the CORA model with region prompting and anchor pre‑matching, and discusses their impact on future multimodal AI systems.

CORAClipMultimodal Models
0 likes · 14 min read
Open‑Vocabulary Object Detection: Overview of OVR‑CNN, RegionCLIP, and CORA
DataFunTalk
DataFunTalk
Apr 25, 2023 · Artificial Intelligence

DAMO-YOLO: An Efficient Target Detection Framework with NAS, Multi‑Scale Fusion, and Full‑Scale Distillation

This article introduces DAMO‑YOLO, a high‑performance object detection framework that combines low‑cost model customization via MAE‑NAS, an Efficient RepGFPN with HeavyNeck for superior multi‑scale detection, and a full‑scale distillation technique, delivering faster inference, lower FLOPs, and higher accuracy across diverse industrial scenarios.

NASYOLOcomputer vision
0 likes · 15 min read
DAMO-YOLO: An Efficient Target Detection Framework with NAS, Multi‑Scale Fusion, and Full‑Scale Distillation
DataFunSummit
DataFunSummit
Apr 13, 2023 · Artificial Intelligence

ModelScope CV Model Overview: Visual Detection and Keypoint Applications

This article presents a comprehensive overview of ModelScope's computer‑vision models, detailing visual detection and keypoint solutions—including VitDet, YOLOX, res2net, HRNet, and 3D pose models—their architectures, performance highlights, real‑world applications, and future development plans.

AI modelsModelScopecomputer vision
0 likes · 11 min read
ModelScope CV Model Overview: Visual Detection and Keypoint Applications
Sohu Tech Products
Sohu Tech Products
Apr 12, 2023 · Artificial Intelligence

Using Apple CreateML for Object Detection: From Data Annotation to Model Deployment

This article walks through the complete workflow of building an iOS object‑detection model with Apple’s CreateML, covering data collection, JSON annotation, using Roboflow for labeling, configuring training parameters, exporting the model, and integrating it into a Swift app via the Vision framework.

CreateMLSwiftdata annotation
0 likes · 11 min read
Using Apple CreateML for Object Detection: From Data Annotation to Model Deployment
Baidu Tech Salon
Baidu Tech Salon
Apr 7, 2023 · Artificial Intelligence

Ambiguity-Resistant Semi-supervised Learning (ARSL) for Single-stage Object Detection

ARSL, an ambiguity‑resistant semi‑supervised learning framework for single‑stage object detection, introduces Joint‑Confidence Estimation and Task‑Separation Assignment to resolve selection and assignment ambiguities in pseudo‑labels, thereby markedly improving pseudo‑label quality and achieving state‑of‑the‑art AP gains on COCO benchmarks.

ARSLSemi-supervised Learningcomputer vision
0 likes · 8 min read
Ambiguity-Resistant Semi-supervised Learning (ARSL) for Single-stage Object Detection