Tagged articles
24 articles
Page 1 of 1
Weekly Large Model Application
Weekly Large Model Application
May 5, 2026 · Artificial Intelligence

Task Alignment: How to Give Your Speech Model a Job Handbook

The article explains how to transform a pretrained speech model into a product‑ready assistant by defining demonstration data, clarifying team debates on persona, safety, and length, contrasting alignment with pretraining, and highlighting common pitfalls to avoid during deployment.

Dialogue SystemsSafetySpeech AI
0 likes · 6 min read
Task Alignment: How to Give Your Speech Model a Job Handbook
Machine Heart
Machine Heart
Mar 31, 2026 · Artificial Intelligence

Point‑VLA: Overcoming Embodied AI’s Language Bottleneck with Visual Grounding

The Point‑VLA method introduced by Qianxun AI’s Gaoyang team tackles the fundamental limits of language‑only instruction in vision‑language‑action models by adding visual grounding via bounding‑box cues, boosting real‑robot success rates from 32.4% to 92.5% across six challenging tasks.

Multimodal LearningPoint-VLARobotics
0 likes · 13 min read
Point‑VLA: Overcoming Embodied AI’s Language Bottleneck with Visual Grounding
SuanNi
SuanNi
Mar 27, 2026 · Artificial Intelligence

How OmniScience Dataset Boosts Multimodal AI Understanding of Scientific Figures

The OmniScience project introduces a 1.5‑million high‑quality image‑text pair dataset and a sophisticated pipeline that parses complex scientific documents, rewrites figure captions with large language models, and dramatically improves multimodal AI performance on benchmark tests.

Multimodal AIVisual-Language Modelsdata annotation
0 likes · 9 min read
How OmniScience Dataset Boosts Multimodal AI Understanding of Scientific Figures
360 Smart Cloud
360 Smart Cloud
Sep 28, 2025 · Artificial Intelligence

How TLP Accelerates Large‑Model Data Annotation with Automation

The TLP platform offers comprehensive support for labeling text, image, audio, and video data, detailing the importance of high‑quality annotations for large AI models, task creation, workflow steps, review processes, and an automatic annotation feature that can boost efficiency by over 60 percent.

AI training dataannotation platformdata annotation
0 likes · 6 min read
How TLP Accelerates Large‑Model Data Annotation with Automation
DataFunTalk
DataFunTalk
Sep 15, 2025 · Artificial Intelligence

Why xAI Cut 500 Data Annotators While Expanding Professional AI Mentors

In mid‑September, Elon Musk’s xAI announced a sudden layoff of roughly 500 data‑annotation staff—about a third of the team—while simultaneously pledging to grow its professional AI‑mentor workforce tenfold and imposing post‑layoff testing to reassign remaining employees.

AIAI mentorsLayoffs
0 likes · 5 min read
Why xAI Cut 500 Data Annotators While Expanding Professional AI Mentors
DataFunSummit
DataFunSummit
Feb 4, 2025 · Artificial Intelligence

Training Optimization for Large-Scale Multimodal Models in Content Safety

This article examines the challenges of content safety, outlines the limitations of current task‑specific multimodal models, and proposes large‑model‑inspired training optimizations—including diversified data construction, automated annotation, parameter fine‑tuning, and multi‑task evaluation—to improve efficiency, accuracy, and scalability of multimodal AI systems.

AI OptimizationContent SafetyMultimodal Learning
0 likes · 26 min read
Training Optimization for Large-Scale Multimodal Models in Content Safety
58 Tech
58 Tech
May 11, 2023 · Artificial Intelligence

Stella Data Annotation Platform: Design, Architecture, and AI‑Assisted Labeling

The article details the design and implementation of the Stella data annotation SaaS platform at 58.com, covering its background, evolution, modular architecture, annotation capabilities across text, image, audio, and video, AI‑assisted labeling, storage solutions, quality and efficiency management, as well as localization and licensing considerations.

AI PlatformSystem Architectureactive learning
0 likes · 21 min read
Stella Data Annotation Platform: Design, Architecture, and AI‑Assisted Labeling
DataFunTalk
DataFunTalk
Feb 20, 2023 · Artificial Intelligence

ChatGPT Technology, Localization Efforts, and Open‑Source Large Models – Overview and Practices

This article presents an overview of ChatGPT technology, its evolution, current challenges, a three‑stage learning process, data organization and evaluation, details of domestic localization efforts, practical solutions, and the release of a Chinese open‑source large model with training guidance.

ChatGPTModel Localizationdata annotation
0 likes · 12 min read
ChatGPT Technology, Localization Efforts, and Open‑Source Large Models – Overview and Practices
Zuoyebang Tech Team
Zuoyebang Tech Team
Nov 9, 2022 · Artificial Intelligence

Boost Data Annotation Efficiency with VAPAL: Active Learning Meets Virtual Adversarial Perturbation

This article explains how a pool‑based active learning framework that combines uncertainty sampling (using BADGE, ALPS, or virtual adversarial perturbations) with diversity‑driven clustering can dramatically cut labeling costs for Transformer‑based NLP models, and presents experimental results showing VAPAL’s competitive performance and early‑stage advantages.

NLPactive learningdata annotation
0 likes · 10 min read
Boost Data Annotation Efficiency with VAPAL: Active Learning Meets Virtual Adversarial Perturbation
58 Tech
58 Tech
Jun 9, 2022 · Artificial Intelligence

Multi‑Label Image Recognition for 58.com: Algorithm Design, Data Construction, and Model Optimization

This article presents a comprehensive study of multi‑label image recognition applied to 58.com’s business scenarios, covering problem motivation, dataset construction, evaluation metrics, mainstream deep‑learning methods, an asymmetric‑loss‑based optimization pipeline, and practical output schemes for recommendation and retrieval.

Computer Visionasymmetric lossdata annotation
0 likes · 17 min read
Multi‑Label Image Recognition for 58.com: Algorithm Design, Data Construction, and Model Optimization
Kuaishou Audio & Video Technology
Kuaishou Audio & Video Technology
Mar 10, 2022 · Artificial Intelligence

How Kuaishou Achieved High‑Precision, Low‑Latency Danmu Blocking with AI

To prevent dense on‑screen comments from obscuring key video content, Kuaishou’s audio‑video team built a high‑precision, low‑latency intelligent danmu‑blocking system that uses advanced image‑segmentation masks, temporal stability enhancements, SSIM‑based scene detection, and a large‑scale annotated dataset to ensure robust, real‑time protection across diverse video scenarios.

AIdanmu blockingdata annotation
0 likes · 11 min read
How Kuaishou Achieved High‑Precision, Low‑Latency Danmu Blocking with AI
Laiye Technology Team
Laiye Technology Team
Dec 10, 2021 · Artificial Intelligence

Best Practices for Building an Entity‑Relationship Annotation Tool at Laiye AI R&D Center

This article details Laiye Technology’s AI R&D team’s end‑to‑end approach to designing and optimizing a custom entity‑relationship annotation tool, covering data‑labeling challenges, shortcomings of Excel and off‑the‑shelf solutions, architectural requirements, line‑breaking and mark‑position algorithms, performance improvements, and real‑world results.

JavaScriptNLPPerformance Optimization
0 likes · 12 min read
Best Practices for Building an Entity‑Relationship Annotation Tool at Laiye AI R&D Center
DataFunTalk
DataFunTalk
Nov 18, 2021 · R&D Management

Insights into AI R&D Management, Cost Efficiency, and Education Solutions at New Oriental AI Research Institute

The article reviews New Oriental’s AI research institute journey, analyzing AI development trends, challenges, performance metrics, organizational structure, cost‑reduction strategies, and product innovations in education, offering practical insights for AI R&D management and enterprise AI deployment.

AIEducation TechnologyR&D management
0 likes · 27 min read
Insights into AI R&D Management, Cost Efficiency, and Education Solutions at New Oriental AI Research Institute
New Oriental Technology
New Oriental Technology
Sep 29, 2021 · Artificial Intelligence

Building and Managing an AI Research Department: Engineering Practices, Organizational Structure, and Educational Applications

This article explores the strategic management and engineering practices required to build an effective AI research department, detailing organizational structures, performance metrics, cost-reduction strategies, and practical educational AI solutions that bridge theoretical algorithms with scalable industrial applications.

AI DevOpsAI R&D ManagementCost Optimization
0 likes · 23 min read
Building and Managing an AI Research Department: Engineering Practices, Organizational Structure, and Educational Applications
Ctrip Technology
Ctrip Technology
Aug 26, 2021 · Artificial Intelligence

Applying Snorkel Weak Supervision to Automate Event Summaries in Ctrip Customer Service

The article explains how Ctrip’s hotel customer‑service team uses the Snorkel weak‑supervision framework to generate large‑scale labeled data for training models that automatically produce structured event summaries, detailing the workflow, labeling functions, generative and discriminative model training, and performance improvements.

Labeling FunctionsNLPSnorkel
0 likes · 14 min read
Applying Snorkel Weak Supervision to Automate Event Summaries in Ctrip Customer Service
Alibaba Cloud Developer
Alibaba Cloud Developer
Jul 7, 2020 · Artificial Intelligence

How Active Learning Can Cut Labeling Costs and Boost Model Performance

This article explains active learning techniques that let models select valuable training samples, reducing annotation costs and improving performance, and describes business‑specific adaptations, experiments, and results that demonstrate its effectiveness in content‑safety applications.

active learningbatch samplingdata annotation
0 likes · 14 min read
How Active Learning Can Cut Labeling Costs and Boost Model Performance
Alibaba Cloud Developer
Alibaba Cloud Developer
Feb 7, 2020 · Artificial Intelligence

Tackling Scalability, Data Scarcity, and Training Efficiency in Dialogue Management Models

This article reviews the evolution of dialogue management models from rule‑based systems to deep‑learning approaches, identifies three major challenges—poor scalability, limited annotated data, and low training efficiency—and surveys recent research solutions including semantic matching, knowledge distillation, hierarchical reinforcement learning, model‑based RL, and human‑in‑the‑loop methods.

Conversational AIdata annotationdialogue management
0 likes · 44 min read
Tackling Scalability, Data Scarcity, and Training Efficiency in Dialogue Management Models
DataFunTalk
DataFunTalk
May 30, 2019 · Artificial Intelligence

Data Annotation, Data‑Driven Development, and Decision‑Making in Autonomous Driving

The talk explains how massive, well‑annotated data fuels autonomous‑driving AI, covering data annotation metrics, team structure, efficiency‑boosting techniques, system stability, and how data‑driven development and decision‑making improve model training, evaluation, and product priorities.

artificial intelligenceautonomous drivingdata annotation
0 likes · 9 min read
Data Annotation, Data‑Driven Development, and Decision‑Making in Autonomous Driving
JD Tech
JD Tech
Mar 8, 2019 · Artificial Intelligence

Integrated Engineering & Algorithm Platform for AI Visual Applications

This article describes a comprehensive, end‑to‑end AI visual algorithm platform that unifies data collection, annotation, model training, deployment, testing, quality evaluation, and service gateways, illustrating how such integration improves transparency, efficiency, and quality across use cases like background removal, face swapping, and clothing recommendation.

AIAlgorithm PlatformClothing Recommendation
0 likes · 13 min read
Integrated Engineering & Algorithm Platform for AI Visual Applications
Alibaba Cloud Developer
Alibaba Cloud Developer
Jul 4, 2018 · Artificial Intelligence

Turning Fashion Into AI‑Ready Data: Building Practical Image Datasets

This article explains how Alibaba's Image & Beauty team designs and iterates a practical fashion image dataset by aligning data purpose, integrating professional knowledge, handling sample scarcity and structured noise, and defining fine‑grained evaluation metrics to enable AI models that truly understand clothing.

Computer VisionKnowledge Engineeringdata annotation
0 likes · 34 min read
Turning Fashion Into AI‑Ready Data: Building Practical Image Datasets