Tagged articles

AI dataset

3 articles · Page 1 of 1
AI Large-Model Wave and Transformation Guide
AI Large-Model Wave and Transformation Guide
Jun 1, 2026 · Artificial Intelligence

How to Build High‑Quality AI Datasets: Standards, Templates, and Practical Steps

This guide walks AI engineers and project leaders through the full lifecycle of high‑quality dataset creation—from defining requirements and setting annotation standards to data collection, preprocessing, labeling, augmentation, evaluation, and continuous iteration—providing concrete metrics, compliance rules, and tool recommendations to avoid common pitfalls.

AI datasetData QualityData preprocessing
0 likes · 16 min read
How to Build High‑Quality AI Datasets: Standards, Templates, and Practical Steps
Programmer DD
Programmer DD
Apr 18, 2023 · Artificial Intelligence

Can OpenAssistant Rival ChatGPT? Inside the Largest Open‑Source AI Assistant

This article examines OpenAssistant, the world’s largest open‑source ChatGPT replica, detailing its dataset of over 160 k annotated conversations, the fine‑tuned LLaMA and Pythia models, evaluation results against GPT‑3.5‑turbo, practical usage examples, and the project's current limitations and future directions.

AI datasetChatGPT alternativeLarge Language Model
0 likes · 11 min read
Can OpenAssistant Rival ChatGPT? Inside the Largest Open‑Source AI Assistant
iQIYI Technical Product Team
iQIYI Technical Product Team
Aug 10, 2018 · Artificial Intelligence

iQIYI Releases World's First Multimodal, Multi-angle Celebrity Video Dataset (iQIYI-VID) and Announces AI Competition

iQIYI released iQIYI-VID, the world’s first multimodal, multi-angle celebrity video dataset (1,000 hours, 500,000 clips, 5,000 celebrities) for a new AI competition focusing on multimodal video person recognition, which has attracted global university teams and top computer‑vision judges to advance AI understanding in entertainment.

AI datasetcompetitioncomputer vision
0 likes · 7 min read
iQIYI Releases World's First Multimodal, Multi-angle Celebrity Video Dataset (iQIYI-VID) and Announces AI Competition