Tagged articles
32 articles
Page 1 of 1
Machine Heart
Machine Heart
Apr 7, 2026 · Artificial Intelligence

A Comprehensive Survey of Tactile‑Based Multimodal Fusion in Embodied Intelligence

This survey reviews state‑of‑the‑art research up to Q1 2026 on integrating tactile sensing with vision and language for embodied AI, presenting a four‑stage fusion pipeline, a hierarchical taxonomy of datasets, methods, sensors, and highlighting current evaluation challenges and future directions.

DatasetsEmbodied AIRobotics
0 likes · 13 min read
A Comprehensive Survey of Tactile‑Based Multimodal Fusion in Embodied Intelligence
Machine Heart
Machine Heart
Apr 5, 2026 · Artificial Intelligence

How Imitation Learning Powers Dexterous Manipulation: A 2021‑2025 Technical Roadmap

This survey maps the 2021‑2025 progress of imitation learning for dexterous manipulation, detailing theoretical foundations, datasets, algorithms, hardware platforms, and evaluation protocols, and highlights challenges such as data quality, hardware dependence, and the need for standardized benchmarks to advance embodied AI.

AlgorithmsDatasetsDexterous Manipulation
0 likes · 11 min read
How Imitation Learning Powers Dexterous Manipulation: A 2021‑2025 Technical Roadmap
Big Data Technology Tribe
Big Data Technology Tribe
Mar 15, 2026 · Databases

How to Build Distributed Scalar Indexes with Lance and Ray

This guide explains the end‑to‑end workflow for constructing a distributed scalar index in Lance by orchestrating validation, fragment sharding, worker‑level indexing via Ray, and final metadata merging, complete with code snippets and detailed step‑by‑step instructions.

DatasetsLancePython
0 likes · 12 min read
How to Build Distributed Scalar Indexes with Lance and Ray
Liangxu Linux
Liangxu Linux
Oct 21, 2025 · Artificial Intelligence

Explore 4 Must‑Try Open‑Source AI Tools: Datasets, Finance Model, Real‑Time Speech, and Agent Toolbox

This article introduces four high‑impact open‑source projects—a curated public dataset collection, the Kronos financial K‑line analysis model, WhisperLiveKit for real‑time speech transcription, and Youtu‑agent for building versatile AI agents—each with descriptions, key features, and GitHub links.

AI modelsDatasetsagent toolbox
0 likes · 6 min read
Explore 4 Must‑Try Open‑Source AI Tools: Datasets, Finance Model, Real‑Time Speech, and Agent Toolbox
HyperAI Super Neural
HyperAI Super Neural
Oct 21, 2025 · Artificial Intelligence

7 Essential Math Reasoning Datasets for AI: From Arithmetic to Visual Geometry

This article compiles seven prominent math reasoning datasets—including We‑Math2.0‑Standard, NuminaMath‑LEAN, T‑Wix, Nemotron‑Math‑HumanReasoning, Open‑Omega‑Atom‑1.5M, GSM8K, and VCBench—detailing their sizes, sources, associated papers, and unique features to support high‑quality AI research on mathematical problem solving.

AIBenchmarkDatasets
0 likes · 9 min read
7 Essential Math Reasoning Datasets for AI: From Arithmetic to Visual Geometry
HyperAI Super Neural
HyperAI Super Neural
Sep 29, 2025 · Artificial Intelligence

8 Popular Remote Sensing Object Detection Datasets with One-Click Downloads

This article presents a curated list of eight widely used remote sensing object detection datasets covering indoor scenes, landslides, drone imagery, crop diseases, safety vests, human fractures, urban issues, and plant diseases, each with size estimates and direct download links for researchers.

AIComputer VisionDatasets
0 likes · 10 min read
8 Popular Remote Sensing Object Detection Datasets with One-Click Downloads
Data Party THU
Data Party THU
Sep 5, 2025 · Artificial Intelligence

What a PRISMA Review Uncovers About Retrieval‑Augmented Generation (RAG)

This systematic PRISMA review analyzes 128 highly‑cited RAG papers, covering five major databases, 343 datasets, a detailed technical roadmap, evaluation metrics from EM to LLM‑as‑Judge, and future research directions, showing that RAG has evolved into a complex, programmable, and auditable distributed system.

AIDatasetsEvaluation Metrics
0 likes · 5 min read
What a PRISMA Review Uncovers About Retrieval‑Augmented Generation (RAG)
IT Services Circle
IT Services Circle
Sep 4, 2025 · Artificial Intelligence

4 Open‑Source AI Tools: Datasets, K‑Line Model, Real‑Time Speech, Agent Toolbox

This article introduces four high‑impact open‑source AI projects—a curated high‑quality dataset collection, the Kronos financial K‑line model, WhisperLiveKit for real‑time speech transcription, and Youtu‑agent for building versatile AI agents—highlighting their features, usage, and GitHub links.

AI agentsDatasetsfinancial modeling
0 likes · 6 min read
4 Open‑Source AI Tools: Datasets, K‑Line Model, Real‑Time Speech, Agent Toolbox
Model Perspective
Model Perspective
Sep 3, 2025 · Artificial Intelligence

Top Free Datasets for AI, ML, and Data Science Projects – A Curated Guide

This article compiles a comprehensive list of high‑quality, publicly available datasets across domains such as general platforms, education, finance, health, text, and vision, providing URLs, key features, and practical usage tips to help researchers and practitioners quickly find the right data for their AI and data‑science projects.

AIData ScienceDatasets
0 likes · 11 min read
Top Free Datasets for AI, ML, and Data Science Projects – A Curated Guide
360 Zhihui Cloud Developer
360 Zhihui Cloud Developer
Jul 23, 2025 · Artificial Intelligence

How to Leverage TLM Platform for Comprehensive Large Model Evaluation

This guide explains how to use the TianJi Large Model (TLM) platform to create evaluation tasks, choose effectiveness or performance modes, work with built‑in datasets, interpret detailed reports, and understand the underlying metrics and judge‑model techniques for large‑model assessment.

AI metricsDatasetsModel Evaluation
0 likes · 9 min read
How to Leverage TLM Platform for Comprehensive Large Model Evaluation
AI Frontier Lectures
AI Frontier Lectures
Jun 19, 2025 · Artificial Intelligence

Essential Multimodal Datasets for AI Research – Links, Stats, and Quick Overview

This article compiles a curated list of widely used multimodal datasets—including CLEVR, Visual Genome, Pangea, Touch‑Vision‑Language, WIT, and more—providing download URLs, key statistics, and brief descriptions to help researchers quickly locate the right data for vision‑language and multimodal model training.

AIDatasetslanguage models
0 likes · 9 min read
Essential Multimodal Datasets for AI Research – Links, Stats, and Quick Overview
AIWalker
AIWalker
May 11, 2025 · Artificial Intelligence

Unified Multimodal Understanding and Generation: A 30K‑Word Survey of Recent Advances

This comprehensive survey reviews the rapid progress of multimodal understanding and text‑to‑image generation models, categorises existing unified architectures into diffusion‑based, autoregressive, and hybrid paradigms, analyses their tokenisation strategies, datasets and benchmarks, and highlights current challenges and future research directions.

Autoregressive ModelsDatasetsMultimodal AI
0 likes · 64 min read
Unified Multimodal Understanding and Generation: A 30K‑Word Survey of Recent Advances
AntTech
AntTech
Dec 23, 2024 · Artificial Intelligence

Ant Group’s AIGC Security Detection System Earns Top Rating in China ICT Academy’s Multimodal Evaluation

Ant Group’s AIGC security detection system was evaluated by the China Information and Communication Research Institute, achieving the highest "Excellent" rating with a 0.99 F1 score across image, video, and audio modalities, while also releasing large‑scale detection datasets for the research community.

AIGC detectionAnt GroupBenchmark
0 likes · 5 min read
Ant Group’s AIGC Security Detection System Earns Top Rating in China ICT Academy’s Multimodal Evaluation
Kuaishou Tech
Kuaishou Tech
Dec 1, 2023 · Artificial Intelligence

Short Video Recommendation Algorithm Frontier Research Forum at CCIR 2023

The CCIR 2023 conference in Beijing, sponsored by Kuaishou, hosted a short‑video recommendation algorithm frontier research forum where over 100 experts and students shared the latest AI‑driven recommendation technologies, open datasets, and interdisciplinary challenges in short‑video platforms.

AIDatasetsconference
0 likes · 8 min read
Short Video Recommendation Algorithm Frontier Research Forum at CCIR 2023
TAL Education Technology
TAL Education Technology
Aug 31, 2023 · Artificial Intelligence

Research on Content-Based Image Retrieval Techniques

This article reviews the fundamentals, feature extraction methods, evaluation metrics, and common datasets of content‑based image retrieval (CBIR), discussing traditional low‑level features, local descriptors, unsupervised and supervised learning approaches, and recent deep‑learning models for improving retrieval performance.

CBIRDatasetsDeep Learning
0 likes · 13 min read
Research on Content-Based Image Retrieval Techniques
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Aug 12, 2023 · Artificial Intelligence

An Introduction to OCR: Concepts, History, Applications, Datasets, and Technical Workflow

This article provides a comprehensive overview of Optical Character Recognition (OCR), covering its definition, historical development, classification, real‑world applications, technical pipeline, common challenges, mitigation strategies, popular datasets, model performance comparisons, and leading open‑source platforms.

Computer VisionDatasetsDeep Learning
0 likes · 16 min read
An Introduction to OCR: Concepts, History, Applications, Datasets, and Technical Workflow
DataFunSummit
DataFunSummit
May 31, 2023 · Artificial Intelligence

Evolution of Face Detection Techniques: Datasets, Research Directions, and Future Work

This article reviews the evolution of face detection, covering the Widely‑Face dataset, major research directions such as feature fusion, label assignment, auxiliary supervision, anchor‑free methods, NAS‑based designs, summarizes key papers from S3FD to MogFace, introduces ModelScope implementations, and outlines future challenges and opportunities.

AI researchComputer VisionDatasets
0 likes · 13 min read
Evolution of Face Detection Techniques: Datasets, Research Directions, and Future Work
DataFunSummit
DataFunSummit
Oct 19, 2022 · Artificial Intelligence

Series Six of the Integer Intelligence Autonomous Driving Dataset Collection – Overview and Highlights

This article presents a comprehensive overview of several publicly available autonomous driving datasets, focusing on Series Six of the Integer Intelligence collection, which includes StreetLearn, UTBM RoboCar, Multi‑Vehicle Stereo Event Camera, comma2k19, the Annotated Laser Dataset, Ford, and Oxford RobotCar, detailing their sources, download links, publication years, key features, and research relevance.

Computer VisionDatasetsRobotics
0 likes · 10 min read
Series Six of the Integer Intelligence Autonomous Driving Dataset Collection – Overview and Highlights
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Oct 12, 2022 · Artificial Intelligence

Unlock Vision AI: How EasyCV Streamlines Datasets and Model Training

This article introduces EasyCV, an open‑source all‑in‑one visual algorithm platform that abstracts diverse data sources, provides SOTA self‑supervised models, and offers ready‑to‑download datasets for image classification, object detection, segmentation, and pose estimation, complete with configuration examples.

Computer VisionDatasetsDeep Learning
0 likes · 9 min read
Unlock Vision AI: How EasyCV Streamlines Datasets and Model Training
Kuaishou Tech
Kuaishou Tech
Aug 31, 2022 · Artificial Intelligence

Selected Papers from CIKM 2022 on Real‑Time Short Video Recommendation and Large‑Scale Datasets

This article summarizes four CIKM 2022 papers that present a client‑side short‑video recommender, the fully‑observed KuaiRec dataset, the unbiased KuaiRand sequential recommendation dataset, and an industrial‑scale solution for billion‑user lifetime value prediction, highlighting their motivations, methods, and reported impacts.

Datasetsshort videouser modeling
0 likes · 8 min read
Selected Papers from CIKM 2022 on Real‑Time Short Video Recommendation and Large‑Scale Datasets
DeWu Technology
DeWu Technology
Mar 11, 2022 · Artificial Intelligence

Deep Learning in Face Recognition

The article surveys deep‑learning‑based face‑recognition systems, detailing detection, preprocessing, and recognition pipelines, describing evaluation metrics such as TAR, FAR, and Rank‑K, reviewing major datasets like LFW, MS‑Celeb‑1M and VGGFace2, and comparing leading architectures—including FaceNet, CenterLoss, SphereFace and InsightFace—while highlighting their strengths, limitations, real‑world applications, and seminal research references.

AIDatasetsDeep Learning
0 likes · 14 min read
Deep Learning in Face Recognition
Laravel Tech Community
Laravel Tech Community
Sep 5, 2021 · Artificial Intelligence

Comprehensive Collection of Open Data Sources and Datasets for AI and Data Analysis

This article provides a curated list of publicly available data query websites, simple universal datasets, large-scale collections, and specialized datasets for machine learning, image classification, text classification, and recommendation systems, offering valuable resources for AI research and data-driven projects.

Big DataDatasetsImage Classification
0 likes · 7 min read
Comprehensive Collection of Open Data Sources and Datasets for AI and Data Analysis
ITPUB
ITPUB
Oct 20, 2019 · Artificial Intelligence

How NL2SQL Is Revolutionizing Database Queries: Past, Present, and Future

NL2SQL converts natural language questions into executable SQL, bridging the gap between users and databases; the article reviews its value, historical roots, academic positioning, major datasets, current models, challenges, and future directions, highlighting its potential to reshape data interaction across industries.

AIDatasetsNL2SQL
0 likes · 16 min read
How NL2SQL Is Revolutionizing Database Queries: Past, Present, and Future
iQIYI Technical Product Team
iQIYI Technical Product Team
Apr 12, 2019 · Artificial Intelligence

iQIYI Multimodal Technology: Datasets, Applications, and Future Directions

iQIYI leverages multimodal AI—combining audio, visual, and textual cues—to advance video understanding, releasing the world’s largest celebrity dataset (iQIYI‑VID), powering applications such as actor‑focused playback, AI Radar, emoji generation, and rapid automated editing, while pursuing future research in emoji captioning, cross‑modal retrieval, visual question answering, and broader health‑care and education uses.

DatasetsMultimodal AIiQIYI
0 likes · 13 min read
iQIYI Multimodal Technology: Datasets, Applications, and Future Directions
MaGe Linux Operations
MaGe Linux Operations
Aug 21, 2018 · Artificial Intelligence

How Deep Learning Transformed Face Recognition: From Images to Real‑Time Video

This article surveys the evolution of face recognition from early statistical methods to modern deep‑learning approaches, outlines key researchers, open‑source projects, popular APIs, core processing steps, the DeepFace architecture, datasets, and experimental results, providing a comprehensive guide for practitioners and researchers.

CNNComputer VisionDatasets
0 likes · 22 min read
How Deep Learning Transformed Face Recognition: From Images to Real‑Time Video
MaGe Linux Operations
MaGe Linux Operations
Nov 5, 2017 · Artificial Intelligence

How Deep Learning Transforms Modern Face Recognition: From Basics to DeepFace

This article surveys the evolution of face recognition from traditional image‑based methods to real‑time video processing, highlights key researchers and open‑source projects, explains the four‑stage pipeline, details DeepFace's deep‑learning architecture, and provides practical installation and usage instructions for Python developers.

CNNComputer VisionDatasets
0 likes · 21 min read
How Deep Learning Transforms Modern Face Recognition: From Basics to DeepFace