Tagged articles
17 articles
Page 1 of 1
Machine Heart
Machine Heart
May 19, 2026 · Artificial Intelligence

HyperEyes: Parallel Multimodal Search Agents Move from Deep to Wide for Efficiency

HyperEyes introduces a unified‑location‑as‑search (UGS) action space, parallel data synthesis, and a dual‑granularity efficiency‑aware RL framework that enable multimodal agents to perform simultaneous multi‑target retrieval, dramatically reducing interaction rounds while improving accuracy and cost‑efficiency across benchmark evaluations.

Reinforcement Learningagentbenchmark
0 likes · 9 min read
HyperEyes: Parallel Multimodal Search Agents Move from Deep to Wide for Efficiency
DataFunSummit
DataFunSummit
May 15, 2026 · Artificial Intelligence

From Text to Images: Building Multimodal Product Search with Elasticsearch Serverless

The article analyzes the shift from keyword‑based to multimodal e‑commerce search, outlines a generic architecture that combines text and image embedding with vector retrieval, and demonstrates how Elasticsearch Serverless and Alibaba Cloud AI Search platform enable a low‑cost, scalable, and high‑performance product search solution.

AI searchElasticsearchEmbedding
0 likes · 20 min read
From Text to Images: Building Multimodal Product Search with Elasticsearch Serverless
DataFunSummit
DataFunSummit
May 7, 2026 · Artificial Intelligence

From Text to Images: Building Multimodal Product Search with Elasticsearch Serverless

This article walks through a complete multimodal product search solution, explaining how embedding and vector retrieval technologies—combined with Elasticsearch Serverless and Alibaba Cloud AI Search—enable image‑based and semantic queries, detailing the architecture, key algorithms, quantization tricks, and practical deployment steps.

AI searchElasticsearchEmbedding
0 likes · 22 min read
From Text to Images: Building Multimodal Product Search with Elasticsearch Serverless
DataFunSummit
DataFunSummit
May 4, 2026 · Artificial Intelligence

Inside Alibaba Cloud AI Search: Agentic RAG Architecture and Multi‑Agent Techniques

Alibaba Cloud AI Search tackles high‑concurrency, multimodal, and multi‑hop queries by evolving its Agentic RAG architecture from a single agent to a coordinated multi‑agent system that integrates planning, retrieval, and generation, leverages hybrid vector‑text‑DB‑graph recall, GPU‑accelerated indexing, quantization, NL2SQL, and multimodal search, with performance data and real‑world case studies.

AI searchAgentic RAGAlibaba Cloud
0 likes · 6 min read
Inside Alibaba Cloud AI Search: Agentic RAG Architecture and Multi‑Agent Techniques
DataFunSummit
DataFunSummit
Apr 19, 2026 · Artificial Intelligence

How to Build a Multimodal Product Search Engine with Embedding and Vector Retrieval on Elasticsearch Serverless

This article explains a complete multimodal product search solution that combines text and image embeddings, dense, sparse, and hybrid models, vector similarity metrics, and Elasticsearch Serverless features such as dense_vector, sparse_vector, hybrid search, quantization, and RRF ranking to achieve fast, accurate, and cost‑effective retrieval.

ElasticsearchEmbeddingServerless
0 likes · 20 min read
How to Build a Multimodal Product Search Engine with Embedding and Vector Retrieval on Elasticsearch Serverless
DataFunSummit
DataFunSummit
Apr 4, 2026 · Databases

How Hologres 4.0 Unifies Real‑Time Data Warehousing and AI‑Native Analytics

This article analyzes the architectural evolution of Alibaba Cloud Hologres from a fragmented multi‑engine data stack to the All‑in‑One design of Hologres 4.0, detailing its multimodal search, AI‑native functions, performance benchmarks, resource governance, lake integration, and real‑world deployment scenarios.

AI-native analyticsHologrescloud database
0 likes · 12 min read
How Hologres 4.0 Unifies Real‑Time Data Warehousing and AI‑Native Analytics
DataFunSummit
DataFunSummit
Mar 29, 2026 · Artificial Intelligence

How to Build a Multimodal Product Search Engine with Embedding and Vector Retrieval on Elasticsearch Serverless

This article explores the evolution of e‑commerce search toward multimodal and cross‑modal capabilities, outlines a generic architecture that combines text and image processing via embedding and vector retrieval, and demonstrates how to implement the solution using Alibaba Cloud's AI Search Open Platform and Elasticsearch Serverless with detailed guidance on models, similarity metrics, quantization, and performance optimization.

ElasticsearchEmbeddingVector Retrieval
0 likes · 22 min read
How to Build a Multimodal Product Search Engine with Embedding and Vector Retrieval on Elasticsearch Serverless
DataFunSummit
DataFunSummit
Mar 24, 2026 · Artificial Intelligence

How to Build a Multimodal Product Search System with Embedding and Vector Retrieval

This article presents a comprehensive, end‑to‑end solution for multimodal product search, detailing the evolution from keyword to image‑based queries, the core embedding and vector retrieval technologies, practical Elasticsearch Serverless integration, quantization methods, and a complete demo workflow for building a high‑performance, low‑cost search platform.

AI search platformElasticsearchEmbedding
0 likes · 21 min read
How to Build a Multimodal Product Search System with Embedding and Vector Retrieval
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Aug 11, 2025 · Artificial Intelligence

How Multimodal Product Search Transforms E‑Commerce with Embedding and Vector Retrieval

This article explores the evolution from keyword‑based to multimodal e‑commerce search, detailing a universal solution that combines text and image processing through embedding and vector retrieval, and demonstrates how Alibaba Cloud's AI Search Open Platform and Elasticsearch Serverless enable fast, low‑cost, and scalable multimodal product search deployments.

EmbeddingVector Retrievalmultimodal search
0 likes · 17 min read
How Multimodal Product Search Transforms E‑Commerce with Embedding and Vector Retrieval
21CTO
21CTO
May 18, 2021 · Big Data

How Baidu Scales Multimodal Image Search with the Imazon Platform

This article explains Baidu's multimodal retrieval system, detailing the offline and online pipelines, the image processing and indexing platform (Imazon), its architecture, key technologies such as ANN and GPU models, and the optimization practices that enable massive daily image ingestion and real‑time search at billion‑scale.

BaiduImage Processinglarge-scale data
0 likes · 13 min read
How Baidu Scales Multimodal Image Search with the Imazon Platform
Youku Technology
Youku Technology
Jul 9, 2020 · Artificial Intelligence

Multi-level Multi-modal Search Engine and Graph Engine for Billion-scale Video Content

An advanced multi‑level, multi‑modal search and graph engine for Youku processes text, voice, image and video queries across hierarchical video elements, using combined vector and inverted indexes to merge cross‑level and cross‑modal results, while a distributed knowledge‑graph layer enables multimodal graph traversal for billion‑scale video retrieval.

aigraph enginelarge-scale indexing
0 likes · 12 min read
Multi-level Multi-modal Search Engine and Graph Engine for Billion-scale Video Content
Youku Technology
Youku Technology
Jun 17, 2020 · Industry Insights

How Youku’s Multi‑Modal Search Engine Powers Billion‑Scale Video Retrieval

This article details the design and implementation of Youku’s Multi‑Modal Search Engine (MMS), covering its distributed multi‑level indexing architecture, vector retrieval using Aitheta, cross‑modal query scheduling, graph‑based execution engine, and real‑world applications such as intelligent video search and image‑based series lookup.

Vector RetrievalVideo platformgraph execution engine
0 likes · 10 min read
How Youku’s Multi‑Modal Search Engine Powers Billion‑Scale Video Retrieval
Youku Technology
Youku Technology
Feb 3, 2020 · Artificial Intelligence

Alibaba Entertainment Video Search Algorithms: Practice and Insights

In this talk, senior algorithm expert Ruo Ren outlines Alibaba Entertainment’s video‑search framework—covering basic relevance, ranking, and multimodal techniques that blend information retrieval, NLP, machine learning, and computer vision—using Youku as a case study to illustrate business needs, algorithmic challenges, and practical implementation solutions.

AI AlgorithmsAlibaba EntertainmentYouku
0 likes · 2 min read
Alibaba Entertainment Video Search Algorithms: Practice and Insights