Scaling Fashion AI: How Zhiyi Built a Massive Image‑Recognition Platform on Alibaba Cloud

This article details how Hangzhou Zhiyi Technology leverages AI, big‑data pipelines, and Alibaba Cloud services to create a scalable fashion‑focused image‑recognition and visual‑search platform, covering company background, system architecture, model training, vector search, and future technical upgrades.

Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Scaling Fashion AI: How Zhiyi Built a Massive Image‑Recognition Platform on Alibaba Cloud

Company Overview

Hangzhou Zhiyi Technology Co., Ltd is a national high‑tech enterprise driven by artificial intelligence, founded in February 2018. It secured a $10 million Series A round in its founding year and a RMB 200 million Series B round in 2021, earning a place on Hangzhou’s quasi‑unicorn list.

Zhiyi has developed a suite of SaaS products—Zhiyi, Zhikuan, Meinian—providing fashion trend forecasting, design empowerment, and intelligent style recommendation, serving thousands of fashion brands such as UR, Vipshop, and Peacebird.

Solution Architecture on Alibaba Cloud

The platform is divided into product, service, data, and big‑data layers.

Product layer: multiple apps (Zhiyi, Meinian) and customizable APIs for data access and image‑search.

Service layer: front‑ and back‑end systems containerized on Alibaba Cloud Container Service for Kubernetes (ACK).

Data layer: raw images stored in OSS, business data in MySQL, KV data in HBase, feature vectors in Proxima, and metadata in Elasticsearch.

Big‑data components include Log Service (SLS) for vector caching, DataWorks + MaxCompute for offline warehousing, Python recommendation jobs on ACK, and an on‑prem GPU cluster for batch image recognition.

Big‑Data Solution Evolution

Stage 1 – IDC‑built CDH Cluster

Data were synced from Alibaba Cloud to an on‑prem CDH cluster for Hive processing, then results returned to the cloud.

Stage 2 – DataWorks + MaxCompute

Computation migrated to MaxCompute with task orchestration via DataWorks, reducing operational complexity.

Stage 3 – Elasticsearch for Ad‑hoc Queries

Elasticsearch (managed by Alibaba Cloud) with Proxima vector search enables fast multi‑dimensional queries for trend analysis, handling millions of records per query.

Image Recognition Pipeline

All fashion images are processed offline: each image is transformed into a 256‑dimensional feature vector and indexed by Proxima.

Model Training

Annotated images are used to train models on a GPU cluster. Active Learning reduces labeling cost by iteratively training on a small labeled set, predicting unlabeled data, and refining with human verification.

Batch Image Recognition

Trained models are packaged into Docker images and run on GPU nodes. Feature vectors are first cached in SLS, then cleaned (deduplication, etc.) via DataWorks tasks before being written to Proxima. Cloud‑based preemptible GPU instances supplement on‑prem resources, cutting costs.

Single‑Image Recognition

Through an online serving endpoint, users can upload a single image and receive inference results instantly.

Image Search (Visual Search)

When a user uploads a new image, its vector is computed and searched against the Proxima index to retrieve the most similar fashion items, enabling content‑based image retrieval.

Faiss Deployment and Challenges

Initially, Faiss was deployed on a distributed GPU cluster for vector search, but faced stability issues, GPU memory limits, high operational overhead, bandwidth contention, and limited recall size.

Proxima Vector Engine

Proxima, Alibaba’s in‑house vector search engine integrated with managed Elasticsearch, offers high stability, HNSW‑based algorithm without GPU, easy scaling, no bandwidth contention, and larger recall sets, addressing the shortcomings of Faiss.

Technical Roadmap

OLAP Analysis Optimization

Switching from Elasticsearch to ClickHouse for pre‑aggregated wide tables reduced query latency from 9 seconds to under 2 seconds for large‑scale image statistics.

Data Modeling and Governance

DataWorks is being used to enforce unified data modeling standards and automated quality checks, improving data management across the organization.

Future Image‑Search Collaboration

Alibaba Cloud PAI’s similarity‑search solution, which builds models without manual labeling, is under evaluation for potential integration.

More Information

Proxima: https://developer.aliyun.com/article/782391 Image Retrieval Solutions: https://help.aliyun.com/document_detail/313270.html

AIbigdataCloudComputingImageRecognitionFashionTechVectorSearch
Alibaba Cloud Big Data AI Platform
Written by

Alibaba Cloud Big Data AI Platform

The Alibaba Cloud Big Data AI Platform builds on Alibaba’s leading cloud infrastructure, big‑data and AI engineering capabilities, scenario algorithms, and extensive industry experience to offer enterprises and developers a one‑stop, cloud‑native big‑data and AI capability suite. It boosts AI development efficiency, enables large‑scale AI deployment across industries, and drives business value.

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.