Big Data 21 min read

Data Architecture, Governance, and Application Practices at XinXuan Group for Live E‑commerce

The article presents a comprehensive case study of XinXuan Group’s data ecosystem, detailing its business background, data platform construction, governance framework, technical architecture, and practical analytics applications that support live‑commerce operations and strategic decision‑making.

DataFunSummit
DataFunSummit
DataFunSummit
Data Architecture, Governance, and Application Practices at XinXuan Group for Live E‑commerce

Introduction

With the rapid rise of the live‑streaming industry, XinXuan Group’s business has expanded quickly, increasing the volume and importance of data in daily operations. The company faces higher demands for data analysis depth, timeliness, accuracy, and completeness.

1. XinXuan Group Overview

Founded in Guangzhou in 2017, XinXuan Group entered the live‑e‑commerce sector in 2018 and achieved annual sales exceeding 150 billion CNY by 2019, later tripling that figure. The group focuses on a supply‑chain‑centric digital retail model, incubating top‑tier livestreaming influencers and building its own brands and factories.

2. Data Construction Implementation

The data platform aims to turn raw data into actionable business value through three main dimensions:

Cost reduction and efficiency improvement (e.g., online data entry, automated business‑process alerts).

Data services for middle‑ and senior‑level management (diagnosing performance, monitoring cash flow, identifying market opportunities).

Support for commercial decision‑making (user analysis, category scouting, account‑level value‑added services).

Challenges include building the platform from scratch, aligning technical selection with business needs, and ensuring data quality, breadth, and trust across departments.

3. Framework and Methodology

The data architecture combines TOGAF and DAMA principles, adapted to XinXuan’s specific context. A V‑shaped model is used to map business processes to data and application layers, emphasizing:

Business architecture first – fully understanding requirements.

Mapping to data‑application architecture – documenting all logical processes.

Explicit input/output definition – every business step produces data.

Data standardization and governance – using data dictionaries and industry standards.

These steps enable rapid business onboarding, clear data lineage, and effective cross‑functional collaboration.

4. Data Source Identification and Governance

Data sources span internal IT systems, external platforms (Douyin, Taobao, etc.), and manual collection. The process includes:

Cataloguing data sources, regardless of availability.

Collecting and consolidating logical data.

Standardizing, cleaning, and integrating data into a central data‑mid‑platform.

Implementing governance measures (encryption, access control, logging, audit trails) to ensure data security and compliance.

5. Technical Architecture

The solution runs on Alibaba Cloud with 2000 CPU cores, 7000 GB memory, and 150 TB storage for images and videos. It supports cold/hot data storage, real‑time and batch computation, AI‑assisted data enrichment, monitoring, and secure access.

6. Data Analysis System

Two major analysis tracks are provided:

Strategic & commercial analysis – supporting high‑level decision‑making.

Operational analysis – covering finance, business, market, and KPI dashboards.

External market data (e.g., JD, Taobao, Douyin) is integrated to enrich internal insights.

7. Application Cases

Self‑operated product analysis: Standardized templates generate market size, growth trends, brand positioning, sentiment analysis, product positioning, and differentiation strategies for categories such as apparel, home‑care, health, and cosmetics.

Data‑driven influencer recommendation: User, influencer, and product profiling models achieve 70‑85 % recommendation accuracy for top influencers and ~80 % for mid‑tier influencers.

Risk control in product selection: Real‑time shipment monitoring, historical performance checks, and TOGAF‑based indicator mapping identify and mitigate supply‑chain risks.

Data security foundation: Encryption, multi‑factor authentication, permission management, logging, and watermarking protect sensitive data.

Conclusion

The case demonstrates how a systematic data‑first approach—combining robust architecture, governance, and analytics—can empower live‑e‑commerce businesses to achieve cost efficiency, strategic insight, and operational resilience.

Big DataData Analyticsdata governancedata-architectureTOGAFLive E‑commerce
DataFunSummit
Written by

DataFunSummit

Official account of the DataFun community, dedicated to sharing big data and AI industry summit news and speaker talks, with regular downloadable resource packs.

0 followers
Reader feedback

How this landed with the community

login Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.