Big Data 19 min read

Data Middle Platform: General Architecture and Core Components

The article explains the concept, benefits, and detailed modular architecture of a data middle platform, covering data storage, acquisition, processing, governance, security, and operation frameworks, and illustrates how enterprises can build and evolve such platforms to turn data into valuable services.

Top Architect
Top Architect
Top Architect
Data Middle Platform: General Architecture and Core Components

Most enterprises now prefer a centralized data collection and storage model rather than siloed pipelines, enabling rapid application deployment and unified data management as a strategic asset.

The emergence of the data middle platform addresses the mismatch between data development speed and application development, originating from Alibaba's "big middle platform, small front end" concept.

A generic data middle platform architecture decouples six functional subsystems—data storage, acquisition, processing, governance, security, and operation—allowing flexible composition and incremental implementation.

Key components include:

Data storage framework: supports object, block, and database storage, handling raw, structured, and unstructured data, as well as metadata and tagging.

Data acquisition framework: provides multiple ingestion methods (FTP, DB, API, streaming, web crawling) and preprocessing to filter irrelevant data.

Data processing framework: handles ETL, batch and stream processing, AI analysis, cleansing, exchange, and scheduling.

Data governance framework: manages catalogs, models, metadata, master data, quality, and lineage, but excludes security and sharing functions.

Data security framework: implements logging, authentication, authorization, and encryption across all data flows.

Data operation framework: offers portals, capability exposure, data publishing, and monitoring to support external services.

The platform’s ultimate goal is to transform collected data into reusable services, supporting various business scenarios while ensuring security, scalability, and continuous operation.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Sign in to view source
Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactadmin@besthub.devand we will review it promptly.

Big DataData PlatformData IntegrationData Governancemiddle platformData Architecture
Top Architect
Written by

Top Architect

Top Architect focuses on sharing practical architecture knowledge, covering enterprise, system, website, large‑scale distributed, and high‑availability architectures, plus architecture adjustments using internet technologies. We welcome idea‑driven, sharing‑oriented architects to exchange and learn together.

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.