Tagged articles
237 articles
Page 1 of 3
DataFunSummit
DataFunSummit
May 16, 2026 · Industry Insights

What Powers Palantir’s 137% Revenue Surge? Inside Its Ontology‑Based Enterprise AI Platform

Palantir’s Q4 2025 revenue jumped 70% to $14.07 billion, with U.S. commercial revenue soaring 137%, driven not merely by AI hype but by its Ontology‑centric approach that tightly integrates data, business logic, actions, and security, locking large enterprises into a deeply embedded decision‑making stack.

AI OpsCase StudiesData Integration
0 likes · 9 min read
What Powers Palantir’s 137% Revenue Surge? Inside Its Ontology‑Based Enterprise AI Platform
Digital Planet
Digital Planet
May 12, 2026 · Industry Insights

Why Central SOEs Are Rushing into DRP – It’s More Complex Than It Looks

The Digitalized Resource‑management Platform (DRP) is being adopted en masse by central state‑owned enterprises as a strategic response to tighter regulatory oversight, the need for precise governance, and untapped data value, but its implementation faces legacy system overload, data‑standard fragmentation, and deep organizational resistance that demand strong leadership, cross‑departmental coordination, and phased, value‑driven execution.

DRPData IntegrationDigital Governance
0 likes · 14 min read
Why Central SOEs Are Rushing into DRP – It’s More Complex Than It Looks
DataFunSummit
DataFunSummit
May 2, 2026 · Artificial Intelligence

How Palantir’s 4‑Layer Ontology Architecture Enables Buildings, Tenants, and Data to ‘Talk’

Healthpeak transformed its commercial‑real‑estate operations by replacing fragmented spreadsheets with Palantir’s AI Platform (AIP), using a four‑layer architecture and ontology‑driven modeling to automate billing, detect anomalies, and orchestrate workflows, dramatically cutting manual effort, errors, and scaling costs.

AI Workflow AutomationCommercial Real EstateData Integration
0 likes · 18 min read
How Palantir’s 4‑Layer Ontology Architecture Enables Buildings, Tenants, and Data to ‘Talk’
DataFunSummit
DataFunSummit
Apr 29, 2026 · Industry Insights

Beyond the Data Rear‑view Mirror: Palantir’s Strategic Value and Real‑World Cases

Palantir leverages its Ontology‑driven data integration and AI platforms—Gotham, Foundry, and AIP—to transform fragmented data into actionable intelligence, delivering decision‑making advantages in government, aerospace, food, and energy sectors, while shifting from custom‑heavy services to an open, platform‑based ecosystem.

AI AgentsAI PlatformData Integration
0 likes · 11 min read
Beyond the Data Rear‑view Mirror: Palantir’s Strategic Value and Real‑World Cases
SuanNi
SuanNi
Apr 27, 2026 · Artificial Intelligence

How MIT’s RUBICON Cuts AI Agent Costs by 90% While Achieving 100% Accuracy

The paper shows that conventional LLM agents fail on real‑world enterprise data because of chaotic data sources, while the RUBICON architecture uses a minimal Agentic Query Language to let users direct data retrieval, achieving 100% accuracy with a much cheaper model and dramatically lower token and monetary costs.

Agentic Query LanguageBenchmarkData Integration
0 likes · 11 min read
How MIT’s RUBICON Cuts AI Agent Costs by 90% While Achieving 100% Accuracy
DataFunSummit
DataFunSummit
Apr 26, 2026 · Industry Insights

Why Palantir AIP Is More Than a Data Platform – The Secret ‘Implementation Orchestration Machine’

The article analyzes how Palantir’s ontology‑driven platforms—Gotham, Foundry, and the 2023 AI Platform (AIP)—break data silos, enable real‑time decision making, and shift the company from custom‑heavy solutions to a low‑code, AI‑agent‑centric ecosystem, illustrated with military, aerospace, and retail case studies.

AI PlatformAIPData Integration
0 likes · 10 min read
Why Palantir AIP Is More Than a Data Platform – The Secret ‘Implementation Orchestration Machine’
Old Zhao – Management Systems Only
Old Zhao – Management Systems Only
Apr 23, 2026 · Operations

Supply Chain vs Logistics vs Procurement: Clear Differences Explained

The article clarifies why many companies confuse procurement, logistics, and supply chain, outlines each function’s specific tasks, shows how fragmented data and unclear boundaries cause order delays, and proposes a linear, data‑driven workflow that links demand, purchasing, inbound, outbound, and delivery for smoother operations.

Data IntegrationLogisticsOperations Management
0 likes · 10 min read
Supply Chain vs Logistics vs Procurement: Clear Differences Explained
AI Large-Model Wave and Transformation Guide
AI Large-Model Wave and Transformation Guide
Apr 22, 2026 · Industry Insights

How to Build a Scalable Ontology‑Driven Investigation Platform: A Full‑Stack Architecture Blueprint

This article dissects the design of an end‑to‑end investigation platform by breaking down its core capabilities, mapping a layered architecture, justifying open‑source component choices, detailing deployment topology, comparing gaps with the commercial Gotham solution, and outlining a phased implementation roadmap.

AIData IntegrationDevOps
0 likes · 12 min read
How to Build a Scalable Ontology‑Driven Investigation Platform: A Full‑Stack Architecture Blueprint
DataFunTalk
DataFunTalk
Apr 21, 2026 · Industry Insights

How a Chinese Bank Used AI Large Models to Revolutionize Data Development

Facing siloed, tool‑fragmented, and low‑quality data pipelines, China Everbright Bank built an AI‑driven, end‑to‑end data integration platform that unifies heterogeneous databases, automates workflow checkpoints, and adds intelligent code quality checks, delivering faster, higher‑quality data services for the financial sector.

AIData DevelopmentData Integration
0 likes · 8 min read
How a Chinese Bank Used AI Large Models to Revolutionize Data Development
Digital Planet
Digital Planet
Apr 14, 2026 · Industry Insights

Why Most FMCG Channel Digitalization Projects Fail and How to Turn Data into Real Incentives

The article analyzes three fundamental pitfalls that cause FMCG channel digitalization projects to produce fake or delayed data, explains why binding sales incentives to real product flow is essential, and outlines a formula and four capability pillars to achieve true online sales expense management.

Channel DigitalizationData IntegrationFMCG
0 likes · 16 min read
Why Most FMCG Channel Digitalization Projects Fail and How to Turn Data into Real Incentives
Digital Planet
Digital Planet
Mar 30, 2026 · Industry Insights

Can Master Kong’s New “One More Bottle” Campaign Reverse Its Decline? A Deep Dive into FMCG Digital Transformation

Facing its first annual revenue decline in a decade, Master Kong revives the classic “One More Bottle” promotion using a five‑code integration that links factories, distributors, stores, and consumers, offering a case study on how digital‑first, full‑chain strategies can rejuvenate legacy FMCG growth models in a saturated market.

Data IntegrationDigital TransformationFMCG
0 likes · 15 min read
Can Master Kong’s New “One More Bottle” Campaign Reverse Its Decline? A Deep Dive into FMCG Digital Transformation
Wukong Talks Architecture
Wukong Talks Architecture
Mar 5, 2026 · Databases

Unifying Card and Coin Payments: KaiwuDB’s Dual‑Mode Solution for Amusement Parks

This article presents a detailed technical case study of using KaiwuDB’s multi‑model database to unify card‑based and coin‑based payment processing in amusement parks, covering architecture, schema design, SQL implementations, offline handling, cross‑model analytics, hot‑cold data tiering, visualization, monitoring, security, and high‑availability strategies.

Amusement ParkData IntegrationDual-Mode Payments
0 likes · 42 min read
Unifying Card and Coin Payments: KaiwuDB’s Dual‑Mode Solution for Amusement Parks
AI Large Model Application Practice
AI Large Model Application Practice
Feb 19, 2026 · Artificial Intelligence

When Should You Add a Knowledge Graph? 6 Practical Decision Criteria

This article outlines six concrete criteria—relationship‑centric data, reproducible reasoning, evolving schemas, multi‑hop queries, explainable decisions, and cross‑system data integration—to help engineers decide whether a knowledge graph is the right solution or if a relational database will suffice.

AI EngineeringData IntegrationKnowledge Graph
0 likes · 15 min read
When Should You Add a Knowledge Graph? 6 Practical Decision Criteria
Fighter's World
Fighter's World
Feb 7, 2026 · Artificial Intelligence

Who Will Capture the Trillion‑Dollar Value of Context Graphs?

The article analyzes why Context Graphs can unlock trillion‑dollar value by unifying heterogeneous enterprise systems, how platform‑level compounding effects outpace vertical AI agents, the strategic advantage of data companies in cross‑system integration, and why open standards and unified Context layers will decide the market winners.

AI AgentsCompetitive analysisContext Graph
0 likes · 25 min read
Who Will Capture the Trillion‑Dollar Value of Context Graphs?
Big Data Tech Team
Big Data Tech Team
Jan 19, 2026 · Big Data

What Is Data Fabric and How It Can Eliminate Data Silos Today

This article explains the concept of Data Fabric, debunks common misconceptions, outlines the three key drivers behind its rise, and provides a practical four‑step roadmap—including metadata, semantic layers, policy engines, and AI—to help teams of any size adopt the technology.

AIData FabricData Integration
0 likes · 7 min read
What Is Data Fabric and How It Can Eliminate Data Silos Today
JavaEdge
JavaEdge
Jan 3, 2026 · Blockchain

Why Oracles Are Essential for Real‑Time On‑Chain Data: Methods & Alternatives

Oracles serve as the crucial bridge that enables smart contracts to access off‑chain data, and while they are the dominant solution for real‑time on‑chain updates, the article also explores alternative approaches such as centralized data entry, state channels, sidechains, and cross‑chain oracles, outlining their pros, cons, and challenges.

Data IntegrationDecentralizedOracle
0 likes · 6 min read
Why Oracles Are Essential for Real‑Time On‑Chain Data: Methods & Alternatives
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Dec 30, 2025 · Big Data

How StarRocks and Apache Paimon Unite to Build a True Lakehouse Native Engine

StarRocks and Apache Paimon have been progressively integrated across multiple releases, enabling a unified lakehouse architecture that supports multi-source federated analysis, time-travel queries, native readers/writers, distributed planning, and advanced profiling, while delivering performance gains that bring Paimon query speed on par with native StarRocks tables.

Apache PaimonData IntegrationLakehouse
0 likes · 9 min read
How StarRocks and Apache Paimon Unite to Build a True Lakehouse Native Engine
Old Meng AI Explorer
Old Meng AI Explorer
Dec 25, 2025 · Industry Insights

How Open-Source OpenBB Terminal Gives You Bloomberg‑Level Analysis for Free

OpenBB Terminal is a free, open‑source financial analysis platform that consolidates over 500 data sources, offers AI‑driven report generation, one‑click industry comparisons, and local Docker deployment, enabling individual investors and small institutions to perform Bloomberg‑level research, quantitative backtesting, and secure data handling without costly subscriptions.

AIData IntegrationDocker
0 likes · 10 min read
How Open-Source OpenBB Terminal Gives You Bloomberg‑Level Analysis for Free
DataFunSummit
DataFunSummit
Nov 9, 2025 · Artificial Intelligence

How Zilliz Cut an 8‑Minute Sales Lead Process to Seconds with AI‑Powered Dify

This article recounts how Zilliz leveraged the low‑code platform Dify to integrate large‑model AI, private data, and business logic, transforming an eight‑minute, manual sales‑lead workflow into a seconds‑level automated pipeline and illustrating a new human‑AI collaboration paradigm.

AIData IntegrationMarketing Automation
0 likes · 14 min read
How Zilliz Cut an 8‑Minute Sales Lead Process to Seconds with AI‑Powered Dify
BirdNest Tech Talk
BirdNest Tech Talk
Oct 11, 2025 · Artificial Intelligence

How to Load Documents into LangChain: From Files to APIs

Learn how to use LangChain's Document Loaders to import data from files, web pages, databases, and APIs, understand the Document object structure, compare load() versus lazy_load(), and follow a step‑by‑step Python example that demonstrates loading, inspecting, and optionally processing documents with an LLM.

Data IntegrationDocument LoaderLLM
0 likes · 12 min read
How to Load Documents into LangChain: From Files to APIs
DataFunTalk
DataFunTalk
Aug 26, 2025 · Artificial Intelligence

Exploring Cutting-Edge AI & Knowledge Graph Applications: A Curated Resource Guide

This resource guide presents a curated list of cutting‑edge topics—including multimodal GraphRAG, knowledge‑graph‑driven large‑model applications in finance, traditional Chinese medicine, automotive manufacturing, and knowledge‑management trends—offering insights into AI‑powered knowledge services, and invites readers to scan the QR code to download the full e‑book.

AIData IntegrationKnowledge Graph
0 likes · 2 min read
Exploring Cutting-Edge AI & Knowledge Graph Applications: A Curated Resource Guide
360 Tech Engineering
360 Tech Engineering
Aug 12, 2025 · Artificial Intelligence

How Knowledge Graphs Are Reinventing AI Security: Insights from ISC.AI 2025

At the 13th ISC.AI 2025 Knowledge Graphs Reshaping Intelligent Security Summit in Beijing, leading experts from academia and industry highlighted how knowledge graphs enhance AI model accuracy, explainability, and trust, offering comprehensive data integration and risk monitoring to fortify intelligent systems across sectors.

Data IntegrationKnowledge Graphrisk monitoring
0 likes · 6 min read
How Knowledge Graphs Are Reinventing AI Security: Insights from ISC.AI 2025
360 Zhihui Cloud Developer
360 Zhihui Cloud Developer
Jul 22, 2025 · Big Data

How Apache SeaTunnel Revolutionizes Heterogeneous Data Integration with Decoupled Connectors

This article explores how Apache SeaTunnel addresses modern data integration challenges by providing a high‑performance, distributed, plugin‑based platform that decouples connectors from execution engines, enabling seamless batch and streaming synchronization across heterogeneous sources such as databases, message queues, and data lakes.

Apache SeaTunnelBatch ProcessingConnector Architecture
0 likes · 24 min read
How Apache SeaTunnel Revolutionizes Heterogeneous Data Integration with Decoupled Connectors
Code Ape Tech Column
Code Ape Tech Column
Jul 8, 2025 · Backend Development

Mastering Spring Batch: Real-World Use Cases and Hands‑On Guide

This comprehensive guide explains why batch processing is essential, walks through typical banking, e‑commerce, logging and medical data scenarios, details Spring Batch's core architecture and components, provides step‑by‑step setup and code examples, and presents a production‑grade bank reconciliation case with monitoring and troubleshooting tips.

Batch ProcessingData IntegrationJob Scheduling
0 likes · 27 min read
Mastering Spring Batch: Real-World Use Cases and Hands‑On Guide
Alibaba Cloud Developer
Alibaba Cloud Developer
Jun 13, 2025 · Artificial Intelligence

Accelerate Enterprise Data Insights with Alibaba Cloud Hologres and AI Agents

Learn how to rapidly build an intelligent data analysis agent by integrating multi‑source data through Alibaba Cloud Hologres, leveraging Bailei’s AI model service and the serverless Function AI platform, covering architecture, step‑by‑step deployment, verification, and resource cleanup for cost‑effective, real‑time business insights.

AIAlibaba CloudData Integration
0 likes · 8 min read
Accelerate Enterprise Data Insights with Alibaba Cloud Hologres and AI Agents
Java Captain
Java Captain
Jun 10, 2025 · Backend Development

Why Spring Batch? Real‑World Scenarios, Core Architecture and Hands‑On Guide

This article explains the necessity of batch processing, presents typical use cases such as daily interest calculation, e‑commerce order archiving, log analysis and medical data migration, then dives deep into Spring Batch's core components, provides step‑by‑step code examples, performance‑tuning tips, production‑grade fault‑tolerance, monitoring solutions and a comprehensive FAQ.

Batch ProcessingData IntegrationJava
0 likes · 20 min read
Why Spring Batch? Real‑World Scenarios, Core Architecture and Hands‑On Guide
DataFunSummit
DataFunSummit
Jun 2, 2025 · Artificial Intelligence

Enterprise Knowledge Brain Powered by Large Models and Knowledge Graphs

This article explains how the rapid development of large language models and knowledge graph technologies creates new opportunities for enterprise knowledge management, outlines the challenges of massive unstructured data, describes the architecture and core data flow of a corporate knowledge brain, and showcases key technologies and real‑world applications.

AI ArchitectureData IntegrationEnterprise AI
0 likes · 13 min read
Enterprise Knowledge Brain Powered by Large Models and Knowledge Graphs
Data Thinking Notes
Data Thinking Notes
May 5, 2025 · Artificial Intelligence

How MCP’s Text2SQL Service Turns Natural Language into Powerful Database Queries

This article explores the MCP platform’s data service capabilities, detailing its core components—Resources, Prompts, and Tools—and demonstrates how its Text2SQL feature enables natural‑language queries to retrieve table schemas, perform data sampling, and execute complex relational analyses across multiple database tables.

AIData IntegrationLLM
0 likes · 7 min read
How MCP’s Text2SQL Service Turns Natural Language into Powerful Database Queries
Big Data Tech Team
Big Data Tech Team
Apr 21, 2025 · Industry Insights

8 Practical Ways DeepSeek Boosts Data Quality for Better Governance

This guide outlines eight concrete methods DeepSeek uses to improve data quality—including automated cleaning, validation, classification, monitoring, governance standards, anomaly detection, integration, and intelligent analysis—providing actionable steps for organizations to enhance data accuracy, completeness, consistency, and usability.

Data IntegrationData QualityDeepSeek
0 likes · 5 min read
8 Practical Ways DeepSeek Boosts Data Quality for Better Governance
DataFunSummit
DataFunSummit
Apr 1, 2025 · Big Data

Understanding Flink CDC 3.3: Features, Improvements, and Future Plans

This article provides a comprehensive overview of Flink CDC 3.3, detailing its CDC fundamentals, new connectors, Transform module enhancements, asynchronous snapshot splitting, community adoption, and upcoming roadmap for broader ecosystem support and batch‑mode execution.

Big DataCDCChange Data Capture
0 likes · 15 min read
Understanding Flink CDC 3.3: Features, Improvements, and Future Plans
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Mar 20, 2025 · Big Data

How to Read and Write StarRocks Data with EMR Serverless Spark

This step‑by‑step guide explains how to use EMR Serverless Spark together with the StarRocks Spark Connector to create a workspace, upload the connector JAR, configure network connections, create databases and tables in StarRocks, and perform read/write operations via SQL sessions, Notebook sessions, or batch Spark jobs, complete with code examples and UI screenshots.

Big DataData IntegrationSpark
0 likes · 14 min read
How to Read and Write StarRocks Data with EMR Serverless Spark
AI Product Manager Community
AI Product Manager Community
Feb 17, 2025 · Product Management

How AI Can Transform Your Product Roadmap into a Real‑Time Strategic Tool

In today’s fast‑changing market, traditional product planning falls short, so this article explains how AI‑powered data integration, predictive analytics, and dynamic feedback loops can create a real‑time, data‑driven product roadmap, detailing three implementation phases—data unification, intelligent analysis, and continuous adjustment—with practical steps for product managers.

AIData IntegrationRoadmap
0 likes · 8 min read
How AI Can Transform Your Product Roadmap into a Real‑Time Strategic Tool
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Jan 27, 2025 · Big Data

Unlock Real-Time Data Sync with Flink CDC: YAML Integration, Transform & Route Explained

This article summarizes an advanced Flink CDC presentation, covering Flink CDC fundamentals, real‑time Flink integration, CDC‑YAML core capabilities, supported sync links, Transform and Route modules, monitoring metrics, schema‑change strategies, typical use cases, performance optimizations, demo implementations, and future development plans.

CDCData IntegrationFlink
0 likes · 20 min read
Unlock Real-Time Data Sync with Flink CDC: YAML Integration, Transform & Route Explained
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Jan 23, 2025 · Big Data

How Alibaba Cloud DataWorks Leverages Flink CDC for Scalable Data Lake Integration

Alibaba Cloud DataWorks’ Data Integration platform, built on Flink CDC, offers a comprehensive, serverless solution for real‑time and batch data lake ingestion, detailing its architecture, elastic scaling, productized use cases, and future roadmap, including AI‑driven diagnostics and expanded source support.

Big DataData IntegrationData Lake
0 likes · 12 min read
How Alibaba Cloud DataWorks Leverages Flink CDC for Scalable Data Lake Integration
Bilibili Tech
Bilibili Tech
Nov 26, 2024 · Big Data

Bilibili’s Iceberg‑Based Streaming‑Batch Integration: Architecture, Optimizations, and Practices

Bilibili migrated its massive user‑behavior, commercial AI training, and database synchronization pipelines from Hive and Kafka to an Iceberg‑based streaming‑batch architecture, using Flink and the Magnus optimizer to achieve minute‑level freshness, reduce CPU and memory usage by about 20‑22 %, save roughly 3.55 M CNY annually, and dramatically improve query latency and join performance.

BatchData IntegrationData Lake
0 likes · 20 min read
Bilibili’s Iceberg‑Based Streaming‑Batch Integration: Architecture, Optimizations, and Practices
Data Thinking Notes
Data Thinking Notes
Nov 5, 2024 · Big Data

How a Next‑Gen Data Management Platform Boosts Efficiency and Innovation

This article outlines the motivations, objectives, and architectural design of a next‑generation data management platform, detailing its four‑layer “four‑ization” approach, core services such as data integration, modeling, API provisioning, componentization, as well as governance, security, and operational best practices.

Big DataData GovernanceData Integration
0 likes · 20 min read
How a Next‑Gen Data Management Platform Boosts Efficiency and Innovation

How Apache SeaTunnel Redefines Data Integration for Modern Data Platforms

This article reviews the evolution of data‑integration architectures toward EtLT, explains the core capabilities of Apache SeaTunnel, and details how a Chinese data‑platform vendor applied and extended SeaTunnel to simplify batch and streaming ingestion, unify multi‑engine processing, and reduce development and operational costs.

Apache SeaTunnelBig DataConnector Development
0 likes · 17 min read
How Apache SeaTunnel Redefines Data Integration for Modern Data Platforms
DataFunSummit
DataFunSummit
Nov 1, 2024 · Big Data

DataFun Summit Session Overview and E‑book Access Instructions

The article outlines how to obtain the DataFun Summit e‑book by following the public account instructions and provides concise English summaries of twelve technical sessions covering data lineage, integration, AI language models, multimodal content, game AI agents, lake‑warehouse governance, big‑data architecture, and cluster management.

AIBig DataData Integration
0 likes · 5 min read
DataFun Summit Session Overview and E‑book Access Instructions
DataFunSummit
DataFunSummit
Oct 27, 2024 · Artificial Intelligence

How Siemens Harnesses Generative AI to Build the Enterprise Knowledge Chatbot “XiaoYu”

This article describes Siemens' journey in applying generative AI and Retrieval‑Augmented Generation to create an internal knowledge chatbot, detailing the business challenges, technical architecture, data integration, multi‑modal capabilities, deployment outcomes, and strategic lessons for enterprise AI adoption.

AI chatbotData IntegrationEnterprise Knowledge Management
0 likes · 21 min read
How Siemens Harnesses Generative AI to Build the Enterprise Knowledge Chatbot “XiaoYu”
macrozheng
macrozheng
Sep 27, 2024 · Big Data

Master DataX: Efficient Offline Data Sync for Heterogeneous Sources

This guide walks through the challenges of synchronizing massive datasets across heterogeneous databases, introduces Alibaba's open‑source DataX tool, explains its framework‑plugin architecture, and provides step‑by‑step instructions—including environment setup, installation, job configuration, and both full and incremental MySQL synchronization—complete with code examples and performance metrics.

Big DataData IntegrationDataX
0 likes · 15 min read
Master DataX: Efficient Offline Data Sync for Heterogeneous Sources
Data Thinking Notes
Data Thinking Notes
Sep 9, 2024 · Fundamentals

Master the 6‑Step Blueprint for Building an Enterprise Data Middle Platform

This guide outlines a practical six‑step methodology—covering overall planning, data integration, model construction, data development, asset management, and data services—to help enterprises build a robust data middle platform that unlocks business value and supports agile digital transformation.

Data GovernanceData IntegrationData Platform
0 likes · 10 min read
Master the 6‑Step Blueprint for Building an Enterprise Data Middle Platform
Ops Development & AI Practice
Ops Development & AI Practice
Aug 7, 2024 · Artificial Intelligence

How ChatGPT’s New JSON Output Transforms AI Integration

This article examines OpenAI's recent ChatGPT API update that adds JSON‑formatted responses, detailing the technical background, implementation steps, example requests and responses, and the broader impact on developers, enterprises, and future AI applications.

APIChatGPTData Integration
0 likes · 10 min read
How ChatGPT’s New JSON Output Transforms AI Integration
Data Thinking Notes
Data Thinking Notes
Jul 29, 2024 · Big Data

What Is a Data Middle Platform and How Does It Transform Enterprise Data Management?

This article explains the concept, design principles, and core components of a data middle platform, detailing its overall, functional, layered, logical, and data architectures, as well as the specific platforms for data collection, processing, organization, governance, quality, sharing, and visualization, illustrated with diagrams.

Big DataData ArchitectureData Governance
0 likes · 27 min read
What Is a Data Middle Platform and How Does It Transform Enterprise Data Management?
DaTaobao Tech
DaTaobao Tech
Jul 8, 2024 · Big Data

ODPS (MaxCompute) SQL Basics, Data Integration and Hologres Import Guide

This guide provides a comprehensive, beginner‑to‑advanced reference for ODPS (MaxCompute) SQL, covering table creation, DDL/DML commands, query syntax, join hints, MySQL‑to‑ODPS synchronization, one‑click and custom imports into Hologres, and scheduling variables for automated data pipelines.

Data IntegrationETLHologres
0 likes · 37 min read
ODPS (MaxCompute) SQL Basics, Data Integration and Hologres Import Guide
DataFunSummit
DataFunSummit
Jun 14, 2024 · Big Data

JD Logistics One‑Stop Agile BI Solution: Architecture, Challenges, and Product Evolution

This article presents JD Logistics' one‑stop agile BI platform, detailing the complex data sources, rapid business demands, the UData solution architecture, performance and usability improvements, and future upgrade plans that together enable faster data integration, self‑service reporting, and enhanced decision‑making across the organization.

Agile AnalyticsBIBig Data
0 likes · 25 min read
JD Logistics One‑Stop Agile BI Solution: Architecture, Challenges, and Product Evolution
DataFunTalk
DataFunTalk
May 13, 2024 · Big Data

Data Integration Maturity Model: From ETL to EtLT

The article examines the evolution of data integration architectures—from traditional ETL through ELT to the emerging EtLT model—highlighting their advantages, disadvantages, industry trends, maturity stages, and practical guidance for enterprises and professionals navigating modern big‑data pipelines.

Big DataData IntegrationDataOps
0 likes · 31 min read
Data Integration Maturity Model: From ETL to EtLT
DataFunTalk
DataFunTalk
May 8, 2024 · Big Data

Risk Control and Data Application in the Bulk Commodity Industry: Challenges, Solutions, and Core Capabilities

The article presents Ant Group's exploration of applying its data‑driven risk control and credit assessment capabilities to the traditional bulk commodity sector, detailing industry background, data pain points, core technical solutions, and the construction of a secure, explainable data‑model platform for digital transformation.

AIBig DataBulk Industry
0 likes · 13 min read
Risk Control and Data Application in the Bulk Commodity Industry: Challenges, Solutions, and Core Capabilities
21CTO
21CTO
Apr 28, 2024 · Artificial Intelligence

5 Transformative Business Use Cases for Conversational AI

This article explores how conversational AI, powered by large language models, is reshaping enterprise operations across five key scenarios—from customer support assistants and AI‑driven data interfaces to HR bots, unstructured data processing, and multi‑agent digital assistants—highlighting benefits, implementation considerations, and privacy challenges.

Conversational AIData Integrationbusiness applications
0 likes · 13 min read
5 Transformative Business Use Cases for Conversational AI
Data Thinking Notes
Data Thinking Notes
Apr 9, 2024 · Big Data

What Is a Data Middle Platform and Why It’s Essential for Modern Enterprises

Data middle platforms transform raw enterprise data into reusable assets by integrating collection, storage, processing, governance, and service layers, enabling faster deployment, consistent metrics, improved data quality, and business value across digital transformation, while addressing challenges like siloed data, low efficiency, and inconsistent standards.

Big DataData GovernanceData Integration
0 likes · 23 min read
What Is a Data Middle Platform and Why It’s Essential for Modern Enterprises
DataFunSummit
DataFunSummit
Apr 7, 2024 · Big Data

Li Auto’s Flink on Kubernetes Data Integration Practice

This article presents Li Auto’s end‑to‑end data integration journey, detailing the evolution of its data platform, the challenges of heterogeneous sources, and how a unified Flink‑on‑K8s solution with cloud‑native architecture, operator management, monitoring, and checkpointing addresses batch‑stream convergence and future scalability.

Batch ProcessingBig DataData Integration
0 likes · 12 min read
Li Auto’s Flink on Kubernetes Data Integration Practice
DataFunTalk
DataFunTalk
Mar 1, 2024 · Big Data

Understanding Data Fabric and Data Virtualization: Concepts, Practices, and Real‑World Case Study

This article explains the fundamentals of Data Fabric and data virtualization, highlights the limitations of traditional centralized data warehouses, describes the three‑layer virtualization architecture, and presents a detailed securities‑industry case study that demonstrates cost, efficiency, and compliance benefits.

Big DataData FabricData Integration
0 likes · 17 min read
Understanding Data Fabric and Data Virtualization: Concepts, Practices, and Real‑World Case Study
DataFunTalk
DataFunTalk
Feb 23, 2024 · Artificial Intelligence

Challenges and Opportunities in Applying Large‑Model AI to Healthcare

The article analyzes how large‑model medical AI is rapidly adopted yet struggles with implementation due to doctor shortages, behavioral resistance, data silos, safety regulations, and the need for strategic alignment, while contrasting the more supportive innovation ecosystem in the United States.

AI adoptionData IntegrationHealthcare Innovation
0 likes · 6 min read
Challenges and Opportunities in Applying Large‑Model AI to Healthcare
DataFunSummit
DataFunSummit
Feb 20, 2024 · Big Data

BitSail Open‑Source Data Integration Engine: Architecture, New Features, CDC Solutions and Future Outlook

This article introduces ByteDance's open‑source data integration engine BitSail, covering its background, layered architecture, recent feature enhancements, automated testing framework, CDC‑based full‑library synchronization solutions, and future development plans for connectors and real‑time data consistency.

Big DataCDCData Integration
0 likes · 12 min read
BitSail Open‑Source Data Integration Engine: Architecture, New Features, CDC Solutions and Future Outlook
DataFunTalk
DataFunTalk
Feb 17, 2024 · Big Data

JD Logistics One‑Stop Agile BI Solution: Architecture, Challenges, and Optimization

This article presents JD Logistics' one‑stop agile BI platform, detailing the complex data sources, rapid requirement changes, and Chinese‑style reporting challenges it addresses, while outlining the UData solution, product methodology, performance enhancements, and real‑world case studies that demonstrate significant efficiency gains.

Agile AnalyticsBIBig Data
0 likes · 26 min read
JD Logistics One‑Stop Agile BI Solution: Architecture, Challenges, and Optimization
DataFunSummit
DataFunSummit
Feb 5, 2024 · Artificial Intelligence

Ant Group's Knowledge Graph: Overview, Construction, Applications, and Integration with Large Models

Ant Group shares its comprehensive knowledge graph initiatives, detailing the fundamentals, construction pipeline, fusion techniques, cognitive representations, diverse business applications, and the emerging synergy between knowledge graphs and large language models, illustrating how graph-based AI enhances accuracy, interpretability, and downstream services.

Data IntegrationGraph FusionKnowledge Graph
0 likes · 14 min read
Ant Group's Knowledge Graph: Overview, Construction, Applications, and Integration with Large Models
DataFunTalk
DataFunTalk
Jan 29, 2024 · Big Data

Case Study: Deploying RisingWave for Real-Time Stream Processing in a Large-Scale Quantitative Hedge Fund

An ultra‑large hedge fund with over $10 billion AUM replaced ksqlDB and Flink with RisingWave, leveraging its PostgreSQL‑compatible streaming SQL to achieve sub‑10 ms latency, lower learning and operational costs, rich connectors, advanced operators, and comprehensive observability for real‑time trade data processing.

Data IntegrationLow latencyQuantitative Trading
0 likes · 10 min read
Case Study: Deploying RisingWave for Real-Time Stream Processing in a Large-Scale Quantitative Hedge Fund
NetEase LeiHuo UX Big Data Technology
NetEase LeiHuo UX Big Data Technology
Jan 9, 2024 · Artificial Intelligence

Accelerating Recommendation System Development with MindsDB

The article explains how the data team adopted the open‑source machine‑learning platform MindsDB to simplify data integration, enable SQL‑based model training and inference, manage model versions, and dramatically shorten recommendation system development cycles, achieving up to 30% efficiency gains.

Data IntegrationMindsDBModel Management
0 likes · 5 min read
Accelerating Recommendation System Development with MindsDB
Alibaba Cloud Native
Alibaba Cloud Native
Dec 28, 2023 · Cloud Computing

How to Set Up No‑Code Data Dump from Alibaba Cloud Kafka to OSS

This guide explains how to use Alibaba Cloud Message Queue Kafka's no‑code, fully managed, serverless dump feature to transfer data to OSS, covering its benefits, typical scenarios, required prerequisites, step‑by‑step configuration, testing, and verification of the resulting objects.

Alibaba CloudData IntegrationKafka
0 likes · 9 min read
How to Set Up No‑Code Data Dump from Alibaba Cloud Kafka to OSS
Sohu Tech Products
Sohu Tech Products
Dec 27, 2023 · Big Data

Practical Implementation of Data Integration with Flink on Kubernetes at Li Auto

Li Auto built a cloud‑native data‑integration platform by deploying Flink on Kubernetes, unifying batch and streaming workloads with a storage layer (JuiceFS + BOS) and Flink Operator, enabling simple source‑sink pipelines, elastic scaling, automated checkpointing, and centralized monitoring while addressing earlier fragmentation and resource inefficiencies.

Big DataCloud NativeData Integration
0 likes · 11 min read
Practical Implementation of Data Integration with Flink on Kubernetes at Li Auto
DataFunTalk
DataFunTalk
Dec 22, 2023 · Big Data

Practical Implementation of Flink on Kubernetes for Data Integration at Li Auto

This article details Li Auto's end‑to‑end data integration practice using Flink on Kubernetes, covering the evolution of their integration platform, architectural design, cloud‑native deployment, operational challenges, and future roadmap, while highlighting unified batch‑stream processing and resource elasticity.

Batch ProcessingBig DataCloud Native
0 likes · 12 min read
Practical Implementation of Flink on Kubernetes for Data Integration at Li Auto
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Dec 12, 2023 · Databases

Master Database Migration to Cloud: Challenges & Solutions with Baidu DTS

This article examines the rapid growth of China's database market, the technical hurdles of moving databases to public cloud—including engine selection, lengthy migration processes, efficiency, disaster recovery, and data consistency—and explains how Baidu Intelligent Cloud's DTS platform offers a smooth, reliable, high‑availability, and high‑performance one‑stop solution with real‑world use cases.

Baidu CloudCloud DatabasesDTS
0 likes · 25 min read
Master Database Migration to Cloud: Challenges & Solutions with Baidu DTS
Data Thinking Notes
Data Thinking Notes
Dec 5, 2023 · Big Data

How to Overcome Data Governance Challenges and Unlock Business Value

Enterprises face significant hurdles in data governance and integration, from siloed systems and unclear responsibilities to poor data quality, but by establishing clear rules, fostering user department engagement, and aligning governance with business-driven data applications, they can create a cohesive data asset management framework that drives value.

Big DataData AssetsData Governance
0 likes · 10 min read
How to Overcome Data Governance Challenges and Unlock Business Value
Alibaba Cloud Native
Alibaba Cloud Native
Nov 23, 2023 · Cloud Native

How CDC + Serverless Functions Enable Real‑Time ETL in Cloud Native Architectures

This article explains how Alibaba Cloud's Serverless Function Compute combined with Database Change Data Capture (CDC) creates a complete, real‑time ETL pipeline, detailing the ETL model, DTS integration, architecture components, event‑driven processing, and practical use cases such as OLTP‑to‑OLAP data flow.

Alibaba CloudCDCData Integration
0 likes · 10 min read
How CDC + Serverless Functions Enable Real‑Time ETL in Cloud Native Architectures
DataFunSummit
DataFunSummit
Oct 24, 2023 · Big Data

Practices of Data Fabric in Data Integration Scenarios

The presentation by Aloudata Vice President Yu Jun introduces his extensive background in large‑scale internet and big‑data platforms and outlines how Data Fabric and data virtualization can be applied to data integration, highlighting the differences from traditional solutions and the business value of logical data warehouses.

Big DataData FabricData Integration
0 likes · 2 min read
Practices of Data Fabric in Data Integration Scenarios
DataFunTalk
DataFunTalk
Sep 30, 2023 · Big Data

Building a Marketing‑Oriented Data Middle Platform: Concepts and Practices

This article outlines how a marketing‑focused data middle platform can be constructed by integrating online and offline behavior data, business data, and third‑party sources, then applying data integration, modeling, processing, and application capabilities to enable data‑driven user journeys and personalized marketing strategies.

Big DataData Integrationdata modeling
0 likes · 13 min read
Building a Marketing‑Oriented Data Middle Platform: Concepts and Practices
Java High-Performance Architecture
Java High-Performance Architecture
Sep 28, 2023 · Databases

How to Use Debezium for MySQL CDC in Spring Boot Without Adding Extra Middleware

Learn how to capture MySQL data changes using Debezium's CDC capabilities within a Spring Boot application, avoiding heavyweight message brokers by leveraging binlog monitoring, configuring connectors, handling snapshots, and processing change events for use cases like cache invalidation, data integration, and simplifying monolithic architectures.

CDCData IntegrationDebezium
0 likes · 24 min read
How to Use Debezium for MySQL CDC in Spring Boot Without Adding Extra Middleware
Architects Research Society
Architects Research Society
Sep 27, 2023 · Fundamentals

What Is the Common Data Model and Why Use It?

The Common Data Model provides a shared, standardized data language and metadata system that simplifies cross‑application data integration, reduces custom development effort, and enables consistent, extensible data structures for business and analytics scenarios across Microsoft Power Platform and Azure services.

Common Data ModelData IntegrationEnterprise Data
0 likes · 8 min read
What Is the Common Data Model and Why Use It?
DataFunSummit
DataFunSummit
Sep 8, 2023 · Big Data

Tianqiong OLAP Real‑time Lakehouse Fusion Platform Architecture Practice

This article explains why lake‑warehouse fusion is needed, describes the challenges of integrating real‑time data warehouses with data lakes, introduces a new StarRocks‑based architecture that supports real‑time ingestion, cooling, offline loading, and adaptive hot‑cold query rewriting, and outlines future plans and Q&A.

Big DataData IntegrationData Warehouse
0 likes · 21 min read
Tianqiong OLAP Real‑time Lakehouse Fusion Platform Architecture Practice
DataFunSummit
DataFunSummit
Aug 13, 2023 · Big Data

KwaiBI: Evolution of Kuaishou’s One‑Stop Business Intelligence Platform from 1.0 to 2.0

The article details Kuaishou’s KwaiBI business intelligence platform evolution, covering its 1.0 tool‑based implementation, the 2.0 standardized architecture built on an indicator middle‑platform, core processes, data integration, self‑service features, and future directions for self‑service and intelligent analytics.

BIBig DataData Integration
0 likes · 22 min read
KwaiBI: Evolution of Kuaishou’s One‑Stop Business Intelligence Platform from 1.0 to 2.0
DataFunSummit
DataFunSummit
Aug 10, 2023 · Databases

ClickHouse Deployment in Lenovo Manufacturing: Architecture, Data Integration, and Performance Optimization

This article details Lenovo's implementation of ClickHouse in a manufacturing environment, covering the current data landscape, cluster architecture, integration challenges, performance enhancements, and solutions such as Seatunnel and query pre‑aggregation, illustrating how OLAP engines can address real‑time analytics and concurrency issues in production data pipelines.

ClickHouseData IntegrationManufacturing
0 likes · 11 min read
ClickHouse Deployment in Lenovo Manufacturing: Architecture, Data Integration, and Performance Optimization
Data Thinking Notes
Data Thinking Notes
Aug 2, 2023 · Fundamentals

Mastering Enterprise Data: A Practical Guide to Master Data Management

This article explains why fragmented data hampers business insight in large enterprises and provides a comprehensive overview of master data concepts, governance structures, standards, processes, and step‑by‑step implementation practices to achieve consistent, high‑quality enterprise data.

Data GovernanceData IntegrationEnterprise Data
0 likes · 18 min read
Mastering Enterprise Data: A Practical Guide to Master Data Management
Architects Research Society
Architects Research Society
Aug 2, 2023 · Fundamentals

Data Fabric Architecture: Three Patterns, Core Technical Components, and Inherent Limitations

The article explains data fabric architecture as a promising approach for enabling data exchange across distributed systems, outlines its three design patterns, describes key technical components such as data virtualization, data catalog, and knowledge graphs, and discusses the trade‑offs, costs, and limitations that organizations must consider.

Data CatalogData FabricData Integration
0 likes · 17 min read
Data Fabric Architecture: Three Patterns, Core Technical Components, and Inherent Limitations
Didi Tech
Didi Tech
Jul 31, 2023 · Big Data

Data Serviceization at Didi: Architecture, Phases, and Standard Metric Service

Didi’s data serviceization converts raw business data into consumable services through a four‑stage pipeline—integration, development, production, and back‑flow—while the Data Dream Factory and Shu‑Chain platform automate synchronization, provide a unified access gateway for thousands of APIs, and introduce a standard metric service that abstracts storage complexities and ensures high‑performance, secure data delivery.

Data IntegrationData Platformdata serviceization
0 likes · 16 min read
Data Serviceization at Didi: Architecture, Phases, and Standard Metric Service
Inke Technology
Inke Technology
Jun 28, 2023 · Big Data

Extending Apache Seatunnel for Trino and Kyuubi Integration: A Practical Guide

This article outlines the challenges of scaling data integration platforms, proposes a comprehensive solution using Apache Seatunnel and Dinky, details the implementation of Trino and Kyuubi JDBC support, and describes the platform's architecture, task publishing workflow, logging, monitoring, resource management, and future enhancements.

Apache SeaTunnelData IntegrationKyuubi
0 likes · 16 min read
Extending Apache Seatunnel for Trino and Kyuubi Integration: A Practical Guide
Architects Research Society
Architects Research Society
Jun 21, 2023 · Fundamentals

The Strategic Role of Enterprise Architects: Five Strategic and One Tactical Focus Areas

Enterprise architects align IT strategy with business goals by overseeing application portfolio management, technology and risk, IT operations, security and privacy, integration and data, and finance, defining roadmaps for 1‑3‑5 year plans while balancing strategic and tactical responsibilities in a rapidly changing environment.

Application PortfolioData IntegrationIT Operations
0 likes · 6 min read
The Strategic Role of Enterprise Architects: Five Strategic and One Tactical Focus Areas
21CTO
21CTO
Jun 20, 2023 · Fundamentals

ETL vs ELT: Which Data Integration Method Wins for Your Business?

ETL extracts, transforms, then loads data, while ELT extracts, loads, and transforms later, each offering distinct advantages; the article compares their processes, key differences, and factors such as data volume, complexity, latency, and cost to help businesses choose the optimal integration approach.

Data IntegrationData WarehousingELT
0 likes · 12 min read
ETL vs ELT: Which Data Integration Method Wins for Your Business?
360 Tech Engineering
360 Tech Engineering
Jun 2, 2023 · Big Data

Overcoming Challenges in User Profiling: A Big Data‑Driven Framework for Precise Marketing

The article outlines how a unified, big‑data‑based user profiling platform addresses traditional data silos, high costs, and limited functionality by standardizing tags, integrating Spark and RHadoop processing, and enabling a closed‑loop marketing workflow that improves accuracy and operational efficiency.

Big DataData IntegrationMarketing Automation
0 likes · 7 min read
Overcoming Challenges in User Profiling: A Big Data‑Driven Framework for Precise Marketing
StarRocks
StarRocks
May 26, 2023 · Big Data

How SeaTunnel’s StarRocks Connector Enables High‑Performance Data Sync

This article explains SeaTunnel’s architecture and its StarRocks connector, detailing source and sink features such as field projection, predicate push‑down, parallel reading, state recovery, data type mapping, Stream Load writes, CDC support, configuration examples, and future roadmap for exactly‑once semantics.

Big DataConnectorData Integration
0 likes · 16 min read
How SeaTunnel’s StarRocks Connector Enables High‑Performance Data Sync
Top Architect
Top Architect
May 4, 2023 · Big Data

Data Middle Platform: General Architecture and Core Components

The article explains the concept, benefits, and detailed modular architecture of a data middle platform, covering data storage, acquisition, processing, governance, security, and operation frameworks, and illustrates how enterprises can build and evolve such platforms to turn data into valuable services.

Big DataData ArchitectureData Governance
0 likes · 19 min read
Data Middle Platform: General Architecture and Core Components
ITPUB
ITPUB
Apr 26, 2023 · Databases

Mastering Change Data Capture: Open‑Source Tools and How to Choose the Right One

This article explains the concept of Change Data Capture (CDC), outlines its common use cases, compares the main technical approaches—including timestamps, data diff, triggers, and log‑based methods—and reviews popular open‑source CDC solutions and their database‑specific configuration requirements.

CDCChange Data CaptureData Integration
0 likes · 15 min read
Mastering Change Data Capture: Open‑Source Tools and How to Choose the Right One
ITPUB
ITPUB
Apr 25, 2023 · Big Data

Top 8 Open‑Source ETL Tools for Data Migration and Integration

This article reviews eight widely used ETL and data‑migration tools—including Kettle, DataX, DataPipeline, Talend, DataStage, Sqoop, FineDataLink, and Canal—detailing their core features, architectures, supported data sources, and typical usage scenarios to help practitioners choose the right solution.

Big DataData IntegrationData Migration
0 likes · 13 min read
Top 8 Open‑Source ETL Tools for Data Migration and Integration
DataFunSummit
DataFunSummit
Apr 9, 2023 · Big Data

Expert Interview: Architecture and Trends of Big Data Platforms

This article presents a comprehensive interview with several big‑data platform experts, outlining the core components such as data integration, storage and computation, distributed scheduling, and query analysis, while also highlighting current challenges, best‑practice tools, and future trends in big‑data architecture.

Big DataData IntegrationOLAP
0 likes · 10 min read
Expert Interview: Architecture and Trends of Big Data Platforms
DataFunTalk
DataFunTalk
Apr 4, 2023 · Big Data

Upgrading Hangzhou Bank Consumer Finance Big Data Platform with Apache Doris 1.2: Architecture, Performance Gains, and Integration

This article details how Hangzhou Bank Consumer Finance modernized its big‑data platform by introducing Apache Doris 1.2, replacing the original Greenplum + CDH architecture, unifying data sources via Multi‑Catalog, achieving second‑level query latency, reducing storage and compute costs, and outlining the integration workflow with DolphinScheduler, SeaTunnel, and Spark.

Apache DorisBig DataData Integration
0 likes · 20 min read
Upgrading Hangzhou Bank Consumer Finance Big Data Platform with Apache Doris 1.2: Architecture, Performance Gains, and Integration
HomeTech
HomeTech
Mar 31, 2023 · Artificial Intelligence

Digital Transformation of Used‑Car Buying: Integrated Data, AI Valuation, and VR Visualization

The article describes how a comprehensive digital platform combines structured, semi‑structured, and panoramic data with machine‑learning valuation models, natural‑language processing, and VR technology to make used‑car condition information transparent, improve estimation accuracy, and enhance user decision‑making in the Chinese second‑hand car market.

AI valuationBig DataData Integration
0 likes · 15 min read
Digital Transformation of Used‑Car Buying: Integrated Data, AI Valuation, and VR Visualization