Tagged articles
193 articles
Page 2 of 2
Architects Research Society
Architects Research Society
Dec 21, 2021 · Fundamentals

Next-Generation Master Data Management (MDM): Architecture, Business Value, and Technical Challenges

This article explains master data management concepts, regulatory drivers, business benefits, key technical challenges, architectural trends such as graph databases and machine learning, and highlights leading vendors, providing a comprehensive overview for enterprises seeking modern MDM solutions.

AnalyticsBig DataData Governance
0 likes · 9 min read
Next-Generation Master Data Management (MDM): Architecture, Business Value, and Technical Challenges
政采云技术
政采云技术
Dec 16, 2021 · Big Data

What Is Event Tracking (埋点) and Its Implementation in a Data Analysis System

This article explains the concept of event tracking (埋点), its importance for capturing user behavior, outlines the four‑module architecture of a tracking system, compares code‑based, visual and full tracking methods, describes data models, storage, management, and presents a practical case study with analysis techniques.

AnalyticsBackendBig Data
0 likes · 15 min read
What Is Event Tracking (埋点) and Its Implementation in a Data Analysis System
High Availability Architecture
High Availability Architecture
Oct 25, 2021 · Big Data

iQIYI Data Governance Practices: Event Tracking (Pingback) Governance and Application

The article details iQIYI's comprehensive data governance initiative for event tracking (Pingback), covering definitions, timing, quality requirements, governance challenges, standardized specifications, coordinate management, testing and gray‑release processes, upgrade workflows, and data security measures that together reduced event volume by 40% and cut resource consumption in half.

AnalyticsBig DataData Governance
0 likes · 16 min read
iQIYI Data Governance Practices: Event Tracking (Pingback) Governance and Application
Alimama Tech
Alimama Tech
Oct 20, 2021 · Big Data

Designing Evaluation Metrics and Building an Overall Evaluation Index (OEC) for AB Testing

The article explains how to design experiment evaluation metrics—from top‑down objectives to core, quality, and observation types—and construct an Overall Evaluation Criterion by processing, weighting, and aggregating metrics, providing a robust, scalable framework for credible AB‑test assessment and product optimization.

AB testingAnalyticsData Science
0 likes · 11 min read
Designing Evaluation Metrics and Building an Overall Evaluation Index (OEC) for AB Testing
iQIYI Technical Product Team
iQIYI Technical Product Team
Oct 15, 2021 · Industry Insights

How iQIYI Streamlined Event Tracking: A Deep Dive into Data Governance

This article details iQIYI's comprehensive data‑governance practice for event tracking, covering the definition of pingback, the need for governance, the governance framework, coordinate management, gray‑data handling, and the upgrade process that reduced tracking volume by 40% while cutting resource consumption in half.

AnalyticsBig DataData Governance
0 likes · 17 min read
How iQIYI Streamlined Event Tracking: A Deep Dive into Data Governance
DataFunTalk
DataFunTalk
Aug 14, 2021 · Databases

Evolution of OLAP Engines at Lenovo Liancheng Zhida and DorisDB Adoption

The article chronicles Lenovo Liancheng Zhida’s three‑stage evolution of OLAP engines—from early SQL Server scripts, through a Hadoop‑based Presto solution, to the adoption of DorisDB—detailing architecture, tool comparisons, implementation practices, and the performance and operational benefits achieved.

AnalyticsBig DataDorisDB
0 likes · 12 min read
Evolution of OLAP Engines at Lenovo Liancheng Zhida and DorisDB Adoption
DataFunTalk
DataFunTalk
Aug 5, 2021 · Big Data

Building a Unified High‑Performance OLAP Platform with DorisDB at Beike Real Estate

The article describes how Beike Real Estate consolidated multiple OLAP engines into a single DorisDB‑based platform, detailing the business challenges, DorisDB’s technical advantages, extensive performance and concurrency benchmarks, and the resulting improvements in stability, query speed, and operational simplicity across various business scenarios.

AnalyticsBig DataDorisDB
0 likes · 14 min read
Building a Unified High‑Performance OLAP Platform with DorisDB at Beike Real Estate
dbaplus Community
dbaplus Community
Jul 8, 2021 · Databases

Why ClickHouse Outperforms Elasticsearch for Log Storage and Analytics

This article compares ClickHouse and Elasticsearch for API log storage, detailing development activity, schema handling, query performance, statistical functions, MySQL integration, new features, and practical drawbacks, while providing concrete SQL examples and migration tips.

AnalyticsElasticsearchJSON
0 likes · 14 min read
Why ClickHouse Outperforms Elasticsearch for Log Storage and Analytics
DataFunTalk
DataFunTalk
Jul 7, 2021 · Big Data

Solving Data Island Challenges and Enabling Advanced OLAP Analysis on Heterogeneous Big Data Platforms – Kyligence Solution Overview

This article explains the growing analytical demands in the big‑data era, the limitations of traditional OLAP, and how Kyligence’s distributed OLAP engine addresses data‑island issues, multi‑dimensional and many‑to‑many analysis, unified security, and performance optimization with MDX on Spark, delivering a seamless Excel‑like experience.

AnalyticsBig DataData Integration
0 likes · 9 min read
Solving Data Island Challenges and Enabling Advanced OLAP Analysis on Heterogeneous Big Data Platforms – Kyligence Solution Overview
ITPUB
ITPUB
Jul 7, 2021 · Big Data

How NetEase Cloud Music Scaled Its Data Warehouse for Billion‑User Traffic

This article details NetEase Cloud Music's journey of redesigning its data warehouse and governance processes to support over a billion monthly active users, covering pain points, standardization, shared services, self‑service tools, and the resulting improvements in data quality, latency, and operational efficiency.

AnalyticsData GovernanceData Platform
0 likes · 19 min read
How NetEase Cloud Music Scaled Its Data Warehouse for Billion‑User Traffic
Architects Research Society
Architects Research Society
May 15, 2021 · Big Data

Data Warehouse vs Data Lake: Definitions, Differences, and Architectural Considerations

Data warehouses store structured data centrally for reporting and analysis, while data lakes retain raw data in various formats, offering flexible, low‑cost, schema‑on‑read processing; the article explains their definitions, key differences, common misconceptions, and why many organizations now combine both to enable self‑service big‑data analytics.

AnalyticsBig DataData Architecture
0 likes · 21 min read
Data Warehouse vs Data Lake: Definitions, Differences, and Architectural Considerations
HelloTech
HelloTech
May 14, 2021 · Big Data

User Behavior Analysis System: Architecture, ClickHouse Cluster Deployment, and Analytical Techniques

The article describes a real‑time user behavior analysis platform built on a ClickHouse cluster, detailing its architecture, Hive‑to‑ClickHouse data ingestion with user‑ID routing, table designs for behavior and group data, and five analytical methods—event, funnel, path, retention, and attribution—leveraging shard‑level parallelism and custom functions for high efficiency.

AnalyticsBig Dataclickhouse
0 likes · 20 min read
User Behavior Analysis System: Architecture, ClickHouse Cluster Deployment, and Analytical Techniques
Architects Research Society
Architects Research Society
May 9, 2021 · Big Data

Data Lakes vs. Data Warehouses: Key Differences and Choosing the Right Approach

This article explains the fundamental distinctions between data lakes and data warehouses, outlines five critical differences—including data retention, type support, user support, adaptability, and insight speed—and offers guidance on selecting the appropriate solution based on organizational needs and technology options.

AnalyticsBig DataData Architecture
0 likes · 12 min read
Data Lakes vs. Data Warehouses: Key Differences and Choosing the Right Approach
Big Data Technology & Architecture
Big Data Technology & Architecture
Apr 22, 2021 · Big Data

Debunking Common Misconceptions About Data Lakes

This article debunks eight common misconceptions about data lakes, explains why they are not mutually exclusive with data warehouses, clarifies that they are not limited to Hadoop or raw data only, and provides practical tips for building flexible, secure, and business‑driven data lake solutions.

AnalyticsBig DataCloud Services
0 likes · 21 min read
Debunking Common Misconceptions About Data Lakes
Tencent Cloud Developer
Tencent Cloud Developer
Mar 29, 2021 · Cloud Native

How Tencent Cloud’s Native Data Lake Redefines Big Data Analytics

This article examines the evolution of data lakes, outlines the challenges enterprises face with massive, heterogeneous data, and details Tencent Cloud’s native data lake architecture and its serverless Data Lake Compute service, highlighting performance, cost‑efficiency, and future development directions.

AnalyticsCloud NativeData Lake
0 likes · 10 min read
How Tencent Cloud’s Native Data Lake Redefines Big Data Analytics
iQIYI Technical Product Team
iQIYI Technical Product Team
Mar 26, 2021 · Big Data

Evolution of iQIYI's Real-Time Big Data Ecosystem

iQIYI transformed its data infrastructure from a traditional offline T+1 model to a comprehensive real‑time ecosystem—leveraging Kafka, Flink, a three‑layer Stream Data Service Platform, the Talos drag‑and‑drop pipeline, and a Druid‑based analytics platform—to enable low‑latency monitoring, personalized recommendations, ad targeting, and continuous machine‑learning workflows while planning future stream‑batch integration and lake‑warehouse convergence.

AnalyticsBig DataFlink
0 likes · 13 min read
Evolution of iQIYI's Real-Time Big Data Ecosystem
21CTO
21CTO
Mar 2, 2021 · Big Data

How Suning’s Data Platform Unifies OLAP, Metrics, Visualization & Reporting

Suning’s Data Middle Platform integrates an accelerated OLAP engine, a star‑schema metric system, a visualization tool built on standardized dimensions, and a unified report portal to solve data silos, improve security, and enable enterprises to evolve into technology‑driven organizations.

AnalyticsBig DataData Platform
0 likes · 3 min read
How Suning’s Data Platform Unifies OLAP, Metrics, Visualization & Reporting
Big Data Technology & Architecture
Big Data Technology & Architecture
Jan 28, 2021 · Big Data

Understanding Data Lakes: Definitions, Benefits, Architectures, and Technology Choices

Data lakes, emerging since 2020, are centralized repositories that store structured and unstructured data at any scale, offering flexible analytics, but require robust management to avoid becoming data swamps; this article explains definitions, advantages, typical architectures, and compares cloud and open‑source solutions such as AWS Lake Formation, Alibaba Cloud, Delta, Iceberg, and Hudi.

AnalyticsBig Datacloud storage
0 likes · 13 min read
Understanding Data Lakes: Definitions, Benefits, Architectures, and Technology Choices
Full-Stack Internet Architecture
Full-Stack Internet Architecture
Jan 21, 2021 · Operations

From Raw Decision Process to Data‑Driven Management: A Step‑by‑Step Guide

The article explains how organizations evolve from a simplistic, intuition‑based decision approach to a refined, data‑driven management cycle by introducing quantitative evaluation, the three‑stage decision model, PDCA loops, and modern data‑collection tools, while also highlighting common pitfalls that hinder effective data‑driven decision making.

AnalyticsPDCAdata-driven decision
0 likes · 10 min read
From Raw Decision Process to Data‑Driven Management: A Step‑by‑Step Guide
Laiye Technology Team
Laiye Technology Team
Dec 18, 2020 · Big Data

Comprehensive Overview of Laiye Technology's Business Intelligence Ecosystem

This article provides a detailed, end‑to‑end description of Laiye Technology's BI ecosystem, covering its background, development stages, data acquisition, transmission, transformation, loading, modeling, storage layers, statistical analysis, real‑time metrics, visualization, and future challenges, illustrating how the company builds a scalable, cloud‑native data‑driven platform.

AnalyticsBIBig Data
0 likes · 22 min read
Comprehensive Overview of Laiye Technology's Business Intelligence Ecosystem
DataFunSummit
DataFunSummit
Nov 12, 2020 · Big Data

OLAP Engine Selection and Challenges in Large-Scale Data at Youku

This article explores the challenges big data brings to traditional data technologies and reviews various OLAP solutions—including MPP, batch processing, pre‑computation, and Hadoop‑based engines—while detailing Youku’s specific business scenarios and how different OLAP engines are selected to meet performance, scalability, and real‑time analysis requirements.

AnalyticsBig DataMPP
0 likes · 14 min read
OLAP Engine Selection and Challenges in Large-Scale Data at Youku
DataFunTalk
DataFunTalk
Sep 24, 2020 · Databases

Understanding OLAP vs. OLTP and the Fundamentals of Data Warehousing

This article explains the core differences between OLTP and OLAP, evaluates whether traditional OLTP databases like MySQL can handle analytical workloads, introduces benchmark queries, and provides a comprehensive overview of data‑warehouse concepts such as data sources, fact and dimension tables, multi‑dimensional modeling, and common cube operations.

AnalyticsHTAPOLAP
0 likes · 21 min read
Understanding OLAP vs. OLTP and the Fundamentals of Data Warehousing
Big Data Technology Architecture
Big Data Technology Architecture
Aug 15, 2020 · Big Data

Alluxio: Open‑Source Data Orchestration Platform – Overview, Benefits, Innovations, and Getting‑Started Resources

Alluxio is an open‑source, memory‑centric data orchestration layer that bridges compute frameworks such as Spark, Presto, and TensorFlow with diverse storage systems, offering high‑speed I/O, unified namespace, multi‑level caching, and easy deployment, while providing extensive documentation, download links, and community resources for rapid adoption.

AlluxioAnalyticsData Orchestration
0 likes · 7 min read
Alluxio: Open‑Source Data Orchestration Platform – Overview, Benefits, Innovations, and Getting‑Started Resources
Alibaba Cloud Developer
Alibaba Cloud Developer
Jul 29, 2020 · Fundamentals

What Is Data Analysis? Definitions, Skills, History, and Practical Steps

This comprehensive guide explains what data analysis is, the types of data, its historical evolution, the relationship between data analysis, data science and business intelligence, essential skills, why analysis matters, and a step‑by‑step framework with models and metric design for effective decision‑making.

AnalyticsBusiness IntelligenceMetrics
0 likes · 25 min read
What Is Data Analysis? Definitions, Skills, History, and Practical Steps
Amap Tech
Amap Tech
Jul 23, 2020 · Big Data

Overview of Apache Big Data Ecosystem Tools

The article surveys the Apache big‑data ecosystem, covering Hadoop’s storage and resource management, column stores HBase and Kudu, compute engines Spark, Flink, Impala, and Presto, coordination via ZooKeeper, ingestion with Sqoop and Flume, messaging Kafka, security Ranger and Sentry, metadata Atlas, OLAP Kylin, Hive, quality tool Griffin, notebooks Zeppelin, visualizations Superset and Tableau, the TPCx‑BB benchmark, and ends with an Alibaba analysis competition notice.

AnalyticsApacheData Governance
0 likes · 19 min read
Overview of Apache Big Data Ecosystem Tools
Huolala Tech
Huolala Tech
Jul 15, 2020 · Big Data

How to Build Smart, Scalable Data Tracking Solutions for Comprehensive Analytics

This article explores the fundamentals, common schemes, pain points, and a smart end‑to‑end solution for data tracking (埋点), offering practical guidelines, architectural diagrams, and a concrete example to help engineers implement comprehensive, controllable, and efficient event collection pipelines.

AnalyticsBig DataData Tracking
0 likes · 9 min read
How to Build Smart, Scalable Data Tracking Solutions for Comprehensive Analytics
Aotu Lab
Aotu Lab
Jul 7, 2020 · Mobile Development

Zero‑Code Data Tracking for Taro Mini‑Programs with Tencent YouShu

This guide explains how Taro developers can instantly enable eight automatic, zero‑code data‑tracking events and custom analytics in WeChat mini‑programs by integrating Tencent YouShu via a one‑click template, CLI commands, and SDK configuration.

AnalyticsData TrackingSDK
0 likes · 9 min read
Zero‑Code Data Tracking for Taro Mini‑Programs with Tencent YouShu
Architects' Tech Alliance
Architects' Tech Alliance
Jul 1, 2020 · Big Data

Gartner’s Perspective on Building a Data Middle Platform: Strategies and Recommendations

According to Gartner, enterprises should view data middle platforms as strategic, collaborative hubs that balance data collection and connection, promote reusable analytics, integrate with digital platforms, and leverage data fabric, AI-driven tools, and graph analysis to become truly data‑driven while maintaining trust and privacy.

AnalyticsData FabricData Platform
0 likes · 12 min read
Gartner’s Perspective on Building a Data Middle Platform: Strategies and Recommendations
Big Data Technology Architecture
Big Data Technology Architecture
Jun 7, 2020 · Big Data

Comprehensive Overview of Data Lake Concepts, Architectures, Vendor Solutions, and Use Cases

This article provides an in‑depth, English‑language overview of data lakes, covering their definition, core characteristics, reference architectures, major cloud‑vendor implementations (AWS, Huawei, Alibaba Cloud, Azure), typical industry applications such as advertising and gaming, as well as practical guidance on building and evolving a data lake in a cloud‑native, big‑data environment.

AnalyticsData ArchitectureLakehouse
0 likes · 50 min read
Comprehensive Overview of Data Lake Concepts, Architectures, Vendor Solutions, and Use Cases
21CTO
21CTO
May 29, 2020 · Product Management

Unlocking Business Growth with User & Product Profiling: A Practical Guide

This article explains how to build a digital ecosystem using user and product profiling, covering the concepts, motivations, construction methods, and real‑world applications to drive precise marketing, improve product experience, and achieve sustainable business growth.

AnalyticsData-drivenDigital Transformation
0 likes · 20 min read
Unlocking Business Growth with User & Product Profiling: A Practical Guide
政采云技术
政采云技术
May 17, 2020 · Frontend Development

Building a User Behavior Data Collection and Analysis System (Hunyi) – Frontend Team Experience

This article describes how the frontend team designed and implemented a comprehensive user behavior data collection and analysis platform, covering its business value, overall architecture, SDK-based data gathering, event interception, processing pipelines, analytics dashboards, and practical insights for product and operations teams.

AnalyticsSDKdata collection
0 likes · 15 min read
Building a User Behavior Data Collection and Analysis System (Hunyi) – Frontend Team Experience
JD.com Experience Design Center
JD.com Experience Design Center
May 9, 2020 · Fundamentals

10 Essential E‑Commerce Metrics Every Analyst Should Master

This article explains the purpose and types of data metrics, outlines a logical framework for analyzing e‑commerce performance from traffic to behavior to transaction, and details ten key metrics—including GMV, conversion rate, UV value, click‑through and exposure rates—along with practical interpretation tips.

AnalyticsGMVconversion rate
0 likes · 12 min read
10 Essential E‑Commerce Metrics Every Analyst Should Master
DataFunTalk
DataFunTalk
Apr 7, 2020 · Product Management

Design and Implementation of an A/B Testing System for Data Product Managers

This article explains the core modules of an A/B testing system, details a step‑by‑step design workflow using an internet‑finance example, and highlights key design principles such as scientific traffic allocation, sufficient data, rigorous statistical analysis, and continuous iteration for data‑driven product optimization.

A/B testingAnalyticsData-driven
0 likes · 24 min read
Design and Implementation of an A/B Testing System for Data Product Managers
Xianyu Technology
Xianyu Technology
Mar 26, 2020 · Big Data

Scalable User Behavior Data Collection and Auto-Generated Datasets for Xianyu

Xianyu created a highly extensible user‑behavior collection framework that standardizes data into a common ODPS schema, uses JavaScript Proxy to intercept navigation and API calls, maps business metrics via JSON, aggregates reports to cut dataset‑creation effort from days to minutes while avoiding heavy full‑tracking overhead.

AnalyticsBig DataJavaScript
0 likes · 9 min read
Scalable User Behavior Data Collection and Auto-Generated Datasets for Xianyu
360 Quality & Efficiency
360 Quality & Efficiency
Mar 24, 2020 · Big Data

Understanding Granularity in Data Warehouse Design

This article explains the concept of granularity in data warehouse design, describing data models composed of structures, operations, and constraints, illustrating how granularity affects storage detail, query performance, and resource consumption, and recommending a dual‑granularity approach to balance efficiency and analytical depth.

AnalyticsBig Datadata modeling
0 likes · 5 min read
Understanding Granularity in Data Warehouse Design
Yanxuan Tech Team
Yanxuan Tech Team
Feb 17, 2020 · Big Data

Why Data Warehouses Matter: From Basics to the Hadoop Ecosystem

This article explains the purpose of data as a strategic asset, compares traditional databases with data warehouses, outlines key characteristics and related concepts of data warehouses, and introduces the Hadoop ecosystem components that support large‑scale data storage and analysis.

AnalyticsETLHadoop
0 likes · 14 min read
Why Data Warehouses Matter: From Basics to the Hadoop Ecosystem
Big Data Technology Architecture
Big Data Technology Architecture
Feb 4, 2020 · Big Data

What Is a Data Lakehouse? Introduction, Key Features, and Evolution

The article explains the emerging Lakehouse paradigm that combines the low‑cost storage of data lakes with the management and ACID guarantees of data warehouses, detailing its advantages over traditional architectures, core capabilities, early implementations, and its role in supporting modern AI and analytics workloads.

AnalyticsLakehousedata-warehouse
0 likes · 9 min read
What Is a Data Lakehouse? Introduction, Key Features, and Evolution
Xueersi Online School Tech Team
Xueersi Online School Tech Team
Sep 6, 2019 · Big Data

Real-Time Data Architecture, Evolution, and Applications at an Online School

The article details the six‑layer big‑data architecture of an online school, chronicles its migration from Storm to Spark Streaming and finally to Flink, and showcases concrete real‑time applications such as gateway monitoring, user‑profile tagging, renewal reporting, and advertising analysis, while outlining future development directions.

AnalyticsBig Data ArchitectureFlink
0 likes · 14 min read
Real-Time Data Architecture, Evolution, and Applications at an Online School
ITPUB
ITPUB
Sep 3, 2019 · Frontend Development

Turn Any GitHub Repository into a Live Front‑End Site in One Click

This guide shows how to publish a GitHub repository directly from the master branch, share specific code lines via URL fragments, auto‑close issues with commit keywords, embed GitHub widgets, adjust language detection with .gitattributes, and view traffic and trending data, all without extra branches or tools.

AnalyticsGitHubGitHub Pages
0 likes · 7 min read
Turn Any GitHub Repository into a Live Front‑End Site in One Click
Youzan Coder
Youzan Coder
Aug 14, 2019 · Big Data

Comprehensive Guide to Data Collection, Event Modeling, and Tracking in Big Data Platforms

The guide explains how comprehensive data collection in big‑data platforms relies on a standardized event model, passive and code‑based embedding, multi‑platform SDKs, a log‑middleware layer, precise location tracking, and an embedding management platform that supports workflow, testing, quality monitoring, and scalable infrastructure for future enhancements.

AnalyticsBig DataLog Processing
0 likes · 19 min read
Comprehensive Guide to Data Collection, Event Modeling, and Tracking in Big Data Platforms
Xianyu Technology
Xianyu Technology
Jun 12, 2019 · Mobile Development

High‑Accuracy User Behavior Tracking in Flutter for Xianyu

To replace the native‑only tracking used by Xianyu after its migration to Flutter, the team built a high‑accuracy solution that mirrors the Flutter navigation stack with an index list, correctly fires enter/leave events on pushes, pops and filtered dialogs, and adds exposure detection based on 50 % visibility for 500 ms, ultimately delivering 100 % tracking accuracy in production.

AnalyticsFlutterMobile Development
0 likes · 9 min read
High‑Accuracy User Behavior Tracking in Flutter for Xianyu
21CTO
21CTO
Jan 26, 2019 · Big Data

Data Lake vs Data Warehouse: Which One Powers Your Business?

This article explains the core differences between data lakes and data warehouses, their respective strengths, and how they complement each other to support both exploratory analytics and routine business reporting.

AnalyticsBig DataData Lake
0 likes · 5 min read
Data Lake vs Data Warehouse: Which One Powers Your Business?
DataFunTalk
DataFunTalk
Jan 25, 2019 · Big Data

Evolution and Technical Architecture of Ant Financial's Data Analysis Platform

This article presents a comprehensive overview of Ant Financial's data analysis platform, detailing its departmental role, the data analysis lifecycle, the platform's evolution from version 1.0 to 3.0, core technical components such as intelligent sync and pre‑computation, and a practical case study of performance optimization.

AnalyticsDataAnalysisDataEngineering
0 likes · 24 min read
Evolution and Technical Architecture of Ant Financial's Data Analysis Platform
Architects' Tech Alliance
Architects' Tech Alliance
Jan 2, 2019 · Big Data

A Comprehensive Guide to Data Visualization Tools for Big Data Analysis

This article surveys a wide range of data visualization tools—from basic spreadsheet solutions like Excel to advanced JavaScript libraries such as D3, GIS platforms like QGIS, and specialized charting suites—highlighting their key features, supported data sources, and typical use cases for effective big‑data analysis.

AnalyticsData visualizationcharting tools
0 likes · 12 min read
A Comprehensive Guide to Data Visualization Tools for Big Data Analysis
JD Retail Technology
JD Retail Technology
Dec 12, 2018 · Big Data

Construction and Architecture of JD Overseas Data Analysis Platform (Columbus Platform)

JD.com’s overseas data analysis platform, dubbed the Columbus platform, combines a lightweight data warehouse deployment with standardized, customizable BI tools to provide real‑time and offline analytics, visualization, KPI management, and future self‑service reporting and predictive capabilities for its global e‑commerce operations.

AnalyticsBIBig Data
0 likes · 9 min read
Construction and Architecture of JD Overseas Data Analysis Platform (Columbus Platform)
Tencent Cloud Developer
Tencent Cloud Developer
Oct 31, 2018 · Big Data

Top 10 Data Visualization Cases of 2018

The 2018 roundup showcases ten standout data‑visualization projects—from a Spotify‑driven map of Bruce Springsteen’s songs and a real‑time solar‑lunar app, to Game of Thrones dialogue analytics, interactive world‑book maps, Apollo mission prints, child‑friendly Earth bios, mascara‑choice charts, Marvel cinematic networks, ISS construction timelines, and a poster‑ready Jupiter moons diagram.

2018AnalyticsData visualization
0 likes · 7 min read
Top 10 Data Visualization Cases of 2018
Architect's Tech Stack
Architect's Tech Stack
Oct 23, 2018 · Fundamentals

Common Data Collection Challenges in Startups and Practical Solutions

The article examines three typical data collection problems faced by startups—unclear collection methods, chaotic tracking points, and poor collaboration between data and engineering teams—and offers practical strategies such as adopting full‑event models, appointing data architects, and securing top‑down support to achieve reliable, comprehensive analytics.

AnalyticsData Governancedata collection
0 likes · 10 min read
Common Data Collection Challenges in Startups and Practical Solutions
dbaplus Community
dbaplus Community
Oct 16, 2018 · Databases

Master MySQL Window Functions: From Basics to Advanced Use Cases

This tutorial explains why and how to use MySQL window functions, covering their concepts, syntax, various function families, practical examples such as ranking, distribution, lead/lag, first/last, and aggregations within dynamic windows, plus detailed SQL snippets and visual illustrations.

Analyticsdatabasemysql
0 likes · 13 min read
Master MySQL Window Functions: From Basics to Advanced Use Cases
360 Quality & Efficiency
360 Quality & Efficiency
Sep 12, 2018 · Fundamentals

Generic Architecture and Key Differentiators of IoT Platforms

The article translates and explains a typical IoT platform architecture, outlining its core Gather‑Analyze‑Act functions, common building blocks such as device interfaces, messaging brokers, storage and analytics layers, and highlights key differentiators like multi‑tenancy, protocol support, security, and extensible rule engines.

AnalyticsDevice onboardingIoT
0 likes · 7 min read
Generic Architecture and Key Differentiators of IoT Platforms
58UXD
58UXD
Aug 15, 2018 · Product Management

7 Must‑Read Books to Master User Experience and Product Design

This article curates a reading list of seven influential books covering high‑performance reading, the Pyramid Principle, comprehensive UX design, website analytics, user‑experience measurement, visualization techniques, and demand‑matching, offering practical insights for building strong product and UX skills.

AnalyticsBook RecommendationsUX design
0 likes · 7 min read
7 Must‑Read Books to Master User Experience and Product Design
21CTO
21CTO
Jun 8, 2018 · Big Data

8 Powerful Techniques to Elevate Your Data Visualizations

This article presents eight advanced data‑visualization strategies—including conditional formatting, trend lines, rule‑based filtering, hierarchical views, sorting, formatting, comparative charts, and clear titles—to help big‑data professionals present information accurately, efficiently, and compellingly for better decision‑making.

Analyticschart design
0 likes · 6 min read
8 Powerful Techniques to Elevate Your Data Visualizations
Architecture Digest
Architecture Digest
May 27, 2018 · Big Data

Installing Elasticsearch and Performing Data Aggregation Queries

This article walks through installing Elasticsearch 5.6.9, configuring system limits, creating indices, inserting and deleting documents, executing complex aggregation queries, and integrating Elasticsearch with Java using the TransportClient, providing a practical guide for building analytics on large‑scale data.

AnalyticsBig DataElasticsearch
0 likes · 12 min read
Installing Elasticsearch and Performing Data Aggregation Queries
Qunar Tech Salon
Qunar Tech Salon
Apr 10, 2018 · Big Data

Design and Implementation of Meituan's Traffic Compass Data Warehouse for Hotel‑Travel Business

The article presents Meituan's Traffic Compass—a data‑warehouse‑driven traffic analysis platform for the hotel‑travel business—detailing its background, challenges, architectural layers, dimensional modeling, Kylin‑based query engine, configuration mechanisms, performance metrics, and future optimization plans.

AnalyticsBig DataKylin
0 likes · 14 min read
Design and Implementation of Meituan's Traffic Compass Data Warehouse for Hotel‑Travel Business
Baidu Intelligent Testing
Baidu Intelligent Testing
Oct 9, 2017 · Big Data

User Behavior Analysis: From Data Acquisition to Funnel Insights

The article explains how to move beyond macro app metrics by collecting offline and real‑time user data, storing it in HDFS, processing it with Spark, visualizing behavior paths as state‑machine trees, and performing branch‑funnel analysis to uncover conversion bottlenecks and improve product quality.

AnalyticsBig DataFunnel Analysis
0 likes · 5 min read
User Behavior Analysis: From Data Acquisition to Funnel Insights
Architecture Digest
Architecture Digest
Jul 22, 2017 · Big Data

Popular Big Data Tools and Their Descriptions

This article provides an extensive overview of more than ninety open‑source and commercial big‑data tools—including ETL platforms, resource managers, storage systems, messaging queues, processing engines, and visualization libraries—detailing their core functions, typical use cases, and notable adopters.

AnalyticsBig DataData Integration
0 likes · 26 min read
Popular Big Data Tools and Their Descriptions
Qunar Tech Salon
Qunar Tech Salon
May 19, 2017 · Mobile Development

Zero‑Instrumentation Interaction and Performance Monitoring for Large‑Scale Mobile Apps

The article presents a comprehensive approach to solving crash and performance issues in large‑scale mobile applications by reconstructing user interaction traces through a no‑track analytics platform, compile‑time AOP instrumentation, and unified data aggregation, ultimately improving debugging efficiency and reducing operational overhead.

Analyticsaopmonitoring
0 likes · 9 min read
Zero‑Instrumentation Interaction and Performance Monitoring for Large‑Scale Mobile Apps
Architects' Tech Alliance
Architects' Tech Alliance
May 2, 2017 · Big Data

Data Mining and Innovation in the Adult Entertainment Industry

The article examines how extensive data collection and analysis of adult performers and their content reveal surprising demographic patterns and drive innovative business models, product development, and technology adaptations within the porn industry, illustrating the practical impact of big‑data insights beyond traditional sectors.

Adult IndustryAnalyticsInnovation
0 likes · 13 min read
Data Mining and Innovation in the Adult Entertainment Industry
Qunar Tech Salon
Qunar Tech Salon
Feb 22, 2017 · Big Data

Understanding Ctrip Flight Ticket Tracking System (UBT) and Its Key Metrics

This article explains Ctrip's flight ticket tracking framework (UBT), detailing client‑side and server‑side event collection methods, the purpose and trade‑offs of each tracking type, metric definitions, data association challenges, common pitfalls, and best practices for reliable data‑driven analysis.

AnalyticsBig DataCtrip
0 likes · 20 min read
Understanding Ctrip Flight Ticket Tracking System (UBT) and Its Key Metrics
dbaplus Community
dbaplus Community
Dec 22, 2016 · Databases

MariaDB Columnstore Deep Dive: Architecture, Performance, and Deployment

MariaDB Columnstore, the newly integrated InfiniDB storage engine, brings column‑store analytics to MySQL, offering P‑scale data handling, distributed deployment, and impressive performance gains over InnoDB, with detailed installation steps, kernel tuning, command‑line administration, and optimization tips for production environments.

AnalyticsColumnstoreInfiniDB
0 likes · 11 min read
MariaDB Columnstore Deep Dive: Architecture, Performance, and Deployment
Meitu Technology
Meitu Technology
Dec 1, 2016 · Big Data

Meitu Internet Technology Salon: Big Data Architecture Evolution and Practice, and Tencent Multi‑Dimensional Analysis Platform

At Meitu’s third Internet Technology Salon in Xiamen on November 26 2016, over 150 senior engineers heard Meitu’s Lu Rongbin detail the company’s progression from simple rsync scripts to a scalable mobile data and open statistical platform, while Tencent’s Zhao Shiyuan showcased the Glacier multi‑dimensional analysis system for fast, tag‑driven queries, underscoring collaborative technical exchange in South China.

AnalyticsBig DataData Platform
0 likes · 6 min read
Meitu Internet Technology Salon: Big Data Architecture Evolution and Practice, and Tencent Multi‑Dimensional Analysis Platform
Architects' Tech Alliance
Architects' Tech Alliance
Nov 4, 2016 · Big Data

The Seven Camps of the Global Big Data Ecosystem

The article outlines how mobile Internet merges the data‑driven society with the physical world to create a new big‑data architecture and describes the seven distinct camps—Infrastructure, Analytics, Applications, Cross‑Domain Architecture, Open‑Source, Data Sources & APIs, and Incubator & Training—that together form a comprehensive end‑to‑end big‑data solution ecosystem.

APIAnalyticsApplications
0 likes · 3 min read
The Seven Camps of the Global Big Data Ecosystem
Ctrip Technology
Ctrip Technology
Sep 2, 2016 · Product Management

Data‑Driven Product Design at Ctrip: Practices and Case Studies

This article examines how Ctrip integrates data analysis into product design through two detailed case studies—its homestay channel evolution and the Kezutong app order‑detail redesign—highlighting tools, methodologies, and results that illustrate data‑driven decision making in product management.

AnalyticsCtripUser experience
0 likes · 12 min read
Data‑Driven Product Design at Ctrip: Practices and Case Studies
ITPUB
ITPUB
Feb 22, 2016 · Frontend Development

How to Prevent Web Analytics Data Loss on Page Unload: From Blocking Ajax to Beacon API

This article examines why analytics requests often disappear when users leave a page, reviews traditional blocking tricks such as synchronous Ajax, busy‑wait loops, and image hacks, and then presents optimized approaches like URL or window.name transfer before recommending the modern Beacon API for reliable, non‑blocking data transmission.

AnalyticsJavaScriptbeacon-api
0 likes · 8 min read
How to Prevent Web Analytics Data Loss on Page Unload: From Blocking Ajax to Beacon API
Architecture Digest
Architecture Digest
Feb 22, 2016 · Big Data

Building High‑Performance Big Data Analytics Systems: Techniques and Best Practices

An in‑depth guide outlines technology‑agnostic best‑practice techniques for building high‑performance big data analytics systems, covering data acquisition, storage, processing, visualization, and security, and explains how to address the five V’s of big data to meet demanding operational and performance requirements.

AnalyticsBig Datadata engineering
0 likes · 20 min read
Building High‑Performance Big Data Analytics Systems: Techniques and Best Practices
21CTO
21CTO
Feb 16, 2016 · Mobile Development

Scaling Android Development: Insights on Team Management, Testing, and Open‑Source Practices

At AnDevCon, senior engineers from Twitter, Amazon, Cyanogen, Square, Eventbrite and others shared practical strategies for managing large Android teams, implementing effective testing pipelines, leveraging A/B experiments, designing for testability, handling embedded testing, contributing to open‑source, and using analytics to drive product decisions.

AnalyticsAndroid
0 likes · 15 min read
Scaling Android Development: Insights on Team Management, Testing, and Open‑Source Practices
dbaplus Community
dbaplus Community
Jan 27, 2016 · Databases

How to Densify Sparse Data with Oracle 10g Partitioned Outer Join

This article explains why sparse data in Oracle tables hampers continuous time‑series reporting, introduces the Partitioned Outer Join syntax introduced in Oracle 10g, and demonstrates step‑by‑step how to transform one‑dimensional and multi‑dimensional gaps into dense datasets using practical SQL examples.

AnalyticsData DensificationOracle
0 likes · 17 min read
How to Densify Sparse Data with Oracle 10g Partitioned Outer Join
Qunar Tech Salon
Qunar Tech Salon
Aug 17, 2015 · Big Data

Comprehensive Overview of Open‑Source Big Data Tools and Platforms

This article presents a detailed, categorized catalogue of more than fifty open‑source big‑data projects—including Hadoop‑related utilities, analytics platforms, databases, BI solutions, data‑mining packages, query engines, programming languages, search tools, and in‑memory technologies—highlighting their primary functions, supported operating systems, and official links.

AnalyticsHadoopIn-Memory
0 likes · 31 min read
Comprehensive Overview of Open‑Source Big Data Tools and Platforms