Past Memory Big Data
Author

Past Memory Big Data

A popular big-data architecture channel with over 100,000 developers. Publishes articles on Spark, Hadoop, Flink, Kafka and more. Visit the Past Memory Big Data blog at https://www.iteblog.com. Search "Past Memory" on Google or Baidu.

58
Articles
0
Likes
22
Views
0
Comments
Recent Articles

Latest from Past Memory Big Data

58 recent articles
Past Memory Big Data
Past Memory Big Data
Apr 23, 2024 · Big Data

ByConity Replaces ClickHouse for OLAP, Cutting Resource Costs Over 50%

MetaApp replaced ClickHouse with the open‑source cloud‑native warehouse ByConity, achieving more than 50% reduction in resource costs while delivering comparable or faster OLAP query performance across distinct, retention, conversion, and point‑lookup workloads, thanks to compute‑storage separation, read/write isolation, and minute‑level elastic scaling.

ByConityClickHouseOLAP
0 likes · 15 min read
ByConity Replaces ClickHouse for OLAP, Cutting Resource Costs Over 50%
Past Memory Big Data
Past Memory Big Data
Apr 19, 2024 · Databases

How ByteHouse Achieves Hundred‑Fold OLAP Performance Gains

ByteHouse, a cloud‑native data warehouse built on ClickHouse, redesigns storage‑compute separation, introduces a new MPP architecture, rule‑based and cost‑based optimizers, exchange runtime filters, and parallelism techniques, delivering 10‑200× faster query performance on TPC‑DS, TPC‑H and SSB benchmarks and boosting point‑lookup QPS to 32,000.

BenchmarkByteHouseCost-Based Optimizer
0 likes · 19 min read
How ByteHouse Achieves Hundred‑Fold OLAP Performance Gains
Past Memory Big Data
Past Memory Big Data
Jan 17, 2024 · Big Data

How WeChat Implements a StarRocks‑Powered Lakehouse Across Multiple Business Scenarios

WeChat evolved its data platform from Hadoop to ClickHouse and finally to a StarRocks‑based lakehouse, solving data fragmentation and storage redundancy while achieving sub‑second to minute‑level query latency, cutting storage costs by over 65%, halving operational tasks, and reducing offline job time by two hours across several business lines.

Big DataLakehouseMaterialized Views
0 likes · 16 min read
How WeChat Implements a StarRocks‑Powered Lakehouse Across Multiple Business Scenarios
Past Memory Big Data
Past Memory Big Data
Oct 10, 2023 · Big Data

2023 Big Data Interview Guide: Hadoop, Hive, Doris, Data Warehouse Essentials

This comprehensive 2023 guide covers essential big‑data interview topics, providing detailed explanations and step‑by‑step processes for Hadoop HDFS read/write, YARN, Hive table types and optimizations, Doris architecture and data models, data‑warehouse layers, modeling techniques, quality monitoring, and classic algorithm design questions such as TOP‑K and duplicate detection.

Big DataData WarehouseDoris
0 likes · 54 min read
2023 Big Data Interview Guide: Hadoop, Hive, Doris, Data Warehouse Essentials
Past Memory Big Data
Past Memory Big Data
Jun 29, 2023 · Industry Insights

How OceanBase Powered Sichuan Rural Credit Union’s Core System Upgrade and Boosted Customer Efficiency

Sichuan Rural Credit Union, with over 5,000 outlets and 3 billion daily transactions, migrated its core and mobile banking systems to OceanBase's native distributed database, cutting card‑issue time from 30 minutes to 5 minutes, loan processing from 3‑5 days to 1‑33 minutes, and saving more than 40% in infrastructure costs.

Digital TransformationOceanBasePerformance Optimization
0 likes · 16 min read
How OceanBase Powered Sichuan Rural Credit Union’s Core System Upgrade and Boosted Customer Efficiency
Past Memory Big Data
Past Memory Big Data
Jun 19, 2023 · Databases

Why esProc SPL Outperforms SQLite for Small Java Applications

The article analyzes SQLite's shortcomings in data‑source support, complex calculations, and workflow handling for tiny Java apps, then demonstrates how the open‑source esProc SPL engine offers richer data‑source integration, simpler SQL‑like syntax, powerful calculation capabilities, and built‑in flow control, making it a more suitable lightweight database alternative.

Data ProcessingDatabaseSPL
0 likes · 16 min read
Why esProc SPL Outperforms SQLite for Small Java Applications
Past Memory Big Data
Past Memory Big Data
Apr 19, 2023 · Databases

Why a New Open‑Source Language Is Needed to Replace SQL

The article analyses SQL’s fundamental shortcomings—its lack of ordered collections, incomplete set semantics, and missing object‑reference support—illustrates these issues with concrete query examples, and argues that the open‑source Structured Process Language (SPL) solves them with true ordering, full collection handling, and native reference mechanisms.

DatabasesOpen SourceSPL
0 likes · 17 min read
Why a New Open‑Source Language Is Needed to Replace SQL
Past Memory Big Data
Past Memory Big Data
Feb 23, 2023 · Databases

Optimizing ClickHouse: ByteHouse’s Real‑Time Data Warehouse Breakthrough

ByteHouse, a cloud‑native data warehouse on Volcano Engine, leverages ClickHouse to deliver ultra‑fast real‑time analytics, detailing its business motivations, ROI‑driven evaluation, performance trade‑offs, architectural evolution, and concrete financial‑industry deployments such as real‑time monitoring and risk‑control scenarios.

ByteHouseClickHouseData Engineering
0 likes · 17 min read
Optimizing ClickHouse: ByteHouse’s Real‑Time Data Warehouse Breakthrough
Past Memory Big Data
Past Memory Big Data
Feb 1, 2023 · Databases

From ClickHouse to ByteHouse: Real‑Time Data Analytics Optimization Practices

This article details ByteDance's large‑scale ClickHouse deployment, presents two real‑time analytics use cases—recommendation metrics and ad delivery—and explains the performance bottlenecks encountered and the concrete engineering solutions such as asynchronous indexing, multi‑threaded Kafka Engine, and enhanced Buffer Engine that boosted throughput and ensured data integrity.

Buffer EngineByteHouseClickHouse
0 likes · 11 min read
From ClickHouse to ByteHouse: Real‑Time Data Analytics Optimization Practices