Tag

LSM

0 views collected around this technical thread.

DataFunTalk
DataFunTalk
Aug 21, 2024 · Big Data

Apache Paimon: Real‑Time Lakehouse Architecture, Core Technologies, Application Scenarios, and Frontier Features

This article presents a comprehensive overview of Apache Paimon, covering the concept of real‑time lakehouses, the underlying technologies such as LSM and merge‑on‑write, practical application cases across enterprises, and the latest frontier features like tags, branches, and advanced indexing, illustrating how Paimon bridges batch and streaming workloads in modern big‑data ecosystems.

Apache PaimonBig DataLSM
0 likes · 16 min read
Apache Paimon: Real‑Time Lakehouse Architecture, Core Technologies, Application Scenarios, and Frontier Features
DataFunTalk
DataFunTalk
Oct 3, 2023 · Big Data

Design and Practices of Alibaba Cloud's Billion‑Scale Real‑Time Log Analysis System

This article presents the architecture, core challenges, key design decisions, and future directions of Alibaba Cloud's SLS platform, which handles billions of daily log queries with sub‑300 ms latency by leveraging LSM‑based storage, indexing, columnar layout, distributed caching, and multi‑tenant isolation.

Distributed StorageIndexingLSM
0 likes · 17 min read
Design and Practices of Alibaba Cloud's Billion‑Scale Real‑Time Log Analysis System
DataFunTalk
DataFunTalk
Apr 7, 2023 · Big Data

Introducing Apache Paimon: An Open‑Source Streaming Lakehouse Storage Engine

Apache Paimon is an open‑source streaming data lake storage system that combines LSM‑based real‑time updates, open file formats, and deep integration with Flink, Spark, and Trino to deliver high‑throughput ingestion, low‑latency queries, and unified batch‑stream processing for modern big‑data workloads.

Apache PaimonBig DataFlink
0 likes · 7 min read
Introducing Apache Paimon: An Open‑Source Streaming Lakehouse Storage Engine
OPPO Kernel Craftsman
OPPO Kernel Craftsman
Oct 29, 2021 · Information Security

Linux Security Module (LSM) Fundamentals and Implementation

The article explains the Linux Security Module (LSM) framework, detailing its origins, hook‑based architecture, kernel integration points, initialization process, and how it enables flexible, non‑intrusive security enhancements such as SELinux, allowing multiple security modules to coexist within the Linux kernel.

Kernel SecurityLSMLinux security
0 likes · 9 min read
Linux Security Module (LSM) Fundamentals and Implementation
Tencent Cloud Developer
Tencent Cloud Developer
Dec 24, 2020 · Big Data

Distributed Search Engine Design and Index Management in WeChat Search

The article details WeChat Search’s practical distributed architecture—using a Chubby‑elected leader for shard‑to‑node mapping, hash‑based sharding with dynamic rebalancing, a Lambda‑style batch and near‑real‑time indexing pipeline, relaxed monotonic consistency, and group‑based searcher scaling—to illustrate trade‑offs and lessons for building scalable, reliable search services.

Index ManagementLSMLeader Election
0 likes · 28 min read
Distributed Search Engine Design and Index Management in WeChat Search
DataFunTalk
DataFunTalk
Dec 30, 2019 · Databases

Cassandra: Past, Present, and Future – History, Architecture, Features, and Use Cases

This article summarizes a Cassandra meetup presentation that traces the database's origins from BigTable and Dynamo, outlines its key milestones, explains its peer‑to‑peer and LSM architecture, highlights current features, real‑world deployments, performance advantages, and previews upcoming 4.0 releases and community projects.

Big DataCassandraDistributed Database
0 likes · 14 min read
Cassandra: Past, Present, and Future – History, Architecture, Features, and Use Cases
360 Zhihui Cloud Developer
360 Zhihui Cloud Developer
Sep 5, 2017 · Databases

Can Separating Keys and Values Boost LSM Performance on SSDs?

This article examines the trade‑offs of LSM‑based storage engines on SSDs, highlighting write amplification issues, the benefits of separating keys from values via the WiscKey approach, and the challenges of range queries, garbage collection, and crash consistency.

Database StorageKey-Value SeparationLSM
0 likes · 8 min read
Can Separating Keys and Values Boost LSM Performance on SSDs?
Ctrip Technology
Ctrip Technology
Jun 3, 2014 · Databases

SessionDB: A High‑Performance LSM‑Based Key/Value Store for Stateless Sessions

The article introduces SessionDB, a Java‑implemented, LSM‑tree‑based key/value storage engine designed to eliminate sticky sessions by providing high‑throughput, durable, and scalable session data handling, and details its architecture, optimizations, sharding strategy, and benchmark comparisons with BerkeleyDB, LevelDB, and RocksDB.

Bloom FilterJavaKey-Value Store
0 likes · 13 min read
SessionDB: A High‑Performance LSM‑Based Key/Value Store for Stateless Sessions