Tagged articles
64 articles
Page 1 of 1
Alibaba Cloud Developer
Alibaba Cloud Developer
Feb 27, 2026 · Databases

How DuckDB Compression Supercharges AliSQL Storage and Cuts MySQL Costs

AliSQL integrates DuckDB as its storage engine to achieve high‑density columnar compression and fast analytical scans, detailing DuckDB’s multi‑layer storage format, adaptive compression algorithm selection, performance benchmarks versus InnoDB, HBase, ClickHouse, OceanBase, and the engineering optimizations AliSQL adds for throughput and cost reduction.

AliSQLColumnar StorageDatabase Optimization
0 likes · 12 min read
How DuckDB Compression Supercharges AliSQL Storage and Cuts MySQL Costs
JD Tech Talk
JD Tech Talk
Sep 2, 2025 · Databases

Unlock ClickHouse’s Secret Weapons: The 9 Techniques Behind Lightning‑Fast Queries

This article explores ClickHouse’s high‑performance OLAP architecture, covering its MPP design, columnar storage, vectorized execution, pre‑sorting, table engines, data types, sharding and replication strategies, as well as index designs that together enable rapid analysis of massive datasets.

ClickHouseColumnar StorageVectorized Execution
0 likes · 15 min read
Unlock ClickHouse’s Secret Weapons: The 9 Techniques Behind Lightning‑Fast Queries
Amap Tech
Amap Tech
Apr 7, 2025 · Mobile Development

Gaode Uninhabited Area Project: Satellite Rescue and Offline Navigation Technologies

Gaode’s Uninhabited Area project introduces Satellite Rescue, which transmits encrypted, natural‑language emergency messages via satellite and a rescue chatroom to boost response speed, and Offline Navigation, employing quadtree indexing, predictive caching and seamless mode switching to guide users through network‑dead zones, proven in harsh Inner Mongolian terrains.

data compressionemergency rescuemobile app
0 likes · 8 min read
Gaode Uninhabited Area Project: Satellite Rescue and Offline Navigation Technologies
Model Perspective
Model Perspective
Feb 26, 2025 · Artificial Intelligence

How Do Large Language Models Compress Massive Data? Limits and Techniques

This article explains how large language models act like a super‑library by compressing vast amounts of text using information‑theoretic concepts, probability‑based coding, autoregressive neural networks, and arithmetic coding, while discussing accuracy, compression ratios, and theoretical limits.

AIarithmetic codingautoregressive networks
0 likes · 8 min read
How Do Large Language Models Compress Massive Data? Limits and Techniques
vivo Internet Technology
vivo Internet Technology
Dec 4, 2024 · Databases

OceanBase Implementation and Migration Practices at vivo

vivo migrated five 20‑TB MySQL clusters to OceanBase using OCP, oblogproxy, and OMS, eliminating sharding costs, achieving over 70% storage savings, improving consistency and performance, and leveraging native distributed architecture, tenant isolation, and strong compression to support scalable, reliable operations.

OceanBasedata compressiondatabase migration
0 likes · 16 min read
OceanBase Implementation and Migration Practices at vivo
BirdNest Tech Talk
BirdNest Tech Talk
Oct 9, 2024 · Fundamentals

Unlocking the Power of Bit Manipulation: Real‑World Uses and Go Tips

This article explains why bit manipulation is a staple technique in computer science, detailing performance gains, state flag encoding, mask operations, data compression, cryptography, image processing, hash structures, binary counting, and algorithm design, with practical Go examples.

GoHashingImage Processing
0 likes · 3 min read
Unlocking the Power of Bit Manipulation: Real‑World Uses and Go Tips
macrozheng
macrozheng
Oct 7, 2024 · Game Development

How 128KB Powered Classic Games: Secrets of Retro Graphics, Audio, and Code

This article explores why 1980s games like Contra could deliver rich graphics, music, and gameplay within just 128KB by using tile-based graphics, specialized audio chips, and ultra‑compact code, contrasting those techniques with modern developers' expectations of massive storage.

Audio SynthesisCode OptimizationRetro Gaming
0 likes · 7 min read
How 128KB Powered Classic Games: Secrets of Retro Graphics, Audio, and Code
Programmer DD
Programmer DD
Jul 24, 2024 · Databases

How We Cut Redis Costs by $460k Monthly: 10 Proven Optimization Strategies

In 2023, a TapTap infra team reduced Redis operating costs by 460,000 CNY per month through low‑cost ESSD instances, traffic compression, unused‑instance cleanup, TTL management, data migration, online compression, and targeted cleaning, detailing ten concrete measures and the open‑source tools that enabled zero‑downtime optimization.

Cost OptimizationDatabase ManagementInfrastructure
0 likes · 17 min read
How We Cut Redis Costs by $460k Monthly: 10 Proven Optimization Strategies
Baidu Tech Salon
Baidu Tech Salon
Jul 23, 2024 · Artificial Intelligence

Linear Algebra Fundamentals and PaddlePaddle Applications

The article reviews core linear algebra concepts—vectors, matrices, eigenvalues, and transformations—and demonstrates how PaddlePaddle’s paddle.linalg API enables practical tasks such as least‑squares regression, image compression via SVD, PCA‑based dimensionality reduction, and broader machine‑learning, graphics, cryptography, and optimization applications.

PCAPaddlePaddleSVD
0 likes · 10 min read
Linear Algebra Fundamentals and PaddlePaddle Applications
vivo Internet Technology
vivo Internet Technology
Jun 19, 2024 · Big Data

Understanding BitMap and Roaring BitMap: Principles, Containers, and Java API Usage

The article explains BitMap fundamentals and introduces Roaring BitMap’s compressed container architecture—Array, BitMap, and Run containers—detailing their conversion logic, Java implementation snippets, performance advantages over traditional BitSets, and practical API usage for high‑performance, memory‑efficient big‑data applications.

Big DataContainersJava
0 likes · 18 min read
Understanding BitMap and Roaring BitMap: Principles, Containers, and Java API Usage
Tencent Cloud Developer
Tencent Cloud Developer
Apr 16, 2024 · Databases

Major Updates and New Features of WCDB 2: Multi‑language Support, SQL Builder, Backup & Recovery, Data Migration, Compression, and Performance Optimizations

WCDB 2 introduces multi-language support (C++, Java, Kotlin), a rewritten SQL builder, fast backup and recovery with sub‑second times, incremental WAL‑based backup, automatic data migration, Zstandard compression, on‑the‑fly column addition, and performance enhancements such as an FTS5 optimizer and interruptible transactions, delivering a robust high‑performance mobile database solution.

Data MigrationSQL BuilderSQLite
0 likes · 47 min read
Major Updates and New Features of WCDB 2: Multi‑language Support, SQL Builder, Backup & Recovery, Data Migration, Compression, and Performance Optimizations
macrozheng
macrozheng
Feb 18, 2024 · Game Development

How 128KB Powered Classic Games: Secrets of Retro Game Development

This article explores why 1980s games like Contra could deliver rich graphics, music, and gameplay within just 128KB by examining data size perception, tile‑based graphics, limited audio chips, and ultra‑compact code techniques that contrast sharply with modern app bloat.

Game DevelopmentRetro GamingTile Graphics
0 likes · 7 min read
How 128KB Powered Classic Games: Secrets of Retro Game Development
Architect
Architect
Feb 1, 2024 · Backend Development

Design and Optimization of Trace2.0: A High‑Performance Backend Tracing System

Trace2.0 is an OpenTelemetry‑based application monitoring system that processes petabyte‑scale trace data using multi‑channel client protocols, gRPC, load‑balancing optimizations, ZSTD compression, Kafka pipelines, ClickHouse storage, and a JDK 21 upgrade with virtual threads, achieving significant performance and cost improvements.

ClickHouseJDK21OpenTelemetry
0 likes · 15 min read
Design and Optimization of Trace2.0: A High‑Performance Backend Tracing System
Open Source Tech Hub
Open Source Tech Hub
Jan 26, 2024 · Fundamentals

How Smaz2 Compresses LoRa Messages on Tiny Devices

This article explains the motivation, dictionary design, bigram table, encoding rules, and real‑world compression results of the Smaz2 library, a space‑optimized C/Python compressor for short LoRa messages on microcontrollers with less than 2 KB RAM.

CLoRAPython
0 likes · 8 min read
How Smaz2 Compresses LoRa Messages on Tiny Devices
Efficient Ops
Efficient Ops
Jan 17, 2024 · Operations

How We Built a Scalable Cloud‑Native Log Center with ClickHouse

This article details a courier company's evolution from a simple file‑based logging system to a cloud‑native log center, describing the limitations of the original architecture, the migration to an ELK stack, subsequent challenges, and the final redesign using ClickHouse for high compression, low cost, and improved query performance.

ClickHouseELKLog Management
0 likes · 12 min read
How We Built a Scalable Cloud‑Native Log Center with ClickHouse
dbaplus Community
dbaplus Community
Jan 7, 2024 · Databases

How to Cut MySQL Storage Costs by Over 50%: A Practical Framework

This article presents a systematic, nine‑grid method for reducing MySQL storage expenses—including table compression, JSON field serialization, and hot‑cold data separation—while quantifying benefits, ensuring data safety, and validating system stability through staged testing and SRE metrics.

Database Cost ReductionPerformance TestingStorage Optimization
0 likes · 13 min read
How to Cut MySQL Storage Costs by Over 50%: A Practical Framework
JD Tech
JD Tech
Dec 15, 2023 · Databases

Systematic Approach to Reducing MySQL Database Storage Costs

This article presents a comprehensive, step‑by‑step methodology for cutting MySQL storage expenses—including background analysis, challenge identification, a nine‑grid systematic framework, benefit calculation, safety and stability verification, rollout, gray‑release, and rollback—demonstrating over 50% disk space reduction while managing risk.

Cost reductionDatabase OptimizationStorage Management
0 likes · 14 min read
Systematic Approach to Reducing MySQL Database Storage Costs
DataFunTalk
DataFunTalk
Aug 21, 2023 · Databases

Case Study: Building a Real‑Time Log Data Analysis Platform with Apache Doris at China Unicom

This article describes how China Unicom’s Western Innovation Research Institute designed and deployed a centralized, real‑time log analytics platform using Apache Doris, detailing the migration from Hive and ClickHouse, performance optimizations, storage cost reductions, and the resulting improvements in data ingestion, query speed, and operational efficiency.

Apache DorisBig DataCold‑Hot Data Management
0 likes · 18 min read
Case Study: Building a Real‑Time Log Data Analysis Platform with Apache Doris at China Unicom
dbaplus Community
dbaplus Community
Aug 15, 2023 · Databases

Why ClickHouse Outperforms MySQL, Elasticsearch, and HBase for Massive Event Data

This article examines the massive data storage and real‑time analysis needs of an activity platform, evaluates MySQL, sharded MySQL, Elasticsearch and HBase, and explains why ClickHouse—with its columnar storage, MergeTree engine, vectorized execution, and distributed architecture—offers the best balance of write performance, query speed, and scalability for billions of records.

Big DataClickHouseColumnar Database
0 likes · 31 min read
Why ClickHouse Outperforms MySQL, Elasticsearch, and HBase for Massive Event Data
Liangxu Linux
Liangxu Linux
Jul 26, 2023 · Databases

Boost SQL Server Queries with Column Store Indexes: Architecture & Benefits

This article explains how column store indexes in SQL Server store each column separately, dramatically improve query performance through batch processing and compression, outlines their physical structure, encoding methods, creation syntax, maintenance steps, and space usage considerations.

Column StoreSQL Serverdata compression
0 likes · 11 min read
Boost SQL Server Queries with Column Store Indexes: Architecture & Benefits
Python Programming Learning Circle
Python Programming Learning Circle
Mar 29, 2023 · Game Development

How 128KB Games Delivered Rich Graphics and Audio: Tile Graphics, Memory Constraints, and Code Efficiency

This article explains how early 1980s games like Contra fit extensive storylines, graphics, and music into just 128KB by using tile‑based graphics, minimal audio data, and highly optimized code, illustrating the dramatic shift in data expectations as technology advances.

Game DevelopmentRetro GamingTile Graphics
0 likes · 6 min read
How 128KB Games Delivered Rich Graphics and Audio: Tile Graphics, Memory Constraints, and Code Efficiency
Alibaba Terminal Technology
Alibaba Terminal Technology
Nov 3, 2022 · Mobile Development

How to Supercharge Mobile Seat-Selection Apps: Data, Caching, and Rendering Optimizations

This article details comprehensive strategies for handling massive seat‑selection data on mobile devices, covering data modeling, interface pre‑loading, static and dynamic compression, view hierarchy tweaks, bitmap reuse, drawing optimizations, hardware acceleration, and multithreaded task handling to dramatically improve load times and frame rates.

AndroidMobilecaching
0 likes · 21 min read
How to Supercharge Mobile Seat-Selection Apps: Data, Caching, and Rendering Optimizations
DaTaobao Tech
DaTaobao Tech
Oct 18, 2022 · Frontend Development

Client‑Side Interface Performance Optimization: Analysis and Solutions

Client‑side developers can boost API speed and user experience by collaborating with back‑end teams to cut server processing and network latency, using caching, concurrency, smaller or UDP‑based packets, and aggressively pruning or compressing payloads, which can shrink megabyte responses to tens of kilobytes and shave dozens of milliseconds off latency.

API optimizationAsynchronousNetwork Latency
0 likes · 9 min read
Client‑Side Interface Performance Optimization: Analysis and Solutions
Architects' Tech Alliance
Architects' Tech Alliance
Sep 23, 2022 · Databases

Analysis of Chinese Database Product Strategies and Emerging Trends

This article summarizes recent Chinese database product strategy reports, outlining database definitions, management systems, design processes, product classifications, architectural layers, HTAP technology, compression methods, storage index structures, intelligent autonomous optimization, and deployment models, highlighting trends and future directions in the database industry.

AI OptimizationDatabase ArchitectureHTAP
0 likes · 8 min read
Analysis of Chinese Database Product Strategies and Emerging Trends
Huawei Cloud Developer Alliance
Huawei Cloud Developer Alliance
Jul 25, 2022 · Big Data

How Top Teams Accelerated Marine Data Compression in the Ship‑Sea Innovation Contest

The inaugural Ship‑Sea Data Intelligent Application Innovation Competition, co‑hosted by Taihu Lab, Huawei and local authorities, challenged participants to compress massive unstructured marine data, and the winning teams revealed novel preprocessing and encoding‑compression pipelines that dramatically improve storage efficiency while preserving data integrity.

AIdata compressionmarine technology
0 likes · 7 min read
How Top Teams Accelerated Marine Data Compression in the Ship‑Sea Innovation Contest
Qunar Tech Salon
Qunar Tech Salon
Mar 7, 2022 · Backend Development

High-Concurrency Flight Pricing System Design and Optimization at Qunar

This article details Qunar's high‑concurrency flight pricing architecture, covering the origin of pricing data, two‑level caching design, a cache‑management system, CQRS‑based policy storage, data compression techniques, and performance improvements achieved through these optimizations.

Backend ArchitectureCQRScaching
0 likes · 20 min read
High-Concurrency Flight Pricing System Design and Optimization at Qunar
Architect's Tech Stack
Architect's Tech Stack
Dec 14, 2021 · Fundamentals

The History and Impact of Jacob Ziv’s Universal Lossless Compression Algorithms

This article chronicles Jacob Ziv’s pioneering contributions to lossless data compression, tracing the evolution from early Morse code through Shannon‑Fano and Huffman to the groundbreaking LZ77 and LZ78 algorithms, and highlights his biography, major awards, and lasting influence on modern digital media and storage technologies.

Jacob ZivLZ77LZ78
0 likes · 9 min read
The History and Impact of Jacob Ziv’s Universal Lossless Compression Algorithms
21CTO
21CTO
Sep 18, 2021 · Fundamentals

10 Essential Algorithms That Shaped Modern Computing

This article introduces ten foundational algorithms—from sorting and Fourier transforms to encryption and graph analysis—explaining their core principles, historical origins, and the pivotal roles they play in today’s software, security, and data processing systems.

AlgorithmsSignal ProcessingSorting
0 likes · 9 min read
10 Essential Algorithms That Shaped Modern Computing
21CTO
21CTO
Jul 24, 2021 · Fundamentals

How Jacob Ziv’s Lempel‑Ziv Algorithms Revolutionized Data Compression

Jacob Ziv, the 90‑year‑old Israeli pioneer honored with the 2021 IEEE Medal of Honor, co‑created the groundbreaking Lempel‑Ziv lossless compression algorithms that underpin modern formats like GIF, PNG and ZIP, and his century‑spanning career reshaped data storage, transmission, and information theory.

Jacob ZivLempel-Zivalgorithm history
0 likes · 10 min read
How Jacob Ziv’s Lempel‑Ziv Algorithms Revolutionized Data Compression
Alibaba Cloud Developer
Alibaba Cloud Developer
Jul 12, 2021 · Backend Development

How to Compress Large Java int/long Arrays for Massive Memory Savings

This article explains how to reduce memory usage of massive Java int/long arrays by applying real‑time compression, eliminating redundancy, using indexed buckets, offset storage, and a series of low‑level optimizations that boost TPS from dozens to over a thousand while preserving random‑access capabilities.

ArrayMemory Optimizationdata compression
0 likes · 14 min read
How to Compress Large Java int/long Arrays for Massive Memory Savings
ELab Team
ELab Team
Apr 30, 2021 · Fundamentals

Why Use Lossy Compression? Algorithms, RLE, Huffman, and Compression Bombs

This article explains the principles of data compression, contrasting lossy and lossless methods, introduces common algorithms such as Run‑Length Encoding, dictionary coding, and Huffman coding, and discusses extreme cases like compression bombs, illustrating concepts with examples and visual diagrams.

Huffman codingcompression algorithmsdata compression
0 likes · 9 min read
Why Use Lossy Compression? Algorithms, RLE, Huffman, and Compression Bombs
dbaplus Community
dbaplus Community
Dec 27, 2020 · Big Data

How ClickHouse Powers a 700 B‑Row Real‑Time Data Platform at Ctrip

This article details how Ctrip's senior engineering manager leveraged ClickHouse to build a high‑availability, sub‑second response data platform handling nearly 700 billion rows, describing the motivations, architecture, data synchronization processes, performance gains, challenges, and practical recommendations for large‑scale analytics.

Big DataClickHouseData Architecture
0 likes · 28 min read
How ClickHouse Powers a 700 B‑Row Real‑Time Data Platform at Ctrip
ITPUB
ITPUB
Nov 12, 2020 · Databases

How Vivo Scaled to Billions of Records: Sharding and InnoDB Compression Strategies

This article details how Vivo's cloud service tackled explosive data growth by applying horizontal and vertical sharding, routing‑table based dynamic expansion, and MySQL InnoDB compression, providing step‑by‑step guidance, performance results, and practical recommendations for large‑scale database deployments.

InnoDBPartitioningdata compression
0 likes · 16 min read
How Vivo Scaled to Billions of Records: Sharding and InnoDB Compression Strategies
vivo Internet Technology
vivo Internet Technology
Nov 4, 2020 · Databases

Scaling Vivo Cloud Service Data Storage: Sharding, Partitioning, and InnoDB Compression Strategies

Vivo Cloud Service scaled from millions to tens of millions of users by applying four sharding strategies—horizontal table partitioning, horizontal database splitting, vertical partitioning, and dynamic routing—while enabling InnoDB compression, which halved storage size, kept CPU impact modest, and preserved ample free capacity for future growth.

Cloud ServicesInnoDBScalability
0 likes · 13 min read
Scaling Vivo Cloud Service Data Storage: Sharding, Partitioning, and InnoDB Compression Strategies
Tencent Cloud Developer
Tencent Cloud Developer
Oct 20, 2020 · Databases

ClickHouse: Architecture, Core Features, and Limitations for Interactive Analytics

ClickHouse is a PB‑scale, open‑source columnar OLAP database that uses a ZooKeeper‑coordinated sharded cluster, columnar storage, vectorized execution, advanced compression, data‑skipping indexes, and materialized views to deliver high‑performance interactive analytics, yet it requires manual shard management, lacks a mature MPP optimizer, and handles real‑time single‑row writes poorly.

ClickHouseColumnar StorageMaterialized Views
0 likes · 18 min read
ClickHouse: Architecture, Core Features, and Limitations for Interactive Analytics
ITPUB
ITPUB
Oct 12, 2020 · Databases

Why ClickHouse Outperforms Other Databases: Core Features Unveiled

This article explains how ClickHouse’s column‑oriented storage, vectorized execution engine, rich DBMS capabilities, flexible table engines, and carefully designed distributed architecture enable it to handle massive workloads with sub‑second query latency, making it a standout OLAP solution.

ClickHouseColumnar DatabaseDistributed Systems
0 likes · 29 min read
Why ClickHouse Outperforms Other Databases: Core Features Unveiled
JavaEdge
JavaEdge
Aug 9, 2020 · Fundamentals

When Does Data Compression Boost System Performance? A Deep Dive into Kafka and RocketMQ

This article explains the significance of data compression, outlines when it should be applied, compares lossless algorithms, discusses segment selection, and details how Kafka and RocketMQ implement message compression to improve throughput while balancing CPU, storage, and network resources.

KafkaMessage QueuePerformance Optimization
0 likes · 9 min read
When Does Data Compression Boost System Performance? A Deep Dive into Kafka and RocketMQ
DataFunTalk
DataFunTalk
Jul 18, 2020 · Databases

Core Features and Architecture of ClickHouse: An In‑Depth Overview

This article provides a comprehensive technical overview of ClickHouse, covering its complete DBMS capabilities, column‑oriented storage and compression, vectorized execution engine, relational SQL support, diverse table engines, multi‑master clustering, sharding, and the design philosophies that make it exceptionally fast for large‑scale analytical workloads.

ClickHouseColumnar DatabaseDatabase Architecture
0 likes · 29 min read
Core Features and Architecture of ClickHouse: An In‑Depth Overview
Programmer DD
Programmer DD
Apr 16, 2020 · Fundamentals

10 Game-Changing Algorithms That Shaped Modern Technology

This article explains what an algorithm is, outlines its essential properties, traces its historical roots, and introduces ten pivotal algorithms—from sorting and Fourier transforms to RSA, hash functions, and random number generators—that underpin today’s digital world.

AlgorithmsNetworkingSorting
0 likes · 10 min read
10 Game-Changing Algorithms That Shaped Modern Technology
WeChat Client Technology Team
WeChat Client Technology Team
Apr 3, 2019 · Mobile Development

How We Built a Scalable iOS Memory Monitoring Tool to Tackle FOOM

To combat Foreground Out‑Of‑Memory (FOOM) crashes in iOS, we describe the evolution from a simple FBAllocationTracker‑based logger to a high‑performance monitoring system that collects, stores, compresses, reports, and visualizes allocation data using custom hooks, splay trees, hash‑based stack compression, and selective reporting.

FOOMMemory MonitoringPerformance Optimization
0 likes · 16 min read
How We Built a Scalable iOS Memory Monitoring Tool to Tackle FOOM
Beike Product & Technology
Beike Product & Technology
Feb 7, 2018 · Backend Development

High‑Performance Challenge: Optimizing a C‑Based User Information Service for Tens of Millions of Records

This article recounts a 2017 internal high‑performance competition where a C server handling a 40 million‑row user‑info dataset was progressively optimized through data compression, custom hash tables, memory layout redesign, and a thread‑pool model, ultimately achieving a four‑fold throughput increase.

CMemory Optimizationdata compression
0 likes · 13 min read
High‑Performance Challenge: Optimizing a C‑Based User Information Service for Tens of Millions of Records
vivo Internet Technology
vivo Internet Technology
Sep 8, 2017 · Backend Development

Compressing User Tags and Models with Protostuff and Gzip

By serializing user feature data with Java's Protostuff (built on Protobuf) and then applying JDK Gzip compression before storing it in Redis, the author shrank typical 70 KB per‑user payloads to under 10 KB, enabling billions of records with cross‑language compatibility and no schema‑breakage.

GzipJavaProtostuff
0 likes · 6 min read
Compressing User Tags and Models with Protostuff and Gzip
Meituan Technology Team
Meituan Technology Team
Jul 6, 2017 · Backend Development

Online Feature System: Architecture, Storage, and High‑Concurrency Techniques

Using Meituan’s hotel‑travel platform as a case study, the article details a scalable online feature system architecture that combines layered storage, efficient compression, and robust synchronization to meet extreme concurrency, throughput, terabyte‑scale data, and sub‑10 ms latency demands for AI‑driven strategy services.

Big Datadata compressiondistributed storage
0 likes · 23 min read
Online Feature System: Architecture, Storage, and High‑Concurrency Techniques
Huawei Cloud Developer Alliance
Huawei Cloud Developer Alliance
Feb 7, 2017 · Big Data

What’s New in Apache CarbonData 1.0.0? 80+ Features Boost Big Data Performance

Apache CarbonData 1.0.0, now an Apache incubating project, adds over 80 new features and bug fixes—including a new data loading solution, Spark 2.1 integration, update/delete SQL support, adaptive compression for numeric types, B‑Tree LRU cache, V2 format for faster first‑query performance, vectorized reader, bucket‑table joins, off‑heap memory, single‑pass loading, and pre‑generated dictionaries—aimed at delivering faster, more flexible, and efficient columnar storage for big‑data workloads.

Apache CarbonDataBig DataColumnar Storage
0 likes · 8 min read
What’s New in Apache CarbonData 1.0.0? 80+ Features Boost Big Data Performance
Architecture Digest
Architecture Digest
Jul 3, 2016 · Databases

HiStore: A High‑Compression Columnar Database for Massive OLAP Workloads

HiStore is a columnar database developed by Alibaba's middleware team, designed for massive OLAP workloads with high compression ratios, low storage and maintenance costs, supporting ad‑hoc multi‑dimensional queries, knowledge‑grid optimization, efficient data loading, and offering features such as approximate queries and high‑availability clustering.

Columnar DatabaseOLAPdata compression
0 likes · 19 min read
HiStore: A High‑Compression Columnar Database for Massive OLAP Workloads
Architect
Architect
Jun 30, 2016 · Databases

HiStore Columnar Database Overview and Architecture

HiStore, an Alibaba middleware team's columnar database, offers high‑compression, low‑cost OLAP storage for massive datasets, leveraging a knowledge‑grid engine, column‑based storage, and efficient compression to achieve multi‑dimensional ad‑hoc queries with performance gains of up to tens of times over traditional row‑based systems.

HiStoreOLAPdata compression
0 likes · 17 min read
HiStore Columnar Database Overview and Architecture
21CTO
21CTO
Jun 18, 2016 · Databases

Unlock Ultra‑High Compression with HiStore’s Knowledge‑Grid Columnar Database

HiStore, Alibaba’s columnar database built on a patented Knowledge‑Grid, delivers ultra‑high compression (over 10:1, up to 40:1), low‑cost storage, rapid query performance, linear scalability, and seamless MySQL compatibility, making it ideal for massive OLAP workloads and real‑time analytics across diverse industries.

Big DataColumnar DatabaseOLAP
0 likes · 8 min read
Unlock Ultra‑High Compression with HiStore’s Knowledge‑Grid Columnar Database
dbaplus Community
dbaplus Community
May 26, 2016 · Big Data

Mastering Apache Parquet: Columnar Storage, Nested Data, and Performance Gains

This article explains Apache Parquet’s columnar storage format, its support for nested data models, the underlying striping/assembly algorithm, file structure, push‑down optimizations, and performance advantages within the Hadoop ecosystem, providing a comprehensive guide for big‑data practitioners.

Apache ParquetBig DataHadoop
0 likes · 22 min read
Mastering Apache Parquet: Columnar Storage, Nested Data, and Performance Gains
21CTO
21CTO
Oct 21, 2015 · Mobile Development

How to Slash Mobile App Log Reporting Traffic: Practical Optimization Techniques

This article examines why mobile app log reporting consumes excessive traffic and presents concrete methods—such as using concise key‑value URLs, batching uploads, shortening domains and keys, and compressing data—to dramatically reduce bandwidth usage while preserving data timeliness.

HTTPbatch uploaddata compression
0 likes · 7 min read
How to Slash Mobile App Log Reporting Traffic: Practical Optimization Techniques
Qunar Tech Salon
Qunar Tech Salon
Mar 27, 2015 · Fundamentals

Classic Algorithms that Shaped Modern Computing

This article surveys a collection of seminal algorithms—including Huffman coding, public‑key encryption, Dijkstra's shortest‑path, binary search, quicksort, Karatsuba multiplication, Euclid's GCD, Bresenham's line drawing, and the fast inverse square‑root—explaining their origins, principles, and lasting impact on computer science.

AlgorithmsSearchSorting
0 likes · 7 min read
Classic Algorithms that Shaped Modern Computing