Tag

data compression

0 views collected around this technical thread.

JD Retail Technology
JD Retail Technology
Apr 8, 2025 · Databases

ClickHouse Architecture and Core Technologies Overview

ClickHouse is an open‑source, massively parallel, column‑oriented OLAP database that integrates its own columnar storage, vectorized batch processing, pre‑sorted data, diverse table engines, extensive data types, sharding with replication, sparse primary‑key and skip indexes, and a multithreaded query engine, delivering high‑throughput real‑time analytics on massive datasets.

Big DataClickHouseOLAP
0 likes · 15 min read
ClickHouse Architecture and Core Technologies Overview
Amap Tech
Amap Tech
Apr 7, 2025 · Mobile Development

Gaode Uninhabited Area Project: Satellite Rescue and Offline Navigation Technologies

Gaode’s Uninhabited Area project introduces Satellite Rescue, which transmits encrypted, natural‑language emergency messages via satellite and a rescue chatroom to boost response speed, and Offline Navigation, employing quadtree indexing, predictive caching and seamless mode switching to guide users through network‑dead zones, proven in harsh Inner Mongolian terrains.

data compressionemergency rescuemobile app
0 likes · 8 min read
Gaode Uninhabited Area Project: Satellite Rescue and Offline Navigation Technologies
Model Perspective
Model Perspective
Feb 26, 2025 · Artificial Intelligence

How Do Large Language Models Compress Massive Data? Limits and Techniques

This article explains how large language models act like a super‑library by compressing vast amounts of text using information‑theoretic concepts, probability‑based coding, autoregressive neural networks, and arithmetic coding, while discussing accuracy, compression ratios, and theoretical limits.

AIarithmetic codingautoregressive networks
0 likes · 8 min read
How Do Large Language Models Compress Massive Data? Limits and Techniques
vivo Internet Technology
vivo Internet Technology
Dec 4, 2024 · Databases

OceanBase Implementation and Migration Practices at vivo

vivo migrated five 20‑TB MySQL clusters to OceanBase using OCP, oblogproxy, and OMS, eliminating sharding costs, achieving over 70% storage savings, improving consistency and performance, and leveraging native distributed architecture, tenant isolation, and strong compression to support scalable, reliable operations.

Distributed DatabaseMySQLOceanBase
0 likes · 16 min read
OceanBase Implementation and Migration Practices at vivo
macrozheng
macrozheng
Oct 7, 2024 · Game Development

How 128KB Powered Classic Games: Secrets of Retro Graphics, Audio, and Code

This article explores why 1980s games like Contra could deliver rich graphics, music, and gameplay within just 128KB by using tile-based graphics, specialized audio chips, and ultra‑compact code, contrasting those techniques with modern developers' expectations of massive storage.

audio synthesiscode optimizationdata compression
0 likes · 7 min read
How 128KB Powered Classic Games: Secrets of Retro Graphics, Audio, and Code
Baidu Tech Salon
Baidu Tech Salon
Jul 23, 2024 · Artificial Intelligence

Linear Algebra Fundamentals and PaddlePaddle Applications

The article reviews core linear algebra concepts—vectors, matrices, eigenvalues, and transformations—and demonstrates how PaddlePaddle’s paddle.linalg API enables practical tasks such as least‑squares regression, image compression via SVD, PCA‑based dimensionality reduction, and broader machine‑learning, graphics, cryptography, and optimization applications.

PCAPaddlePaddleSVD
0 likes · 10 min read
Linear Algebra Fundamentals and PaddlePaddle Applications
vivo Internet Technology
vivo Internet Technology
Jun 19, 2024 · Big Data

Understanding BitMap and Roaring BitMap: Principles, Containers, and Java API Usage

The article explains BitMap fundamentals and introduces Roaring BitMap’s compressed container architecture—Array, BitMap, and Run containers—detailing their conversion logic, Java implementation snippets, performance advantages over traditional BitSets, and practical API usage for high‑performance, memory‑efficient big‑data applications.

Big DataBitmapJava
0 likes · 18 min read
Understanding BitMap and Roaring BitMap: Principles, Containers, and Java API Usage
Tencent Cloud Developer
Tencent Cloud Developer
Apr 16, 2024 · Databases

Major Updates and New Features of WCDB 2: Multi‑language Support, SQL Builder, Backup & Recovery, Data Migration, Compression, and Performance Optimizations

WCDB 2 introduces multi-language support (C++, Java, Kotlin), a rewritten SQL builder, fast backup and recovery with sub‑second times, incremental WAL‑based backup, automatic data migration, Zstandard compression, on‑the‑fly column addition, and performance enhancements such as an FTS5 optimizer and interruptible transactions, delivering a robust high‑performance mobile database solution.

SQL BuilderSQLiteWCDB
0 likes · 47 min read
Major Updates and New Features of WCDB 2: Multi‑language Support, SQL Builder, Backup & Recovery, Data Migration, Compression, and Performance Optimizations
macrozheng
macrozheng
Feb 18, 2024 · Game Development

How 128KB Powered Classic Games: Secrets of Retro Game Development

This article explores why 1980s games like Contra could deliver rich graphics, music, and gameplay within just 128KB by examining data size perception, tile‑based graphics, limited audio chips, and ultra‑compact code techniques that contrast sharply with modern app bloat.

audio chipdata compressiongame development
0 likes · 7 min read
How 128KB Powered Classic Games: Secrets of Retro Game Development
Architect
Architect
Feb 1, 2024 · Backend Development

Design and Optimization of Trace2.0: A High‑Performance Backend Tracing System

Trace2.0 is an OpenTelemetry‑based application monitoring system that processes petabyte‑scale trace data using multi‑channel client protocols, gRPC, load‑balancing optimizations, ZSTD compression, Kafka pipelines, ClickHouse storage, and a JDK 21 upgrade with virtual threads, achieving significant performance and cost improvements.

ClickHouseJDK21Kafka
0 likes · 15 min read
Design and Optimization of Trace2.0: A High‑Performance Backend Tracing System
Efficient Ops
Efficient Ops
Jan 17, 2024 · Operations

How We Built a Scalable Cloud‑Native Log Center with ClickHouse

This article details a courier company's evolution from a simple file‑based logging system to a cloud‑native log center, describing the limitations of the original architecture, the migration to an ELK stack, subsequent challenges, and the final redesign using ClickHouse for high compression, low cost, and improved query performance.

ClickHouseELKLog Management
0 likes · 12 min read
How We Built a Scalable Cloud‑Native Log Center with ClickHouse
JD Tech
JD Tech
Dec 15, 2023 · Databases

Systematic Approach to Reducing MySQL Database Storage Costs

This article presents a comprehensive, step‑by‑step methodology for cutting MySQL storage expenses—including background analysis, challenge identification, a nine‑grid systematic framework, benefit calculation, safety and stability verification, rollout, gray‑release, and rollback—demonstrating over 50% disk space reduction while managing risk.

Cost ReductionDatabase OptimizationMySQL
0 likes · 14 min read
Systematic Approach to Reducing MySQL Database Storage Costs
DataFunTalk
DataFunTalk
Aug 21, 2023 · Databases

Case Study: Building a Real‑Time Log Data Analysis Platform with Apache Doris at China Unicom

This article describes how China Unicom’s Western Innovation Research Institute designed and deployed a centralized, real‑time log analytics platform using Apache Doris, detailing the migration from Hive and ClickHouse, performance optimizations, storage cost reductions, and the resulting improvements in data ingestion, query speed, and operational efficiency.

Apache DorisBig DataCold‑Hot Data Management
0 likes · 18 min read
Case Study: Building a Real‑Time Log Data Analysis Platform with Apache Doris at China Unicom
Amap Tech
Amap Tech
Aug 1, 2023 · Databases

Gaode’s Adoption of OceanBase: Architecture, Practices, and Future Roadmap

Gaode migrated its core navigation, traffic, and financial services to OceanBase, leveraging Paxos‑based multi‑replica consistency, LSM‑Tree storage, and distributed transactions, selecting multi‑point write for massive sync workloads and central‑write for latency‑critical queries, achieving sub‑millisecond latency, significant storage savings, and a roadmap toward broader cost‑effective, serverless deployment.

Distributed DatabaseOceanBasecloud-native
0 likes · 58 min read
Gaode’s Adoption of OceanBase: Architecture, Practices, and Future Roadmap
Python Programming Learning Circle
Python Programming Learning Circle
Mar 29, 2023 · Game Development

How 128KB Games Delivered Rich Graphics and Audio: Tile Graphics, Memory Constraints, and Code Efficiency

This article explains how early 1980s games like Contra fit extensive storylines, graphics, and music into just 128KB by using tile‑based graphics, minimal audio data, and highly optimized code, illustrating the dramatic shift in data expectations as technology advances.

Memory Optimizationaudio chipdata compression
0 likes · 6 min read
How 128KB Games Delivered Rich Graphics and Audio: Tile Graphics, Memory Constraints, and Code Efficiency
DaTaobao Tech
DaTaobao Tech
Oct 18, 2022 · Frontend Development

Client‑Side Interface Performance Optimization: Analysis and Solutions

Client‑side developers can boost API speed and user experience by collaborating with back‑end teams to cut server processing and network latency, using caching, concurrency, smaller or UDP‑based packets, and aggressively pruning or compressing payloads, which can shrink megabyte responses to tens of kilobytes and shave dozens of milliseconds off latency.

API optimizationCachingPerformance
0 likes · 9 min read
Client‑Side Interface Performance Optimization: Analysis and Solutions
Top Architect
Top Architect
Oct 10, 2022 · Databases

Row vs Column Storage: Comparison, Advantages, and Experimental Evaluation in openGauss

This article explains the fundamental differences between row‑based and column‑based storage in databases, compares their write and read performance, lists pros and cons, presents an openGauss experiment with sample tables, and offers practical guidance on when to choose each storage model.

Database PerformanceOpenGaussSQL
0 likes · 9 min read
Row vs Column Storage: Comparison, Advantages, and Experimental Evaluation in openGauss