Tagged articles
7 articles
Page 1 of 1
DataFunSummit
DataFunSummit
May 10, 2026 · Big Data

How Lance File Format v2.2 Accelerates, Cuts Costs, and Governs Multimodal Data

Lance File Format v2.2 tackles the AI data explosion by delivering hundred‑fold random‑read performance, advanced two‑layer compression, zero‑cost schema evolution, Git‑style versioning, external blob handling, and a roadmap toward native media support and intelligent encoding, positioning it as a core infrastructure for large‑scale multimodal workloads.

Data GovernanceFile FormatIO performance
0 likes · 14 min read
How Lance File Format v2.2 Accelerates, Cuts Costs, and Governs Multimodal Data
vivo Internet Technology
vivo Internet Technology
Mar 26, 2025 · Big Data

Reading Encrypted ORC Files in StarRocks: Architecture and Implementation Details

The article details how StarRocks extends the Apache ORC C++ library to decrypt column‑level encrypted ORC files, describing the file hierarchy, AES‑128‑CTR key handling, the query‑time master‑key retrieval, a decorator‑based decryption/decompression pipeline, and the block‑skip‑read mechanism that enables efficient predicate push‑down.

Big DataFile FormatORC
0 likes · 19 min read
Reading Encrypted ORC Files in StarRocks: Architecture and Implementation Details
Alibaba Cloud Developer
Alibaba Cloud Developer
Sep 9, 2020 · Fundamentals

How Streaming ZIP Decompression Cuts Load Times by 50%

Streaming ZIP decompression enables extracting files while downloading, eliminating the need to fetch the entire archive; by leveraging Local File Header metadata and supporting both traditional and AES encryption, this technique dramatically reduces latency, as demonstrated by Youku’s 0.91‑second load for 30 MB books.

File FormatStreaming Decompressionencryption
0 likes · 11 min read
How Streaming ZIP Decompression Cuts Load Times by 50%
Liangxu Linux
Liangxu Linux
Aug 31, 2020 · Fundamentals

Understanding PNG: Types, Data Structure, and Compression Mechanics

This article explains the PNG image format, covering its full name, lossless compression, size advantages, transparency support, the three main PNG types, the file’s chunk‑based data structure, hexadecimal header details, factors that affect compression efficiency, and the two‑stage prediction‑and‑deflate compression process.

DEFLATEData StructureFile Format
0 likes · 10 min read
Understanding PNG: Types, Data Structure, and Compression Mechanics