Tag

ORC Comparison

0 views collected around this technical thread.

Big Data Technology Architecture
Big Data Technology Architecture
Aug 24, 2021 · Big Data

An Overview of Apache Parquet: Architecture, Storage Model, and Comparison with ORC

This article provides a comprehensive introduction to Apache Parquet, covering its origins, columnar storage advantages, nested schema support, internal architecture, storage model components, comparison with ORC, and practical tools for inspecting Parquet files.

Big DataHadoopORC Comparison
0 likes · 10 min read
An Overview of Apache Parquet: Architecture, Storage Model, and Comparison with ORC
Big Data Technology Architecture
Big Data Technology Architecture
May 19, 2020 · Big Data

An Overview of Apache Parquet: Architecture, Features, and Comparison with ORC

Apache Parquet is a language‑agnostic, columnar storage format for the Hadoop ecosystem that offers high compression, efficient I/O through column and predicate push‑down, nested‑structure support, and a three‑layer architecture, and is compared with ORC while providing tooling for schema inspection.

Apache HadoopBig DataData Formats
0 likes · 9 min read
An Overview of Apache Parquet: Architecture, Features, and Comparison with ORC