Tag

Apache Hadoop

0 views collected around this technical thread.

Architects Research Society
Architects Research Society
Jun 26, 2021 · Big Data

Comprehensive Overview of Over 50 Big Data Terms and Technologies

This article presents an extensive glossary of more than fifty big‑data concepts—including Apache projects, data‑analysis methods, storage formats, AI‑related terms, and emerging metrics—providing concise English explanations for each term.

Apache HadoopBig DataData Analytics
0 likes · 17 min read
Comprehensive Overview of Over 50 Big Data Terms and Technologies
Big Data Technology Architecture
Big Data Technology Architecture
May 19, 2020 · Big Data

An Overview of Apache Parquet: Architecture, Features, and Comparison with ORC

Apache Parquet is a language‑agnostic, columnar storage format for the Hadoop ecosystem that offers high compression, efficient I/O through column and predicate push‑down, nested‑structure support, and a three‑layer architecture, and is compared with ORC while providing tooling for schema inspection.

Apache HadoopBig DataData Formats
0 likes · 9 min read
An Overview of Apache Parquet: Architecture, Features, and Comparison with ORC