Tagged articles
5 articles
Page 1 of 1
IT Services Circle
IT Services Circle
May 6, 2026 · Fundamentals

Why Does a .tar.gz File Have Two Extensions?

The article explains that the .tar.gz suffix reflects two separate Unix tools—tar for archiving and gzip for compression—combined via a pipeline, tracing their historical origins, design philosophy, and why this dual‑extension format remains prevalent today.

GzipUnixarchiving
0 likes · 10 min read
Why Does a .tar.gz File Have Two Extensions?
Big Data Technology & Architecture
Big Data Technology & Architecture
Dec 9, 2020 · Big Data

Handling Small Files in Hive: Configuration, Compression, and File Format Optimization

The article explains why Hive tables generate many small files on HDFS, describes the performance impact on NameNode and MapReduce, and provides detailed configuration steps and compression techniques—including input and output file merging, various Hive file formats, and partition optimization—to efficiently manage storage and resource consumption in big‑data environments.

HadoopHiveSmall Files
0 likes · 19 min read
Handling Small Files in Hive: Configuration, Compression, and File Format Optimization