Tagged articles
1 articles
Page 1 of 1
Big Data Technology & Architecture
Big Data Technology & Architecture
Sep 1, 2021 · Big Data

Understanding Hadoop Data Splitting and InputFormat Mechanisms

This article explains Hadoop's data splitting concepts, the distinction between HDFS blocks and logical InputSplits, details the source code of various InputFormats such as TextInputFormat, CombineTextInputFormat, KeyValueTextInputFormat, NLineInputFormat, and custom InputFormats, and provides complete Java examples for Mapper, Reducer, and driver classes.

Data SplittingHadoopInputFormat
0 likes · 24 min read
Understanding Hadoop Data Splitting and InputFormat Mechanisms