Big Data 5 min read

Weekly Knowledge Summary: Yarn Resource Scheduler, Hadoop Rack Awareness, HDFS Data Flow, and Small File Solutions

This weekly note shares personal updates and a concise technical overview covering Yarn's resource scheduling, Hadoop's rack‑aware architecture, HDFS data flow, and practical solutions to the HDFS small‑file problem, along with links to further reading and upcoming work plans.

Big Data Technology & Architecture
Big Data Technology & Architecture
Big Data Technology & Architecture
Weekly Knowledge Summary: Yarn Resource Scheduler, Hadoop Rack Awareness, HDFS Data Flow, and Small File Solutions

From: Wang Zhiwu To: Friends

This week, after the Qingming holiday, performance reviews have been completed, leaving more mental space for reflection despite a busy schedule.

Yesterday I attended a live internal broadcast by Teacher Ma, who addressed hot topics such as the 996 overtime issue and future industry trends; I will share my own thoughts later.

Our group chat has become increasingly active, with many members asking technical and learning questions. I apologize for delayed replies due to heavy workload, and encourage everyone to continue sharing knowledge and interview experiences within the group.

Weekly Knowledge Points

01 Yarn Resource Scheduling System

This article introduces the background, architecture, and principles of Yarn, recommending the CDH 2.7.x version for production environments and promising further curated resources.

02 Hadoop Rack Awareness

HDFS uses a rack‑aware strategy to improve reliability, availability, and network bandwidth utilization. The NameNode determines each DataNode's rack ID via the NetworkTopology data structure.

03 HDFS Data Flow

This section explains the read/write process of HDFS, a common interview topic.

04 HDFS Small‑File Problem Solutions

The small‑file issue is frequent in clusters; two articles are shared to address it:

https://dwz.cn/M4GpAe6e

https://dwz.cn/2ij0hqcE

Next Week's Work Plan

The upcoming May 1st holiday will be a relaxed period, but knowledge sharing continues with two upcoming articles on thinking and growth, as well as discussions on national trends such as city‑level household registration reforms and talent policies.

Likes and shares are the greatest support—thank you!

Big Dataresource schedulinghdfsHadoopSmall FilesRack Awareness
Big Data Technology & Architecture
Written by

Big Data Technology & Architecture

Wang Zhiwu, a big data expert, dedicated to sharing big data technology.

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.