It executes the filesystem namespace operations like opening, closing, renaming files and directories, etc.Stores meta-data about blocks of a file, blocks location, permissions, etc.Hence, it should be configured on high-end machines. It stores filesystem meta-data in the memory for fast retrieval. NameNode manages the filesystem namespace. NameNode is a master node in the Hadoop HDFS.
The two daemons that are NameNode and the ResourceManager run on the master node. Master in the Hadoop Cluster is a high power machine with a high configuration of memory and CPU. It consists of the master node, slave nodes, and the client node. The Hadoop Cluster follows a master-slave architecture. Let us now study the Architecture of Hadoop Cluster. Thus, when there is a need to process queries on the huge amount of data, the cluster-wide latency is minimized. Hadoop Clusters are also known as Shared-nothing systems because nothing is shared between the nodes in the cluster except the network bandwidth. It is a computational cluster designed for storing as well as analyzing huge amounts of unstructured or structured data in a distributed computing environment. Hadoop Cluster is just a computer cluster used for handling a vast amount of data in a distributed manner. Nodes are nothing but a point of connection/intersection within a network.Ī computer cluster is a collection of computers connected with a network, able to communicate with each other, and works as a single system. Keeping you updated with latest technology trends, Join TechVidvan on Telegram What is a Cluster?Ī Cluster is a collection of nodes.