Yearly Archive: 2014
Hadoop Configuration I have to do in the following layers. HDFS Layer NameNode-Master DataNode-Store Data(Actual Storage) MapReduce Layer JobTracker TaskTracker Secondary Namenode– storing backup of NameNode it will not work as an alternate namenode, it …
This chapter explains how to set up Hadoop to run on a cluster of machines. Running HDFS and MapReduce on a single machine is great for learning about these systems, but to do useful work …
Software Required- Setup Virtual Machine Step1. >goto traffictool.net->goto ubuntu(Ubuntu1404)->download it->extract it Step2. Suppose your directory after extract it “D:personal datahadoopUbuntu1404” Step3. >goto google->search VMWARE PLAYER->goto result select DESKTOP & END USER->download it->install it Step4. After …
Enabling Linux Automatic Password-less SSH Login Automatic passwrod-less ssh login can make our life easier. To enable this, we need to copy our SSH public keys to the remote machines for automatic password-less login. We …
JobTracker and TaskTracker are coming into picture when we required processing to data set. In hadoop system there are five services always running in background (called hadoop daemon services). Daemon Services of Hadoop- Namenodes Secondary …
Hi in this hadoop tutorial we will describing now HDFS Architecture. There are following are two main components of HDFS. Main Components of HDFS- NameNodes master of the system maintain and manage the blocks which …
What is HDFS? HDFS is a file system designed for storing very large files with streaming data access patterns, running on clusters on commodity hardware. Highly fault-tolerant “Hardware failure is the norm rather than the …