Install Hadoop on Windows 10: Hadoop is a framework based on the Java programming language and open source. This Hadoop is useful for processing a number of data distribution using the data storage capability that is also distributed.
Hadoop can store any amount of data and can process it using the MapReduce programming model. Hadoop has also been designed to be able to overcome a problem resulting from the failure of a hardware function that can prevent loss of work processes and some data.
Hadoop will cut data to be distributed to each node in a cluster so that later it will be processed in a parallel or local manner on each node that are interrelated.
How to Install Hadoop on Windows 10
Hadoop is often used to handle a number of BigData processes. In this article, you will learn how to Install Hadoop on Windows 10. However, before that you have to download the components that will be used later when installing Hadoop on Windows 10.
Here are the components that you must download first before installing Hadoop on Windows 10.
- Download and install the JDK or the Java Development Kit first. You can download it at the following link: https://www.oracle.com/java/technologies/javase/javase-jdk8-downloads.html
- Download Apache Hadoop at the following link: https://www.apache.org/dyn/closer.cgi/hadoop/common/hadoop-2.9.2/hadoop-2.9.2.tar.gz
- Download the Hadoop Patch File for Windows operating systems at the following link: https://github.com/wmwijaya/hadoop2-standalone-windows7-windows10
Well, if you have downloaded several files above, then you can do the steps or how to install Hadoop in Windows 10 as described below.
- Download the Apache Hadoop file on the link above, then extract the file. Go to the bin folder and select the folder location.
- Open the control panel menu > System and Security> System> Advanced system settings . The System Properties box will appear and click Environment Variables.
- After that in the Variables, the user clicks the New button and enter the name Variable JAVA_HOME and paste the bin folder location link earlier in the variable value. Then click OK.
- Next to System Variables, look for the Path variable and click Edit. Click the New button and paste the bin folder location link. After that click OK.
- Enter the etc folder and then enter the hadoop folder. Open the core-site.xml, hadoop-env.cmd, hdfs-site.xml, yarn-site.xml, mapred-site.xml files using notepad or notepad ++.
- In the core-site.xml file write the script shown in the red box below. After that click the Save button.
- In the mapred-site.xml file write the script shown in the red box below. After that click the Save button.
- In the yarn-site.xml file write the script shown in the red box below. After that click the Save button.
- In the hadoop folder, create a new folder called datanode and namenode . After that, copy the link location of the two folders.
- Back again to Notepad ++. Open the hdfs-site.xml file and write the script as shown below. And remember, in the value tags Namenode and Datanode, fill with the link location of the Namenode and Datanode folder.
- In the hadoop-env.cmd file, change% JAVA_HOME% to the JDK folder location link.
- Reopen Environment Variables, and in the use variables click the New button. After that, type variable name HADOOP_HOME and fill in the variable value with the bin folder link in the hadoop folder. Then click OK.
- In System Valiables click Path and click the Edit button. After that click the New button and paste the bin folder link.
- Click New again and paste the sbin folder link inside the hadoop folder. After that click OK.
- Extract the hadoop patch file that you previously downloaded and then enter the bin menu. Copy the entire contents of the bin difolder file in the hadoop patch file and paste it into the bin folder in the hadoop file. Previously, delete all files in the bin folder in the hadoop folder. Well, then the contents of the bin folder in the hadoop file will be filled with files in the bin folder belonging to the hadoop patch file.
- Open the Command prompt and type the command hdfs namenode-format. then look at the results, Startup_msg will display the Starting NameNode.
- Then you enter the hadoop folder and then enter the sbin folder again in CMD. After that, type start-all.cmd. Then the result will be like this.
Thus the article that explains about how to install Hadoop on Windows 10. Hopefully, this article can give you benefits.