What is Hadoop

The Apache™ Hadoop® project is a very reliable and scalable distributed storage and computing framework. It allows distributed processing of large datasets across clusters of computers using a simple programming model. It is designed to scale from a single server to thousands of machines, each providing local compute and storage.

Hadoop Latest Version

Release 3.2.3 available 2022 Mar 28
Download .tar.gz Download src

Installation and Setup

For Single Node Setup, click here
For Cluster Setup, click here

Hadoop Document

Full Document