What is Hadoop
The Apache™ Hadoop® project is a very reliable and scalable distributed storage and computing framework. It allows distributed processing of large datasets across clusters of computers using a simple programming model. It is designed to scale from a single server to thousands of machines, each providing local compute and storage.