What is Hadoop?
It is a word which is used very often nowadays in the IT as well as some other industry sectors. It is often used in connection with Big Data. But very few know how it is related to Big Data.
Big data is the voluminous heterogeneous data in terabytes and Hadoop is a framework (not a single technology or a product) to process efficiently Big Data. Hadoop is a part of an Ecosystem to handle big data which comprises of various other supporting technologies and products.
Hadoop is an Apache open source framework written in java which has the potential to manage thousands of terabytes of data. It has quickly developed as basis for big data processing tasks like statistical analytics, business planning and processing huge volumes of data from sensors including IoT sensors. Hadoop is designed to scale up from single server to thousands of machines, each offering local computation and storage.
The main four modules of Hadoop Framework
The other softwares which are part of the Hadoop Ecosystem
Advantages of Hadoop