Research firm IDC in a recent report shows that companies combine Hadoop with other databases to make big data analysis. A significant proportion of survey respondents said Hadoop is used to replace traditional data warehouse technologies. This ranges from the analysis of raw data, whether operational data, data from different machines or terminals or point-of-sale, or data on customer behavior collected by the e-commerce retail systems.
Hadoop was designed especially for the analysis of large data sets to build scalable, distributed applications. The last October announcement of Hadoop version 2.2 by Apache Foundation marks the first stable release of Hadoop – making it move even deeper into the organization to pursue unstructured and semi-structured data types.