Hadoop, that ecosystem of open-source tools for storing and analyzing large quantities and many kinds of data, is spawning more and more companies. Some offer commercial support or consulting assistance for Hadoop. Others provide analytics software for understanding data sitting in Hadoop. But Hadoop itself is free.
Still, being free doesn’t make it perfect. Data analytics company 1010data commissioned a survey this year of 158 U.S. executives, and the results identify shortcomings of Apache Hadoop’s latest big release.
Hadoop 2.0, which became generally available in October, includes resource-management software called YARN that lets engineers run multiple kinds of Hadoop jobs on the same bunch of servers. The 2.0 edition also comes with a way to help the service survive if a key physical server in the cluster called a NameNode goes down.