UPDATED 08:21 EDT / JUNE 13 2012

Your Complete Guide to Hadoop: Issues, Ecosystem and More

Is your company interested in Hadoop? Want to look like an expert to your boss? Wikibon Principal Research Contributor Jeff Kelly has published a comprehensive assessment of the state of Hadoop and the five Hadoop vendors, focusing on the major remaining issues in this still immature technology rather than on the advantages of Big Data analysis.

The report, “Hadoop: From Innovative Up-Start to Enterprise-Grade Big Data Platform” begins with a summary of Hadoop’s history and then jumps into a discussion of the four main issues that are keeping it out of data centers. These are mainly a product of the immaturity of this revolutionary technology, and they are not inconsequential. They are:

  1. The single-point-of-failure that is the Hadoop namenode. If it crashes, your Hadoop database shuts down until it can be rebuilt, which can take hours or days.
  2. The poor integration with traditional relational databases that makes it hard to combine relational data with nonrelational to get a true 360 degree view of customers, for instance.
  3. The total unfamiliarity of this radically new technology, which does not use industry standards such as SQL, and the general lack of people trained in MapReduce, Hbase, and other Big Data technologies.
  4. The almost total lack of security for Hadoop beyond Kerberos.

These issues are being tackled by both the Apache Open Source Community and the five major commercial Hadoop companies, Cloudera, DataStax, EMC Greenplum, Hortonworks, and MapR. Each of the vendors has its own approach to solving these problems, each of which is at least partially closed, although each has also donated large amounts of important code to the Apache Hadoop community. Kelly provides a summary of the main features and issues of the products of each of these commercial vendors in the second half of this report.

So what doesn’t it have? Mainly, it makes no attempt to discuss the ways in which pioneering companies are leveraging Hadoop and Big Data to achieve competitive advantage. But that side of the coin has been well covered, both in other articles on Wikibon and SiliconAngle, coverage by other media, and of course published use cases from the vendors. This report does not question on invalidate those, it just presents the other side of the coin.

So is Kelly saying that companies should beware of Hadoop? Not at all. All of these issues are being solved both by the commercial vendors and the Apache Open Source community. He is merely warning companies of the issues that still exist that they need to plan for before they start experimenting with Hadoop.

Warning, this is not light reading. It is closer to a white paper than a normal Wikibon Peer Incite of Gartner Research Note. And obviously it cannot cover every issue or question. But if you are looking for a good grounding in Hadoop, this is as comprehensive an examination of the issues as they stand today as you are going to get.

Like all Wikibon community research, this report is available in its entirety without charge on the Wikibon.org Web site. Interested parties are invited to read it and to join Wikibon.


Since you’re here …

… We’d like to tell you about our mission and how you can help us fulfill it. SiliconANGLE Media Inc.’s business model is based on the intrinsic value of the content, not advertising. Unlike many online publications, we don’t have a paywall or run banner advertising, because we want to keep our journalism open, without influence or the need to chase traffic.The journalism, reporting and commentary on SiliconANGLE — along with live, unscripted video from our Silicon Valley studio and globe-trotting video teams at theCUBE — take a lot of hard work, time and money. Keeping the quality high requires the support of sponsors who are aligned with our vision of ad-free journalism content.

If you like the reporting, video interviews and other ad-free content here, please take a moment to check out a sample of the video content supported by our sponsors, tweet your support, and keep coming back to SiliconANGLE.