Pervasive’s DataRush Unlocks High Utilization for Hadoop
For all its advantages in Big Data research, Hadoop has one major drawback, it struggles to achieve 20% server utilization, writes Jeff Kelly in his latest Wikibon Peer Incite, “Improving Hardware Efficiency Important to Overall Hadoop ROI”. This makes large, production Hadoop installations running on hundreds of nodes into a drain on IT CapEx. Earlier this summer VMware announced Project Serengeti, including the contribution of code to Apache Hadoop to make HDFS and MapReduce “virtualization aware”, to support Hadoop virtualization to attack this issue. The only problem – each node of a Hadoop cluster needs its own copy of vSphere Enterprise Edition, which adds a major “virtualization tax” to the implementation.
Now Pervasive Software has introduced a potentially less expensive solution based on a Java framework called DataRush that allows multiple Hadoop jobs to run in parallel on commodity hardware in vanilla JVMs, according to Pervasive Senior Director of Business Development and Strategy David Inbar. A second product, RushAnalyzer, speeds up Big Data preprocessing, Inbar says.
Together they abstract away the complexity of parallelizing Hadoop jobs, allow users to monitor IO and CPU use in real time and mitigate memory constraints. Inbar claims server efficiency rates as high as 80%. The products are getting attention from systems integrators and consultancies and, says Kelly, should also be considered along with Project Serengeti by IT groups moving from proof-of-concept trials of Hadoop to large-scale implementations. If Pervasive’s products perform as promised, the savings in hardware costs, power and cooling, and data center floor space can be significant, without the “virtualization tax”.
Since you’re here …
… We’d like to tell you about our mission and how you can help us fulfill it. SiliconANGLE Media Inc.’s business model is based on the intrinsic value of the content, not advertising. Unlike many online publications, we don’t have a paywall or run banner advertising, because we want to keep our journalism open, without influence or the need to chase traffic.The journalism, reporting and commentary on SiliconANGLE — along with live, unscripted video from our Silicon Valley studio and globe-trotting video teams at theCUBE — take a lot of hard work, time and money. Keeping the quality high requires the support of sponsors who are aligned with our vision of ad-free journalism content.
If you like the reporting, video interviews and other ad-free content here, please take a moment to check out a sample of the video content supported by our sponsors, tweet your support, and keep coming back to SiliconANGLE.