UPDATED 01:44 EDT / JUNE 04 2012

NEWS

Cascading 2.0: An Application Framework for Hadoop Winning the Attention of Twitter, Etsy and EMC

Cascading is an open-source application framework getting the attention of Twitter, Etsy, AirBnB and big data analytics companies such as EMC Greemplum and Map R. The popularity stems from its ability to abstract the complexities of MapReduce and making Hadoop clusters easier to manage.

Today Concurrent announced Cascading 2.0, an ennterprise-grade development platform designed for Java developers to build big data applications on top of Hadoop.

The complexity of MapReduce makes the process of deploying big data apps a time consuming endeavor with multiple opportunities for error. With Cascading 2.0, data scientists and developers use high-level scripting languages and open APIs to process, integrate and schedule on complex Hadoop clusters.

Cascading reminds me a bit of platforms such as Yahoo! Pipes that aggregates RSS feeds, Web pages and other data sources. It pipes that data into Web-based applications that publish information to the Web.

Cascading follows similar principles. According to Wikipedia, Cascading users create descriptions of processes that often consist of business logic. Data is captured from different sources and run through pipes that use algorithms to process the data. Pipes are built independently from the data they will process. Once tied to the data sources and “sinks,” the user can create flows that may be grouped inta a “cascade.” These cascades run through a process scheduler so the clusters can be easier managed.

Developers program on JVM-based languages and do not need to learn MapReduce. That in itslef can makes it far easier to deploy big data apps.

As a result, Cascading 2.0 is getting more attention from companies like EMC that are investing heavily in big data. EMC Greenplum is diistributing Cascading as part of its  Greenplum MR distribution, and plan to increase integration and support with other offerings in the future.

 


Since you’re here …

… We’d like to tell you about our mission and how you can help us fulfill it. SiliconANGLE Media Inc.’s business model is based on the intrinsic value of the content, not advertising. Unlike many online publications, we don’t have a paywall or run banner advertising, because we want to keep our journalism open, without influence or the need to chase traffic.The journalism, reporting and commentary on SiliconANGLE — along with live, unscripted video from our Silicon Valley studio and globe-trotting video teams at theCUBE — take a lot of hard work, time and money. Keeping the quality high requires the support of sponsors who are aligned with our vision of ad-free journalism content.

If you like the reporting, video interviews and other ad-free content here, please take a moment to check out a sample of the video content supported by our sponsors, tweet your support, and keep coming back to SiliconANGLE.