Data Artisans offers smoother on-ramp to stream processing with Apache Flink
Organizations that want to implement stream processing are about to get another boost with a new release of the commercial version of Apache Flink from data Artisans GmbH, a company founded by Flink’s creators.
Apache Flink was created to replace the MapReduce component in the big-data software framework Hadoop with a faster engine that can handle high flows of real-time data, such as credit card activity monitoring, machine learning and business intelligence. Flink has been overshadowed by Apache Spark and its Spark Streaming extension, but Spark isn’t a true streaming engine. Rather, it processes streaming data in small chunks, a process called “micro-batching.”
The distinction is important because micro-batching is stateless, meaning that only simple processes can be performed on data, such as transforming or storing it. Stateful platforms like Flink enable computation to be performed on the stream, a key capability for real-time analytics. “Stateless stream processing is simple. Stateful processing is a lot harder,” said Stephan Ewen, data Artisans’ chief technology officer.
Berlin-based data Artisans debuted its first fully supported commercial distribution of Flink, called dA Platform, a year ago a few months after raising $6 million in initial funding. With the release of the new dA Platform 2, the company has added features to reduce the cost and trouble involved in implementing Flink widely across an organization.
“Last year we worked with a good number of streaming users and observed the same pattern over and over again,” Ewen said. “It’s easy to get started with Flink, but difficult to perform upgrades without carrying forward all the previous apps. They don’t have the manpower to implement all the infrastructure pieces.”
The highlight of the new platform is Application Manager, a package of tools that streamlines the process of deploying and maintaining real-time streaming applications in production. Developers can specify processes such as updating, testing and committing code through integration with popular automation platforms such as Jenkins and Puppet.
Flink provides out-of-the-box tools for automating processes like starting and stopping jobs and triggering snapshots, but they require considerable skill to use, Ewen said. “We’re raising the abstraction level so that all you worry about is moving the application to a newer version or a different cluster,” he said.
The Application Manager also enables Flink to be integrated with popular open-source logging and metric systems such as Grafana and Logstash, as well as container orchestration platforms such as Kubernetes. The company provides application programming interfaces that can be used to integrate with other platforms as well.
DA Platform 2 is available now through an early access program. Production availability scheduled for October. Pricing is either on a per-node or annual license basis, but specifics weren’t announced.
Image: Flickr CC
Since you’re here …
… We’d like to tell you about our mission and how you can help us fulfill it. SiliconANGLE Media Inc.’s business model is based on the intrinsic value of the content, not advertising. Unlike many online publications, we don’t have a paywall or run banner advertising, because we want to keep our journalism open, without influence or the need to chase traffic.The journalism, reporting and commentary on SiliconANGLE — along with live, unscripted video from our Silicon Valley studio and globe-trotting video teams at theCUBE — take a lot of hard work, time and money. Keeping the quality high requires the support of sponsors who are aligned with our vision of ad-free journalism content.
If you like the reporting, video interviews and other ad-free content here, please take a moment to check out a sample of the video content supported by our sponsors, tweet your support, and keep coming back to SiliconANGLE.