Why “Science” is an Important Word in the World of Big Data
Some observations from conference organizers Alistair Croll and Ed Brill on theCube last week at the Strata Conference:
Once users start using Hadoop, it is often the unexpected things that create value and it’s why the word “scientist,” is important. Using today’s big data tools requires you to explore. It’s true with Hadoop as an analytics engine with Map Reduce. And it’s certainly also true with data visualization and the ability to harness data streams to create algorithms.
Watch live video from SiliconANGLE.com on Justin.tv
But this also poses a threat to people with domain expertise. In an interview with John Furrier, Croll said that he had conversations with people at Strata about this idea that domain expertise gets trumped by data. He cited Moneyball, the book and film about the Oakland A’s and the data jock who helped the team discover a group of players who went on to help the team have a fabulous season. The baseball scouts represented the domain experts. They had an institutionalized system for picking players. The scouts had already made up their minds what they wanted. The data jock had no such preconceptions. He used data to help find the best players. The results spoke for themselves.
Services Angle
The players in the big data game get this concept of discovery. I’d say every student of the Web gets this, too. Have you ever built a blog? It’s a discovery process. Feed readers? When introduced about ten years ago, they provided a window into a new world for people. Data was discoverable in new ways and people began to use those feeds to make new apps. Today we have millions of available feeds and data streams from sensors and machines that developers experiment with to create new apps. APIs have matured to the point that they are becoming gateways for commerce.
And now we have an ecosystem emerging. HBase is gaining acceptance as a database on top of Hadoop that acts in a similar way to Google’s BigTable. Pig is a high-level language running on top of Hadoop.
These new tools help fuel new discovery. And that means lots of change for the services market as it adapts to the new world of data science. CIOs would do well to bring in people with fresh eyes to existing problems. Service providers that can bring new perspectives to old domains will be the ones to watch.
Since you’re here …
… We’d like to tell you about our mission and how you can help us fulfill it. SiliconANGLE Media Inc.’s business model is based on the intrinsic value of the content, not advertising. Unlike many online publications, we don’t have a paywall or run banner advertising, because we want to keep our journalism open, without influence or the need to chase traffic.The journalism, reporting and commentary on SiliconANGLE — along with live, unscripted video from our Silicon Valley studio and globe-trotting video teams at theCUBE — take a lot of hard work, time and money. Keeping the quality high requires the support of sponsors who are aligned with our vision of ad-free journalism content.
If you like the reporting, video interviews and other ad-free content here, please take a moment to check out a sample of the video content supported by our sponsors, tweet your support, and keep coming back to SiliconANGLE.