UPDATED 12:00 EDT / AUGUST 01 2014

The ‘father of data warehousing’ moves on to bigger things | #MITCDOIQ

Bill Inmon - MITCDOIQ 2014 - theCUBEThe “father of the data warehouse” hasn’t talked data warehousing in a decade. Instead, Bill Inmon is tackling the thorny problem of making the 80 percent of corporate data that’s in unstructured form easily available for analysis.

Inmon, who is credited with writing the first book about data warehousing as well as coining the term, said his latest venture holds even more promise than his previous two successful startups. Forest Rim Technology Inc. provides textual ETL (extract, transform, load) software that mines insight from free-form text. Inmon dropped by SiliconANGLE’s theCUBE at the recently concluded MIT Chief Data Officer and Information Quality (CDOIQ) Symposium to provide a rare look behind the scenes of the ambitious effort.

The Colorado-based Forest Rim sets itself apart with a unique approach that attempts to discern the meaning of raw text by its context rather than just the contents, Inmon told hosts Dave Vellante and Paul Gillin, a methodology that it claims allows it to tackle even the most unwieldy of datasets. That includes everything from contact center transcripts to the clinical narratives used by doctors to describe a specific medical situation to their colleagues, linguistically complex workloads that necessitate an equally elaborate analytic process to ingest.

“We take language for granted because we speak it and to us language is very natural and normal, but when you start to put language into a computer, it’s anything but natural and normal,” Inmon explained. “So when it comes to the question of how you do contextual analysis, you have a hundred different ways that you do it because in language, there’s a hundred different ways context can occur and appear to us.”

It took Inmon and his team of researchers a full 12 years to develop a set of algorithms can handle that tremendous amount of variety, he detailed, and they’re still no where near the 100 mark. But he said that the offering has nonetheless proven itself in the field to be effective in addressing the formidable challenge of mapping unstructured text into the rows and columns of the relational databases enterprises rely on today.

From there, the information can be easily fed into a downstream analytical process, be it a traditional business intelligence (BI) solution like SAP BusinessObjects or a virtualization product like Tableau or QlikView. Inmon said that simplicity has earned Forest Rim several paying customers across multiple industries, including several household brands he wouldn’t mention by name.

See the entire interview below:


Since you’re here …

… We’d like to tell you about our mission and how you can help us fulfill it. SiliconANGLE Media Inc.’s business model is based on the intrinsic value of the content, not advertising. Unlike many online publications, we don’t have a paywall or run banner advertising, because we want to keep our journalism open, without influence or the need to chase traffic.The journalism, reporting and commentary on SiliconANGLE — along with live, unscripted video from our Silicon Valley studio and globe-trotting video teams at theCUBE — take a lot of hard work, time and money. Keeping the quality high requires the support of sponsors who are aligned with our vision of ad-free journalism content.

If you like the reporting, video interviews and other ad-free content here, please take a moment to check out a sample of the video content supported by our sponsors, tweet your support, and keep coming back to SiliconANGLE.