UPDATED 13:21 EDT / MARCH 17 2011

Infochimps, the Gathering (of Structured Big Data)

Startups: The Hackering - a game from infochimps One of the more interesting conversations I had at SxSW was the with Nick Ducoff of Austin-based Infochimps. Those of you interested in the trends in big data are likely well aware the company – they’re in the business of curating, housing and providing API access to large data sets, and at the festival, they announced the launch of over 2000 new data sets, an amazing feat in and of itself.

Additionally, they announced the debut of the API explorer, a vehicle for getting an immediate understanding sans-documentation of how each API functions.

The datasets aren’t particularly obscure, either. Nick showed me a number of the staff favorites from the new debut, including a database of all MLB statistics, which contains “a record of major league games played from 1871-2008.”

On the social data side of things, they also play host to the Qwerly API, which allows, in essence, a “whois” for Twitter. A simple query to the Qwerly API returns profile URLs and some extended information across everything that Qwerly is able to find.

Curation and Mashups were the Theme of the Conference

I’ve been asked repeatedly by those who didn’t get to go to SxSW as to what I saw were vibes and themes of this show. I noticed a number of things, both of which Infochimps’ presence and announcement played right into: this is a distribution year, rather than an innovation year, and that curating (otherwise known as extracting signal from noise) played a major role.

Both themes have the same root cause, essentially: there is more data freely available floating around, and the only way to extract value from that data is to curate it, organize it, and cross-reference it with other data classifications.

Infochimps plays a crucial role in all of this. Infochimps isn’t in the business of creating structure of unstructured data. Rather, similar to Google, their core philosophy is to index the data (and unlike Google) curate, make available and clean up for developers the metadata around that data.

Nick also hinted, in addition to the existing announcements, at upcoming available data around location, which certainly plays into the zeitgeist of where current social development projects are. This clearly puts Infochimps in a place where they’ll easily be a go-to source of data as this world develops.


Since you’re here …

… We’d like to tell you about our mission and how you can help us fulfill it. SiliconANGLE Media Inc.’s business model is based on the intrinsic value of the content, not advertising. Unlike many online publications, we don’t have a paywall or run banner advertising, because we want to keep our journalism open, without influence or the need to chase traffic.The journalism, reporting and commentary on SiliconANGLE — along with live, unscripted video from our Silicon Valley studio and globe-trotting video teams at theCUBE — take a lot of hard work, time and money. Keeping the quality high requires the support of sponsors who are aligned with our vision of ad-free journalism content.

If you like the reporting, video interviews and other ad-free content here, please take a moment to check out a sample of the video content supported by our sponsors, tweet your support, and keep coming back to SiliconANGLE.