UPDATED 10:31 EDT / AUGUST 07 2013

Sqrrl CTO Explains How to Bring Structure to a Schema-less Database

In the fourth episode of theCube’s special series on Accumulo, Sqrrl founding CTO Adam Fuchs discusses how his company’s platform helps enterprises organize their NoSQL environments.

Fuchs opens the session with an overview of the system. Accumulo limits querying to a range within a keyspace, he explains, and that range represents a hierarchical structure which follows a row, column, and timestamp format. The row determines how the data is partitioned in the database, the column defines vertical partitioning within the row, and the qualifier denounces the uniqueness of the value stored in the key-value pair.

A user can query a specific row, a row in a particular column family, and any value or set of values that may be associated with it.

Fuchs says that one of simplest ways to optimize NoSQL databases is pairing a document table with an inverted index. The document is organized using universally unique identifiers (UUIDs) that represent fields, which in turn contain values that can be retrieved by querying the IDs. This table design enables users to perform query based on the characteristics of a document (that is, the value or parts of the value they’re looking for) rather than its identifier.

The approach described above is known as term-distributed information retrieval, which differs from document-distributed information retrieval. Fusch points out that the latter method involves grouping documents and index entries into partitions, or shards, that can be queried in parallel.

The CTO wraps up the episode by highlighting that Sqrrl implemented both concepts in Accumulo to simplify location data analysis, graph organization, 3D modeling and other data-driven workloads.

Click the video below for the full highlights.


Since you’re here …

… We’d like to tell you about our mission and how you can help us fulfill it. SiliconANGLE Media Inc.’s business model is based on the intrinsic value of the content, not advertising. Unlike many online publications, we don’t have a paywall or run banner advertising, because we want to keep our journalism open, without influence or the need to chase traffic.The journalism, reporting and commentary on SiliconANGLE — along with live, unscripted video from our Silicon Valley studio and globe-trotting video teams at theCUBE — take a lot of hard work, time and money. Keeping the quality high requires the support of sponsors who are aligned with our vision of ad-free journalism content.

If you like the reporting, video interviews and other ad-free content here, please take a moment to check out a sample of the video content supported by our sponsors, tweet your support, and keep coming back to SiliconANGLE.