UPDATED 10:17 EST / NOVEMBER 16 2010

Extractiv Opens Semantic Web Crawling On-Demand

Extractiv is opening its service to the public today, after year in private beta.  The new release includes Semantic Web Crawling and Semantic On-Demand Document Conversion.  Their web-scale structures data service fuses dynamic web crawling with efficient language processing text extraction of Language Computer Corporation to provide a low cost and simplified semantic web crawling data collections.

It is currently available in three account types: Basic which is free, Plus with $99 monthly subscription and gives users access to APIs and higher volume analysis, and Premium with $299 monthly subscription which provides additional features for power user.

“Extractiv will be where the semantic web begins.” said Shion Deysarkar, CEO of both Extractiv and 80legs. “We’re providing a gateway for all unstructured content to be transformed into rich semantic data.”

Compared to its competitors, Extractiv supports a wider range of semantic data collection and processing because of the combination of Semantic Web Crawling and Semantic On-Demand. The former allows users to crawl millions of web pages and convert any unstructured content found on web pages into semantic data while the latter provides automatic semantic conversion for processing specific documents. Users can upload and process a heap of documents via On-demand REST API.

“Extractiv democratizes access to semantic data,” said Extractiv President John Lehmann, who also serves as CEO of Language Computer Corporation. “We believe a host of new, semantically-aware applications become possible with the more intelligent analysis and data extraction performed by our service.”

Use Cases

Adding markup to emails, blogs and documents

Semantic On-Demand can add semantic knowledge to your content before people look it, so they can quickly and easily pull out the high valued concepts. If you’re publishing a news article, adding semantic markup will let your readers get the most out of it. If you’re an email application, tagging dates, locations, and people will help the user understand that email quickly.

Making sense of document archives

Does your company have a large archive of stored documents? With Semantic On-Demand, you can process documents in your archive and use it to pull out the key information. Want to search your archive with advanced queries instead of string matching? The metadata Extractiv provides will allow you to easily add these capabilities on existing data.


Since you’re here …

… We’d like to tell you about our mission and how you can help us fulfill it. SiliconANGLE Media Inc.’s business model is based on the intrinsic value of the content, not advertising. Unlike many online publications, we don’t have a paywall or run banner advertising, because we want to keep our journalism open, without influence or the need to chase traffic.The journalism, reporting and commentary on SiliconANGLE — along with live, unscripted video from our Silicon Valley studio and globe-trotting video teams at theCUBE — take a lot of hard work, time and money. Keeping the quality high requires the support of sponsors who are aligned with our vision of ad-free journalism content.

If you like the reporting, video interviews and other ad-free content here, please take a moment to check out a sample of the video content supported by our sponsors, tweet your support, and keep coming back to SiliconANGLE.