UPDATED 21:19 EDT / APRIL 18 2018

CLOUD

48M social media records exposed in latest cloud storage mishap

Seattle data gathering firm LocalBlox Inc. is the latest company to expose data via a misconfigured Amazon Web Services Inc. S3 storage instance — a lot of data. In this case, in involved 48 million records relating to social media information.

Discovered by Chris Vickery at UpGuard Inc., as all good S3 data exposures are, the data related to tens of millions of people that LocalBlox had gathered from social media platforms, including physical addresses, dates of birth, “scraped” data from LinkedIn and Facebook, Twitter handles and more.

As Vickery explained in a blog post, the data builds a “three-dimensional picture of every individual affected — who they are, what they talk about, what they like, even what they do for a living — in essence a blueprint from which to create targeted persuasive content, like advertising or political campaigning.”

The data gathering itself isn’t greatly surprising, since LocalBlox has stated clearly that its service provides automatic crawling, discovery, extraction, indexing, mapping and augmenting of data “in a variety of formats from the web and from exchange networks.” The difference, of course, is that although it would usually sell access to the data, this time it exposed it to all and sundry for free on an S3 instance.

Instead of admitting it made a mistake, LocalBlox instead accused Vickery of hacking the company. Chief Executive Officer Ashfaq Rahman told ZDNet that “most” of the data was fabricated for internal tests but did not say how much. Still, the company changed the setting on the file from public to private within hours of being made aware that it was publicly exposed.

Given that the data itself is based on publicly available information, arguably it’s less concerning than a data exposure that exposes private information such as passport and ID card scans, as was the case with Thai telco True Corp. late last week. Still, with the drama of the so-called Cambridge Analytica data scandal involving Facebook still making headlines, the incident once again raises concerns about data scraping practices.

“Data gathered on these people connected their identity and online behaviors and activity, all in the context of targeted marketing, i.e. how best to persuade them,” Vickery noted. “It is exactly this persuasive factor that lies at the heart of discussions about how data is gathered and sold: when aggregated together at scale, your psychographic data can be used to influence you. It is what makes exposures of this nature so dangerous.”

Image: frauhoelle/Flickr

Since you’re here …

… We’d like to tell you about our mission and how you can help us fulfill it. SiliconANGLE Media Inc.’s business model is based on the intrinsic value of the content, not advertising. Unlike many online publications, we don’t have a paywall or run banner advertising, because we want to keep our journalism open, without influence or the need to chase traffic.The journalism, reporting and commentary on SiliconANGLE — along with live, unscripted video from our Silicon Valley studio and globe-trotting video teams at theCUBE — take a lot of hard work, time and money. Keeping the quality high requires the support of sponsors who are aligned with our vision of ad-free journalism content.

If you like the reporting, video interviews and other ad-free content here, please take a moment to check out a sample of the video content supported by our sponsors, tweet your support, and keep coming back to SiliconANGLE.