Cloudflare outage briefly knocks parts of the web offline
Cloudflare Inc. this morning experienced a brief but widely felt service disruption that made many major websites inaccessible worldwide.
The outage (pictured) started at 9:52 a.m. EDT and lasted for more than an hour. Down Detector, a service that tracks website outages, received user reports of “502 Bad Gateway” errors for sites such as Pinterest and BuzzFeed and business applications such as Dropbox. Down Detector itself was knocked offline for a short period.
Even some sites that don’t rely on Cloudflare’s content delivery network were affected. The BBC reported that Coindesk, a major cryptocurrency and blockchain news blog, briefly displayed incorrect pricing information for bitcoin after the outage hit some of its data providers.
Certain secondary Cloudflare services were still experiencing issues more than two hours after the company brought its platform back online. Cloudflare Analytics, which helps site operators track user traffic, stopped displaying new data and web request logs were being delivered with a delay.
The downtime was apparently caused by an internal operational error. In a series of tweets, Cloudflare Chief Executive Matthew Prince detailed that a “massive spike in global CPU usage” took down both the provider’s primary and its backup systems. Prince pledged that the company will put new protections in place to insulate its backup systems from potential future outages and ensure they can kick into action as intended.
The reason why the disruption had such a big impact on the web has to do with Cloudflare’s central role in processing global internet traffic. More than 16 million sites rely on the company’s platform to load pages for users, as well as to fend off online threats such as distributed denial-of-service attacks.
Large-scale outages at major internet companies don’t occur too frequently, but the impact is often global when they do happen. In June, a technical issue caused YouTube, Gmail, G Suite and other core Google LLC products to become unavailable for four hours. It also caused disruptions for external services such as Snapchat that run on the search giant’s cloud platform.
“In 2019, with automation software available for all deployment tasks downtime caused by human error is simply no longer acceptable, Robert Reeves, co-founder and chief technology officer of database release automation software provider Datical Inc., told SiliconANGLE. “It’s time for this to stop.”
Image: Down Detector
Since you’re here …
… We’d like to tell you about our mission and how you can help us fulfill it. SiliconANGLE Media Inc.’s business model is based on the intrinsic value of the content, not advertising. Unlike many online publications, we don’t have a paywall or run banner advertising, because we want to keep our journalism open, without influence or the need to chase traffic.The journalism, reporting and commentary on SiliconANGLE — along with live, unscripted video from our Silicon Valley studio and globe-trotting video teams at theCUBE — take a lot of hard work, time and money. Keeping the quality high requires the support of sponsors who are aligned with our vision of ad-free journalism content.
If you like the reporting, video interviews and other ad-free content here, please take a moment to check out a sample of the video content supported by our sponsors, tweet your support, and keep coming back to SiliconANGLE.