Amazon Finds the Cause of Its AWS Outage: A Typo

March 02, 2017

Company says disruption started at its Northern Virginia data centers

By LAURA STEVENS March 2, 2017 4:35 p.m. ET

Amazon.com Inc. on Thursday blamed human error for an outage at its cloud-services unit that caused widespread disruption to internet traffic across the U.S. earlier this week.

In a post on its website, Amazon said the outage started with a typo at Amazon’s northern Virginia data centers Tuesday.

An employee trying to speed up the company’s S3 cloud-storage billing system tried to take a few servers offline. The employee mistyped the command, however, affecting more servers than intended, which led to a cascade of failures that ultimately knocked out S3 and other Amazon services. It also took longer than expected to restart certain services, Amazon said.

Amazon said it is adding safeguards to prevent server capacity from falling too quickly or below a minimum level.

Amazon Web Services, the largest global seller of cloud infrastructure, has more than a million users. The hours long outage Tuesday disabled and slowed apps and websites from a wide section of U.S. companies, including Quora Inc., Slack Technologies Inc. and Medium.com Inc.

The AWS outage cost companies in the S&P 500 index $150 million, according to Cyence Inc., a startup that specializes in estimating cyberrisks. Apica Inc., a website-monitoring company, said 54 of the internet’s top 100 retailers saw website performance slow by 20% or more.

—Robert McMillan contributed to this article.

https://www.wsj.com/articles/amazon-finds-the-cause-of-its-aws-outage-a-typo-1488490506

Search This Blog

Ken Garen, CPA, Co-Founder and President, UBCC

Amazon Finds the Cause of Its AWS Outage: A Typo

Comments

Post a Comment

Popular posts from this blog

New ATM's: withdraw money with veins in your finger

Will AI replace doctors who read X-rays, or just make them better than ever?

The City That’s Trying to Replace Politicians With Computers (It’s Working)