Cloud Outages

2026-02-20 Cloudflare

2026-02-20T00:00:00.000Z

Root Cause

BYOIP Prefix Mass Deletion Outage
missing empty check on filter parameter
A buggy automated cleanup task attempted to delete BYOIP prefixes.

Impact

customers lost CDN, Spectrum, Dedicated Egress, and Magic Transit

Duration

Sources

https://blog.cloudflare.com/cloudflare-outage-february-20-2026/

2026-02-07 Microsoft Azure

2026-02-07T00:00:00.000Z

Root Cause

"The event began following a power interruption affecting one of the datacenters within the region, after which impact manifested as infrastructure availability loss and service disruptions across multiple dependent workloads in the region."

Impact

West US region
region without AZs, so all services more or less down for everyone without multi-region failover

Duration

20h

Sources

https://www.youtube.com/watch?v=p_HKW7qbwXs&list=PLmsFUfdnGr3xomlYbZPAYTtFdkcvbv2ye&index=3
MS tracking id "_SVS-5_G"
https://azure.status.microsoft/en-us/status/history/

2025-12-05 Cloudflare

2025-12-05T00:00:00.000Z

Impact

28% of HTTP traffic

Root Cause

botched fix for React vulnerability

Duration

25min

Sources

https://blog.cloudflare.com/5-december-2025-outage/

2025-11-18 Cloudflare

2025-11-18T00:00:00.000Z

Root Cause

faulty configuration replicated globally

Impact

global
services affected: Cloudflare Sites and Services (Access, Bot Management, CDN/Cache, Dashboard, Firewall, Network, WARP, Workers)
ChatGPT, X, many websites

Duration

9,5h

Sources

2025-10-20 AWS

2025-10-20T00:00:00.000Z

Root Cause

DNS problems with DynamoDB in us-east1
HA realized via DNS entries, wrong updating of DNS caused DynamoDB to be unavailable

Impact

many gaming platforms down

Duration

3h (according to post mortem)
15h (according to techtarget)

Sources

2025-07-09 Outlook

2025-07-09T00:00:00.000Z

Impact

outlook.com
unable to access virtual mailboxes
sign in issues

Root Cause

A recent service update to an authentication component unintentionally prevented access for a subset of users, resulting in intermittent service unavailability.

Duration

19-21h

Sources

2025-06-15 Heroku

2025-06-15T00:00:00.000Z

Root Cause

automated OS Update on production took networking routes down

Duration

23h

Sources

https://www.heroku.com/blog/summary-of-june-10-outage/

2025-06-12 Google Cloud

2025-06-12T00:00:00.000Z

Impact

Many Google cloud locations
Secondary effects caused by Cloudflare being affected
Google Cloud, Google Workspace and Google Security Operations products experienced increased 503 errors in external API requests, impacting customers.

Root Cause

control plane policy bug in quota management
binary crash loop in each region deployment

Duration

overall 7h
most regions fixed after 2h

Sources

https://status.cloud.google.com/incidents/ow5i3PPK96RduMcb1SsW

2025-05-24 X

2025-05-24T00:00:00.000Z

Root Cause

fire in data center PDX11

Duration

2,5h

Sources

https://www.wired.com/story/elon-musk-x-datacenter-fire/

2025-01-08 Microsoft Azure

2025-01-08T00:00:00.000Z

Root Cause

A networking configuration change in East US 2 created issues across multiple Azure services.

Duration

50h

Sources

https://www.techtarget.com/searchCloudComputing/feature/Cloud-outages-expected-to-be-the-new-normal-in-2026

2024-07-19 Crowdstrike

2024-07-19T00:00:00.000Z

Root Cause

Crowdstrike agent downloading new update and causing a reboot loop.

Impact

All infrastructure running Windows with Crowdstrike agent installed
Worst global impact seen so far

Duration

hours to days due to the need to recover affected end user systems manually

Sources

https://arstechnica.com/information-technology/2024/07/major-outages-at-crowdstrike-microsoft-leave-the-world-with-bsods-and-confusion/

2024-07-18 Microsoft Azure

2024-07-18T00:00:00.000Z

Root Cause

Between 21:40 UTC on 18 July 2024 and 12:15 UTC on 19 July 2024, customers may have experienced issues with multiple Azure services in the Central US region due to an Azure Storage availability event. This issue affected Virtual Machine (VM) availability, which caused downstream impact on multiple Azure services, including failures of service management operations and connectivity or availability of services. Services with dependencies on the impacted Virtual Machines would have been affected.

Impact

Azure VMs in US central region

Duration

~14h

Media

https://www.techradar.com/pro/microsoft-says-its-cloud-services-are-back-up-after-major-outage

2024-05-02 Google Cloud

2024-05-02T00:00:00.000Z

Root Cause

Impact

Infrastructure of client UniSuper was delete including backup

Duration

Sources

2024-04-08 Rackspace

2024-04-08T00:00:00.000Z

Root Cause

Impact

"impacted multiple downstream providers, as well as Rackspace customers within multiple regions including the U.S., Japan, Vietnam, Spain, Canada, Germany, Singapore, France, the Netherlands, the U.K., Brazil, and South Africa"

Duration

14min

Sources

https://www.cbsnews.com/news/microsoft-teams-outages-block-limit-user-access/

2024-01-26 Microsoft Teams

2024-01-26T00:00:00.000Z

Root Cause

"a networking issue impacting a portion of the Teams service"

Duration

2,5h

Sources

2023-11-02 Cloudflare

2023-11-02T00:00:00.000Z

Root Cause

data center outage + high availability did not work

Duration

Impact

Cloudflare control panel and analytics outage

Sources

https://blog.cloudflare.com/post-mortem-on-cloudflare-control-plane-and-analytics-outage/

2023-07-05 Azure

2023-07-05T00:00:00.000Z

Root Cause

fiber cut caused by severe weather conditions in the Netherlands

Duration

Impact

Region West Europe partially down

Sources

2023-06-13 AWS-us-east1

2023-06-13T00:00:00.000Z

Impact

Service degradation of 104 AWS services (that where using AWS Lambda)

Duration

Root Cause

Lambda scaling crossing a new threshold hit a functional bug

Sources

https://aws.amazon.com/message/061323/

2023-04-25 GCP-europe-west-9

2023-04-25T00:00:00.000Z

Root Cause

fire after cooling system water pipe leak

Duration

Impact

Cloud region europe-west-9 was offline (one day)
Zone europe-west-9-a was offline (two weeks)

Sources

https://status.cloud.google.com/incidents/dS9ps52MUnxQfyDGPfkY

2023-04-07 SpaceX

2023-04-07T00:00:00.000Z

Impact

No connection

Duration

Root Cause

Expired certificate

Sources

https://blog.cloudflare.com/q2-2023-internet-disruption-summary/

2023-03-09 Datadog

2023-03-09T00:00:00.000Z

Root Cause

automatic OS update takes network down

Duration

Impact

Service outage

Sources

https://www.crn.com/news/cloud/the-15-biggest-cloud-outages-of-2023?page=5

2023-02-16 GCP

2023-02-16T00:00:00.000Z

Root Cause

network update caused traffic disruption

Duration

Impact

Gmail, Youtube, Google Drive partial outage

Sources

https://www.linkedin.com/pulse/recent-cloud-platform-outages-2023-pankaj-kumar-mandal

2023-02-13 Oracle OCI

2023-02-13T00:00:00.000Z

Root Cause

performance problems in DNS-based load management

Duration

Impact

OCI Vault, API Gateway, Oracle Digital Assistant and OCI Search with OpenSearch

Sources

https://www.networkworld.com/article/3688509/oracle-outages-serve-as-warning-for-companies-relying-on-cloud-technology.html

2023-01-25 Microsoft Teams

2023-01-25T00:00:00.000Z

Root Cause

network configuration error

Duration

Impact

World-wide MS Teams outage

Sources

2022-12-05 AWS US East2

2022-12-05T00:00:00.000Z

Root Cause

unclear

Duration

75min

Impact

US East2 connectivity issues

Sources

https://www.networkworld.com/article/971716/aws-suffers-outage-at-its-us-east-2-cloud-region.html

2022-10-25 Whatsapp

2022-10-25T00:00:00.000Z

Root Cause

backend application service failure

Duration

Impact

Users unable to send/receive messages

Sources

https://www.thousandeyes.com/blog/internet-report-pulse-update-november-7-2022

2022-09-15 Zoom

2022-09-15T00:00:00.000Z

Root Cause

unclear

Duration

Impact

worldwide, no meetings possible

Sources

https://www.thousandeyes.com/blog/internet-report-pulse-update-september-26-2022

2022-08-09 Google Search+Maps

2022-08-09T00:00:00.000Z

Root Cause

software update

Duration

Impact

Google Search, Google Maps globally unavailable

Sources

https://www.networkworld.com/article/971832/top-10-outages-of-2022.html

2022-07-08 AWS US East2 AZ1

2022-07-08T00:00:00.000Z

Root Cause

power failure

Duration

20min

Impact

AZ1 of US East2 without connectivity

Sources

https://www.thousandeyes.com/blog/aws-outage-analysis-july-28-2022

2022-06-21 Cloudflare

2022-06-21T00:00:00.000Z

Root Cause

A change to the network configuration in those locations caused an outage [1]

Duration

1h 15min

Impact

many affected websites

Sources

[1] https://blog.cloudflare.com/cloudflare-outage-on-june-21-2022

2022-04-05 Atlassian

2022-04-05T00:00:00.000Z

Root Cause

global scale orchestration human error, instead of shutting down component product instances were terminated

Impact

400 companies and anywhere from 50,000 to 400,000 users had no access to JIRA, Confluence, OpsGenie, JIRA Status page, and other Atlassian Cloud services

Duration

">14days for some customers"

Sources

2022-03-01 Apple

2022-03-01T00:00:00.000Z

Root Cause

DNS problems

Impact

App Store, Maps, TV

Duration

Sources

https://www.crn.com/news/cloud/the-10-biggest-cloud-outages-of-2022-so-far?page=6

2022-02-22 Slack

2022-02-22T00:00:00.000Z

Root Cause

Quote from Slack status page: A configuration change inadvertently lead to a sudden increase in activity on our database infrastructure. Due to this increased activity, the affected databases failed to serve incoming requests to connect to Slack.

Impact

Slack not loading

Duration

Status Page

https://status.slack.com/2022-02-22

2021-12-08 AWS

2021-12-08T00:00:00.000Z

Impact

different services in us-east1#

Duration

4h

Sources

https://www.zdnet.com/article/aws-goes-down-and-with-it-goes-a-host-of-websites-and-services/

2021-10-04 Facebook

2021-10-04T00:00:00.000Z

Impact

Facebook, Instagram, Whatsapp down

Duration

Sources

https://engineering.fb.com/2021/10/04/networking-traffic/outage/
https://twitter.com/jgrahamc/status/1445068309288951820
https://www.theverge.com/2021/10/4/22709575/facebook-outage-instagram-whatsapp
Reports on the need for an angle grinder: https://x.com/cullend/status/1445156376934862848

2021-06-08 Fastly

2021-06-08T00:00:00.000Z

Impact

global incident
high origin loads
"Customers could continue to experience a period of increased origin load and lower Cache Hit Ratio (CHR)."

Duration

2h

Root Cause

unknown

Status Page

https://status.fastly.com/incidents/vpk0ssybt3bj (Report)

2021-05-11 Salesforce

2021-05-11T00:00:00.000Z

Impact

All services not available due to DNS outage

Duration

Root Cause

failed global DNS change

Status Page

https://www.theregister.com/2021/05/19/salesforce_root_cause/ (Report)

2021-03-23 quay.io

2021-03-23T00:00:00.000Z

Impact

No image pulls possible

Duration

Root Cause

somehow AWS related

Status Page

https://status.quay.io/incidents/vfs19hmq660h (Incident Report)

2021-03-10 OVH SBG Datacenters

2021-03-10T00:00:00.000Z

Impact

4 datacenters down
2 destroyed
recovery >10days

Sources

https://www.bleepingcomputer.com/news/technology/ovh-data-center-burns-down-knocking-major-sites-offline/

Provider Status Page

https://status.us.ovhcloud.com/

2020-11-26 AWS

2020-11-26T00:00:00.000Z

Root Cause

Duration

Impact

only us-east1
Roku, Adobe, Glassdoor, Autodesk, The Wall Street Journal, 1Password
Kinesis Data Streams API and other dependent services

Sources

https://techhq.com/2020/12/3-biggest-public-cloud-outages-of-2020/

2020-08-24 Zoom

2020-08-24T00:00:00.000Z

Root Cause

not disclosed

Duration

Sources

https://techhq.com/2020/12/3-biggest-public-cloud-outages-of-2020/

Status Page

https://status.zoom.us/

2020-06-29 Github

2020-06-29T00:00:00.000Z

Duration

Impact

FIXME

Sources

https://statusgator.com/blog/2020/08/21/5-biggest-outages-of-q2-2020/

2020-06-10 IBM Cloud

2020-06-10T00:00:00.000Z

Duration

several hours

Impact

cloud down globally

Sources

https://statusgator.com/blog/2020/08/21/5-biggest-outages-of-q2-2020/

2020-05-17 Zoom

2020-05-17T00:00:00.000Z

Root Cause

undisclosed

Duration

Impact

customers unable to join meetings

Sources

https://statusgator.com/blog/2020/08/21/5-biggest-outages-of-q2-2020/

2020-05-12 Slack

2020-05-12T00:00:00.000Z

Root Cause

scaling up automation failure
new servers were not added to LBs, causing continuous performance degradation

Duration

3h (everyone) 1d (for Electron app users)

Impact

no messages could be sent

Sources

https://statusgator.com/blog/2020/08/21/5-biggest-outages-of-q2-2020/

Status Page

https://status.slack.com/2020-05-12 (Incident Report)
https://slack.engineering/a-terrible-horrible-no-good-very-bad-day-at-slack/ (Postmortem)

2020-03-03 Azure

2020-03-03T00:00:00.000Z

Root Cause

physical datacenter malfunction of air ventilation, overheating HW

Duration

Impact

us-east1

Sources

https://techhq.com/2020/12/3-biggest-public-cloud-outages-of-2020/

2019-07-18 Slack

2019-07-18T00:00:00.000Z

Root Cause

some servers unavailablity, performance degradation

Duration

~7h

Impact

connectivity issues
10-25% error rate

Sources

https://www.cnet.com/news/slack-explains-last-weeks-hours-long-outage/

2019-06-24 Verizon

2019-06-24T00:00:00.000Z

Root Cause

BGP route leak
Route propagation

Duration

Impact

Google, AWS, Reddit, Netflix, Cloudflare customers

Sources

https://slate.com/technology/2019/06/verizon-dqe-outage-internet-cloudflare-reddit-aws.html

2019-06-02 GCP Outage

2019-06-02T00:00:00.000Z

Root Cause

Network control plane
automation tool

Duration

Impact

G-Suite, Gmail, Google Docs, Google Drive, Google Cloud, YouTube
Vimeo, Shopify, Discord, Snapchat

Sources

https://techhq.com/2019/08/what-we-learned-from-google-clouds-june-outage/

Status Page

2019-05-18 Salesforce

2019-05-18T00:00:00.000Z

Root Cause

internal DB update script messed up user privileges (making them too open)

Duration

~15h

Impact

all customers shut off to prevent unprivileged data access