20/10/2025
(Updated 0920Z/2020AEDT, see further below)
Snapchat refusing to post an image or update your latest story? It's not just you. Vast swathes of the Internet aren't functioning as expected.
Amazon's AWS, whose CDN and distributed cloud compute powers many corners of the web is having issues with the company now admitting to the fault on their website, stating that "We can confirm increased error rates and latencies for multiple AWS Services in the US-EAST-1 Region [...] We are actively engaged and working to both mitigate the issue and understand root cause."
Given they have a SLA and downtime costs Amazon large amounts of money, it would be expected that the issue would be resolved quickly, likely within the next hour but if BGP was involved with the outage it may take longer for all AS (autonomous systems, each individual network providers' segment within BGP) route views to converge.
โ
Updated 0920Z/2020AEDT: the Amazon service health page now states they have identified the root cause of the problem in a distributed database platform known as DynamoDB, which they claim "appears to be related to DNS resolution of the DynamoDB API endpoint in US-EAST-1."
They furthermore state that they are working on multiple parallel paths to attempt to hasten recovery but this may be a matter of waiting out the TTL.
Hidden in their update was what we already knew but Amazon hadn't yet admitted, that the incident "also affects other AWS Services in the US-EAST-1 Region. Global services or features that rely on US-EAST-1 endpoints such as IAM updates and DynamoDB Global tables may also be experiencing issues."
Perhaps ironically, Amazon's own trouble ticket system for AWS customers relies on DynamoDB meaning that new issues cannot be lodged or are at least severely rate limited.