Postmortem -
Read details
May 3, 17:43 UTC
Resolved -
Check runs that were successfully scheduled have been backfilled. This incident has been resolved.
May 2, 05:42 UTC
Update -
We are continuing to monitor for any further issues.
May 2, 05:28 UTC
Monitoring -
We rolled out another fix and all metrics return back to normal. Checks are processing fine again, Check results rolling in
May 2, 04:41 UTC
Identified -
We are still seeing some related errors and checks are not processing as expected. we moved back to "partial outage".
May 2, 04:00 UTC
Update -
Checks get processed since ~1:11 UTC but it will take some time for results to show up
May 2, 02:41 UTC
Update -
While catching up after the outage, we see some higher than usual scheduling delays in us-west-2, eu-south-1, ap-southeast-2 and me-south-1.
May 2, 01:36 UTC
Monitoring -
A fix has been implemented and we are monitoring our infra working through the backlog of checks.
May 2, 01:14 UTC
Identified -
The failing service has been identified and we are looking into a fix.
May 2, 01:08 UTC
Investigating -
We are currently investigating this issue.
May 2, 00:51 UTC