Resolved -
The incident is now resolved. Since May 1st 2025 at 04:52 UTC we falsely reported a limited number of Atlas Clusters as down in the UI and alerted as such.
May 2, 12:14 UTC
Monitoring -
We have identified the root cause for the delays in Atlas metrics processing and have mitigated the issue. Atlas metrics processing is no longer delayed. Clusters are still operational.
May 2, 10:05 UTC
Investigating -
We are investigating delayed Atlas metrics processing that can result in host down alerts for Atlas clusters. Clusters are still operational.
May 2, 08:37 UTC