Resolved -
Error rates have returned to historic norms. This incident is resolved.
Apr 1, 20:25 UTC
Monitoring -
Error rates have recovered and we are continuing to monitor.
Apr 1, 17:00 UTC
Identified -
We've identified an infrastructure constraint causing an elevated API error rate. Run ingestion requests are retried. We have implemented a mitigation and are monitoring for recovery.
Apr 1, 16:54 UTC
Resolved -
Performance has returned to historic norms.
Mar 25, 16:04 UTC
Monitoring -
A fix has been implemented and we are monitoring the results.
Mar 25, 15:42 UTC
Update -
We are continuing to work on a fix for this issue.
Mar 25, 15:30 UTC
Identified -
We've identified an infrastructure issue causing run ingest delays and slow queries. Some queries are failing and retryable. We are actively working on a mitigation.
Mar 25, 15:30 UTC