Increased latency and error rate in API
Incident Report for Checkly
Postmortem

In this incident our analytics database ran out of memory due to a misconfiguration. This caused issues for viewing the home dashboard and check overview pages, for private locations, and for heartbeat checks.

After analyzing our logs, we determined that the incident lasted from 2024-02-22 13:50 UTC to 2024-02-22 14:35 UTC. To address this issue, we’ve increased the capacity of our analytics database. We’ve also added additional alerting to detect increased memory usage in the analytics database before it affect users.

We apologize for any problems this may have caused. If you have any questions or concerns feel free to contact us via support@checklyhq.com.

Posted Feb 26, 2024 - 10:53 UTC

Resolved
This incident has been resolved.
Posted Feb 22, 2024 - 14:54 UTC
Investigating
We are currently investigating this issue.
Posted Feb 22, 2024 - 14:35 UTC
This incident affected: API and Web Application.