Back to overview

IntraManager Board API Downtime Incident Report - March 11, 2024

Mar 11 at 03:51pm CET
Affected services
API

Resolved
Mar 11 at 03:51pm CET

Event Summary:

On March 11, 2024, at approximately 14:53, our API experienced an unplanned downtime that lasted for a total of 17 minutes, concluding at 15:10. This incident was triggered by a failed server scaling operation conducted by our hosting provider.

Cause of the Incident:

The downtime occurred as a direct result of a failed scaling event within our server infrastructure. Server scaling operations are a common practice, performed multiple times daily to dynamically adjust resources in response to fluctuating demand on our clusters. Unfortunately, this particular scaling event did not execute as anticipated, leading to temporary service disruption.

Detection and Response:

Our server monitoring systems promptly alerted us to the issue, allowing our technical team to immediately begin diagnosing and addressing the malfunction. Through swift and coordinated efforts, we were able to fully resolve the problem by 15:11, restoring normal service operations.

Conclusion:

We apologize for any inconvenience caused by this service interruption and wish to assure our users that we are taking this incident seriously. We are currently working closely with our hosting provider to investigate the root cause of the failed server scaling operation and to implement measures designed to prevent similar incidents in the future.

Thank you for your understanding and continued support.