Previous incidents

February 2025
Feb 06, 2025
1 incident

Dashboard is down

Downtime

Resolved Feb 06 at 01:39pm IST

Dashboard recovered.

1 previous update

January 2025
Jan 16, 2025
1 incident

AI Models is down

Downtime

Resolved Jan 16 at 12:20pm IST

All models are recovered.

3 previous updates

December 2024
Dec 03, 2024
1 incident

Logging is degraded

Degraded

Resolved Dec 13 at 12:38pm IST

Update

The RCA is tracked on the following Google public issue: https://issuetracker.google.com/issues/363324206

They have enabled a liveness probe on the cluster to ensure this doesn't happen again. We are taking some precautionary steps to avoid this entirely in the future.

6 previous updates