Previous incidents
Dashboard, API Service, and 1 other service are down
Resolved Dec 10 at 08:01am IST
AI Models recovered.
3 previous updates
Dashboard, API Service, and 1 other service are down
Resolved Dec 05 at 08:33am IST
Quick RCA
- Bitnami latest redis version pull started failing between a few restarts. We moved the setup entirely to our own mirror.
- We have moved the setup to our own mirror to avoid this from happening in future.
4 previous updates
[Cloudflare outage] Prompt executions and log ingestions are slowed down
Resolved Nov 18 at 08:54pm IST
Cloudflare confirmed that the issue is completely resolved. We are still keeping an eye on the system.
1 previous update
Log ingestion has slowed down
Resolved Nov 11 at 12:16pm IST
The cluster is back to normal but we are still observing pipeline for the backlog created. We are in touch with Clickhouse team to get the detailed RCA and we will publish it as soon as we recieve it.
1 previous update
Dashboard, API Service, and 1 other service are down
Resolved Oct 31 at 01:07am IST
Dashboard and API Service recovered.
6 previous updates
Test runs are degraded
Resolved Oct 30 at 11:30pm IST
Pipelines are working normally still we are observing all states.
1 previous update