Previous incidents

October 2025

Oct 30, 2025

2 incidents

Dashboard, API Service, and 1 other service are down

Downtime

Resolved Oct 30, 2025 at 07:37pm UTC

Dashboard and API Service recovered.

6 previous updates

Test runs are degraded

Degraded

Resolved Oct 30, 2025 at 06:00pm UTC

Pipelines are working normally still we are observing all states.

1 previous update

September 2025

No incidents reported

August 2025

Aug 28, 2025

1 incident

Log Ingestion Delay Due to Google Pub/Sub Slowness

Degraded

Resolved Aug 28, 2025 at 09:30am UTC

This is now resolved, and ingestion is back to its original speed.

2 previous updates

Aug 13, 2025

1 incident

Log ingestion has slowed down

Degraded

Resolved Aug 12, 2025 at 08:13pm UTC

The node is recovered. Ingestion is resumed.

1 previous update

Aug 06, 2025

1 incident

[Downstream service issue] Log ingestion buffer is taking more time than expe...

Degraded

Resolved Aug 05, 2025 at 07:49pm UTC

We’ve received an update from the Clickhouse team. Here’s the crux of the issue:

"Due to memory starvation, other processes in your cluster are starting to fail, resulting in degraded performance, likely including the failed writes you are experiencing."

All queued logs have been written to the disk and the instance is back to normal.

We’re still keeping an eye on our data pipelines. We’ve also put together a checklist with the team to help prevent this from happening again.

3 previous updates