On January 21, 2025, between 14:02 and 17:49 UTC, Atlassian customers using Bitbucket Cloud were unable to use the website, API, or Pipelines. The event was triggered by write contention in a high traffic database table. The incident was detected within eight minutes. We then worked to both throttle traffic and improve query performance, which allowed services to recover. The total time to resolution was about three hours and 47 minutes.
The overall impact was between 14:02 and 17:49 UTC, affecting Bitbucket Cloud. This impacted customers globally, and they were unable to use the website, APIs, or Pipelines services. Git hosting (SSH) was unaffected.
The issue was caused by an increase in API traffic triggering write contention on a high-traffic table, resulting in increased CPU usage and degraded database performance. This ultimately impacted the availability of core services (web, API, and Pipelines).
We know that outages impact your productivity. While we have several testing and preventative processes in place, this specific issue wasn’t identified because the code path being triggered does not commonly experience this type of traffic.
We are prioritizing the following improvement actions to avoid repeating this type of incident:
To improve service resilience and recovery time for our environments, we will implement additional preventative measures such as:
We apologize to customers whose services were impacted by this incident and are taking immediate steps to improve the platform’s performance and availability.
Thanks,
Atlassian Customer Support