This is a summary and analysis of an issue that occurred with the delivery of an Acquia product or service. The purpose of this document is to share details about what happened and why, so there is a common understanding of what is required to prevent a future occurrence if at all possible. Any remaining issues or risks are identified, as are recommended or pending actions.
Between 1-3 June 2021 Acquia Content Hub and Personalization services experienced a degradation of service which caused syndication requests to queue for extended periods and affected some Drupal application functions attempting to contact the Content Hub service. This degradation was caused when a large volume of requests exceeded the ability of the API/Database to process them. Acquia R&D has identified a number of remediations - enhancing service optimization as well as preventing congestion from any particular application to impact performance for other customers - in order to mitigate risk of recurrence.
Between 1 and 3 June 2021 Acquia Content Hub and Personalization experienced a degradation of service for customers in the US East region of service. This degradation caused significant delays in the syndication of content and operations dependent on interactions with the Content Hub service. During this event content syndication requests queued and all were processed as actions were taken to increase capacity and mitigate load on the service.
The root cause of this service degradation was a large influx of entity node revisions (nearing 500,000 per hour). This exceeded the capacity of the API/database to process incoming requests resulting in requests being queued and taking significant times to process.