For more information about our Incident Response and Communications please read this support article.

We also maintain a list of Known Product Issues separate from this site here.

[Major] Delays in Box Relay
Incident Report for Box
Postmortem

We recently addressed issues affecting the execution of Relay workflows. We would like to take the opportunity to further explain these issues and the steps we have taken to keep them from happening in the future.

Between 12:00 am PDT on September 13, 2023 and 4:46 pm PDT on September 14, 2023, some users may have experienced difficulties while working in Box. During this time, scheduled workflows created in Relay were not executing and non-scheduled workflows were executing with delays. The issue occurred as a result of code change introduced earlier this year related to migration of scheduled workflows to new servers. We were able to resolve the issue by fixing the faulty code, which resolved the issue with our internal message queueing system that was responsible for this impact. In addition, we improved logging and observability to prevent similar issues from occurring in the future.

Analysis

The investigation has shown that the root cause was faulty code, which in specific circumstances, was creating multiple duplicate workflows, rapidly filling internal message queues. This resulted in delays in the execution of non-scheduled workflows and additionally temporarily prevented scheduled workflows from executing correctly. The faulty code was introduced as a performance improvement while doing the migration of scheduled workflows to new servers earlier this year.

Corrective Actions

The following corrective actions have been completed or are planned:

  • Corrected the faulty code responsible for this issue.
  • Introduce more testing of this code to reduce the likelihood of similar issues occurring again in the future.
  • Add alerts and metrics measuring number of scheduled workflow executions and introducing additional logging to facilitate investigation of similar cases in future

We are continuously working to improve Box and want to make sure we are delivering the best product and user experience we can. We hope we have provided some clarity here and we would be happy to answer any questions you may still have regarding this matter. 

Sincerely,
The Box Team

Posted Oct 25, 2023 - 09:20 PDT

Resolved
After further monitoring, this incident is now considered resolved. Box Relay Services have been restored to full functionality. If you continue to experience any issues, please contact Box Support at https://support.box.com.
Posted Sep 13, 2023 - 22:40 PDT
Update
Our team has taken further steps to remediate this issue. Existing Relay workflows will continue to be processed, however users will be unable to create any new scheduled workflows until further notice. We will continue to provide updates as they become available.
Posted Sep 13, 2023 - 18:54 PDT
Update
Our team is continuing to monitor for any additional impact. We will continue to provide updates as they become available.
Posted Sep 13, 2023 - 16:39 PDT
Update
Our team is continuing to monitor for any additional impact. We will continue to provide updates as they become available.
Posted Sep 13, 2023 - 13:50 PDT
Monitoring
Our team has taken steps to remediate this issue and we are starting to see improvements to Relay. We are continuing to monitor for any additional impact.
Posted Sep 13, 2023 - 10:03 PDT
Identified
We are continuing to work on a fix for this issue.
Posted Sep 13, 2023 - 09:28 PDT
Update
Our team are still investigating an issue regarding Box Relay. Users may observe delays in their Box Relay workflows. We will provide additional information as it becomes available.
Posted Sep 13, 2023 - 05:42 PDT
Investigating
Our team is investigating an issue regarding Box Relay. Users may observe delays in their Box Relay workflows. We will provide additional information as it becomes available.
Posted Sep 13, 2023 - 04:52 PDT
Identified
Users may observe delays in their Box Relay workflows. We're expecting this to be resolved within the next three hours.
Posted Sep 13, 2023 - 03:53 PDT
This incident affected: Box Relay.