Batch Processing & Business Reporting were unavailable for 4 hours and 50 minutes delaying Customer batch jobs and preventing Customer reports from being run.
October 1st, 9:30pm CEST to October 2nd, 4:20am CEST
On October 2 @ 12:30am CEST the FlexNet Manager Suite 2020 R1.2 Production Deployment start was delayed due to an Inventory Database process that was still running. Technical teams were unable to proceed due to this process being in a rollback state. The maintenance window was extended by 2.5 hours as a result.
Once the Production Deployment was finished technical teams, while performing environment health checks, found that Business reporting was still offline. Technical teams also received automated failure emails advising that the automated deployment process had missed updating a number of components. As a result, the batch processing error rates exceeded normal thresholds triggering alert notifications. The Beacon policy update also failed due to a certificate issue. Technical teams found that importing the updated private key had failed – this was corrected by reverting to a method to update the certificate.
As a result of the issues found during health checks, Batch processing was disabled while teams investigated the issues. Technical teams found a deployment automation issue which was fixed – updated components were re-deployed successfully and Batch processing was re-enabled.
Technical teams then discovered around 20% of Customer batch jobs containing Citrix evidence were failing. The SQL responsible for the failures was identified. Technical teams analysed the customer Citrix evidence triggering the batch failures and updated the SQL to address the unanticipated customer use cases. This was implemented as a hot-fix and successfully deployed.
After health checks were completed, services were confirmed restored on Friday, October 2 @ 4:20am CEST.