Sandbox Answers Overview Delays
Incident Report for Yext
Postmortem

Summary

On January 6th, 2021, at 11:23 a.m. ET, a server which processed logs for the Sandbox Answers Overview page stopped functioning. This prevented some portions of the overview page from displaying the latest data, which also impacted Hitchhiker training. Engineers restored service at 12:45 p.m. ET. No data was lost as a result of this outage, and production services were unaffected.

Root Cause

A downstream service had an incorrect configuration which caused it to run out of resources, halting the data processing pipeline. Once the service was restored, processing resumed and completed the backlog of logs.

Remediation

We will be updating our configuration delivery mechanism to remove the incorrectly provisioned settings. Additionally, future configuration updates will require manual user review to reduce the likelihood of errors.

Posted Jan 14, 2021 - 18:28 EST

Resolved
This incident has been resolved.
Posted Jan 06, 2021 - 13:53 EST
Monitoring
We have mitigated the delays and data processing is now up to date in the Sandbox Answers Overview page. We will continue to monitor for any regressions.
Posted Jan 06, 2021 - 12:56 EST
Update
We are continuing to investigate this issue.
Posted Jan 06, 2021 - 12:18 EST
Investigating
We are currently investigating delays in data processing for our Answers Overview page in the Sandbox environment. Some Hitchhiker flows may be delayed at this time.
Posted Jan 06, 2021 - 12:17 EST
This incident affected: Sandbox.