Summary
Starting form Tuesday, Jan 24, 2023, 05:15 UTC, All data sent to our data platform on EU region failed to be ingested. No exception was thrown.
The event was caused by an internal Azure platform component malfunction.
Once the issue was reported we notified to the vendor and they have fixed it internally.
Incident Timeline
Root Cause Analysis
Internal Azure platform error caused data not being ingested and no error message was thrown. The problem was fixed internally by the vendor engineering team. After the issue was resolved it took several hours to consume the data backlog accumulated while ingestion was down.
Steps Done
Dedicated alert was added to both vendor’s platform and to checkpoint data platform.