[CHAT] Issue Receiving Chat Attachments [PROD1 and PROD2]

Incident Report for Kustomer

Postmortem

Summary

A change made to improve scaling of Kustomer’s chat infrastructure had an unintentional side effect of preventing customers from sending attachments over chat for approximately 3 hours. While the change affected all pods, the issue principally affected customers in prod2 during morning business hours.

Root Cause

To address a critical need to address scaling up our chat infrastructure, a change was made that inadvertently caused rate limit calculations on chat attachments to false hit limits due to the increased number of instances. Ultimately, the configuration was updated to resolve this.

Timeline

Dec 18, 2024

9:00 PM EST - Chat infrastructure scaled up, affecting rate limit calculations for attachments.

Dec 19, 2024

1:09 AM EST - Clients in Prod2 report that attachments cannot be added to chats.

2:14 AM EST - Engineering was alerted to attachments not working.

3:26 AM EST - Deployment of increase to rate limits initiated, with functionality coming back online as this rolled out.

3:53 AM EST - Incident declared resolved

Lessons/Improvements

  • Testing Enhancements & Alerting - While checkouts of the chat functionality were done as part of standard checkouts to scaling up changes, a gap was revealed to the lack of testing specifically using attachments at scale. Additionally alerting failed to capture this specific issue as it was only a subset of the total chats in progress.
  • Rate limiting [Resolved] - Improvements to Rate Limiting logic have already been rolled out.
Posted 5 months ago. Jan 14, 2025 - 09:33 EST

Resolved

Kustomer has resolved an event limited to the chat channel that caused an error when customers tried to send attachments through chat.

After careful monitoring, our team has determined that all impacted areas are now fully restored. Please reach out to Kustomer Support at support@kustomer.com if you have additional questions or concerns.
Posted 5 months ago. Dec 19, 2024 - 03:55 EST

Update

Kustomer continues to work on an event limited to the chat channel that may cause an error when customers try to send attachments through chat.

We are actively working to address the issue to restore full functionality. Further updates will follow as progress is made.

Please expect additional updates within the next 30 minutes and contact Kustomer Support at support@kustomer.com if you have any further queries.
Posted 5 months ago. Dec 19, 2024 - 03:47 EST

Update

Kustomer is aware of an event limited to the chat channel that may cause an error when customers try to send attachments through chat.

Our team has identified the cause of this issue and are working to implement a resolution. Please expect additional updates within the next 30 minutes, please reach out to Kustomer Support at support@kustomer.com if you have any further queries.
Posted 5 months ago. Dec 19, 2024 - 03:03 EST

Identified

Kustomer is aware of an event impacting chat that may cause an error when customers try to send attachments through chat. This is limited to the chat channel.

Our team is currently working to identify the cause of this issue in an effort to implement a resolution. Please expect additional updates within the next 30 minutes, please reach out to Kustomer Support at support@kustomer.com for any further questions or updates.
Posted 5 months ago. Dec 19, 2024 - 02:33 EST
This incident affected: Prod1 (US) (Channel - Chat) and Prod2 (EU) (Channel - Chat).