NA1 Outage
Incident Report for QLess
Postmortem

We would like to provide additional detail surrounding the downtime which occurred on 6/29/2021

What happened?
At 11:35am PST, the NA1 server environment experienced downtime which resulted in all QLess applications becoming temporarily inaccessible on the server.
Duration: 19 minutes.

Cause
The root cause of the outage was due to a rare condition that was triggered on one of our backend servers. Customers reaching the site during that period of time could not reach the site, affecting 100% of them.

Remediation
Upon receiving a monitoring alert notification QLess engineers restarted the service.

Prevention
QLess has initiated a complete refactoring of the affected application. The new service application will be more fault-tolerant, available and self-healing.

Posted Jul 01, 2021 - 22:21 PDT

Resolved
QLess services are now operational.
Posted Jun 29, 2021 - 11:44 PDT
Update
We are continuing to investigate this issue.
Posted Jun 29, 2021 - 11:38 PDT
Investigating
We are currently investigating this issue.
Posted Jun 29, 2021 - 11:38 PDT
This incident affected: NA1.