On January 25th, 2023 at 10am, an incident occurred that resulted in delayed delivery of webhooks across multiple products. Over 10,000 webhooks were affected by the delay. The incident was identified at 4pm the same day.
An investigation was immediately launched by the above-named parties to determine the cause of the incident. It was discovered that only a single dispatcher was active. A queue of webhooks had built up for the dead dispatchers. The webhook dispatchers were restarted and the issue was resolved. The dispatchers began dispatching webhooks in a first-in-first-out (FIFO) order.
The Microsoft Azure outage that was reported earlier in the day was the source of the corrupted webhook dispatchers.
The webhook dispatchers were restarted, which resolved the issue and webhooks began to be dispatched in a FIFO order.
This incident resulted in delayed delivery of webhooks across multiple products. The cause of the incident was determined to be an Azure outage, and the issue was resolved by restarting the webhook dispatchers. Preventative measures have been put in place to prevent similar incidents from occurring in the future.