cd /news/ai-infrastructure/incident-with-actions · home topics ai-infrastructure article
[ARTICLE · art-14763] src=githubstatus.com pub= topic=ai-infrastructure verified=true sentiment=↓ negative

Incident with Actions

GitHub Actions hosted runners in the East US region experienced degraded performance on May 5, 2026, from approximately 13:22 UTC to 17:05 UTC, causing 13.5% of standard runner job failures and roughly 16% of Larger Runners with private networking to fail or be delayed by over five minutes. Approximately 8,500 Copilot Code Review requests timed out during the incident, which was triggered by a scale-up operation that hit an internal rate limit when pulling VM images from storage. GitHub has paused all scale operations until system throttling behavior and controls are improved to prevent similar incidents.

read3 min publishedMay 5, 2026

May 5, 17:26 UTC

Resolved - On May 5, 2026, from approximately 13:22 UTC to 17:05 UTC, GitHub Actions hosted runners in the East US region were degraded. 13.5% of jobs requesting a standard runner failed and ~16% of requested Larger Runners with private networking pinned to East US failed or were delayed by more than 5 minutes. Copilot Code Review requests were also impacted. Approximately 8,500 code review requests timed out during this window. Affected users saw an error comment on their pull requests and were able to retry by re-requesting a review. Most runner requests were picked up by other regions automatically, but a portion of requests still routing to East US were impacted.

This was triggered by a scale-up operation for hosted runner VMs in the East US region. This is a regular operation, but the VM create load hit an internal rate limit when VM creates pull images from storage. Existing backoff logic was not triggered because of the response code returned in this case. The rate limiting and VM creation failures were mitigated by reducing load to allow for recovery and allowing queued work to be processed. By 15:34 UTC, queued and failed job assignments were mostly mitigated, with less than 0.5% of runner assignments impacted between 15:34 and full recovery at 17:05.

We are improving our system’s throttling behavior when limits occur, improving our controls to more quickly mitigate similar situations in the future, and reviewing all limits end-to-end for similar operations. We also immediately d all scale and similar operations until these changes are in place and validated.

May 5, 17:11 UTC

Update - Actions is experiencing degraded performance. We are continuing to investigate.

May 5, 17:11 UTC

Update - Standard hosted runners have now reached full recovery. Hosted Runners with Private Networking in the East US region remain degraded as we continue working with our compute provider to restore capacity. Hosted Runners with private networking can fail over to a different Region to mitigate the issue.

May 5, 16:33 UTC

Update - We've seen signs of recovery for Standard Hosted Runners and are continuing to monitor for full recovery. Hosted Runners with Private Networking in the East US region remain affected as we continue working with our compute provider to restore capacity.

May 5, 15:54 UTC

Update - We've applied a mitigation for long queue times and failures on Standard Hosted Runners and are monitoring for full recovery. Hosted Runners with Private Networking in the East US region remain affected as we continue working with our compute provider to restore capacity.

May 5, 15:12 UTC

Update - We are working with our compute provider to alleviate elevated queue times and failures for Actions Jobs running on Hosted Runners in the East US region affecting 10% of runs. Hosted Runners with private networking can fail over to a different Region to mitigate the issue.

May 5, 14:14 UTC

Update - We are investigating elevated queue times and failures on Actions Jobs running on Hosted Runners in East US affecting 8% of runs. Hosted Runners with private networking can fail over to a different Azure region to mitigate the issue.

May 5, 13:48 UTC

Update - We are investigating elevated queue times on Actions Jobs running on Standard Hosted Runners in East US affecting 10% of runs

May 5, 13:37 UTC

Investigating - We are investigating reports of degraded availability for Actions

── more in #ai-infrastructure 4 stories · sorted by recency
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/incident-with-action…] indexed:0 read:3min 2026-05-05 ·