The Day DNS Broke Our Deployment: Solving a Serverless Framework S3 Resolution Failure on AWS

wpnews.pro

cd /news/developer-tools/the-day-dns-broke-our-deployment-sol… · home › topics › developer-tools › article

[ARTICLE · art-37660] src=dev.to ↗ pub=2026-06-24T12:03Z topic=developer-tools verified=true sentiment=· neutral

The Day DNS Broke Our Deployment: Solving a Serverless Framework S3 Resolution Failure on AWS

A developer encountered a complete deployment failure of a serverless AI application on AWS due to a DNS resolution error. The error EAI_AGAIN indicated a temporary DNS lookup failure on the local machine, not an AWS outage. Switching to reliable public DNS servers like 8.8.8.8 and 1.1.1.1 resolved the issue, and the team added safeguards such as retry logic and pre-deployment connectivity checks.

read3 min views6 publishedJun 24, 2026

As engineers, we often spend hours optimizing code, improving prompts, and scaling infrastructure.

But sometimes the biggest production issues come from something much simpler.

A DNS lookup.

Recently, while deploying a serverless AI application to AWS, I encountered an error that completely blocked deployment.

The application hadn't changed.

AWS was healthy.

Permissions were correct.

The deployment package was valid.

Yet every deployment failed.

The error looked like this:

Error:
getaddrinfo EAI_AGAIN serverless-framework-deployments-eu-north-1-xxxxxxxx.s3.eu-north-1.amazonaws.com

At first glance, it looked like an AWS outage.

It wasn't.

This is the story of how a simple DNS resolution issue brought an entire deployment pipeline to a halt—and how we fixed it.

The application was an AI-powered Due Diligence Platform built with:

Deployment flow:

Developer
     ↓
Serverless Framework
     ↓
S3 Deployment Bucket
     ↓
CloudFormation
     ↓
Lambda Functions

Every deployment package is first uploaded to an S3 bucket created by the Serverless Framework.

Only after the upload succeeds does CloudFormation update the stack.

During deployment, the terminal suddenly returned:

serverless deploy

✖ Error:
getaddrinfo EAI_AGAIN serverless-framework-deployments-eu-north-1-xxxxxxxx.s3.eu-north-1.amazonaws.com

The deployment stopped immediately.

No Lambda updates.

No CloudFormation changes.

Nothing.

The first thing I checked was AWS Service Health.

Everything was operational.

No incidents were reported.

The next suspect was permissions.

I verified:

aws sts get-caller-identity

Response:

{
  "Account": "123456789012",
  "Arn": "arn:aws:iam::123456789012:user/developer"
}

Credentials were valid.

Permissions were correct.

Still failing.

I upgraded Serverless Framework.

npm install -g serverless

Deployment still failed.

The key clue was:

EAI_AGAIN

This is not an AWS error.

It is a DNS resolution error.

Specifically:

Temporary DNS lookup failure

The operating system could not resolve the S3 endpoint hostname.

The request never reached AWS.

I manually tested DNS resolution:

nslookup google.com

Intermittent failures appeared.

Then:

nslookup s3.eu-north-1.amazonaws.com

The same issue occurred.

This confirmed that the problem existed locally.

Not in AWS.

The machine was using an unstable DNS resolver.

Under heavy network usage, DNS lookups occasionally timed out.

When Serverless Framework attempted to upload artifacts to S3:

Serverless
      ↓
DNS Lookup
      ↓
Failure
      ↓
Deployment Stops

No connection to AWS was ever established.

We switched to reliable public DNS servers.

Linux:

sudo nano /etc/resolv.conf

Added:

nameserver 8.8.8.8
nameserver 1.1.1.1

Then restarted networking:

sudo systemctl restart NetworkManager

After updating DNS:

nslookup s3.eu-north-1.amazonaws.com

Returned instantly.

Deployment succeeded:

serverless deploy

Output:

✔ Service deployed successfully

To avoid future issues, we added several safeguards.

serverless deploy || serverless deploy

Useful for CI/CD jobs when transient network issues occur.

Before deployment:

curl https://s3.eu-north-1.amazonaws.com

If connectivity fails:

Stop deployment

This prevents wasting build minutes on doomed deployments.

Added:

aws sts get-caller-identity

to deployment pipelines.

This immediately detects expired or invalid credentials.

The biggest lesson was simple:

Not every AWS deployment error is actually an AWS problem.

Sometimes:

And the cloud gets blamed.

When deployment issues occur, I now follow this order:

aws sts get-caller-identity
ping google.com
nslookup s3.eu-north-1.amazonaws.com
aws s3 ls
serverless deploy

This process has saved hours of troubleshooting.

As engineers, we often expect complex problems to have complex causes.

This incident reminded me that some of the most disruptive failures originate from the most basic layers of infrastructure.

A single DNS lookup failure stopped an entire deployment pipeline.

The code was correct.

AWS was healthy.

The architecture was sound.

But none of that mattered until the network could resolve a hostname.

Sometimes the fastest fix isn't changing code.

It's understanding where the request actually fails.

Before blaming AWS:

You'll save yourself hours of debugging.

source & further reading

dev.to — original article I Let My AI Agent Build a Bedrock RAG Knowledge Base, Here Are the 2 Mistakes the AWS Agent Toolkit Caught Cara pakai API Claude & DeepSeek dari Indonesia — bayar Rupiah via QRIS (tanpa kartu kredit) How to use the Claude & DeepSeek APIs from Indonesia — pay in Rupiah via QRIS (no credit card)

~/api · this article 200

$curl api.wpnews.pro/v1/news/the-day-dns-broke-our-de…

Read original on dev.to → dev.to/saif_urrahman/the-day-dns-broke-our-deplo…

mentioned entities

AWS

Serverless Framework

CloudFormation

Lambda

Google DNS

Cloudflare DNS

metadata

slugthe-day-dns-broke-our-deployment-solving-a-serverless-framework-s3-resolution-on

topic#developer-tools

secondary1 topics

sentimentneutral

canonicaldev.to

navigation

← prevSimulating the World Inside the …

next →GLM 5.2: why I’m replacing Opus …

── more in #developer-tools 4 stories · sorted by recency

dev.to · 25 Jun · #developer-tools

I Let My AI Agent Build a Bedrock RAG Knowledge Base, Here Are the 2 Mistakes the AWS Agent Toolkit Caught

aws.amazon.com · 25 Jun · #developer-tools

Implementing super resolution by deploying SeedVR2 on Amazon SageMaker AI

aws.amazon.com · 25 Jun · #developer-tools

Building agentic AI applications with a modern data mesh strategy on AWS

letsdatascience.com · 21 Jun · #developer-tools

Moving Inference Workloads from Lambda to SageMaker

── more on @aws 3 stories trending now

wpnews · 19 Oct · #developer-tools

Windows Script to clean up and remove all ASUS software

wpnews · 22 Jun · #generative-ai

Bain tests software takeover targets using vibecoding AI replicas

wpnews · 28 May · #ai-startups

The Niche SaaS Opportunity Map 2026: Highly Demanded Subscribed Categories Beyond Mainstream

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required