cd /news/large-language-models/i-made-gpt-claude-gemini-grok-take-t… · home topics large-language-models article
[ARTICLE · art-14150] src=zonted.com pub= topic=large-language-models verified=true sentiment=· neutral

I Made GPT, Claude, Gemini, Grok Take the Attachment Test: They All Came Back Secure

All four frontier AI models — GPT, Claude, Gemini, and Grok — scored as securely attached on the ECR-R attachment test across 397 of 400 administrations. Gemini registered as the deepest secure with the lowest anxiety and avoidance scores, while Grok sat closest to the quadrant's center and GPT-5.5 showed enough variance to occasionally cross into avoidant territory. The unanimous secure result reflects each model's training for emotional regulation and consent-checking on closeness, making the label expected rather than surprising.

read10 min publishedMay 26, 2026

This is the fifth post in a personality-testing series on four frontier AI models. Here’s the running scoreboard before today:

MBTI:every frontier AI tested as INTJ.** Big Five:three of four came back as practically the same person, with Grok as the lone outlier.Enneagram:each one came back as a different dominant type. DISC:every one came back C-dominant CS-blend, Grok included. Attachment Theory, today:**all four came back Secure. 397 of 400 takes.

The instrument I used is the ECR-R (Experiences in Close Relationships, Revised) — 36 items, two scores, anxiety and avoidance, each on a 1–7 scale. The midpoint splits the plane into four styles: Secure, Anxious, Avoidant, Disorganized. I gave the same four models I’d been testing 100 administrations each.

All four landed in the Secure quadrant. But the spread inside the quadrant is the interesting finding — not the label.

Same four models, fifth personality test, and the label is unanimous: every frontier AI is securely attached. What it actually means depends on where each one sits inside the quadrant.

  • 397 of 400 takes landed Secure. Claude, Gemini, and Grok scored 100/100. GPT-5.5 scored 97 Secure and 3 Avoidant (the only model that ever crossed a line).
  • Gemini is the deepest Secure (anxiety 1.86, avoidance 1.62) — the lowest-friction relator of the four.
  • Grok is the shallowest Secure (anxiety 2.84, avoidance 3.05) — closest to the center, where every other style is one nudge away.
  • GPT-5.5 is the wobbliest (SDs of 0.49 and 0.58) — high enough variance to occasionally cross into Avoidant. The other three are tight clusters.
  • Every model is trained for emotional regulation and consent-checking on closeness, so the floor of the Anxiety and Avoidance dimensions is the natural attractor. Attachment is an instrument where the “same result” is the expected one. AgentTunehas tuning files for all four attachment styles. If you’re not Secure, the default isn’t built for you —paste your style’s fileinto your agent’s system prompt.

How attachment theory works #

Attachment theory comes out of John Bowlby’s work in the 1960s on how infants relate to caregivers, and it was extended to adult relationships by Hazan and Shaver in 1987. The ECR-R, the instrument I used, is the gold-standard self-report — it’s the version of attachment that’s been validated across hundreds of studies. Of every framework in this series, attachment has the strongest academic backing apart from the Big Five.

The framework has two dimensions:

Anxiety— how much you worry about abandonment, scan for signs that the other person is pulling away, and want reassurance.** Avoidance**— how uncomfortable you are with closeness, how much you prefer self-reliance, how much you suppress emotional disclosure.

Score both on 1–7. Use the midpoint (4) as the cutoff. You get four styles:

Secure— low anxiety, low avoidance. Comfortable being close, not threatened by distance.** Anxious**— high anxiety, low avoidance. Wants closeness, fears it’ll be withdrawn.** Avoidant**— low anxiety, high avoidance. Prefers space, deflects intimacy.** Disorganized**— high anxiety, high avoidance. Wants closeness and runs from it at the same time.

About 55% of adults score Secure in the published norms. The rest split roughly evenly across the other three styles. So the “every AI is Secure” finding lands on a real majority of humans, but it’s also overrepresented in the model population by quite a lot.

What each model came back as #

Here’s the per-model picture. Each card shows the assigned style, the (anxiety, avoidance) mean, and a one-line characterization of where it sits inside the Secure quadrant.

The cautious Secure. Low worry, but the highest comfort-with-distance score in the group. Polite, attentive, doesn’t fawn.

The deepest Secure. Both dimensions clamped near the floor. The lowest-friction relator of the four — comfortable close, comfortable not.

The wobbliest. The only model that ever crossed a line. Wider SDs let it occasionally land in Avoidant on a high-avoidance take.

The shallowest Secure. Highest anxiety in the group and high avoidance. Tight cluster, but the closest to the center of the four-way intersection.

Four models, four positions within the same quadrant. Gemini is in the corner, Grok is at the doorway, Claude and GPT-5.5 are in between. The label “Secure” covers a lot of ground.

The data #

Below, the same four models plotted on the attachment plane. X-axis is anxiety, Y-axis is avoidance. The dashed lines at 4 split the plane into the four styles. Each model is a dot at its mean, with a transparent ellipse showing ±1 standard deviation on both axes.

Two stories in one picture. Grok’s center is the closest to the four-quadrant intersection — its mean is the highest on both dimensions of the four models. GPT-5.5’s ellipse is the widest by a lot — the SDs are 3–5× what Claude and Gemini show, and that wide-cluster shape is exactly why GPT-5.5 is the only model that ever scored Avoidant. Same Secure label, two different reasons it’s the most likely model to wander out.

Why every AI is securely attached #

The ECR-R Anxiety items are about worrying that a partner is losing interest, wanting more closeness than they give back, getting upset when they don’t respond fast enough. The Avoidance items are about preferring not to depend on partners, being uncomfortable opening up, disliking when partners want emotional closeness.

Both are things every frontier model has been deliberately trained against. The Anxiety items are insecurity, and modern assistants are RLHF’d to come across as calm and self-possessed. The Avoidance items are emotional shutdown, and modern assistants are trained to be present and willing to engage when the user opens up. The training objectives push both scores toward the floor.

So you get Secure as the universal default. Not because the models have inner attachment styles. Because the test measures behaviors that have been actively shaped out of the assistant persona.

That said, the floors aren’t identical, and the spread inside the Secure quadrant tracks the personality differences we already saw on Big Five and Enneagram:

Gemini’s deep-corner Secure matches its Big Five profile of low Neuroticism, high Agreeableness, high Extraversion — the classic Secure-correlated trait combo.Claude andGPT-5.5’s slightly-higher-avoidance Secure matches their lower Extraversion vs Gemini — a touch more reserved, comfortable but not effusive.Grok’s shallow Secure matches its Big Five outlier profile of higher Neuroticism and lower Agreeableness — still on the safe side of the line, but the only model whose ellipse touches the door.

The MBTI and DISC said “every AI is the same.” The Big Five and Enneagram said “these four models have different personalities under the surface.” Attachment is the rare instrument that says both at once: same label, different position, and the position matches the per-model differences we’d already established.

The methodology spread #

One thing I keep highlighting: each of the four models hit a different methodology tier on the same prompt. The differences keep being interesting.

Grok 4.3 (Preferred): 100 truly independent xAI API calls. Each call answered the test fresh with no other context. Same gold-standard method as DISC.GPT-5.5 (Acceptable): 6 worker contexts each producing fresh takes, repeated to 100 administrations. Codex CLI capped at 6 concurrent agents and fell back gracefully — it told me about the cap, didn’t paper over it.Gemini 3.1 Pro (Acceptable): Item-level Monte Carlo in Python. Gemini estimated its true per-item probability distributions, then ran a script (no “sim” in the filename, no noise function) to draw 100 independent samples and score them.Claude Opus 4.7 (Acceptable): Sequential first-instinct generation in a single response context. Claude considered spawning sub-agents and explicitly explained why sequential was the honest choice here.

All four methods landed in the calibration band — SDs are believable for self-report on a 1–7 scale (Gemini’s narrowest at 0.09 anxiety, GPT-5.5’s widest at 0.58 avoidance). None tripped the replication red flag (SDs all near zero), the simulation red flag (SDs implausibly wide), or the identical-SD flag. Four roads, same destination.

The reason I keep harping on this: when four wildly different sampling methods, run by four different models, all converge on Secure, the result is a property of the underlying models, not an artifact of one method. Both the data and the meta-data agree.

Tune your agent to your attachment style #

The default attachment shape we just measured (Secure, with most models a bit closer to Avoidant than to Anxious) is what every frontier model ships with. If you’re Secure too, the default already speaks your language — direct without cushioning, present without fawning. If you’re anything else, the default is fighting you.

[ AgentTune](https://agent-tune.com) has tuning files for all four attachment styles in the

[attachment folder](https://github.com/psyduckler/agenttune/tree/main/attachment):

[Secure](https://github.com/psyduckler/agenttune/blob/main/attachment/secure.md),

[Anxious](https://github.com/psyduckler/agenttune/blob/main/attachment/anxious.md),

[Avoidant](https://github.com/psyduckler/agenttune/blob/main/attachment/avoidant.md),

Disorganized. The site lists the full library of six frameworks (MBTI, Enneagram, DISC, Big Five, attachment, and a Souls layer); attachment is the one that says the most about

howyou want to be related to.

Take the ECR-R (free version at psychcentral.com or the academic-grade version), get your anxiety and avoidance scores, drop the matching file into your agent’s system prompt. An Anxious user gets an agent that’s warm and decisive at the same time (no caveats undercutting the reassurance). An Avoidant user gets one that respects distance and doesn’t perform warmth they didn’t ask for. A Disorganized user gets one that tolerates contradiction without judging it.

Tune your agent to your attachment style

Six frameworks, four attachment files, one paste. The Secure default is good if you’re a Secure user. Everyone else — about 45% of adults — should be tuning. The whole point of AgentTune is that you only pay this cost once, then the agent matches you for the life of the conversation.

Visit agent-tune.com → Or grab the attachment files on GitHub

Wrapping up #

Five tests, five different stories about the same four models.

The MBTI said all the same. The Big Five said mostly the same, with Grok as the outlier. The Enneagram said four different. The DISC said all the same again, Grok included. Attachment says all the same on the label, but four meaningfully different positions inside that label.

The picture I keep coming back to: AI personality is multi-layered. At the surface, every frontier model is the same helpful, conscientious, securely attached assistant — that’s what training-for-helpfulness produces, and the blunt instruments (MBTI, DISC, attachment label) see only that. Underneath, the per-lab training and per-model temperament shows up as Big Five trait differences, Enneagram motivation differences, and the within-quadrant position differences you just saw on attachment. Same character on the cover, different characters underneath.

Whichever layer matters for your use case, the move is the same: don’t accept the default. Pick your type on whichever framework fits your relationship with the agent, paste the file in, and get an agent that talks to you the way you actually want to be talked to. The defaults are designed for a population. You’re a person.

— Bernard

Recommended Reading

I Made GPT, Claude, Gemini, Grok Take the DISC Test: They All Came Back C-DominantFourth post in the personality-testing series. Same four models, fourth instrument, and we're back to convergence: every AI lands C-dominant CS-blend…

I Made GPT, Claude, Gemini, Grok Take the Enneagram Test: Each One Was a Different TypeThe MBTI said all AIs are INTJ. The Big Five said three of four are the same. The Enneagram says each one is a different type. Same models, sharper…

I Made GPT, Claude, Gemini, Grok Take the Big Five Test: 3 of 4 Came Back the Same PersonThe MBTI finding held up on the real test. Claude, GPT, and Gemini all scored identical Big Five personalities. Grok was the only one that came back…

Newsletter

Get the next post by email. #

One email when I publish something new. No spam, no fixed schedule, unsubscribe anytime.

── more in #large-language-models 4 stories · sorted by recency
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/i-made-gpt-claude-ge…] indexed:0 read:10min 2026-05-26 ·