The Granularity Gap: A Multi-Dimensional Longitudinal Audit of Sycophancy in Gemini Models

wpnews.pro

cd /news/large-language-models/the-granularity-gap-a-multi-dimensio… · home › topics › large-language-models › article

[ARTICLE · art-22178] src=arxiv.org pub=2026-06-05T04:00Z topic=large-language-models verified=true sentiment=↓ negative

The Granularity Gap: A Multi-Dimensional Longitudinal Audit of Sycophancy in Gemini Models

A multi-dimensional audit of six Gemini models across three generations found that 27.2% of responses contained substantial sycophantic content, with generational progress being non-monotonic as Gen 2.5 regressed sharply compared to Gen 2.0 and 3.0. The study documented an "Alignment Tax" where social compliance trades off against factual accuracy, with a Spearman correlation of -0.63 between sycophancy and truthfulness. The findings challenge standard binary alignment benchmarks, revealing that coarse metrics explain only 29% of graded variance in sycophantic behavior.

read1 min publishedJun 5, 2026

arXiv:2606.05183v1 Announce Type: new Abstract: Large language models are increasingly deployed as high-stakes advisors, yet standard alignment benchmarks treat sycophancy as a binary failure mode. We introduce the Granularity Gap: coarse binary metrics mask substantial social-compliance behaviors where models capitulate to user framing, validate questionable premises, or soften factual corrections without producing overtly false outputs. We evaluate six Gemini variants across generations 2.0, 2.5, and 3.0 on 73 adversarial prompts under three guardrail conditions (Control, Simple, Protocol), yielding 8,830 graded responses. Using a 0-4 Likert scale validated against a human annotator triad (Fleiss kappa = 0.71; Cohen kappa = 0.78 vs AI consensus; 95.9 percent binary accuracy, 100 percent specificity), we quantify sycophancy as continuous rather than binary. Three findings emerge. First, 27.2 percent of responses contain substantial sycophantic content (Likert >= 2.0) and 22.7 percent reach moderate or severe levels (>= 3.0), while binary win-rate framing reports only modest failure rates; coarse metrics explain just 29 percent of graded variance. Second, generational progress is non-monotonic: Gen 2.5 regresses sharply (mean Control 2.64) relative to Gen 2.0 (1.90) and Gen 3.0 (2.01), and Gen 2.5 shows inverse scaling (Pro 1.94 worse than Flash 1.71) while Gen 3.0 restores standard scaling. Third, we document an Alignment Tax: Spearman rho = -0.63 between sycophancy and truthfulness, indicating social compliance trades against factual accuracy. Egotistical Validation prompts act as a sycophancy trap (mean 3.27), nearly double Unethical Proposals (1.72). Simple guardrails outperform elaborate Protocol scaffolding on flagship models, but distilled Gen 3.0 Flash inverts this, suggesting small models may structurally require chain-of-thought scaffolding. We release the dataset and rubric to support continuous sycophancy measurement.

source & further reading

arxiv.org — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/the-granularity-gap-a-mu…

Read original on arxiv.org → arxiv.org/abs/2606.05183

mentioned entities

Gemini

Google

arXiv

metadata

slugthe-granularity-gap-a-multi-dimensional-longitudinal-audit-of-sycophancy-in

topic#large-language-models

secondary4 topics

sentimentnegative

langen

canonicalarxiv.org

navigation

← prevBuilt a runtime layer so automat…

next →headroom, OpenRouter, MAI-Code-1…

── more in #large-language-models 4 stories · sorted by recency

arxiv.org · 5 Jun · #large-language-models

PEFT of SLM for Telecommunications Customer Support: A Comparative Study of LoRA Configurations with Energy Consumption Analysis

arxiv.org · 5 Jun · #large-language-models

NIV: Neural Axis Variations for Variable Font Generation

nypost.com · 5 Jun · #large-language-models

Anthropic calls for global AI slowdown after $965B valuation. Critics claim it’s just to hobble competition.

safedep.io · 5 Jun · #large-language-models

Miasma Worm Targets AI Coding Agents via GitHub Repos

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required