GPT-3.5-Turbo drops from 90% accuracy to 50% when the answer sits in the middle of a 20k-token prompt instead of the sta

wpnews.pro

cd /news/large-language-models/gpt-3-5-turbo-drops-from-90-accuracy… · home › topics › large-language-models › article

[ARTICLE · art-22457] src=dev.to ↗ pub=2026-06-05T11:02Z topic=large-language-models verified=true sentiment=↓ negative

GPT-3.5-Turbo drops from 90% accuracy to 50% when the answer sits in the middle of a 20k-token prompt instead of the sta

OpenAI's GPT-3.5-Turbo model drops from 90% accuracy to 50% when the correct answer is placed in the middle of a 20,000-token prompt rather than at the start or end, according to research by Liu et al. (2023) published at ACL. The performance degradation stems from the transformer's attention pattern, which biases toward recent tokens and salient prefixes, causing signal dilution when information is buried in long sequences.

read1 min views14 publishedJun 5, 2026

GPT-3.5-Turbo drops from 90% accuracy to 50% when the answer sits in the middle of a 20k-token prompt instead of the start or end. Liu et al. (2023) documented this in "Lost in the Middle: How Language Models Use Long Contexts" at ACL. The edges of your context window are prime real estate. The middle is a graveyard.

This is not a retrieval bug. It is an attention pattern. Transformers use soft attention across the full sequence, but positional encodings and training distributions bias the model toward recent tokens and salient prefixes. When you stuff a long JSON array or a chunked document into the prompt, the signal dilutes. The model attends to the framing, not the buried row at index 847. Attention weights decay toward the center in long sequences because the training corpus rarely requires mid-span reasoning over 20k tokens.

Picture a RAG pipeline in rag_engine.py

where you dump ten retrieved chunks into a single prompt. You sort by cosine similarity and concatenate. Chunk five holds the exact clause that answers the user, but it sits between chunks four and six. Your generation fails. The fix is not a larger context window. The fix is re-ranking

source & further reading

dev.to — original article I Ran 10+ AI Coding Agents in Parallel. The Bottleneck Wasn't the AI. Read-only Postgres access can still take down your application The Cold-Start Problem for Agent Evals: What to Gate on Day One With Zero Labeled Data

~/api · this article 200

$curl api.wpnews.pro/v1/news/gpt-3-5-turbo-drops-from…

Read original on dev.to → dev.to/a3e_ecosystem/gpt-35-turbo-drops-from-90-…

mentioned entities

GPT-3.5-Turbo

Liu et al.

ACL

metadata

sluggpt-3-5-turbo-drops-from-90-accuracy-to-50-when-the-answer-sits-in-the-middle-of

topic#large-language-models

secondary4 topics

sentimentnegative

canonicaldev.to

navigation

← prevI Ran My Business With AI for 30…

next →LLM-Free Multi-Agent Memory Arch…

── more in #large-language-models 4 stories · sorted by recency

dev.to · 21 Jul · #large-language-models

Probabilistic Graph Neural Inference for circular manufacturing supply chains during mission-critical recovery windows

hackaday.com · 21 Jul · #large-language-models

Neural Net Reads the Gas Meter

ca.finance.yahoo.com · 21 Jul · #large-language-models

University of Tennessee sues Anthropic over neural network technology

dev.to · 21 Jul · #large-language-models

I Finally Understood Why Neural Networks Need Activation Functions

── more on @gpt-3.5-turbo 3 stories trending now

wpnews · 30 May · #ai-safety

Nightcord Security Analysis Report - Threat Investigation

wpnews · 26 May · #ai-agents

Think, Durable Objects, and the Real Shape of AI Applications

wpnews · 8 Jul · #ai-tools

What's the Future of Clay?

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required