AI does exactly what you ask — that's the problem Here is a factual summary of the article: The article argues that modern AI coding assistants like Claude 4.x and GPT-4o have become strictly literal in following prompts, which means they will produce technically correct but unusable code if the prompt lacks sufficient context. To get good results, developers must provide the same level of detail they would give a junior developer, including the tech stack, exact error messages, and explicit constraints. The author recommends structuring prompts with XML tags, placing code before the instruction, and always including what has already been tried when debugging. I asked Claude to add pagination to a product list. Response in 30 seconds: clean, functional, complete. And completely disconnected from the rest of the app. Wrong pagination component we already had one , invented styles, existing filters broken. Technically correct. Unusable as-is. The problem wasn't the AI — it was my prompt. I wrote: "Add pagination to this list." That's exactly what it did. Nothing more, nothing less. Current models Claude 4.x, GPT-4o have dropped the "infer intent" behavior. They take prompts literally. That's progress overall, but it fundamentally changes how you need to prompt for code. A good code prompt isn't about tricks — it's about giving the AI the same context you'd give a junior dev joining the project. I tested and scored dozens of formulations across the four most common coding tasks. Here's what actually works. The structure that applies to everything Before getting into specific cases, there are four elements present in every good code prompt: - Stack/context — language, version, framework, relevant files - Precise task — what you want, phrased as an instruction, not a question - Constraints — what NOT to touch, scope limits - Expected output — diff, full code, explanation only, CVSS score… Anthropic's golden rule sums it up: show your prompt to a colleague with no context. If they'd be confused, the AI will be too. For larger prompts multiple files, complex instructions , XML tags help separate sections. Tests show 30–39% improvement in response quality when prompts are structured with clear tags: