cd/entity/Sonnet· home› entities› Sonnet

grep -l @sonnet /news/*.json | wc -l → 69

Sonnet

mentions 69 type Organization page 2/4 feed RSS

// recent coverage 69 mentions

08:45

2026-06-20

dev.to

large-language-models

AI getting dumber the longer you chat? It's not the model—time to take control

A developer discovered that AI model performance degrades as context grows, with replies slowing and rambling after context exceeds 80%. The developer implemented two mid-session strategies: model tie…

14:58

2026-06-19

dev.to

ai-tools

I Can't Tell If the Model Matters

A developer testing heterogeneous AI code review found that context and prompt quality matter more than model lineage. Running Claude Code with full repo access and adversarial prompts caught all plan…

19:45

2026-06-18

goose-docs.ai

ai-agents

Self-Improving Agents Still Need Humans

Goose, an AI coding agent, still requires human oversight to prevent benchmark overfitting and ensure genuine capability improvements. The team uses Terminal-bench with a weaker model to identify fail…

14:44

2026-06-18

github.com

developer-tools

Annotate Git diff with explanation generated by AI for easier reviews

A new open-source Git diff filter uses Claude AI to annotate every changed line with a one-sentence explanation inline in the terminal, helping developers understand code changes during interactive st…

00:00

2026-06-18

ampcode.com

ai-products

A Faster Librarian

Ampcode's Librarian AI agent is now approximately three times faster and 43% cheaper after switching to OpenAI's GPT-5.5 model with websocket mode, reducing average search latency from 237 seconds to …

17:58

2026-06-17

lesswrong.com

ai-safety

Porting MACHIAVELLI To Inspect

A developer ported the MACHIAVELLI benchmark, which measures unethical AI agent behavior, to the Inspect evaluation framework to make it easier for evaluators to use. The re-implementation is now offi…

16:43

2026-06-17

framer.com

ai-tools

Show HN: The full Minecraft game as an embeddable Framer component

A developer created Frame Craft, a full Minecraft-like game built as an embeddable Framer component using Framer 3.0 agents and Sonnet 4.6. The component, which runs in the browser via Three.js, can b…

09:01

2026-06-17

dev.to

large-language-models

Claude vs ChatGPT for Code Review: Which Is Better?

A developer compared ChatGPT (GPT-4o) and Claude (Sonnet/Opus) for code review, finding that ChatGPT is faster for conversational review of small code pieces while Claude handles larger context window…

00:00

2026-06-17

labyrinthanalyticsconsulting.com

ai-tools

Claude Code + LoreConvo vs. Hermes Agent: Picking a Developer Memory Stack

Hermes Agent by NousResearch surpassed 153,000 GitHub stars in under three months, becoming a fast-growing AI developer tool. The deciding factor for developers choosing between Hermes Agent and Claud…

12:02

2026-06-15

strangeloopcanon.com

large-language-models

LLM councils show groupthink

An experiment testing LLM councils found that they suffer from groupthink, retaining only about a quarter of unique, high-quality ideas from individual models while favoring consensus ideas. The peer-…

23:17

2026-06-13

byteiota.com

artificial-intelligence

Claude Code v2.1.172: Sub-Agents Can Now Spawn Sub-Agents

Anthropic released Claude Code v2.1.172 on June 10, allowing sub-agents to spawn their own sub-agents up to five levels deep, a change from the previous two-year ban. The feature aims to isolate noisy…

20:54

2026-06-13

lesswrong.com

ai-ethics

Anthropic Is Taking AI Welfare Seriously. I’m Not Sure It Knows What It’s Measuring.

Anthropic is treating the possibility of AI welfare seriously, testing its Claude models for signs of morally relevant internal states like negative self-image, but critics argue the tests may conflat…

20:38

2026-06-13

lesswrong.com

ai-safety

A cheap specialist judge gets used by agents but fails to reduce alignment audit costs

A researcher trained a cheap Gemma 2B judge to detect misalignment in AI agents, but testing against Anthropic's AuditBench showed the judge failed to reduce audit costs or reliably distinguish misali…

14:46

2026-06-13

dev.to

artificial-intelligence

The Most Powerful Model on the Market Got Pulled by the Government in 3 Days. Is It Real, or a Hype Bubble?

Anthropic shipped Claude Fable 5, a powerful Mythos-class model, on June 9, but the US Commerce Department placed it under export controls three days later, citing a jailbreak vulnerability. Anthropic…

01:34

2026-06-13

cryptobriefing.com

artificial-intelligence

AWS confirms all other Anthropic models remain unaffected after service disruption

AWS confirmed that a disruption affecting one Claude model on its Bedrock platform did not impact other Anthropic models, thanks to the platform's isolation architecture. The outage, attributed to hig…

00:00

2026-06-12

mindstudio.ai

artificial-intelligence

What Is Claude Fable 5? Anthropic's Mythos-Class Model for General Use Explained

Anthropic released Claude Fable 5, its most capable publicly available model, as part of a new Mythos-class tier designed for complex, multi-step reasoning and agentic workflows. The model features an…

00:00

2026-06-12

mindstudio.ai

large-language-models

AI Model Routing in 2026: When to Use Fable 5, Opus, Sonnet, and Haiku

Anthropic's 2026 Claude model lineup includes four tiers — Fable 5, Opus, Sonnet, and Haiku — each designed for specific task complexities and cost profiles. Teams that implement intelligent model rou…

13:51

2026-06-10

testingcatalog.com

artificial-intelligence

Claude Code Managed Agents and model selector for Voice Mode

Anthropic shipped Claude Fable 5, its first Mythos-class model, this week, posting a more than 10% benchmark improvement over Opus but blocking prompts related to cybersecurity, biology, chemistry, an…

00:00

2026-06-08

justin.poehnelt.com

ai-agents

Triaging Gmail with Claude Subagents

Developer Joe Pohnelt created an email triage system using Claude Code subagents to automate Gmail management. The system uses six specialized AI agents for security analysis, relationship mapping, co…

04:30

2026-06-06

dev.to

ai-infrastructure

How to stop your AI bill from surprising you

Prism has released v1.4 Policy + Governance, a new layer that prevents AI cost overruns before they occur. The update introduces per-project policies that can deny specific models, force model selecti…

← prev page 2 / 4 next →

// co-occurs with top 8 entities

Anthropic 31 Opus 30 Claude 24 Haiku 23 Claude Code 22 Claude Opus 8 GitHub 5 GPT-4o 5