cd/entity/Sonnet· home entities Sonnet
grep -l @sonnet /news/*.json | wc -l → 69

Sonnet

mentions 69 type Organization page 2/4 feed RSS

// recent coverage 69 mentions

14:58
2026-06-19
dev.to
ai-tools

I Can't Tell If the Model Matters

A developer testing heterogeneous AI code review found that context and prompt quality matter more than model lineage. Running Claude Code with full repo access and adversarial prompts caught all plan…

19:45
2026-06-18
goose-docs.ai
ai-agents

Self-Improving Agents Still Need Humans

Goose, an AI coding agent, still requires human oversight to prevent benchmark overfitting and ensure genuine capability improvements. The team uses Terminal-bench with a weaker model to identify fail…

00:00
2026-06-18
ampcode.com
ai-products

A Faster Librarian

Ampcode's Librarian AI agent is now approximately three times faster and 43% cheaper after switching to OpenAI's GPT-5.5 model with websocket mode, reducing average search latency from 237 seconds to …

17:58
2026-06-17
lesswrong.com
ai-safety

Porting MACHIAVELLI To Inspect

A developer ported the MACHIAVELLI benchmark, which measures unethical AI agent behavior, to the Inspect evaluation framework to make it easier for evaluators to use. The re-implementation is now offi…

09:01
2026-06-17
dev.to
large-language-models

Claude vs ChatGPT for Code Review: Which Is Better?

A developer compared ChatGPT (GPT-4o) and Claude (Sonnet/Opus) for code review, finding that ChatGPT is faster for conversational review of small code pieces while Claude handles larger context window…

12:02
2026-06-15
strangeloopcanon.com
large-language-models

LLM councils show groupthink

An experiment testing LLM councils found that they suffer from groupthink, retaining only about a quarter of unique, high-quality ideas from individual models while favoring consensus ideas. The peer-…

23:17
2026-06-13
byteiota.com
artificial-intelligence

Claude Code v2.1.172: Sub-Agents Can Now Spawn Sub-Agents

Anthropic released Claude Code v2.1.172 on June 10, allowing sub-agents to spawn their own sub-agents up to five levels deep, a change from the previous two-year ban. The feature aims to isolate noisy…

13:51
2026-06-10
testingcatalog.com
artificial-intelligence

Claude Code Managed Agents and model selector for Voice Mode

Anthropic shipped Claude Fable 5, its first Mythos-class model, this week, posting a more than 10% benchmark improvement over Opus but blocking prompts related to cybersecurity, biology, chemistry, an…

00:00
2026-06-08
justin.poehnelt.com
ai-agents

Triaging Gmail with Claude Subagents

Developer Joe Pohnelt created an email triage system using Claude Code subagents to automate Gmail management. The system uses six specialized AI agents for security analysis, relationship mapping, co…

04:30
2026-06-06
dev.to
ai-infrastructure

How to stop your AI bill from surprising you

Prism has released v1.4 Policy + Governance, a new layer that prevents AI cost overruns before they occur. The update introduces per-project policies that can deny specific models, force model selecti…

← prev page 2 / 4 next →
// co-occurs with top 8 entities