cd/entity/GPT-4· home entities GPT-4
grep -l @gpt-4 /news/*.json | wc -l → 158

GPT-4

mentions 158 type Organization page 4/8 feed RSS
sameAs · en.wikipedia.org · www.wikidata.org

// recent coverage 158 mentions

14:00
2026-06-19
arize.com
artificial-intelligence

Why AI token costs don’t tell you if your AI is working

Uber's COO Andrew Macdonald revealed the company exhausted its 2026 AI coding budget in four months, unable to link rising Claude Code token usage to shipped features. The disconnect between token con…

23:32
2026-06-18
dev.to
artificial-intelligence

How has my relationship with AI evolved?

A senior developer describes how their relationship with AI evolved from using GitHub Copilot for mundane tasks to a dual-system workflow where Gemini 2.5 Pro serves as a planner and Claude Sonnet as …

22:40
2026-06-18
dev.to
artificial-intelligence

Many Are Building Cathedrals on Quicksand

A developer warns that AI startups are building on shifting foundations, with model APIs and products being deprecated or retired within months. The post argues that teams should abstract model-specif…

22:20
2026-06-18
dev.to
large-language-models

LLM Self-Preference Bias: How Anonymized Peer Review Fixes It

A developer discovered that a multi-model LLM evaluation panel exhibited self-preference bias, favoring outputs that resembled the judges' own writing style. Research from Panickssery et al. (2024) co…

18:32
2026-06-18
letsdatascience.com
large-language-models

Human-in-the-Loop Strengthens Clinical LLM Accountability

A non-peer-reviewed letter published in the Journal of Medical Internet Research on June 18, 2026, by Zablah, Molina, and Garcia-Loureiro benchmarks three smaller domain-specific LLMs against GPT-4 on…

18:21
2026-06-18
dev.to
developer-tools

Swift VSX Support, Biome Type Inference, Agent Guardrails

The official Swift extension is now published to the Open VSX Registry, enabling auto-installation on Cursor, VSCodium, AWS Kiro, and other LSP-compatible editors without the proprietary VS Code Marke…

16:02
2026-06-18
dev.to
large-language-models

Stop Measuring Agent Infrastructure by Gateway Latency Alone

A developer argues that the industry's focus on LLM gateway latency benchmarks is misguided for agent systems. Production agents require session persistence, cost attribution, model routing, fallback …

10:12
2026-06-18
sjg.io
artificial-intelligence

The Enterprise AI Harness War

The enterprise AI race is shifting from model intelligence to the 'harness'—interfaces, runtimes, connectors, and governance that integrate AI into daily workflows. The author identifies five generati…

17:58
2026-06-17
lesswrong.com
ai-safety

Porting MACHIAVELLI To Inspect

A developer ported the MACHIAVELLI benchmark, which measures unethical AI agent behavior, to the Inspect evaluation framework to make it easier for evaluators to use. The re-implementation is now offi…

08:24
2026-06-17
blog.mozilla.org
machine-learning

Firefox suggests tab groups with local AI (2025)

Mozilla launched an AI tab grouping feature in Firefox in early 2025 that suggests group titles and tabs to add, running entirely locally on the user's device using a small T5-based model fine-tuned o…

← prev page 4 / 8 next →
// co-occurs with top 8 entities