UK Government

mentions 2 type Person feed RSS

// recent coverage 2 mentions

04:00

2026-07-09

arxiv.org

large-language-models

Healthier LLMs: Retrieval-Augmented Generation for Public Health Question Answering

Researchers extended PubHealthBench, a public health QA benchmark, into a retrieval-augmented setting and found that hybrid retrieval consistently improves recall and ranking quality. Providing retrie…

15:26

2026-05-29

dev.to

ai-safety

The UK Government Just Merged This Open-Source AI Security Benchmark Into Their National Evaluation Framework

The UK Government's AI Safety Institute has merged the open-source AgentThreatBench benchmark into its official inspect_evals framework, which is used to evaluate frontier AI models from OpenAI, Anthr…

// co-occurs with top 8 entities

AI Safety Institute 1 AgentThreatBench 1 inspect_evals 1 OpenAI 1 Anthropic 1 Google DeepMind 1 OWASP Agentic Security Initiative 1 PubHealthBench 1