June-July 2026 AI Security via Formal Methods

wpnews.pro

cd /news/ai-safety/june-july-2026-ai-security-via-forma… · home › topics › ai-safety › article

[ARTICLE · art-47268] src=lesswrong.com ↗ pub=2026-07-03T12:32Z topic=ai-safety verified=true sentiment=· neutral

June-July 2026 AI Security via Formal Methods

A new position paper on using formal methods for AI security focuses on model weight confidentiality and integrity through infrastructure hardening, with a minimal and uncontroversial approach. The UK's Advanced Research and Invention Agency (ARIA) is hiring for a £20m AI/formal methods/cybersecurity funding call, and Anthropic's Preparedness team is reportedly seeking formal methods talent for security moonshots.

read2 min views1 publishedJul 3, 2026

Last month, I said I would do a bigger writeup of Nora’s funding call. I did not. But she is hiring currently, and I want you to take a look at the job posting.

I’m hiring someone to help me drive our upcoming £20m AI/FM/cybersec funding call:

[https://aria.pinpointhq.com/postings/f1288172-37fe-4da5-96ed-de7e719d65e8]This person will work closely with the funded teams, help drive the sprint cadence, sharpen our perspective on targets/threat models/security specs, and pave the way towards high-impact demonstration and translation!

I’m no longer going to try to do the kind of newsletter that claims to attempt completeness over the happenings that fall in its jurisdiction. Instead, I’ll just poke you guys with whatever I happened to catch wind of in the day-to-day carrying out my duties, responsibilities, and third synonyms.

Can We Secure AI With Formal Methods? is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.

Typically when someone writes a position paper about formal methods as an AI safety technology, they’re very bombastic about it. So a bunch of us got together and decided we’d make a new position paper, one that is minimal and uncontroversial instead of maximal and scifi. We focus mostly on model weight confidentiality and integrity through infrastructure hardening. Each problem has a solution sketch

Still welcoming contributions. There’s also native comments on the website. You can have a pdf if you want, but the website is the first-class usecase.

Please get me invited onto podcasts to talk about this.

They don’t literally say it in the job description, but I have it on good authority that this JD is super FM pilled. There may be more explicit JDs in the future. This was roughly telegraphed months ago by “leveling up across the board” section of the frontier roadmap, though again no literal name-drop of FM.

This is, as far as you’re concerned, a glorified rumor (I have less confidence in it panning out than the Anthropic case). But I have it on good authority that Preparedness (prepared for what, exactly?) would like to get FM pilled, has gestured in the direction of adding FM talent for security moonshots. You’ll have to email me for the warm intro on this one.

They’re gonna do it cuz of the incentive prisons they built for themselves, it seems right to help them lock down their infra with FM. But I want it on the record that I think the whole thing is somewhere between silly and barbaric.

Gwern’s Scaling Hypothesis is one of the most influential documents in 21st century science communication. He wants you to think through a SWE version of scaling laws via Lean. I haven’t read this yet, so no commentary.

Fill in email here to get direct to your inbox! reminder that there are Zero perks to being a Paid subscriber, but it does help me spend more time on the newsletter.

source & further reading

lesswrong.com — original article The Reverse AI Box Announcing the Safe Pareto Improvements (SPI) Fundamentals Program Fable #6: The Return of the King

~/api · this article 200

$curl api.wpnews.pro/v1/news/june-july-2026-ai-securi…

Read original on lesswrong.com → www.lesswrong.com/posts/jq5gjS9dtorwYvpTD/june-j…

mentioned entities

ARIA

Anthropic

Nora

Gwern

Lean

Scaling Hypothesis

Preparedness

metadata

slugjune-july-2026-ai-security-via-formal-methods

topic#ai-safety

secondary4 topics

sentimentneutral

canonicallesswrong.com

navigation

← prevWhy Perplexity's founder doesn't…

next →Lee to review mega chip cluster …

── more in #ai-safety 4 stories · sorted by recency

thenextweb.com · 3 Jul · #ai-safety

Alibaba bans Claude Code after Anthropic is caught tracking Chinese users with hidden code

wired.com · 3 Jul · #ai-safety

Google DeepMind Unionization Talks Are Off to a Rocky Start

pub.towardsai.net · 3 Jul · #ai-safety

Claude Fable 5, Explained: Why Anthropic Ships its Most Powerful Model in Two Versions

letsdatascience.com · 3 Jul · #ai-safety

Anthropic Launches Claude Apps Gateway For Bedrock And Google Cloud

── more on @aria 3 stories trending now

wpnews · 27 May · #artificial-intelligence

How I Run Two Claude Accounts as One

wpnews · 28 May · #ai-startups

The Niche SaaS Opportunity Map 2026: Highly Demanded Subscribed Categories Beyond Mainstream

wpnews · 1 Jul · #ai-infrastructure

My Notes After Databricks Data and AI Summit 2026

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required