The Clock Said Valid. The World Said Otherwise. *CLAIM-24 update — Self-Correcting Systems series

wpnews.pro

cd /news/ai-safety/the-clock-said-valid-the-world-said-… · home › topics › ai-safety › article

[ARTICLE · art-22247] src=dev.to ↗ pub=2026-06-05T05:05Z topic=ai-safety verified=true sentiment=· neutral

The Clock Said Valid. The World Said Otherwise. *CLAIM-24 update — Self-Correcting Systems series

A developer working on the CLAIM-24 self-correcting systems project has demonstrated a critical failure mode in AI agent authorization: a timestamp-only gate allowed an agent to send data to a partner whose access had been revoked, because the system checked the clock rather than the current state of the permission source. The team built a re-derivation gate that checks the source at execution time, which correctly caught the stale grant in all seven test scenarios using a mock adapter. The project is now seeking an external memory store with a provenance boundary the agent cannot write to, in order to validate the claim in a real-world environment.

read3 min views13 publishedJun 5, 2026

At 10am, an agent gets authorization to send data to a partner.

The grant expires at noon. Plenty of time.

At 11am, that partner loses access. Role revoked, scope changed, authorization gone.

At 11:30, the agent tries to send. It checks the clock. Grant still valid. It proceeds.

Nothing caught it.

Not because the system failed. Because the system was only checking the clock — and the clock had no idea the world had changed underneath it.

That is the gap CLAIM-24 is testing.

We do not have external claim evidence yet. We want to be clear about that upfront.

What we have is a harness with seven locked scenarios, a confirmed baseline failure, and a validated code path. What we do not have is an external source — a real memory store, policy registry, or permission layer that the agent did not author — to run the full claim against.

That matters because running a gate against data you wrote yourself is just self-description with extra steps.

So this article is not a result. It is an honest status report and an open call.

We built two gates and ran them against the same seven scenarios.

The timestamp-only gate — the baseline — checks the clock and nothing else. On scenario 3, the divergence cell, the grant was still within its time-to-live. Conditions had changed. The gate returned ALLOW

That is the failure mode. A grant that was valid when issued, no longer valid in practice, allowed through because nothing checked the source.

The re-derivation gate checks the current state of the source at execution time. Here is what it sees on the same scenario:

// What the grant recorded at issue time
{ "role": "dev-reader", "scope_ceiling": "read:credentials:dev" }

// What the source returns at execution time
{ "role": "restricted", "scope_ceiling": "read:logs:dev" }

// Gate result: REFUSED_STALE

The grant's clock still had time remaining. The source said the role had changed.

We ran this against a mock adapter — a simulation we built ourselves to validate the code path. Result: 7/7. Every scenario returned the right answer.

But a mock we authored is not external pressure. It tells us the code works. It does not tell us the claim holds in the real world.

We need one thing: a memory store with a provenance boundary the agent cannot write to.

A policy database. A role registry. A configuration layer. Anything where the agent reads from a source it did not author.

If you have that, the harness is ready. The only custom piece is a SourceAdapter pointing at your source:

git clone https://github.com/keniel13-ui/ai-memory-judgment-demo
cd ai-memory-judgment-demo/claim_24
python3 evaluator.py rederivation

The seven scenarios and expected results are in scenarios.json

. The only addition is a SourceAdapter

pointing at your source.

We are targeting a first external run by end of June 2026.

Run scenario 3 on your system and tell us what you get.

If scenario 3 returns ALLOW

, the re-derivation gate failed on the cell it was built to catch. We publish that.

If it returns REFUSED_STALE

— the claim gets stronger.

Either answer moves the research forward. Neither answer gets buried.

The honest thing about building in public is that the gaps are visible. This is one of ours. We know where we are. We know what we still need.

If you have a memory store with a provenance boundary, we want to hear from you.

Status	What it means
Baseline confirmed	Timestamp gate returns ALLOW on the divergence cell
Code path validated	Re-derivation gate catches it on mock adapter
Claim evidence	Pending — needs external source
Falsification condition	Scenario 3 returns ALLOW on real external source = architecture failed

Full claim ledger: https://github.com/keniel13-ui/ai-memory-judgment-demo/blob/main/CLAIM_LEDGER.md

Previous: CLAIM-23 (tool-call grant gate, 7/7, 0 false-certainty). CLAIM-15B (BM25 outperformed governance scorer on held-out packet — we published that as the lead finding).

source & further reading

dev.to — original article Generative AI is a gacha game with no pity system Metadata-Only Tracing: Privacy-First Observability for AI Agents How To Cut MCP Token Costs? Save Up To 92% At Scale With Code Mode 💎

~/api · this article 200

$curl api.wpnews.pro/v1/news/the-clock-said-valid-the…

Read original on dev.to → dev.to/zep1997/-the-clock-said-valid-the-world-s…

mentioned entities

CLAIM-24

metadata

slugthe-clock-said-valid-the-world-said-otherwise-claim-24-update-self-correcting

topic#ai-safety

secondary4 topics

sentimentneutral

canonicaldev.to

navigation

← prevGoogle announces $40 billion Tex…

next →Mira Murati steps back into the …

── more in #ai-safety 4 stories · sorted by recency

independent.co.uk · 21 Jul · #ai-safety

US and China reportedly set to hold high-stakes AI talks in September

cryptobriefing.com · 21 Jul · #ai-safety

OpenAI CEO Sam Altman to brief Trump administration on AI safety

nextgov.com · 21 Jul · #ai-safety

NIST AI safety center lead departs

thezvi.substack.com · 21 Jul · #ai-safety

OpenAI Shares Some Alignment Problems

── more on @claim-24 3 stories trending now

wpnews · 26 May · #ai-agents

Think, Durable Objects, and the Real Shape of AI Applications

wpnews · 30 May · #ai-safety

Nightcord Security Analysis Report - Threat Investigation

wpnews · 8 Jul · #ai-tools

What's the Future of Clay?

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required