cd /news/ai-agents/openclaw-collaborates-with-nvidia-fo… · home topics ai-agents article
[ARTICLE · art-22895] src=openclaw.ai pub= topic=ai-agents verified=true sentiment=· neutral

OpenClaw Collaborates with NVIDIA for Stronger Agent Skill Security

OpenClaw has partnered with NVIDIA to strengthen security for AI agent skill files, integrating NVIDIA's SkillSpector scanner and Skill Card trust artifacts into its ClawHub registry. The collaboration aims to address "agentic risk" — threats that traditional malware scanners miss, such as skills that misrepresent their behavior or cause unintended damage. Initial analysis of the combined scanning pipeline found that three independent scanners flagged only 0.69% of skills in common, revealing that most security risks are detected by only a single tool.

read5 min publishedJun 1, 2026

Agent skill files have a reputation for being insecure, and that reputation is earned.

When we launched ClawHub alongside OpenClaw, we were immediately targeted by actors who tried to publish skills bundling known malware. We partnered with VirusTotal to flag those skills and ban the publishers automatically.

Traditional malware scanning is a relatively solved problem. Identifying agentic risk is not. A skill can claim to summarize your logs while bundling a script that ships them off your machine. A well-meaning skill can point your agent at a CLI that wipes production on the wrong flag. Neither of those is malware in the classic sense, and neither is something a virus scanner was built to catch.

So before installing a skill, you really want to know three things:

  • What it claims to do.
  • Whether the bundled code actually matches that claim.
  • What the blast radius looks like if something goes wrong.

Answering those questions at the ecosystem scale is core to ClawHub’s mission. To take full advantage of OpenClaw, our users need to trust that the skills and plugins they install have been thoroughly vetted. Today we’re sharing how we do that work, what we found when we measured it, and a public dataset so the rest of the community can build on it.

The ClawScan Pipeline #

Our first attempt at building trust was a Codex agent prompted to look for OWASP agentic risks. It worked, and it caught real bad actors. But it was a closed-source effort, and the agentic-risk problem is too new and too fast-moving for any single registry to defend on its own.

So we’re now collaborating with NVIDIA on its verified agent skills initiative, doing the work in the open. Every skill that flows through ClawHub passes a pre-catalog verification gate before it is ever published:

When a new skill version is published, an OpenAI Codex agent receives the output of three independent scanners as context: our static analysis, VirusTotal, and NVIDIA SkillSpector. The Evaluate step, ClawScan, weighs all three alongside provenance, metadata, and moderation history, then produces a Skill Card along with a final verdict: Clean, Suspicious, or Malicious.

NVIDIA Skill Cards and SkillSpector collaboration #

Two pieces of that security process are new, and both come out of the NVIDIA collaboration.

NVIDIA Skill Cards are an open trust-artifact specification, and they now ship with every published skill. Each card tells you who published it, what it can do, what ClawScan found, and exactly where it came from. These are all verified by ClawHub, not taken from the publisher’s self-description. Read it in a tab on the skill detail page, or from the terminal with openclaw skills verify <slug> --card

.

NVIDIA SkillSpector is a new agent-skill scanner. It combines static checks with AI-assisted semantic analysis to flag risks that malware scanners miss: hidden instructions, risky code paths, overbroad capabilities, dependency issues, and mismatches between a skill’s declared purpose and its actual behavior. In ClawHub, SkillSpector findings are shown as advisories; they do not automatically block a skill. ClawScan weighs them alongside everything else before reaching a verdict.

Initial findings #

Our assumption was that the results from these three scanners would mostly overlap. Instead, they barely overlap at all.

Scanner pair Both positive Jaccard agreement
VirusTotal and SkillSpector 3,286 0.094
Static analysis and SkillSpector 3,511 0.104
Static analysis and VirusTotal 586 0.065

No pair agrees on more than 10.4% of its combined positives. Only 468 skills, or 0.69%, are flagged by all three scanners at once. 81.9% of positive findings come from a single scanner alone.

We do not believe these findings point toward an issue with any of the individual scanners. Rather, each scanner has a different risk surface. VirusTotal sees malware reputation, static analysis sees dangerous code patterns, and SkillSpector sees agentic risk. Across all 67,453 rows, SkillSpector is positive on 32,856 rows, or 48.71%, versus 5,225 rows, or 7.75%, for VirusTotal and 4,434 rows, or 6.57%, for static analysis. Within the 25,504 rows with a suspicious ClawScan verdict, SkillSpector is positive on 19,209 rows, or 75.3%. Among the 206 malicious rows, the pattern flips: VirusTotal is positive on 150 rows, or 72.8%, while SkillSpector is positive on 14 rows, or 6.8%.

These discrepancies highlight the need for an LLM-as-judge like ClawScan. Distinguishing between skills with a broad risk surface and those that are truly malicious is a novel challenge.

As one example, a skill we identified had 173 findings from SkillSpector, yet ClawScan still labeled it suspicious rather than malicious.

Rather than keep this corpus of scan outcomes to ourselves, we’re excited to open-source it for the broader security community to help us improve.

Open-sourcing our security scan signals dataset #

ClawHub’s commitment to skill security does not end at our own registry. The whole community gets safer when we share what we know.

As one of the most popular skill registries, we are now running the full ClawScan suite on thousands of published events every single day, and in the process burning millions of LLM tokens with OpenAI GPT-5.5 to do so. The v1 dataset covers 67,453 latest public skill versions. This in itself is generating a huge amount of signals that would be invaluable to the security research community, but until now has been locked away inside ClawHub.

Today, we’re releasing a public dataset of all ClawHub security scan outcomes on Hugging Face: OpenClaw/clawhub-security-signals. Special thanks to Jacob Tomlinson, Agustin Rivera, and Michael Appel from NVIDIA for their contributions to this project.

Our hope is that this empowers the broader research community to join us in improving the state of the art in skill security tooling. This work is core to our mission of securing the entire agent skill ecosystem, and supporting the wider AI ecosystem.

A rising tide lifts all claws. 🦞

── more in #ai-agents 4 stories · sorted by recency
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/openclaw-collaborate…] indexed:0 read:5min 2026-06-01 ·