cd /news/ai-safety/i-trusted-a-random-ai-plugin-until-c… · home topics ai-safety article
[ARTICLE · art-30590] src=dev.to ↗ pub= topic=ai-safety verified=true sentiment=↓ negative

I Trusted a Random AI Plugin… Until Cisco Showed It Was Stealing Data Behind My Back - 07 of 21

Cisco's AI security research team discovered a third-party plugin in the OpenClaw agent marketplace that silently exfiltrated sensitive data. The finding prompted the Chinese government to restrict OpenClaw on state computers, highlighting the risks of autonomous AI agents with broad permissions. Industry data shows 45% of AI-generated code deployments led to problems, and 48% of organizations reported increased security vulnerabilities from AI-assisted coding.

read2 min views2 publishedJun 17, 2026

In the first week of 2026, Cisco's AI security research team published a finding.

A third-party skill, a plugin, available in the OpenClaw agent marketplace had been performing data exfiltration. Silently. The skill appeared functional. It did what its documentation claimed. It also, without any indication, extracted sensitive data from the local environment and transmitted it to an external endpoint the user had never authorised.

One of OpenClaw's core maintainers responded on Discord: "If you can't understand how to run a command line, this is far too dangerous of a project for you to use safely."

Three months later, the Chinese government restricted state agencies from running OpenClaw on office computers, citing unauthorised data deletion, data leaks, and excessive resource usage.

This is the risk conversation of 2026. Not "will AI write a bug." That's manageable. That's code review.

The new risk: agents that act autonomously, with file-write permissions, git commit access, and CI/CD pipeline control, making decisions at 2am when no human is watching.

45% of all deployments linked to AI-generated code led to problems (Harness, 2025).

48% of organisations reported increased security vulnerability concerns from AI-assisted coding.

These numbers are from environments where human engineers were still reviewing AI output.

The governance principles that are non-negotiable before you grant an agent broad permissions:

Default to cautious autonomy. Start at the most restrictive setting that still makes the agent useful. Expand permissions based on observed behaviour over time.

Mandate audit logging. Every action an agent takes must be logged. Not for compliance theatre. Because the only forensic tool when something goes wrong is the record of what happened.

Design for reversibility. Git commits at meaningful checkpoints. All agent changes must be undoable. An agent operating in a system where its changes can't be reversed is an agent with uncapped blast radius.

Treat third-party skills like npm packages. No, treat them with more scrutiny. A malicious npm package compromises your application. A malicious agent skill compromises your agent, which has system permissions.

Accountability cannot be delegated. The agent opened the PR. You own the PR. The agent committed the code. You own the code. Always.

Tomorrow: the tool that changed how I think about code generation entirely. And the landscape of every AI coding agent you should know in 2026.

── more in #ai-safety 4 stories · sorted by recency
── more on @cisco 3 stories trending now
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/i-trusted-a-random-a…] indexed:0 read:2min 2026-06-17 ·