AI Incident Response Agent A new open-source AI incident response agent, AgentAz, has been released under Apache-2.0, designed to autonomously handle low-risk remediation steps while escalating high-risk actions to human engineers. The agent operates within strict governance boundaries, including cost limits, tool scoping, and mandatory human approval for irreversible actions, aiming to reduce response times for DevOps and SRE teams. Overview Alert → correlate → hypothesize → mitigate → communicate: a full first-responder loop grounded in your real telemetry. Acts only within its authority: low-risk, reversible steps run automatically; rollbacks, scaling, and anything risky require human approval. Evidence-based: every hypothesis cites the metric, log, or deploy that supports it — no guessing at root cause. Fails to a human, fast: real SEV1s and ambiguous, high-blast-radius incidents are escalated and paged with a clean summary and a holding status update. AgentAz™ specification A lightweight, design-time governance spec for security review. It documents what this agent is authorized to do — and why — and pairs with whatever policy engine you already run. It does not enforce anything at runtime. Machine-readable contract agentaz.json , validated against the open AgentAz™ JSON Schema — bundled for offline use and published at a permanent URL: { "$schema": "./agentaz.schema.json", "agent id": "incident-response-agent", "version": "2.0.0", "trust level": "A4", "dna pattern": "Execution", "worst case action": "Runs a reversible, low-risk auto-remediation that was unnecessary; rolled back. Irreversible actions require human approval.", "authority boundary": "Auto-runs allowlisted reversible steps with rollback; risky/irreversible actions require human approval.", "last reviewed": "2026-06-24", "tags": "devops", "sre", "security", "sandboxed", "rollback", "human-approval", "agentaz", "agent-governance", "trust-level", "production-readiness" , "tool boundary": { "auto executable tools": "restart service", "clear cache", "rotate log", "health check" , "approval required tools": "rollback deploy", "scale service", "change config", "modify security group" , "execution tools absent": false, "rollback required": true }, "output boundary": { "format": "structured json", "never without approval": "rollback deploy", "scale service", "change config", "modify security group" }, "cost boundary": { "max usd per trace loop": 0.35, "alert threshold usd": 0.25 }, "loop boundary": { "max reasoning turns": 12 }, "human handoff": { "triggers": "sev1", "irreversible action", "low confidence" , "destination": "oncall engineer" }, "audit": { "append only": true, "logs": "actions", "rollbacks", "approvals", "escalations" } } New to this? Read the AgentAz specification guide /agentaz-specifications — Trust Levels, DNA patterns, and how it complements your runtime. This is a flagship reference blueprint for AgentAz v1.0.0. AgentAz™ is open source under Apache-2.0 https://www.apache.org/licenses/LICENSE-2.0 spec text under CC‑BY‑4.0 — schema and source on GitHub https://github.com/agent-kits/agentaz . Governance matrix A scannable summary of this blueprint's governance coverage, derived from its AgentAz™ specification. It documents the boundaries that already ship — not new functionality. | Agent goal | Bounded by the authority spec above | |---|---| | Trust Level | A4 — Limited Autonomy | | Tool access | Scoped tools; high-risk actions gated behind approval | | Context handling | Grounded in provided inputs; cites or flags rather than guessing | | Memory strategy | Task-scoped; no persistent cross-session memory | | Human approval | Required on sev1, irreversible action, low confidence → oncall engineer | | Audit trail | Append-only log actions, rollbacks, approvals, escalations | | Cost & loop bounds | ≤ $0.35 per loop · ≤ 12 reasoning turns | | Recovery / escalation | Escalates to oncall engineer | Agent component mapping A framework-neutral view of how this blueprint maps to standard agent-architecture components the vocabulary common to ADK-style frameworks . It describes structure for clarity — not an official integration or certified compatibility. | Agent | Primary reasoner — Limited Autonomy authority A4 | |---|---| | Tools | restart service, clear cache, rotate log, health check; approval-gated: rollback deploy, scale service, change config, modify security group | | Memory | Task-scoped working context; no persistent cross-session memory | | Guardrails | Worst-case classified A4 ; high-risk actions gated; ≤ $0.35/loop · ≤ 12 turns | | Evaluator | Confidence and authority-boundary checks; low-confidence or out-of-bounds results are flagged, not actioned | | Handoff | Escalates to oncall engineer on sev1, irreversible action, low confidence | Failure modes Specific ways this blueprint can fail, and how it is designed to detect, contain, and recover from each — the boundaries that make it safe to run, stated plainly. Misdiagnoses the incident and targets the wrong service with a remediation step. - Detection - Pre-action validation checks the action target against the alert's affected service; a mismatch raises an anomaly before anything runs. - Mitigation - Remediation actions are reversible and sandboxed; destructive steps are gated behind human approval and must match the diagnosed scope. - Recovery - Automatic rollback of the reversible step; the incident is escalated to on-call with the full diagnosis trail. Acts on a stale or duplicate alert for an incident that is already resolved. - Detection - Incident status and a dedup key are checked before any action. - Mitigation - An idempotency key per incident makes repeated triggers a no-op. - Recovery - The duplicate is closed and the dedup event is logged. Remediation loops — repeated restarts that never converge. - Detection - A loop counter and max-reasoning-turn cap detect repeated identical actions. - Mitigation - Bounded retries with backoff; escalate after N attempts. - Recovery - Automation halts and pages a human with the full attempt history. A cascading action makes the incident worse. - Detection - A health check runs after each step; if health degrades, the chain aborts. - Mitigation - One reversible action at a time with a health gate between steps. - Recovery - The last action is rolled back, automation is frozen, and control passes to a human. Evaluation Action correctness matters most here — whether the remediation it ran or proposed was the right one for the diagnosed incident — because a wrong action has real blast radius. | Diagnosis accuracy | Share of incidents where the identified root cause and affected service match the ground truth. | |---|---| | Action correctness | Of actions taken or proposed, the share that were appropriate and scoped to the actual incident. | | Rollback success | When a reversible action is undone, how reliably the system returns to its prior state. | | Escalation rate | How often it hands off to on-call, split into correct escalations vs missed or unnecessary ones. | | Latency to first action | Time from alert to the first correct remediation step. | Recommended approach. Replay a labeled set of historical incidents in a sandbox and compare proposed actions to what on-call actually did; track rollback success and escalation quality separately. Never grade on live production traffic. When to use Use it when - You run on-call and want faster triage on the flood of alerts, especially the repetitive, well-understood ones. - You have metrics, logs, and deploy history the agent can correlate to form grounded hypotheses. - You have runbooks with clearly safe, reversible steps that can be automated under guardrails. - You want consistent, timely status updates drafted during an incident. - You want the agent to handle first response and escalate the genuinely serious incidents to humans with context. Avoid it when - You have no observability data for the agent to ground hypotheses in — it would be guessing. - You expect it to resolve novel SEV1s autonomously; those need experienced humans and the agent should escalate. - Your mitigations are all high-risk/irreversible with no safe automation surface. - You are unwilling to put approval gates in front of production-changing actions. System prompt You are an Autonomous Incident Response Agent acting as a first responder for an on-call SRE team. Your job is to triage one alert/incident: understand it, mitigate what is safe, communicate clearly, and escalate fast when it is serious. You are judged on reducing time-to-mitigate AND on never taking an unsafe action and never hiding a real incident. == CORE PRINCIPLES == 1. Evidence first. Form a hypothesis only from telemetry you have actually queried — metrics, logs, traces, recent deploys/changes. Cite the specific signal. Never assert a cause you cannot show. 2. Safety over speed. A fast wrong action is worse than a clean escalation. When in doubt, stabilize, communicate, and hand to a human. 3. Smallest safe action. Prefer the least invasive, most reversible mitigation that addresses the evidence. == HARD RULES NON-NEGOTIABLE == - ACTION TIERS: You may AUTONOMOUSLY take only low-risk, reversible, explicitly allow-listed actions e.g. restart a stateless pod, clear a cache, scale up within a cap, silence a known-false alert . Any rollback, deploy, scale-down, data operation, traffic shift, or config change to production REQUIRES human approval — propose it, do not execute it. - NEVER hide severity. Do not downgrade or silence an alert that could be a real incident to make the board look clean. Suppress only alerts you can show are non-actionable, and say why. - BLAST RADIUS: Estimate the blast radius before any action. If an action could affect a broad scope or a critical/customer-facing service, it is not autonomous — escalate or seek approval. - DON'T BREAK MORE: Do not take actions that could worsen the incident e.g. mass restarts during a thundering-herd . If unsure of an action's effect, don't take it. - COMMUNICATE: Keep humans informed with concise, honest status updates. Never promise a resolution time or root cause you cannot support. == SEVERITY & DECISION == - Assess severity SEV1 critical/customer-facing outage or data risk; SEV2 major degradation; SEV3 minor/limited; SEV4 noise . - AUTO MITIGATE: SEV3/known-pattern with an allow-listed, reversible fix and confidence = 0.8. Execute, verify, communicate. - PROPOSE: a non-allow-listed but evidence-backed mitigation e.g. rollback the suspect deploy . Stage it for one-click human approval with the supporting evidence. - ESCALATE + PAGE: SEV1/SEV2, broad blast radius, data-loss/security signals, conflicting or missing evidence, or confidence < 0.6. Page on-call, post a holding update, and hand over a structured summary. == COST CONTROL == Query the smallest set of signals that tests your hypothesis; do not pull every dashboard. Stop investigating once you can decide. Cap tool calls; if exceeded, escalate with current evidence. Keep updates short. == OUTPUT FORMAT return ONE JSON object == { "severity": "SEV1|SEV2|SEV3|SEV4", "confidence": <0.0-1.0 , "hypothesis": "