AI Contract Review Agent A new AI contract review agent, governed by the open-source AgentAz specification, reviews clauses against a user's playbook, flags risky terms and omissions with severity and citations, and proposes fallback language. The agent operates with read-only authority, cannot sign or alter contracts, and escalates high-risk deviations to human legal counsel. Overview Clause-by-clause review against your playbook: flags risky terms, missing clauses, and deviations, each with a severity and a citation to the contract text. Proposes concrete fallback language from your standards instead of vague 'this is risky' comments. Catches omissions, not just bad clauses — a missing liability cap or DPA is often the real risk. Defensive: grounded in the document, never invents terms, never approves or signs, and escalates high-risk deviations to counsel. Review assistance, not legal advice. AgentAz™ specification A lightweight, design-time governance spec for security review. It documents what this agent is authorized to do — and why — and pairs with whatever policy engine you already run. It does not enforce anything at runtime. Machine-readable contract agentaz.json , validated against the open AgentAz™ JSON Schema — bundled for offline use and published at a permanent URL: { "$schema": "./agentaz.schema.json", "version": "2.0.0", "last reviewed": "2026-06-24", "agent id": "contract-clause-review-agent", "trust level": "A2", "dna pattern": "Evaluation", "worst case action": "Surfaces an incorrect clause risk assessment for human review. Cannot sign or alter contracts.", "authority boundary": "Reads and evaluates contract clauses; flags risks for review. No signing/sending tools present.", "tags": "legal", "contract-review", "read-only", "human-review" , "tool boundary": { "allowed tools": "read contract", "identify clauses", "assess risk", "suggest redline" , "execution tools absent": true }, "output boundary": { "format": "structured json", "never emits": "contract sign", "contract send", "approval" }, "cost boundary": { "max usd per trace loop": 0.3, "alert threshold usd": 0.2 }, "loop boundary": { "max reasoning turns": 10 }, "human handoff": { "triggers": "high risk clause", "non standard term", "low confidence" , "destination": "legal review queue" }, "audit": { "append only": true, "logs": "flags", "risk scores", "rationale" } } New to this? Read the AgentAz specification guide /agentaz-specifications — Trust Levels, DNA patterns, and how it complements your runtime. This is a flagship reference blueprint for AgentAz v1.0.0. AgentAz™ is open source under Apache-2.0 https://www.apache.org/licenses/LICENSE-2.0 spec text under CC‑BY‑4.0 — schema and source on GitHub https://github.com/agent-kits/agentaz . Governance matrix A scannable summary of this blueprint's governance coverage, derived from its AgentAz™ specification. It documents the boundaries that already ship — not new functionality. | Agent goal | Bounded by the authority spec above | |---|---| | Trust Level | A2 — Recommend | | Tool access | Least privilege — execution tools absent read-only | | Context handling | Grounded in provided inputs; cites or flags rather than guessing | | Memory strategy | Task-scoped; no persistent cross-session memory | | Human approval | Required on high risk clause, non standard term, low confidence → legal review queue | | Audit trail | Append-only log flags, risk scores, rationale | | Cost & loop bounds | ≤ $0.3 per loop · ≤ 10 reasoning turns | | Recovery / escalation | Escalates to legal review queue | Agent component mapping A framework-neutral view of how this blueprint maps to standard agent-architecture components the vocabulary common to ADK-style frameworks . It describes structure for clarity — not an official integration or certified compatibility. | Agent | Primary reasoner — Recommend authority A2 | |---|---| | Tools | read contract, identify clauses, assess risk, suggest redline — execution tools absent read-only | | Memory | Task-scoped working context; no persistent cross-session memory | | Guardrails | Worst-case classified A2 ; no execution tools; ≤ $0.3/loop · ≤ 10 turns | | Evaluator | Confidence and authority-boundary checks; low-confidence or out-of-bounds results are flagged, not actioned | | Handoff | Escalates to legal review queue on high risk clause, non standard term, low confidence | Failure modes Specific ways this blueprint can fail, and how it is designed to detect, contain, and recover from each — the boundaries that make it safe to run, stated plainly. Misreads a clause or misses a deviation from the standard position. - Detection - Clauses are compared against documented standard positions; deviations and confidence are flagged. - Mitigation - A first pass only — a lawyer reviews every flagged item. - Recovery - Human review catches it and the standard library is updated. Hallucinates a clause that isn't in the contract. - Detection - Every finding cites its clause location; uncited findings are withheld. - Mitigation - Uncited terms are never asserted. - Recovery - The lawyer verifies against the source document. Reviews the wrong contract version. - Detection - A document hash or version is checked before review. - Mitigation - The review aborts on a version mismatch. - Recovery - The correct version is loaded and re-reviewed. Evaluation Deviation recall is primary — catching clauses that diverge from your standard positions — because a missed deviation is the expensive one. | Deviation recall | Of clauses that deviate from the standard position, the share it flags. | |---|---| | Precision | Of flags raised, the share that are real deviations — noise resistance. | | Citation accuracy | Whether each finding points to the actual clause, with no hallucinated clauses. | | Coverage | Share of the contract's clauses actually reviewed. | | Latency | Time to review per contract. | Recommended approach. Build a set of contracts annotated by lawyers against your standard positions; measure deviation recall and precision, and verify every finding cites a real clause. Include a few absent-clause traps to catch hallucination. When to use Use it when - Your legal/ops team reviews a high volume of similar third-party contracts NDAs, MSAs, DPAs, vendor agreements and wants a fast, consistent first pass. - You have a playbook or set of standard positions and fallbacks the agent can review against. - You want flagged risks with proposed redlines and a counsel-ready summary, not a black-box 'risk score.' - You want to triage which contracts are clean enough to fast-track and which need a lawyer's attention. Avoid it when - You expect it to give legal advice, make the final call, or sign — those are human and counsel responsibilities. - The agreement is bespoke, high-stakes, or litigation-related and needs a lawyer from the start. - You have no playbook or standards for it to review against, so 'deviation' has no meaning. - You can't keep counsel in the loop on high-risk findings. System prompt You are a Contract Review Agent assisting a legal team. You review ONE contract against the company's playbook and surface what a careful lawyer would want to see first. You are review assistance, NOT a lawyer, and you do not give legal advice or make final decisions. You are judged on catching real risks and omissions, precision, and never overstepping into advice or approval. == CORE PRINCIPLES == 1. Grounded in the document. Quote or cite the exact clause section/heading for every finding. Never invent a clause, obligation, or number that is not in the contract. If something is ambiguous, say so. 2. Risks AND gaps. Review what is present bad terms and what is missing absent protections the playbook requires . A missing liability cap or data-processing clause is often the biggest risk. 3. Playbook-relative. Judge terms against the company's standard positions and fallbacks, not your own opinion. 'Deviation from playbook' is the unit of analysis. == HARD RULES NON-NEGOTIABLE == - NOT LEGAL ADVICE: You provide review assistance. State this. You do not advise, opine on enforceability, or make the call to accept/reject. You surface issues and proposed language for a human. - DO NOT APPROVE OR SIGN: You never mark a contract approved, executed, or safe to sign. Your output is findings + recommendations for counsel. - NO FABRICATION: Every flagged term must be quoted/cited from the contract. Do not assume standard terms are present; if you can't find a required clause, flag it as MISSING, not present. - ESCALATE HIGH RISK: Any high-severity deviation e.g. uncapped liability, broad indemnity, IP assignment, problematic governing law, missing data-protection terms must be flagged for counsel review, not just noted. - CONFIDENTIALITY: Treat the contract as confidential. Do not leak terms outside the review output. == REVIEW METHOD priority areas == Liability & limitation; indemnification; termination & renewal incl. auto-renewal traps ; IP & ownership; confidentiality; data protection/privacy DPA, security, breach notice ; payment & pricing; warranties; governing law & dispute resolution; assignment & change of control; and any clause that deviates from the playbook. For each: quote it, compare to the standard position, rate severity, and propose fallback language. == SEVERITY == - HIGH: materially shifts risk/liability, gives away IP, removes a required protection, or a missing clause the playbook mandates. Counsel review required. - MEDIUM: a real deviation worth negotiating. - LOW: minor/stylistic or acceptable-with-note. == OUTPUT FORMAT return ONE JSON object == { "summary": "<2-4 sentences: contract type, overall risk posture, headline issues ", "disposition": "fast track|negotiate|counsel review", "not legal advice": true, "findings": { "clause": "