"Triple agent" code review Claude skill

wpnews.pro

name	code-review-turbo
description	Run a triple-agent code review on the current branch's PR. Waits for Cursor Bugbot, runs a Claude sub-agent and Codex in parallel, then cross-references all findings to filter out hallucinations. Use when you want a thorough, multi-perspective code review before merging.
metadata


allowed-tools	Bash(gh:) Bash(codex:) Bash(cat:) Bash(tee:) Bash(sleep:) Agent Read Grep Glob Write(/tmp/)

Triple-agent code review: Cursor Bugbot + Claude sub-agent + Codex, with cross-referencing to separate real bugs from hallucinations.

Determine the PR number. If $ARGUMENTS

is provided, use that as the PR number. Otherwise, try to detect it from the current branch:

gh pr view --json number,isDraft -q '{number: .number, isDraft: .isDraft}'

Create a draft PR for the current branch and comment to trigger Bugbot:

gh pr create --draft --fill
gh pr comment <number> --body "@cursor review"

Tell the user you created a draft PR and triggered Bugbot.

Check whether a @cursor review

or @bugbot review

trigger comment already exists:

gh pr view <number> --json comments --jq '.comments[].body'

If no trigger comment is found, add one:

gh pr comment <number> --body "@cursor review"

Bugbot runs automatically on non-draft PRs, so no trigger comment is needed — unless the review is stale (see below).

Bugbot posts one comment per issue it finds, and resolves individual comments when the issue is fixed. So "Bugbot has reviewed" means there are Bugbot comments on the PR, and "stale" means commits were pushed after Bugbot's review pass.

To detect staleness:

Get the timestamps of ALL Bugbot comments (it posts multiple — one per issue):

gh pr view <number> --json comments --jq '[.comments[] | select(.author.login | test("bugbot|cursor"; "i")) | .createdAt]'

Also check review comments (inline on the diff):

gh api repos/{owner}/{repo}/pulls/<number>/comments --jq '[.[] | select(.user.login | test("bugbot|cursor"; "i")) | .created_at]'

Get the timestamp of the most recent commit on the PR:

gh pr view <number> --json commits --jq '.commits | last | .committedDate'

If the latest commit is

newer than ALL of Bugbot's comments (or if Bugbot has never commented), the review is stale. Post a new trigger comment:

gh pr comment <number> --body "@cursor review"

Tell the user: "Bugbot's review is stale (commits landed after its last review). Triggered a fresh review."

If Bugbot has comments that are

newer than the latest commit, the review is current. Proceed with the existing comments.

If you triggered a fresh review (due to staleness, draft PR, or new PR), poll for Bugbot comments to appear. Run individual commands — do NOT write a bash loop:

Run gh pr view <number> --json comments --jq '[.comments[] | select(.author.login | test("bugbot|cursor"; "i"))]'
If empty, run sleep 30
Repeat up to 30 times (15 minutes total)

If Bugbot never shows up, warn the user and ask whether to proceed anyway or keep waiting.

Bugbot posts inline review comments (one per issue) and resolves them when the issue is fixed. You must use the GraphQL API to check resolution status, because the REST API does not expose it.

First, get the repo owner and name:

gh repo view --json owner,name --jq '.owner.login + "/" + .name'

Then fetch all review threads with their resolution status and filter to Bugbot/Cursor comments:

gh api graphql -f query='
  query {
    repository(owner: "<OWNER>", name: "<REPO>") {
      pullRequest(number: <NUMBER>) {
        reviewThreads(first: 100) {
          nodes {
            isResolved
            comments(first: 10) {
              nodes {
                author { login }
                body
                path
                line
              }
            }
          }
        }
      }
    }
  }
'

From the result, keep only threads where:

isResolved

isfalse

, AND- At least one comment has an author.login

matchingbugbot

orcursor

(case-insensitive)

Ignore all resolved threads — these are issues Bugbot already confirmed as fixed.

Also check top-level PR comments (these are rare for Bugbot but possible):

gh pr view <number> --json comments --jq '[.comments[] | select(.author.login | test("bugbot|cursor"; "i")) | select(.isMinimized | not)]'

Save all active (non-resolved) Bugbot findings for later comparison.

Gather the PR context by running these commands:

gh pr diff <number>
gh pr view <number> --json title,body,baseRefName,headRefName

Then construct the following review prompt (referred to as REVIEW_PROMPT

below). This EXACT prompt must be used for BOTH the sub-agent and Codex — do not alter it between the two:

START OF REVIEW_PROMPT

You are reviewing a pull request. Here is the diff:

PR title: PR description: Base branch: Head branch:

Review this PR thoroughly. Focus on these categories IN ORDER OF IMPORTANCE:

Look for logic errors, off-by-one errors, null/undefined issues, race conditions, incorrect conditionals, missing edge cases, wrong variable usage, broken control flow, and any code that simply won't work as intended. This is BY FAR the most important category.

Overly complex solutions where simpler ones exist. Unnecessary abstractions, premature generalizations, or convoluted logic.

Duplicated logic that should be extracted. Copy-pasted code with minor variations.

New functionality or bug fixes lacking appropriate test coverage.

For SQL queries: DO NOT GUESS what the query planner will do. Instead, run EXPLAIN ANALYZE

on the actual local database to verify. - For migrations: Will they lock tables for too long? Are they safe for large tables?

For application code: N+1 queries, unnecessary allocations, missing batching, O(n^2) loops on large datasets.

For any TSX/JSX files: missing aria labels, improper heading hierarchy, missing alt text, keyboard navigation issues, color contrast concerns.

DO NOT report:

Code formatting or style issues (these are linted automatically)
Minor TypeScript type issues (also linted)
Nitpicks that don't affect correctness or maintainability

For each issue found, report:

File and line number(from the diff)** Severity**: critical / high / medium / low** Category**: which of the above categories** Description**: what the issue is and why it matters** Suggestion**: how to fix it

Return a structured list grouped by severity (critical first, then high, medium, low).

END OF REVIEW_PROMPT

Launch BOTH of these at the same time (in parallel):

Use the Agent tool to spawn a sub-agent with the full REVIEW_PROMPT

. This agent should have access to Bash, Read, Grep, and Glob tools so it can run EXPLAIN queries and inspect code.

Run the EXACT SAME REVIEW_PROMPT

through Codex. First write the prompt to a randomly-named temp file using the Write tool (e.g., /tmp/review-prompt-<random-8-chars>.txt

— generate a unique random suffix to avoid collisions with concurrent agents), then pipe it via stdin:

codex exec --full-auto - < /tmp/review-prompt-<random>.txt

The --full-auto

flag prevents Codex from prompting for approval on shell commands (e.g., EXPLAIN queries). The -

tells it to read the prompt from stdin.

CRITICAL: DO NOT do any of your own code research, file reading, or EXPLAIN queries until ALL THREE sub-agents (Bugbot, Claude sub-agent, Codex) have returned their results. If you investigate the code first, you will form your own opinions and become a 4th agent with a veto over the other 3 — biased toward confirming your own findings and dismissing theirs. The whole point of this step is to be an OBJECTIVE judge of three independent reviewers.

Collect and deduplicate all findings from the three agents into a single list. For each unique issue, note which agent(s) reported it. Do NOT yet judge whether the issues are real — just organize them.

Only after compiling the full list, go through each finding and verify it:

Read the actual source code around each reported issue (not just the diff)Run EXPLAIN ANALYZE on any flagged SQL queries against local databaseCheck test files to see if flagged "missing tests" actually existTrace the logic for any reported functional bugs — actually verify the bug is real

For each unique issue, determine:

Is it a real issue(confirmed by your investigation)? - Is it a hallucination(the code doesn't actually have this problem)? - Which agents found it and which missed it?

Be especially careful not to dismiss a finding just because only one agent reported it — sometimes the lone dissenter found the most critical bug.

Present the validated findings in this format:

(issues you confirmed are real and need fixing before merge)

(real issues that should be fixed)

(real but lower-risk issues)

(minor improvements, optional)

(issues reported by agents that turned out to be hallucinations or false positives — briefly explain why each was dismissed)

A table showing which agent found which real issue:

Issue	Bugbot	Claude	Codex	Verdict
...	...	...	...	...

End with a clear merge recommendation: ready to merge, merge after fixes, or needs significant rework.

source & further reading

gist.github.com — original article front-end-prompt.md Fix GPT-5.6 Luna in OpenCode - Plugin Build or refactor a SillyTavern Game Master / Storyteller / Dungeon Master character card for a tabletop RPG system.

"Triple agent" code review Claude skill

Run your AI side-project on zahid.host