RepoCloudflare (Workers AI)Cloudflare (Workers AI)published Jun 18, 2026seen 1w

cloudflare/security-audit-skill

JavaScript

Open original ↗

Captured source

source ↗
published Jun 18, 2026seen 1wcaptured 1whttp 200method plain

cloudflare/security-audit-skill

Description: A coding-agent skill for multi-phase security audits with independently verified, machine-readable findings

Language: JavaScript

License: MIT

Stars: 11

Forks: 0

Open issues: 0

Created: 2026-06-18T14:08:44Z

Pushed: 2026-06-18T16:21:39Z

Default branch: main

Fork: no

Archived: no

README:

security-audit

A coding-agent skill that turns your agent into a security auditor. It orchestrates multiple parallel agents through a six-phase pipeline -- recon, hunting, validation, reporting, structured output, and independent verification -- to find exploitable vulnerabilities with real impact.

This is the skill that seeded Cloudflare's vulnerability discovery harness, described in Build your own vulnerability harness. The harness grew into a multi-stage, fleet-wide system; this skill is the single-repo starting point it evolved from.

What it does

The skill runs a structured audit in six phases:

1. Recon -- parallel research agents map the application's architecture, trust boundaries, and input surfaces. Produces architecture.md. 2. Hunt -- parallel general agents attack the codebase from different angles (injection, access control, business logic, cryptography, feature abuse, chained attacks, and a wildcard). Each agent can spawn sub-agents to dig deeper. 3. Validate -- separate agents try to *disprove* each finding. Adversarial review kills false positives. 4. Report -- produces REPORT.md (human-readable) and FINDINGS-DETAIL.md (detailed traces for MEDIUM+ findings). 5. Structured output -- writes findings.json conforming to report-schema.json, validated by validate-findings.cjs. 6. Independent verification -- fresh agents verify every factual claim in the structured output against the actual source code.

Multiple runs against the same repo are additive. Each run explores different code paths; the skill reads prior findings.json files to skip known issues and target gaps.

Files

| File | Purpose | |------|---------| | SKILL.md | Setup, core principles, platform terminology, workflow overview, and audit anti-patterns | | RECONNAISSANCE.md | Phase 1 reconnaissance prompts and synthesis instructions | | HUNTING.md | Phase 2 orchestration, hunting methodology, and validation rules | | ATTACK-CLASSES.md | Core, wildcard, and obvious-things attack prompts | | VALIDATION-AND-REPORTING.md | Phases 3–6 validation, reporting, and verification | | report-schema.json | JSON schema for findings.json (confirmed and rejected finding structures) | | validate-findings.cjs | Zero-dependency Node.js validator that checks findings.json against the schema |

Installation

Install the skill with the Skills CLI:

npx skills add https://github.com/cloudflare/security-audit-skill \
--skill security-audit

Use --global for a user-level installation:

npx skills add https://github.com/cloudflare/security-audit-skill \
--skill security-audit \
--global

Run npx skills --help for agent-selection and non-interactive options.

Usage

Start your coding agent in (or pointed at) the codebase you want to audit, then ask it to do a security audit:

security audit this codebase
find security vulnerabilities in ./src
do a security review, output to ~/audits/my-project

The skill activates automatically when the request matches its trigger (security audit, find vulnerabilities, pen-test the code, etc.). It will ask for an output directory if you don't specify one, defaulting to ~/security-audit-skill//run-.

Requirements

  • A coding agent with a model that supports tool use and parallel sub-agents
  • Node.js (for validate-findings.cjs schema validation in Phase 5)

Design principles

  • Only report what you can exploit. Every finding needs a concrete attack scenario, not "an attacker could theoretically..."
  • Adversarial validation. The agent that checks a finding is never the agent that found it.
  • Severity requires impact. Likelihood x impact, not deviation from a checklist.
  • Defense-in-depth gaps are not vulnerabilities. If Layer A prevents the attack, the absence of Layer B is a hardening note.
  • Multiple runs improve coverage. Testing shows a single run finds roughly half the total vulnerabilities across multiple runs.

Contact

Questions, feedback, or comparing notes on AI-driven security tooling: security-ai-research@cloudflare.com

License

MIT -- see [LICENSE](LICENSE).

Notability

notability 3.0/10

Low traction routine repo from Cloudflare.