v1.0 Claude Code Plugin 33 Skills

The quality layer
your AI is missing

Name: Zuvo
Author: Zuvo

33 auto-routing skills. Multi-agent pipelines. Code quality gates. Test quality gates. Stack-aware rules. Two commands to install.

terminal

$ claude plugin marketplace add greglas75/zuvo-marketplace

$ claude plugin install zuvo

See how it works View source

33 Skills

22 CQ Gates

17 Test Gates

5 Stack Rules

Pipeline

Every feature goes through
a 10-agent pipeline

Not prompts. Not templates. Agents that read your code, verify their claims, and refuse to proceed without evidence.

zuvo:brainstorm

Three agents explore your codebase, research best practices, and analyze the problem before you write a single line.

Agents

Code ExplorerDomain ResearcherBusiness Analyst

Tools

CodeSiftWebSearchcontext7

Approved spec document

zuvo:plan

Four agents analyze architecture, select patterns, review testability, and decompose into bite-sized TDD tasks.

Agents

ArchitectTech LeadQA EngineerTeam Lead

Tools

impact_analysisdetect_communitiestrace_call_chain

TDD task list with exact code

zuvo:execute

Each task: RED test, GREEN code, spec review, quality review. Two independent reviewers verify every change.

Agents

ImplementerSpec ReviewerQuality Reviewer

Tools

CQ1-CQ22Q1-Q17index_file

Verified, reviewed code

⚠

No code without a failing test RED → GREEN → REFACTOR, enforced at every step

⚠

No completion claims without evidence Agents must run verification commands and show output

⚠

No silent discards Every finding tracked. Confidence 0-25: discard. 26-50: backlog. 51+: report.

Skills

33 skills. Zero manual routing.

Say what you need. Zuvo's meta-skill router matches your intent to the right skill automatically. Click any skill to see details, flags, and when to use it.

Pipeline 5

zuvo:brainstorm Multi-agent problem exploration

Three agents (Code Explorer, Domain Researcher, Business Analyst) run in parallel to understand your codebase, research best practices, and map the problem space. Produces an approved design specification before any code is written.

When to use New feature touching 5+ files, unclear scope, design decisions needed

Flags & modes

3 parallel agentsspec document outputdesign dialogue with user

zuvo:plan Architecture analysis + TDD task list

Four agents (Architect, Tech Lead, QA Engineer, Team Lead) decompose an approved spec into ordered TDD tasks with exact code targets and verification commands. Each task follows RED-GREEN-REFACTOR.

When to use After brainstorm produces a spec document

Flags & modes

requires spec4 sequential agentstask-level RED/GREEN/REFACTOR

zuvo:execute Implement with dual-review gates

Implements plan tasks sequentially. Each task: Implementer writes failing test then code, Spec Reviewer checks alignment with spec, Quality Reviewer scores CQ1-CQ22 and Q1-Q17 with evidence. Critical gate = 0 sends the task back for correction.

When to use After plan produces a task list

Flags & modes

requires plan3 agents per taskbacklog persistence

zuvo:worktree Git isolation + structured finish

Creates an isolated git worktree for branch-safe development. CREATE mode sets up worktree with smart directory selection. FINISH mode offers merge, PR, or cleanup options with safety verification.

When to use Before executing a plan, or when you need branch isolation

Flags & modes

CREATE / FINISH modessafety checksauto-cleanup

zuvo:receive-review Technical review response protocol

Six-step protocol for processing code review feedback: understand the comment, verify against current code, decide fix-or-pushback, implement with evidence. Prevents blind agreement with reviewer suggestions.

When to use When you receive PR review comments or feedback

Flags & modes

6-step protocolverify before agreeingpushback when warranted

Core Dev 4

zuvo:build Scoped feature dev (1-5 files)

Runs blast radius and duplication analysis in parallel, then TDD implementation with CQ/Q quality gates. Designed for small features with clear scope that don't need the full pipeline.

When to use Small feature with clear scope, adding a utility, a new endpoint

Flags & modes

--auto (skip plan approval)--auto-commitparallel analysis agents

zuvo:review Audit + confidence scoring + auto-fix

Structured code review with parallel audit agents. Examines uncommitted changes, staged diffs, commit ranges, or specific paths. Produces a tiered report (MUST-FIX / RECOMMENDED / NIT) with confidence scores, then optionally applies fixes with verification.

When to use After coding, before push. After any non-trivial implementation.

Flags & modes

staged / HEAD~N / [path] scopefix mode (auto-apply)batch (multiple commits)blocking (CI gate)

zuvo:refactor ETAP workflow with CONTRACT

Four-phase workflow: Evaluate (analyze scope), Test (verify baseline), Act (make changes), Prove (verify nothing broke). Resumable CONTRACT tracks state across sessions. Batch mode processes queued files.

When to use Extracting, splitting, moving, renaming, simplifying code

Flags & modes

full / auto / quick / standardplan-only (preview)continue (resume CONTRACT)batch <file> (queue)

zuvo:debug 5-phase root cause investigation

Systematic bug investigation: reproduce, narrow, diagnose, fix, verify. Produces a structured report with root cause analysis and regression test. Optional --regression flag triggers git bisect to find the breaking commit.

When to use Any bug, error, or unexpected behavior

Flags & modes

--regression (git bisect)structured report outputregression test generation

Testing 5

zuvo:write-tests Coverage scanner + pattern selector + Q1-Q17

Scans coverage gaps, classifies production code into 11 categories (VALIDATOR, SERVICE, CONTROLLER, HOOK, PURE, COMPONENT, GUARD, API-CALL, ORCHESTRATOR, STATE-MACHINE, ORM/DB), selects test patterns per type, writes tests with Q1-Q17 quality gates enforced.

When to use Existing production code lacking tests, post-refactor test rewrite

Flags & modes

[path] (specific target)auto (discover and loop)--dry-run (preview plan)

zuvo:write-e2e Playwright from route discovery

Discovers routes in your app, scores user flows by criticality (auth, CRUD, payment), generates .spec.ts files with page objects and quality gates. Code-first with optional live browser validation.

When to use Web apps needing browser-level test coverage

Flags & modes

[path] (scoped)--live (browser-assisted)--auto (no interaction)--flows (discover only)--max-flows N

zuvo:fix-tests Batch repair per anti-pattern

Detects systematic anti-patterns across your test suite, then fixes one pattern at a time with full production context. Avoids scattered one-off fixes by targeting patterns holistically.

When to use Same anti-pattern across many test files (e.g., phantom mocks, missing assertions)

Flags & modes

--triage (scan and report)--pattern [ID] [path]--dry-run--bundle-gates (fix adjacent gaps)

zuvo:test-audit Q1-Q17 + AP/P anti-pattern detection

Batch audit of test files against Q1-Q17 quality gates and AP1-AP26 anti-patterns. Detects orphan tests, phantom mocks, untested public methods. Tiered output (A/B/C/D) with critical gate enforcement.

When to use After mass test writing, when test quality is uncertain, periodic health check

Flags & modes

all / [path] / [file]--deep / --quick--include-e2e--details (verbose)

zuvo:tests-performance TP1-TP17 runner optimization

Measures baseline timing, audits runner configuration against TP1-TP17 checklist, identifies the slowest tests, and produces an impact-ranked action plan. Verify mode compares against saved baseline.

When to use When test suite feels slow, after adding many tests, before CI optimization

Flags & modes

full (default)baseline (measure only)verify (compare)--no-run (config audit)--path <dir>

Security 2

zuvo:security-audit S1-S14 OWASP, Sentry confidence model

Application security audit covering OWASP Top 10, injection, XSS, SSRF, auth/authz, multi-tenant isolation, secrets, headers, dependencies, business logic, and infrastructure. Uses Sentry 3-tier confidence model for finding severity. Dual scoring: static posture + runtime exploitability.

When to use Before releases, after auth/payment changes, quarterly health check

Flags & modes

[path] / full--live-url <url> (runtime check)--static (source only)--quick--persist-backlog

zuvo:pentest White-box + black-box testing (PT1-PT7)

Hybrid penetration testing across 7 dimensions. Source-to-sink tracing for injection paths, exploit verification against running targets, CMS-specific checks, runtime configuration analysis. Uses Shannon methodology with Sentry confidence filtering.

When to use After security-audit flags issues, before releases, CMS testing

Flags & modes

[path] / --url <url>--from-audit <dir>--cms <type>--quick--verify-live--dimensions PT1,PT2

Audit 9

zuvo:code-audit CQ1-CQ22 batch evaluation

Batch audit of production files against CQ1-CQ22 quality gates and CAP1-CAP14 anti-patterns. Tiered output (A/B/C/D grades), critical gate enforcement, evidence-backed scoring, cross-file pattern analysis, and a prioritized execution plan.

When to use Periodic health check, before releases, after adding 10+ production files

Flags & modes

all / [path] / [file]--deep / --quick--services / --controllers

zuvo:api-audit 10 dimensions, cross-cutting analysis

API endpoint integrity across 10 dimensions (D1-D10): input validation, payload design, pagination, error handling, caching, HTTP semantics, API waterfalls, rate limiting, auth patterns, and documentation. Optional GET probing on non-production targets.

When to use Before releases, after adding new endpoints, before API versioning

Flags & modes

full / [path]--static (no probing)

zuvo:db-audit 60+ checks, 12 dimensions (DB1-DB12)

Database performance and safety audit: query patterns, indexes, schema design, connection management, transactions, migrations, caching strategy, query optimization, ORM anti-patterns, observability, data lifecycle, and DB security. Code-level checks for all ORMs with optional live analysis.

When to use After schema changes, before optimizing queries, new database setup

Flags & modes

full / [path] / [file]--schema / --queries / --connections--live <conn> (PostgreSQL/MySQL)

zuvo:dependency-audit Supply chain + coupling metrics

Dependency health and internal coupling audit across 10 dimensions: supply chain vulnerabilities, version freshness, dead dependencies, license compliance, bundle weight, circular dependencies, coupling metrics, architecture boundary violations, barrel file health, and change coupling.

When to use Before releases, when adding major dependencies, quarterly

Flags & modes

full / [path]--supply-chain--coupling--dead--bundle--lock-in

zuvo:performance-audit 12 dimensions, Impact Models

Full-stack performance health check: rendering, bundles, assets, API/network, algorithms, memory, database, caching, Web Vitals, backend runtime, concurrency, and framework-specific pathologies. Evidence-based Impact Models with confidence tiers and a prioritized optimization roadmap.

When to use Before major releases, after heavy features, when users report slowness

Flags & modes

full / [path] / [file]--frontend / --backend--db / --bundle

zuvo:structure-audit SA1-SA13 organization audit

Codebase organization across 13 dimensions: directory consistency, naming conventions, folder depth, colocation, barrel exports, separation of concerns, file size distribution, dead code, complexity distribution, duplication, root organization, documentation, and git churn hotspots.

When to use When codebase feels messy, before major restructuring, onboarding new team

Flags & modes

full / [path]--naming / --size / --dead-code--duplication / --hotspots--quick / --fix

zuvo:ci-audit Pipeline speed + security (CI1-CI10)

CI/CD pipeline optimization across 10 dimensions: caching strategy, parallelism, conditional execution, artifact management, secret handling, action pinning, timeouts, Docker layer optimization, test integration, and pipeline speed benchmarks. Primary support: GitHub Actions.

When to use After changing CI workflows, when pipelines are slow or fragile

Flags & modes

full / [path]--speed-only--security-only

zuvo:env-audit Config validation + secret exposure (ENV1-ENV8)

Environment config across 8 dimensions: variable completeness, unused vars, startup validation, secret exposure, environment parity (dev/staging/prod), type safety, default values, and documentation. Supports .env, process.env, import.meta.env, and framework-specific patterns.

When to use After adding env vars, before deploy, config-related debugging

Flags & modes

full / [path]--secrets-only--parity

zuvo:seo-audit 13 dimensions + GEO readiness

SEO/GEO site audit covering 200+ checks: meta tags, structured data, AI crawler readiness, content quality, GEO (Generative Engine Optimization), performance, mobile, images, canonical URLs, sitemaps, i18n. Framework-aware: Astro, Next.js, Hugo, WordPress, React.

When to use Before launches, when SEO ranking drops, content quality review

Flags & modes

full / [path]--live-url <url> (Core Web Vitals)--quick--content-only--geo--persist-backlog

Design 4

zuvo:design Intent-first UI creation

Design with conscious, traceable decisions. Persists design system in .interface-design/ (system.md + system.json) for cross-session consistency. Domain exploration, component construction with mandatory checkpoints, and craft validation tests.

When to use Creating new UI, building a design system, improving existing components

Flags & modes

init (new project)[component]improve [path]extract [path]status--quick / --dry-run

zuvo:design-review DX1-DX20 consistency audit

UI/UX consistency audit with DX1-DX20 checklist covering states, consistency, accessibility, responsive behavior, and interaction patterns. Optional visual audit via chrome-devtools screenshots and automated WCAG accessibility via axe-core. DAP1-DAP12 anti-pattern detection.

When to use After adding UI views, when UI feels inconsistent, before design system adoption

Flags & modes

[path]visual (screenshot analysis)--fix-critical--dry-run--max-files--quickloop

zuvo:ui-design-team 4-agent design review team

Multi-agent UI review with 4 specialist perspectives: UX Researcher (flows, friction), Visual Designer (hierarchy, spacing, typography), i18n/Multilingual QA (text overflow, RTL, locale), and Accessibility/Performance Auditor (WCAG, contrast, loading). Lead Designer synthesizes into prioritized fixes with exact code.

When to use Comprehensive UI review from multiple expert perspectives

Flags & modes

[file/path]--screenshot--mobile--fix

zuvo:architecture Review, ADR, system design

Three modes: review existing codebase architecture (A1-A9 dimensions), create Architecture Decision Records, or design new systems from requirements. Uses CodeSift for module discovery, dependency mapping, structural metrics, and temporal coupling detection.

When to use Architecture health check, documenting decisions, designing new systems

Flags & modes

--mode review [path]--mode adr--mode design

Utility 3

zuvo:docs README, API docs, runbooks

Write and update technical documentation from actual codebase analysis. Generates README, API reference, runbook, onboarding guide, or changelog. Update mode patches stale sections without rewriting from scratch.

When to use After building features, when docs are outdated, onboarding new team members

Flags & modes

readme [path]api [path]runbook [topic]onboardingupdate [file]changelog [range]

zuvo:presentation PPTX slide generation

Generate PowerPoint (PPTX) presentations using python-pptx. Professional slides with consistent theming, speaker notes, and visual variety. Can generate from a topic, from a markdown file, or as outline-only.

When to use Creating slide decks for meetings, presentations, demos

Flags & modes

[topic] (from scratch)from [file] (from markdown)--slides N--theme dark|light|corporate--outline-only

zuvo:backlog Tech debt tracking + triage

Manage the project's tech debt backlog. Used by audit and review skills to persist findings via fingerprint-based deduplication. Browse, fix, dismiss, prioritize, and get batch suggestions for accumulated issues.

When to use Viewing or managing accumulated tech debt, after running audits

Flags & modes

list [category]add [desc]fix B-{N}wontfix B-{N} [reason]statsprioritizesuggest

Quality Gates

39 gates stand between
your AI and bad code

Every production file is scored against CQ1-CQ22. Every test file against Q1-Q17. Critical gate = 0 means the agent stops and fixes before proceeding. No exceptions.

Code Quality

CQ1-CQ22

Production code gates with evidence requirements

CQ3 Error strategy CRIT

CQ4 Error narrowing CRIT

CQ5 Secret comparison CRIT

CQ6 Unbounded queries CRIT

CQ8 Frontend timeout CRIT

CQ14 Guard duplication CRIT

CQ1 Atomicity

CQ2 Return types

CQ7 Nullable access

CQ9 Cleanup/unsub

CQ10 Cast guards

CQ11 Idempotency

CQ12 Loop lookups

CQ13 Auth + filter

CQ15 External validation

CQ16 Float math

CQ17 PII in logs

CQ18 Stale closures

CQ19 API response leak

CQ20 Dual fields

CQ21 Manual upsert

CQ22 useEffect deps

Test Quality

Q1-Q17

Test file gates with scoring threshold

Q7 Error path coverage CRIT

Q11 Assertion strength CRIT

Q13 Mock boundaries CRIT

Q15 Edge case coverage CRIT

Q17 Oracle independence CRIT

Q1 Test isolation

Q2 Arrange-Act-Assert

Q3 Descriptive names

Q4 Branch coverage

Q5 Data factories

Q6 No implementation leak

Q8 Boundary values

Q9 Async handling

Q10 Cleanup/teardown

Q12 Flaky prevention

Q14 Snapshot discipline

Q16 Permission matrix

// Evidence standard

Format: file:function:line for each critical gate scored as 1

N/A abuse: >60% gates N/A → flagged as low-signal audit, each N/A justified

Confidence: 0-25 discard · 26-50 backlog · 51+ report — zero silent discards

Comparison

Depth, not breadth

Other tools give you a workflow. Zuvo gives you a workflow with 39 quality gates, evidence requirements, and zero trust in agent claims.

Feature	Zuvo	Superpowers	gstack	Compound
Total skills	33	14	28	6
Auto-routing	✓	✓	—	—
Multi-agent pipeline	10 agents	basic	—	—
Code quality gates	CQ1-CQ22	—	—	—
Test quality gates	Q1-Q17	—	—	—
Evidence requirements	✓	—	—	—
Security audit (OWASP)	S1-S14	—	1 cmd	—
Performance audit	12 dim	—	—	—
DB audit	60+ checks	—	—	—
Stack-specific rules	5 stacks	—	—	—
TDD enforcement	✓	✓	—	—
Verification protocol	✓	✓	—	—
CodeSift integration	✓	—	—	—
Backlog persistence	✓	—	—	✓

Testimonials

What developers say

“Before Zuvo, my AI would hallucinate tests passing. Now it can't claim success without running the actual command. The verification protocol alone is worth it.”

G

Greg CEO @ Company

“33 skills sounds like marketing fluff until you use the security audit. S1-S14 with Sentry confidence scoring caught three auth bypass patterns our manual review missed.”

T

Tan Staff Engineer @ Company

“The pipeline changed how I work. Brainstorm explores the codebase before I even describe the problem. Half the time it finds relevant existing code I didn't know about.”

R

Radek Tech Lead @ Company

FAQ

Frequently asked questions

Everything you need to know about Zuvo — modes, tokens, stacks, and how things work under the hood.

What is auto-routing?

You describe what you want in plain language — "review my changes", "add a notification feature", "audit security" — and Zuvo's meta-skill router automatically picks the right skill. No slash commands to memorize. The router is injected at session start via a SessionStart hook and matches your intent to one of 33 skills.

What do the skill modes mean — deep, quick, auto, full?

full — default mode, comprehensive analysis across all dimensions. --quick — faster scan, fewer dimensions, good for iterative checks. --deep — maximum thoroughness, more files analyzed, cross-file pattern detection. --auto — skip plan approval and human interaction, run end-to-end. --dry-run — preview what would happen without writing files. Each skill documents its modes in the expanded details above.

What is CodeSift and do I need it?

CodeSift is an MCP server for semantic code search, call chain tracing, complexity analysis, and module detection. Zuvo uses it for deep code exploration — tracing how a function flows through your codebase, finding duplicates, detecting architectural boundaries. It's optional. Zuvo works without it in degraded mode (falls back to grep/read), but CodeSift reduces token usage by 15-30% and enables features like trace_route, detect_communities, and find_clones.

How many tokens does a full pipeline run cost?

For a medium-complexity feature (5-10 files): Brainstorm ~30-50K, Plan ~40-60K, Execute ~15-25K per task (8 tasks typical). Total: 200-300K tokens. Smaller features (3-4 tasks) run 100-150K. CodeSift reduces usage by 15-30% vs degraded mode. Individual skills like zuvo:review or zuvo:code-audit are much cheaper — typically 10-30K.

What environments does Zuvo support?

Claude Code — full support: parallel agents via Task tool, model routing, user interaction via AskUserQuestion. Codex — parallel execution with TOML agents, capped at 6 threads, no user interaction. Cursor — sequential execution only, no agent spawning. All environments produce identical output — execution strategy adapts but quality gates remain the same.

What stacks and frameworks are supported?

Stack detection is automatic from your config files. Currently supported: TypeScript (tsconfig.json), React / Next.js (package.json deps), NestJS (@nestjs/core), Python (pyproject.toml), PHP / Yii2 (composer.json). Test runners detected: Vitest, Jest, PHPUnit, Codeception. ORMs: Prisma. Stack-specific rules load automatically — you don't configure anything.

Does Zuvo modify my project files?

During installation: nothing is modified. At runtime: pipeline specs go to docs/specs/, tech debt backlog to memory/backlog.md, design systems to .interface-design/. All paths are deterministic and gitignore-friendly. You can delete these directories anytime without affecting Zuvo.

What's the difference between zuvo:build and the full pipeline?

zuvo:build is for scoped work — 1-5 files, clear scope, no design decisions needed. It runs blast radius + duplication scan, then TDD with quality gates. The full pipeline (brainstorm → plan → execute) is for features touching 5+ files or requiring design exploration. It adds 10 agents, spec/plan documents, and multi-stage review. If you're unsure, the router asks which approach fits.

What happens when a quality gate fails?

CQ1-CQ22 has 6 critical gates (CQ3, CQ4, CQ5, CQ6, CQ8, CQ14) that block immediately if scored 0 — the agent must fix before proceeding. Q1-Q17 has 5 critical gates (Q7, Q11, Q13, Q15, Q17). Non-critical failures are scored and tracked. If a fix takes under 5 minutes, the agent fixes it now. Otherwise, it's persisted to the backlog with a confidence score.

Can I use specific skills without auto-routing?

Yes. Invoke directly: zuvo:review, zuvo:code-audit src/services/, zuvo:security-audit --live-url http://localhost:3000. Slash commands also work: /review, /build, /refactor. For tasks that don't need a skill, state your intent clearly ("just change the port to 3001") and the router won't activate.

Is Zuvo free?

Zuvo is open source under the MIT license. All 33 skills, all quality gates, all agent definitions are included. Install from the marketplace and use everything. You pay for the Claude API tokens consumed — Zuvo itself has no license fee.

How do I update Zuvo?

Enable auto-updates: /plugin → Select zuvo-marketplace → Enable auto-update. Or update manually: claude plugin marketplace update greglas75/zuvo-marketplace followed by claude plugin update zuvo.

Stop trusting.
Start verifying.

One command. 33 skills. Every line of code understood before it's written, verified after it's written, tracked if it has issues.

$ claude plugin marketplace add greglas75/zuvo-marketplace

$ claude plugin install zuvo

Install Zuvo Read the docs

The quality layer your AI is missing

Every feature goes througha 10-agent pipeline

zuvo:brainstorm

zuvo:plan

zuvo:execute

33 skills. Zero manual routing.

39 gates stand betweenyour AI and bad code

Code Quality

Test Quality

Depth, not breadth

What developers say

Frequently asked questions

What is auto-routing?

What do the skill modes mean — deep, quick, auto, full?

What is CodeSift and do I need it?

How many tokens does a full pipeline run cost?

What environments does Zuvo support?

What stacks and frameworks are supported?

Does Zuvo modify my project files?

What's the difference between zuvo:build and the full pipeline?

What happens when a quality gate fails?

Can I use specific skills without auto-routing?

Is Zuvo free?

How do I update Zuvo?

Stop trusting. Start verifying.

The quality layer
your AI is missing

Every feature goes through
a 10-agent pipeline

39 gates stand between
your AI and bad code

Stop trusting.
Start verifying.