All Skills

237 skills in the catalog

Agent Loop Controls

Elevated · Review

Quality gates, eval checkpoints, and recovery controls for continuous autonomous agent loops. Patterns for safe loop operation with stall detection and intervention hooks.

risk: T2published

Autonomous Loop Patterns

Elevated · Review

Autonomous agent loop architectures from simple to complex: sequential pipelines, infinite agentic loops with quality gates, continuous PR loops with CI gates, de-sloppify cleanup passes, and RFC-driven multi-agent DAG orchestration with merge queues.

risk: T2published

Prompt Optimization

Elevated · Review

Systematic prompt improvement pipeline: structured prompt analysis, A/B testing patterns, metric-driven iteration, and prompt versioning for production LLM applications.

risk: T2published

MCP Server Builder

Elevated · Review

MCP server construction with Node/TypeScript SDK: tool registration with Zod validation, resource and prompt handlers, stdio vs Streamable HTTP transport selection, and transport-independent server architecture.

risk: T2published

Progressive Context Retrieval

Elevated · Review

Solves the subagent context problem via 4-phase iterative retrieval: dispatch with minimal context, evaluate gaps, retrieve targeted context, and re-dispatch. Prevents context overflow and starvation in multi-agent workflows.

risk: T2published

CI/CD Deployment Patterns

Elevated · Review

Production deployment workflows: rolling/blue-green/canary strategies, CI/CD pipeline patterns, Docker multi-stage builds, health checks, readiness probes, rollback strategies, and environment-specific configuration management.

risk: T2published

Agent Benchmarking

Elevated · Review

Head-to-head coding agent comparison tool: YAML task definitions with judge criteria, git worktree isolation per agent run, pass rate/cost/time/consistency metrics, and reproducible benchmarking across Claude Code, Aider, Codex, and other agents.

risk: T2published

Playwright E2E

Elevated · Review

Playwright E2E testing patterns: Page Object Model, test configuration, CI/CD integration with artifact capture (screenshots, videos, traces), flaky test quarantine, and critical user flow coverage strategies.

risk: T2published

Eval-Driven Development

Elevated · Review

Eval-driven development framework: define pass/fail criteria before implementation, capability/regression/consistency eval types, pass@k reliability metrics, grader patterns, and continuous eval integration during development.

risk: T2published

TDD Enforcement

Elevated · Review

Test-driven development enforcement: write-tests-first methodology, RED/GREEN/REFACTOR cycle, 80%+ coverage requirement across unit/integration/E2E, user journey definitions, test pyramid structure, and coverage verification gates.

risk: T2published

Docker Compose Ops

Elevated · Review

Docker and Docker Compose patterns: multi-service orchestration for local development, bind mounts with anonymous volume isolation, container networking, multi-stage Dockerfile optimization, container security hardening, and volume backup strategies.

risk: T2published

Verification Gate

Elevated · Review

Pre-PR verification pipeline: sequential build check, type check, lint, test with coverage, security scan, and diff review. Hard stop on build failure. Report all type errors before proceeding.

risk: T2published

Skill Audit

Elevated · Review

Audit skill and command libraries for quality: quick scan mode for changed skills, full stocktake with sequential subagent batch evaluation, quality scoring, and gap analysis.

risk: T2published

Backend Architecture

Elevated · Review

Backend architecture patterns: repository/service/controller layers, database query optimization, Redis caching, background jobs, middleware chains, and structured error handling.

risk: T2published

PostgreSQL Optimization

Attested

PostgreSQL quick reference: B-tree/GIN/BRIN index selection, data type cheat sheet, Row Level Security patterns, connection pooling, and Supabase-specific best practices.

risk: T1published

REST API Design

Elevated · Review

REST API design patterns: resource naming, HTTP methods, status codes, pagination, cursor-based filtering, error response envelopes, versioning strategies, and rate limiting configuration.

risk: T2published

Security OWASP Review

Elevated · Review

OWASP Top 10 security checklist with code patterns for secrets management, input validation, SQL injection prevention, XSS, CSRF, rate limiting, and pre-deployment security verification.

risk: T2published

DB Migration Patterns

Elevated · Review

Zero-downtime database migration patterns: forward-only production migrations, concurrent index creation, nullable column additions, separate schema vs data migrations, rollback planning.

risk: T2published

SHA-256 Cache Pattern

Attested

SHA-256 content-hash caching for expensive file processing. Path-independent keys auto-invalidate on content change. Frozen dataclass cache entries with file-based O(1) storage.

risk: T1published

AgentShield Config Audit

Elevated · Review

Scans agent harness configurations for security vulnerabilities, misconfigurations, injection risks, secrets leaks, and permission escalation vectors.

risk: T2published

AI Regression Sandbox

Elevated · Review

Regression testing strategies for AI-assisted development. Sandbox-mode API testing without database dependencies, automated bug-check workflows, patterns to catch blind spots where the same model writes and reviews code.

risk: T2published