# Changelog All notable changes to the compound-engineering plugin will be documented in this file. The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/), and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html). ## [2.33.1] - 2026-02-13 ### Changed - **`/workflows:plan` command** - All plan templates now include `status: active` in YAML frontmatter. Plans are created with `status: active` and marked `status: completed` when work finishes. - **`/workflows:work` command** - Phase 4 now updates plan frontmatter from `status: active` to `status: completed` after shipping. Agents can grep for status to distinguish current vs historical plans. --- ## [2.33.0] - 2026-02-12 ### Added - **`setup` skill** — Interactive configurator for review agents - Auto-detects project type (Rails, Python, TypeScript, etc.) - Two paths: "Auto-configure" (one click) or "Customize" (pick stack, focus areas, depth) - Writes `compound-engineering.local.md` in project root (tool-agnostic — works for Claude, Codex, OpenCode) - Invoked automatically by `/workflows:review` when no settings file exists - **`learnings-researcher` in `/workflows:review`** — Always-run agent that searches `docs/solutions/` for past issues related to the PR - **`schema-drift-detector` wired into `/workflows:review`** — Conditional agent for PRs with migrations ### Changed - **`/workflows:review`** — Now reads review agents from `compound-engineering.local.md` settings file. Falls back to invoking setup skill if no file exists. - **`/workflows:work`** — Review agents now configurable via settings file - **`/release-docs` command** — Moved from plugin to local `.claude/commands/` (repo maintenance, not distributed) ### Removed - **`/technical_review` command** — Superseded by configurable review agents --- ## [2.32.0] - 2026-02-11 ### Added - **Factory Droid target** — New converter target for [Factory Droid](https://docs.factory.ai). Install with `--to droid` to output agents, commands, and skills to `~/.factory/`. Includes tool name mapping (Claude → Factory), namespace prefix stripping, Task syntax conversion, and agent reference rewriting. 13 new tests (9 converter + 4 writer). ([#174](https://github.com/EveryInc/compound-engineering-plugin/pull/174)) --- ## [2.31.1] - 2026-02-09 ### Changed - **`dspy-ruby` skill** — Complete rewrite to DSPy.rb v0.34.3 API: `.call()` / `result.field` patterns, `T::Enum` classes, `DSPy::Tools::Base` / `Toolset`. Added events system, lifecycle callbacks, fiber-local LM context, GEPA optimization, evaluation framework, typed context pattern, BAML/TOON schema formats, storage system, score reporting, RubyLLM adapter. 5 reference files (2 new: toolsets, observability), 3 asset templates rewritten. ## [2.31.0] - 2026-02-08 ### Added - **`document-review` skill** — Brainstorm and plan refinement through structured review ([@Trevin Chow](https://github.com/trevin)) - **`/sync` command** — Sync Claude Code personal config across machines ([@Terry Li](https://github.com/terryli)) ### Changed - **Context token optimization (79% reduction)** — Plugin was consuming 316% of the context description budget, causing Claude Code to silently exclude components. Now at 65% with room to grow: - All 29 agent descriptions trimmed from ~1,400 to ~180 chars avg (examples moved to agent body) - 18 manual commands marked `disable-model-invocation: true` (side-effect commands like `/lfg`, `/deploy-docs`, `/triage`, etc.) - 6 manual skills marked `disable-model-invocation: true` (`orchestrating-swarms`, `git-worktree`, `skill-creator`, `compound-docs`, `file-todos`, `resolve-pr-parallel`) - **git-worktree**: Remove confirmation prompt for worktree creation ([@Sam Xie](https://github.com/samxie)) - **Prevent subagents from writing intermediary files** in compound workflow ([@Trevin Chow](https://github.com/trevin)) ### Fixed - Fix crash when hook entries have no matcher ([@Roberto Mello](https://github.com/robertomello)) - Fix git-worktree detection where `.git` is a file, not a directory ([@David Alley](https://github.com/davidalley)) - Backup existing config files before overwriting in sync ([@Zac Williams](https://github.com/zacwilliams)) - Note new repository URL ([@Aarni Koskela](https://github.com/aarnikoskela)) - Plugin component counts corrected: 29 agents, 24 commands, 18 skills --- ## [2.30.0] - 2026-02-05 ### Added - **`orchestrating-swarms` skill** - Comprehensive guide to multi-agent orchestration - Covers primitives: Agent, Team, Teammate, Leader, Task, Inbox, Message, Backend - Documents two spawning methods: subagents vs teammates - Explains all 13 TeammateTool operations - Includes orchestration patterns: Parallel Specialists, Pipeline, Self-Organizing Swarm - Details spawn backends: in-process, tmux, iterm2 - Provides complete workflow examples - **`/slfg` command** - Swarm-enabled variant of `/lfg` that uses swarm mode for parallel execution ### Changed - **`/workflows:work` command** - Added optional Swarm Mode section for parallel execution with coordinated agents --- ## [2.29.0] - 2026-02-04 ### Added - **`schema-drift-detector` agent** - Detects unrelated schema.rb changes in PRs - Compares schema.rb diff against migrations in the PR - Catches columns, indexes, and tables from other branches - Prevents accidental inclusion of local database state - Provides clear fix instructions (checkout + migrate) - Essential pre-merge check for any PR with database changes --- ## [2.28.0] - 2026-01-21 ### Added - **`/workflows:brainstorm` command** - Guided ideation flow to expand options quickly (#101) ### Changed - **`/workflows:plan` command** - Smarter research decision logic before deep dives (#100) - **Research checks** - Mandatory API deprecation validation in research flows (#102) - **Docs** - Call out experimental OpenCode/Codex providers and install defaults - **CLI defaults** - `install` pulls from GitHub by default and writes OpenCode/Codex output to global locations ### Merged PRs - [#102](https://github.com/EveryInc/compound-engineering-plugin/pull/102) feat(research): add mandatory API deprecation validation - [#101](https://github.com/EveryInc/compound-engineering-plugin/pull/101) feat: Add /workflows:brainstorm command and skill - [#100](https://github.com/EveryInc/compound-engineering-plugin/pull/100) feat(workflows:plan): Add smart research decision logic ### Contributors Huge thanks to the community contributors who made this release possible! 🙌 - **[@tmchow](https://github.com/tmchow)** - Brainstorm workflow, research decision logic (2 PRs) - **[@jaredmorgenstern](https://github.com/jaredmorgenstern)** - API deprecation validation --- ## [2.27.0] - 2026-01-20 ### Added - **`/workflows:plan` command** - Interactive Q&A refinement phase (#88) - After generating initial plan, now offers to refine with targeted questions - Asks up to 5 questions about ambiguous requirements, edge cases, or technical decisions - Incorporates answers to strengthen the plan before finalization ### Changed - **`/workflows:work` command** - Incremental commits and branch safety (#93) - Now commits after each completed task instead of batching at end - Added branch protection checks before starting work - Better progress tracking with per-task commits ### Fixed - **`dhh-rails-style` skill** - Fixed broken markdown table formatting (#96) - **Documentation** - Updated hardcoded year references from 2025 to 2026 (#86, #91) ### Contributors Huge thanks to the community contributors who made this release possible! 🙌 - **[@tmchow](https://github.com/tmchow)** - Interactive Q&A for plans, incremental commits, year updates (3 PRs!) - **[@ashwin47](https://github.com/ashwin47)** - Markdown table fix - **[@rbouschery](https://github.com/rbouschery)** - Documentation year update ### Summary - 27 agents, 23 commands, 14 skills, 1 MCP server --- ## [2.26.5] - 2026-01-18 ### Changed - **`/workflows:work` command** - Now marks off checkboxes in plan document as tasks complete - Added step to update original plan file (`[ ]` → `[x]`) after each task - Ensures no checkboxes are left unchecked when work is done - Keeps plan as living document showing progress --- ## [2.26.4] - 2026-01-15 ### Changed - **`/workflows:work` command** - PRs now include Compound Engineered badge - Updated PR template to include badge at bottom linking to plugin repo - Added badge requirement to quality checklist - Badge provides attribution and link to the plugin that created the PR --- ## [2.26.3] - 2026-01-14 ### Changed - **`design-iterator` agent** - Now auto-loads design skills at start of iterations - Added "Step 0: Discover and Load Design Skills (MANDATORY)" section - Discovers skills from ~/.claude/skills/, .claude/skills/, and plugin cache - Maps user context to relevant skills (Swiss design → swiss-design skill, etc.) - Reads SKILL.md files to load principles into context before iterating - Extracts key principles: grid specs, typography rules, color philosophy, layout principles - Skills are applied throughout ALL iterations for consistent design language --- ## [2.26.2] - 2026-01-14 ### Changed - **`/test-browser` command** - Clarified to use agent-browser CLI exclusively - Added explicit "CRITICAL: Use agent-browser CLI Only" section - Added warning: "DO NOT use Chrome MCP tools (mcp__claude-in-chrome__*)" - Added Step 0: Verify agent-browser installation before testing - Added full CLI reference section at bottom - Added Next.js route mapping patterns --- ## [2.26.1] - 2026-01-14 ### Changed - **`best-practices-researcher` agent** - Now checks skills before going online - Phase 1: Discovers and reads relevant SKILL.md files from plugin, global, and project directories - Phase 2: Only goes online for additional best practices if skills don't provide enough coverage - Phase 3: Synthesizes all findings with clear source attribution (skill-based > official docs > community) - Skill mappings: Rails → dhh-rails-style, Frontend → frontend-design, AI → agent-native-architecture, etc. - Prioritizes curated skill knowledge over external sources for trivial/common patterns --- ## [2.26.0] - 2026-01-14 ### Added - **`/lfg` command** - Full autonomous engineering workflow - Orchestrates complete feature development from plan to PR - Runs: plan → deepen-plan → work → review → resolve todos → test-browser → feature-video - Uses ralph-loop for autonomous completion - Migrated from local command, updated to use `/test-browser` instead of `/playwright-test` ### Summary - 27 agents, 21 commands, 14 skills, 1 MCP server --- ## [2.25.0] - 2026-01-14 ### Added - **`agent-browser` skill** - Browser automation using Vercel's agent-browser CLI - Navigate, click, fill forms, take screenshots - Uses ref-based element selection (simpler than Playwright) - Works in headed or headless mode ### Changed - **Replaced Playwright MCP with agent-browser** - Simpler browser automation across all browser-related features: - `/test-browser` command - Now uses agent-browser CLI with headed/headless mode option - `/feature-video` command - Uses agent-browser for screenshots - `design-iterator` agent - Browser automation via agent-browser - `design-implementation-reviewer` agent - Screenshot comparison - `figma-design-sync` agent - Design verification - `bug-reproduction-validator` agent - Bug reproduction - `/review` workflow - Screenshot capabilities - `/work` workflow - Browser testing - **`/test-browser` command** - Added "Step 0" to ask user if they want headed (visible) or headless browser mode ### Removed - **Playwright MCP server** - Replaced by agent-browser CLI (simpler, no MCP overhead) - **`/playwright-test` command** - Renamed to `/test-browser` ### Summary - 27 agents, 20 commands, 14 skills, 1 MCP server --- ## [2.23.2] - 2026-01-09 ### Changed - **`/reproduce-bug` command** - Enhanced with Playwright visual reproduction: - Added Phase 2 for visual bug reproduction using browser automation - Step-by-step guide for navigating to affected areas - Screenshot capture at each reproduction step - Console error checking - User flow reproduction with clicks, typing, and snapshots - Better documentation structure with 4 clear phases ### Summary - 27 agents, 21 commands, 13 skills, 2 MCP servers --- ## [2.23.1] - 2026-01-08 ### Changed - **Agent model inheritance** - All 26 agents now use `model: inherit` so they match the user's configured model. Only `lint` keeps `model: haiku` for cost efficiency. (fixes #69) ### Summary - 27 agents, 21 commands, 13 skills, 2 MCP servers --- ## [2.23.0] - 2026-01-08 ### Added - **`/agent-native-audit` command** - Comprehensive agent-native architecture review - Launches 8 parallel sub-agents, one per core principle - Principles: Action Parity, Tools as Primitives, Context Injection, Shared Workspace, CRUD Completeness, UI Integration, Capability Discovery, Prompt-Native Features - Each agent produces specific score (X/Y format with percentage) - Generates summary report with overall score and top 10 recommendations - Supports single principle audit via argument ### Summary - 27 agents, 21 commands, 13 skills, 2 MCP servers --- ## [2.22.0] - 2026-01-05 ### Added - **`rclone` skill** - Upload files to S3, Cloudflare R2, Backblaze B2, and other cloud storage providers ### Changed - **`/feature-video` command** - Enhanced with: - Better ffmpeg commands for video/GIF creation (proper scaling, framerate control) - rclone integration for cloud uploads - Screenshot copying to project folder - Improved upload options workflow ### Summary - 27 agents, 20 commands, 13 skills, 2 MCP servers --- ## [2.21.0] - 2026-01-05 ### Fixed - Version history cleanup after merge conflict resolution ### Summary This release consolidates all recent work: - `/feature-video` command for recording PR demos - `/deepen-plan` command for enhanced planning - `create-agent-skills` skill rewrite (official spec compliance) - `agent-native-architecture` skill major expansion - `dhh-rails-style` skill consolidation (merged dhh-ruby-style) - 27 agents, 20 commands, 12 skills, 2 MCP servers --- ## [2.20.0] - 2026-01-05 ### Added - **`/feature-video` command** - Record video walkthroughs of features using Playwright ### Changed - **`create-agent-skills` skill** - Complete rewrite to match Anthropic's official skill specification ### Removed - **`dhh-ruby-style` skill** - Merged into `dhh-rails-style` skill --- ## [2.19.0] - 2025-12-31 ### Added - **`/deepen-plan` command** - Power enhancement for plans. Takes an existing plan and runs parallel research sub-agents for each major section to add: - Best practices and industry patterns - Performance optimizations - UI/UX improvements (if applicable) - Quality enhancements and edge cases - Real-world implementation examples The result is a deeply grounded, production-ready plan with concrete implementation details. ### Changed - **`/workflows:plan` command** - Added `/deepen-plan` as option 2 in post-generation menu. Added note: if running with ultrathink enabled, automatically run deepen-plan for maximum depth. ## [2.18.0] - 2025-12-25 ### Added - **`agent-native-architecture` skill** - Added **Dynamic Capability Discovery** pattern and **Architecture Review Checklist**: **New Patterns in mcp-tool-design.md:** - **Dynamic Capability Discovery** - For external APIs (HealthKit, HomeKit, GraphQL), build a discovery tool (`list_*`) that returns available capabilities at runtime, plus a generic access tool that takes strings (not enums). The API validates, not your code. This means agents can use new API capabilities without code changes. - **CRUD Completeness** - Every entity the agent can create must also be readable, updatable, and deletable. Incomplete CRUD = broken action parity. **New in SKILL.md:** - **Architecture Review Checklist** - Pushes reviewer findings earlier into the design phase. Covers tool design (dynamic vs static, CRUD completeness), action parity (capability map, edit/delete), UI integration (agent → UI communication), and context injection. - **Option 11: API Integration** - New intake option for connecting to external APIs like HealthKit, HomeKit, GraphQL - **New anti-patterns:** Static Tool Mapping (building individual tools for each API endpoint), Incomplete CRUD (create-only tools) - **Tool Design Criteria** section added to success criteria checklist **New in shared-workspace-architecture.md:** - **iCloud File Storage for Multi-Device Sync** - Use iCloud Documents for your shared workspace to get free, automatic multi-device sync without building a sync layer. Includes implementation pattern, conflict handling, entitlements, and when NOT to use it. ### Philosophy This update codifies a key insight for **agent-native apps**: when integrating with external APIs where the agent should have the same access as the user, use **Dynamic Capability Discovery** instead of static tool mapping. Instead of building `read_steps`, `read_heart_rate`, `read_sleep`... build `list_health_types` + `read_health_data(dataType: string)`. The agent discovers what's available, the API validates the type. Note: This pattern is specifically for agent-native apps following the "whatever the user can do, the agent can do" philosophy. For constrained agents with intentionally limited capabilities, static tool mapping may be appropriate. --- ## [2.17.0] - 2025-12-25 ### Enhanced - **`agent-native-architecture` skill** - Major expansion based on real-world learnings from building the Every Reader iOS app. Added 5 new reference documents and expanded existing ones: **New References:** - **dynamic-context-injection.md** - How to inject runtime app state into agent system prompts. Covers context injection patterns, what context to inject (resources, activity, capabilities, vocabulary), implementation patterns for Swift/iOS and TypeScript, and context freshness. - **action-parity-discipline.md** - Workflow for ensuring agents can do everything users can do. Includes capability mapping templates, parity audit process, PR checklists, tool design for parity, and context parity guidelines. - **shared-workspace-architecture.md** - Patterns for agents and users working in the same data space. Covers directory structure, file tools, UI integration (file watching, shared stores), agent-user collaboration patterns, and security considerations. - **agent-native-testing.md** - Testing patterns for agent-native apps. Includes "Can Agent Do It?" tests, the Surprise Test, automated parity testing, integration testing, and CI/CD integration. - **mobile-patterns.md** - Mobile-specific patterns for iOS/Android. Covers background execution (checkpoint/resume), permission handling, cost-aware design (model tiers, token budgets, network awareness), offline handling, and battery awareness. **Updated References:** - **architecture-patterns.md** - Added 3 new patterns: Unified Agent Architecture (one orchestrator, many agent types), Agent-to-UI Communication (shared data store, file watching, event bus), and Model Tier Selection (fast/balanced/powerful). **Updated Skill Root:** - **SKILL.md** - Expanded intake menu (now 10 options including context injection, action parity, shared workspace, testing, mobile patterns). Added 5 new agent-native anti-patterns (Context Starvation, Orphan Features, Sandbox Isolation, Silent Actions, Capability Hiding). Expanded success criteria with agent-native and mobile-specific checklists. - **`agent-native-reviewer` agent** - Significantly enhanced with comprehensive review process covering all new patterns. Now checks for action parity, context parity, shared workspace, tool design (primitives vs workflows), dynamic context injection, and mobile-specific concerns. Includes detailed anti-patterns, output format template, quick checks ("Write to Location" test, Surprise test), and mobile-specific verification. ### Philosophy These updates operationalize a key insight from building agent-native mobile apps: **"The agent should be able to do anything the user can do, through tools that mirror UI capabilities, with full context about the app state."** The failure case that prompted these changes: an agent asked "what reading feed?" when a user said "write something in my reading feed"—because it had no `publish_to_feed` tool and no context about what "feed" meant. ## [2.16.0] - 2025-12-21 ### Enhanced - **`dhh-rails-style` skill** - Massively expanded reference documentation incorporating patterns from Marc Köhlbrugge's Unofficial 37signals Coding Style Guide: - **controllers.md** - Added authorization patterns, rate limiting, Sec-Fetch-Site CSRF protection, request context concerns - **models.md** - Added validation philosophy, let it crash philosophy (bang methods), default values with lambdas, Rails 7.1+ patterns (normalizes, delegated types, store accessor), concern guidelines with touch chains - **frontend.md** - Added Turbo morphing best practices, Turbo frames patterns, 6 new Stimulus controllers (auto-submit, dialog, local-time, etc.), Stimulus best practices, view helpers, caching with personalization, broadcasting patterns - **architecture.md** - Added path-based multi-tenancy, database patterns (UUIDs, state as records, hard deletes, counter caches), background job patterns (transaction safety, error handling, batch processing), email patterns, security patterns (XSS, SSRF, CSP), Active Storage patterns - **gems.md** - Added expanded what-they-avoid section (service objects, form objects, decorators, CSS preprocessors, React/Vue), testing philosophy with Minitest/fixtures patterns ### Credits - Reference patterns derived from [Marc Köhlbrugge's Unofficial 37signals Coding Style Guide](https://github.com/marckohlbrugge/unofficial-37signals-coding-style-guide) ## [2.15.2] - 2025-12-21 ### Fixed - **All skills** - Fixed spec compliance issues across 12 skills: - Reference files now use proper markdown links (`[file.md](./references/file.md)`) instead of backtick text - Descriptions now use third person ("This skill should be used when...") per skill-creator spec - Affected skills: agent-native-architecture, andrew-kane-gem-writer, compound-docs, create-agent-skills, dhh-rails-style, dspy-ruby, every-style-editor, file-todos, frontend-design, gemini-imagegen ### Added - **CLAUDE.md** - Added Skill Compliance Checklist with validation commands for ensuring new skills meet spec requirements ## [2.15.1] - 2025-12-18 ### Changed - **`/workflows:review` command** - Section 7 now detects project type (Web, iOS, or Hybrid) and offers appropriate testing. Web projects get `/playwright-test`, iOS projects get `/xcode-test`, hybrid projects can run both. ## [2.15.0] - 2025-12-18 ### Added - **`/xcode-test` command** - Build and test iOS apps on simulator using XcodeBuildMCP. Automatically detects Xcode project, builds app, launches simulator, and runs test suite. Includes retries for flaky tests. - **`/playwright-test` command** - Run Playwright browser tests on pages affected by current PR or branch. Detects changed files, maps to affected routes, generates/runs targeted tests, and reports results with screenshots.