pilot-shell vs Supermaven — Comparison | Unfragile

pilot-shell vs Supermaven

Supermaven ranks higher at 71/100 vs pilot-shell at 35/100. Capability-level comparison backed by match graph evidence from real search data.

pilot-shell

Product

/ 100

Free

Supermaven

Extension

/ 100

Free

From $10/mo

Feature	pilot-shell	Supermaven
Type	Product	Extension
UnfragileRank	35/100	71/100
Adoption	0	1
Quality	0

pilot-shell Capabilities

spec-driven task planning with feature/bugfix auto-detection

Analyzes user intent via the /spec command, automatically classifies tasks as features or bugfixes, and generates structured implementation plans using a state machine dispatcher that routes to feature or bugfix workflows. The planning phase uses Claude to decompose requirements into atomic steps with estimated complexity, then presents a human-reviewable plan before implementation begins. This enforces upfront design thinking and prevents Claude Code from diverging into ad-hoc implementations.

Unique: Uses a dispatcher-based state machine that routes feature and bugfix tasks through separate workflows (feature: plan → implement → verify; bugfix: plan → implement → regression test), with mandatory human approval gates between planning and implementation phases. This architectural pattern prevents Claude from skipping the planning phase entirely.

vs alternatives: Unlike Claude Code alone (which implements immediately) or generic AI agents (which lack project context), Pilot Shell enforces structured planning with automatic task classification and blocks implementation until a human approves the plan.

test-driven development enforcement with pre-implementation test generation

During the implementation phase of /spec workflows, generates test cases before code is written, then validates that all generated code passes those tests before marking tasks complete. The system uses a verification agent that runs test suites and blocks code merges if coverage or assertions are insufficient. This is enforced via hooks that intercept code changes and validate test presence before allowing commits.

Unique: Integrates test generation into the implementation phase via a hooks pipeline that intercepts code changes and validates test presence before allowing progression. Uses a verification agent that runs test suites and blocks code merges if tests fail or coverage is insufficient, making TDD non-optional rather than optional.

vs alternatives: Standard Claude Code has no built-in test enforcement; Pilot Shell's hooks pipeline and verification agent make test-first development automatic and mandatory, preventing developers from skipping tests even if they wanted to.

codebase-aware context injection with selective token budgeting

Pilot Shell injects project-specific context into Claude's system prompt at session start, including extracted conventions, relevant code patterns, and project rules from the semantic index. The context injection is selective and respects Claude's token budget — only the most relevant patterns are injected based on the current task, preventing context window overflow. The system uses a context monitor to track which files are most relevant to the current task and prioritizes injection of related patterns.

Unique: Uses a context monitor to selectively inject the most relevant project patterns into Claude's system prompt based on task scope, respecting token budgets by prioritizing high-impact patterns. This enables codebase awareness without exceeding context window limits, making large-codebase support practical.

vs alternatives: Unlike RAG systems that inject all matching documents (risking token overflow) or manual context setup (which is tedious), Pilot Shell's selective context injection uses task-aware heuristics to inject only the most relevant patterns, balancing context richness with token efficiency.

automated code review and style enforcement

The verification phase includes an automated code review agent that checks for style violations, architectural inconsistencies, and deviations from project conventions. The agent uses the extracted project rules and conventions to validate that generated code follows established patterns. Code that violates style or architectural rules is flagged and can block merges, providing automated enforcement of code quality standards without requiring manual review.

Unique: Implements an automated code review agent that validates generated code against extracted project rules and conventions, providing architectural and style enforcement without manual review. The agent uses the same rules extracted by /sync and /learn, making reviews consistent with project standards.

vs alternatives: Unlike manual code review (which is slow and subjective) or linting tools alone (which only check syntax), Pilot Shell's code review agent understands project conventions and architectural patterns, providing semantic-level code quality assurance.

session state persistence and recovery

Pilot Shell persists session state (current task, implementation progress, test results, verification status) to disk, enabling recovery if a session crashes or is interrupted. The worker service maintains a session state file that tracks the current /spec task, implementation phase, and verification results. If a session is interrupted, the next session can resume from the last checkpoint, preventing loss of work and enabling recovery from failures.

Unique: Persists session state to disk via the worker service, enabling recovery from crashes and interruptions. Session state includes current task, implementation progress, test results, and verification status, allowing seamless resumption from the last checkpoint.

vs alternatives: Unlike Claude Code alone (which has no session persistence) or manual checkpointing (which is error-prone), Pilot Shell's automatic session persistence enables recovery from crashes without user intervention, making long-running tasks more reliable.

persistent session memory with semantic codebase indexing

The /sync command builds a semantic search index of the entire codebase using embeddings, then stores project-specific context (architecture patterns, naming conventions, dependencies, test patterns) in a persistent memory store that survives across sessions. This context is automatically injected into Claude's context window at the start of each session, enabling Claude to understand project conventions without requiring manual context setup. The context monitor continuously tracks changes to key files and updates the index incrementally.

Unique: Uses a context monitor hook that tracks file changes and incrementally updates the semantic index, combined with a memory & console system that persists extracted conventions across sessions. The index is injected into Claude's context at session start, eliminating the need for manual context setup while staying within token budgets via selective injection of relevant patterns.

vs alternatives: Unlike Claude Code alone (which has no persistent memory between sessions) or generic RAG systems (which require manual indexing), Pilot Shell's /sync command automatically indexes the codebase and injects relevant context at session start, making project knowledge persistent without manual effort.

project-specific rules and conventions extraction via /learn

The /learn command captures non-obvious discoveries from the current session (e.g., 'this project uses a custom logger instead of console.log', 'all async functions must have timeout handling') and converts them into reusable skill files stored in ~/.pilot/skills/. These skills are automatically loaded into Claude's context for future sessions on the same project, and can be shared across teams via the /vault command. The system uses Claude to extract generalizable patterns from session interactions and format them as structured rules.

Unique: Converts session discoveries into structured skill files that are automatically loaded into Claude's context for future sessions, with a /vault integration for team-wide sharing. Unlike generic documentation, skills are machine-readable and directly injected into Claude's reasoning, making them immediately actionable.

vs alternatives: Standard Claude Code has no mechanism to capture and reuse project-specific patterns; Pilot Shell's /learn command converts ephemeral session insights into persistent, shareable skills that improve Claude's performance on future tasks in the same project.

team knowledge sharing via /vault with git-backed persistence

The /vault command shares rules, commands, skills, hooks, and agents across a team by syncing them to a private Git repository. Each team member's local ~/.pilot/ and ~/.claude/ directories can be configured to pull from a shared vault repository, enabling centralized management of project conventions, custom hooks, and reusable agents. The system uses Git as the backing store and provides conflict resolution via simple merge strategies (last-write-wins or manual resolution).

Unique: Uses Git as the backing store for team knowledge, enabling decentralized sync with version history and audit trails. Rules, skills, hooks, and agents are stored as files in the vault repository and pulled into each team member's local ~/.pilot/ directory, making team knowledge portable and version-controlled.

vs alternatives: Unlike centralized knowledge bases (which require a server) or manual documentation (which gets out of sync), Pilot Shell's /vault uses Git for decentralized, version-controlled sharing of project-specific rules and agents, making team knowledge portable and auditable.

+5 more capabilities

Supermaven Capabilities

codebase-aware inline code completion with 1m token context window

Generates single-line and multi-line code suggestions in real-time as developers type, using semantic indexing of the entire codebase to retrieve relevant type definitions, function signatures, and contextual patterns. The system maintains a 1M token context window (Pro/Team tiers) that enables suggestions informed by distant code definitions and cross-file dependencies, constructed via local codebase semantic search rather than simple token-based recency. Suggestions adapt to detected coding style on Pro/Team tiers through implicit pattern learning from recent edits.

Unique: 1M token context window with codebase-wide semantic indexing enables suggestions informed by distant code definitions and cross-file patterns, versus competitors (Copilot, Tabnine) that typically use fixed context windows (4K-32K tokens) or file-local context. Claimed 250ms latency suggests optimized retrieval pipeline, though indexing mechanism and performance at scale remain undisclosed.

vs alternatives: Larger context window than GitHub Copilot (8K-32K tokens) and faster latency than unnamed competitors (250ms vs 783ms claimed), enabling suggestions on large codebases with minimal typing delay; trade-off is cloud dependency and undisclosed free tier limitations.

multi-model conversational code chat with diff generation and application

Provides a separate chat interface supporting multiple LLM backends (GPT-4o, Claude 3.5 Sonnet, GPT-4, others) for conversational code assistance. Users attach files, reference recent edits, and trigger compiler diagnostic uploads; the system generates diffs and applies code changes directly to the editor. Model selection is per-conversation, and $5/month in credits (included in Pro/Team) covers external model API costs; overage pricing is undisclosed. Hotkey-driven workflow enables rapid context switching between inline completion and chat.

Unique: Multi-model chat interface with per-conversation model selection and integrated diff application, combined with compiler diagnostic auto-upload. Unlike Copilot Chat (single model per tier) or standalone ChatGPT, Supermaven Chat unifies multiple LLM backends in a single hotkey-driven workflow with direct editor integration for change application.

pilot-shell vs Supermaven

pilot-shell Capabilities

Supermaven Capabilities

Verdict

Company