Your First Fleet
A guided tour of the things worth trying first, roughly in order. Each builds on the last. Assumes you've installed agents-fleet and skimmed the Core Concepts.
Every command below is documented in full in the Commands Manual. Type
/+ Tab in the REPL for the interactive picker.
1. /init — let the fleet understand your codebase
❯ /initSpawns seven explorers + one adversarial critic. Each explorer investigates a different domain (architecture, build, patterns, decisions, dependencies, APIs, security); the critic challenges every finding. Results land in .fleet/context/ and auto-load in every future session. Run this first on any new project.
2. /research — deep adversarial investigation
❯ /research "best approach to add caching in this codebase"Spawns four researchers (primary, counter-argument, context, edge-cases) plus a critic that verifies every claim against actual source. Use this before committing to a design.
3. /feature — plan and build a feature
❯ /feature "Add rate limiting to the API endpoints"Runs the multi-phase planning pipeline: brainstorm → requirements → research → design (parallel proposals, one critic picks the best) → validation → definition-of-done → task breakdown. Since v0.25.0 the brainstorm phase treats your description as authoritative and deepens it rather than re-asking from scratch.
4. /start — watch the fleet execute
❯ /startAfter planning, this kicks off autonomous execution: workers spawn for all ready tasks (up to 12 in parallel), blocked tasks auto-unblock as dependencies complete, coder workers get isolated worktrees, and the loop runs until every Definition-of-Done item passes. Monitor with /tasks, /workers, /dod, /tokens.
5. /code-review — four reviewers in parallel
❯ /code-reviewSpawns four specialized reviewers at once — 🔒 security, 🐛 correctness, 🏗️ architecture, ⚡ performance — and synthesizes their findings into one actionable report.
6. Autopilots — end-to-end, hands-off loops
Autopilots chain investigate → fix → test → review → PR into a single self-driving run. They're ideal for unattended work:
❯ /bug-autopilot "users get a 500 when the cart is empty"
❯ /github-issue-worker-autopilot # triage open GitHub issues → fix → ship draft PRs
❯ /code-review-autopilot --scope=diff # iterative review-and-fix loop on your diff/bug-autopilot reproduces the bug, fixes it, adds a regression test, runs a review loop, opens a PR, waits for your approval, and merges. These run deterministically in-process and report progress back through the coordinator — watch them with /status. (Pair with Telegram steering to approve PRs from your phone.)
7. Goal-driven loops
When you want the fleet to keep working toward a free-form goal — even with no tasks tracked yet — use a loop:
❯ /loop "polish the docs until they're release-ready" 30m
❯ /loop-target 30m "finish the auth module" --unattended # --unattended required under --provider claudeEach cycle the coordinator decides whether to create tasks, spawn workers, or signal completion. Ticks fire on coordinator idle, so the interval is a minimum delay, not a strict period.
8. /compete — build it three ways, pick the best
❯ /compete "implement the search feature"Three workers build the same feature independently in separate worktrees, then a judge scores correctness, performance, and quality. You merge the winner.
9. /crew — activate a specialized team
❯ /crews # browse available crews
❯ /crew code-review-crew # activate
❯ /crew stop # deactivateWhen a crew is active, workers use crew-specific prompts (not generic defaults) and the intelligence system tracks per-skill performance. Author your own by dropping a file in .fleet/crews/ — see Crews — Getting Started.
10. /learn — see what the fleet has learned
❯ /learn overview # session insights + pending suggestions
❯ /learn evolve <crew> # propose skill-prompt improvements from telemetry
❯ /learn dashboard <crew> # before/after evolution metrics
❯ /learn shadows # shadow proposals with win/loss stats
❯ /learn promote <skill> # promote a winning shadow to liveThe intelligence pipeline tracks every session — success rates, error patterns, duration trends — and proposes data-driven improvements to agent prompts, which you can validate with A/B eval before committing. See Fleet Intelligence.
Bonus: tune the engine
❯ /models # interactive model picker
❯ /effort # reasoning depth: low → medium → high → xhigh
❯ /doctor # environment + provider-auth health checkWhat to do next
- Run
/initon your project if you haven't — it pays for itself immediately. - Try
/researchon a design question you've been mulling. - Use
/featurefor your next real feature and go through the full pipeline. - Activate a crew and run
/code-reviewto feel the difference specialized prompts make. - Check
/learn overviewafter a few sessions and watch the system spot patterns.
For the full mental model, revisit Core Concepts; for an end-to-end walkthrough, see the Orchestration Tutorial.