The Agent Orchestration Landscape: Where Callipso Fits in April 2026

Six months ago, "AI coding agent" meant one thing: a single assistant in your editor. Today it means five Claude Code sessions rewriting your backend in parallel while you review diffs over coffee. The tooling around this shift is exploding — the design choices diverge sharply, and new entrants appear every week.

We mapped every orchestrator we could find. Terminal multiplexers written in Go. Desktop apps that replace your IDE entirely. Cloud platforms running hundreds of agents headlessly. Voice controllers on iOS. We looked at what they build, what they skip, and what trade-offs they make.

This is not a product comparison — we are obviously biased. It is a landscape survey from the perspective of builders who had to make their own architectural bets. We will be honest about where those bets are ahead and where they are behind.

The Core Problem

The workflow is straightforward: you have three features to build, a bug to fix, and tests to write. You want five AI agents running simultaneously, each in its own git worktree, each making progress while you review.

The naive approach — five terminal tabs, manual branch management, copy-pasting prompts — breaks down fast. You lose track of which agent is doing what. Branches collide. You send a prompt to the wrong session. The cognitive load of managing the agents exceeds the work the agents are doing.

Every tool in this space exists to solve that problem. They disagree on how.

Three Philosophies

After studying the landscape, we see three distinct approaches. Each reflects a different bet about how developers will work with AI agents.

Philosophy 1: Replace the IDE

Intent (Augment Code), OpenAI's Codex App, and to some extent Conductor (Melty Labs) take the boldest position: build an entirely new environment designed around agent orchestration from the ground up.

Intent introduces a coordinator-implementor-verifier pipeline. A planning agent uses Augment's Context Engine to understand your task and propose a plan as a living specification. You review and approve. Then implementor agents execute in isolated git worktrees while a verifier agent checks results against the spec. When requirements change, updates propagate to all active agents. It is architecturally coherent — and it explicitly markets itself as an IDE replacement.

Since our first version of this post, Intent shipped a bring-your-own-agent (BYOA) model — you can now run Claude Code, Codex, or OpenCode inside Intent's workspace. This is a meaningful change: you are no longer locked into Intent's agent. However, the environment lock-in remains. You are still running those agents inside Intent. Your existing iTerm2 tabs, your Ghostty windows, your Cursor terminals — Intent does not see them. BYOA means "bring your own agent into our house." It does not mean Intent comes to yours.

OpenAI's Codex App takes a similar approach. Threads of agents run in parallel, each with isolated code copies, managed through a purpose-built desktop surface. It runs on both macOS and Windows. Multi-agent workflows spawn specialized sub-agents, with Codex handling orchestration across threads.

Conductor from Melty Labs provides a dashboard of all agents — "who is working on what" — with real-time code review, trace logging, and automated PR creation. Each agent gets its own worktree. Checkpoints provide automatic snapshots with rollback. The free tier shifts costs to the AI services themselves — BYOK.

The upside of this approach is deep integration. When you control the entire environment, specs can update automatically, agents share context natively, and the UI reflects agent state in real time.

The downside is environment lock-in. You abandon your existing workflow. Your Cursor keybindings, your iTerm2 profiles, your Ghostty config — gone. Even with BYOA softening the agent lock-in, you still have to move your work into their environment. The switching cost is high when most developers have spent years customizing their setup.

Philosophy 2: Multiply Terminals

Claude Squad, Plural, and the Composio Agent Orchestrator take the opposite approach. They are lightweight multiplexers — spawn N agents, each in a tmux pane or its own worktree, manage them through a terminal UI or framework.

Claude Squad is the most well-known, written in Go. It supports five different AI CLIs: Claude Code, Aider, Codex, OpenCode, and Amp. Each agent runs in a tmux session with its own git worktree. The TUI shows all sessions at a glance. Single binary, works anywhere tmux runs.

Superset is an open-source terminal for running 10+ parallel AI agents, now supporting eight CLI agents: Claude Code, Codex, Gemini CLI, OpenCode, Aider, Copilot, Cursor Agent, and Superset Chat. Each session gets its own git worktree. A persistent daemon survives crashes and app restarts, so long-running tasks continue even when you close the window. Apache 2.0 license, zero telemetry, SOC 2 compliant.

Agent Deck is a Go + Bubble Tea TUI with a distinctive feature: Conductors. These are persistent Claude Code sessions that monitor and orchestrate all other sessions — watching for agents that need help, auto-responding when confident, escalating to you when they cannot. Socket pooling shares MCP processes across sessions via Unix sockets, reducing MCP memory usage by 85-90%. Optional Telegram and Slack integration for remote control.

ittybitty takes the simplest approach: specify a task and it spawns a Claude instance in a tmux virtual terminal. Agents can recursively spawn sub-agents. Minimal surface area — just a shell script and tmux.

myclaude treats Claude Code as an orchestration layer rather than the sole agent. It coordinates multiple AI backends — Codex, Gemini, OpenCode — from within Claude Code, keeping you in a single terminal context instead of switching between tools.

Plural focuses on forking sessions and broadcasting prompts across branches and repos. It can import issues from GitHub, Asana, and Linear, then dispatch them to parallel agents.

Composio Agent Orchestrator is a framework rather than a UI. It manages fleets of agents with a Planner layer (task decomposition) and an Executor layer (tool interaction). Each agent gets its own worktree, branch, and PR. When CI fails, the agent fixes it. When a reviewer requests changes, the agent addresses them.

The upside is low adoption friction. Claude Squad is a single Go binary. Superset is a single install with a daemon. No Electron, no server, no extension to install.

The downside is that these tools live entirely inside the terminal. If you are working in VS Code or Cursor, you context-switch between your IDE and the TUI. There is no awareness of which terminal belongs to which IDE window. The orchestrator cannot see your IDE terminals — it only knows about the tmux sessions it created.

Philosophy 3: Work With Everything

This is where Callipso sits. Instead of replacing your tools or living inside a terminal multiplexer, it floats as a transparent overlay on top of whatever you are already using.

The core idea: discover terminals across every IDE and terminal app on your machine and provide a unified control surface. Say "fix the auth bug, please one" and the transcribed text routes to terminal 1, regardless of whether it lives in Cursor, VS Code, Ghostty, or iTerm2.

No new IDE to learn. No tmux dependency. Your existing workflow stays intact. You get a command layer on top.

The Landscape in One Table

Tool	Type	Agent Support	IDE Awareness	Voice	Worktrees	Cloud/Headless
Callipso	Desktop overlay	Claude Code, Gemini CLI	10 apps (5 IDEs + 5 terminals)	Native STT + voice commands	Yes	No
Intent	Desktop app	Claude Code, Codex, OpenCode (BYOA)	Own environment	No	Yes	No
Codex App	Desktop app	Codex only	Own environment	No	Yes	No
Conductor	Desktop app	Claude Code + multi-model	Own UI	No	Yes	No
Warp Oz	Cloud platform	Claude, Codex, Gemini	Warp only	No	Cloud-based	Yes
Claude Squad	TUI (Go)	5 CLIs	tmux only	No	Yes	No
Superset	Terminal + daemon	8 CLI agents	Own terminal	No	Yes	No
Paperclip	Headless orchestrator	Claude, Codex, Gemini, any CLI	None	No	Yes	Local headless
Agent Deck	TUI (Go)	Claude Code + more	tmux only	No	Yes	No
Copilot /fleet	CLI command	Copilot	GitHub ecosystem	No	Yes	No
ittybitty	Shell script	Claude Code	tmux only	No	No	No
myclaude	Framework	Multi-backend	Claude Code CLI	No	No	No
Plural	TUI	Claude Code	tmux only	No	Yes	No
Composio	Framework	Any MCP client	Via MCP	No	Yes	Yes
VoxHerd	iOS app	Claude Code, Codex, Gemini CLI	By project	Native (on-device)	No	No

Where Callipso Is Ahead

Cross-Platform Terminal Discovery

This is our strongest differentiator. Callipso discovers terminals across nine platforms simultaneously:

IDE Adapters (via extensions):
  ├── VS Code
  ├── Cursor
  ├── Windsurf
  └── Antigravity

Terminal App Adapters (via native macOS APIs):
  ├── iTerm2      (event-driven Python API + AppleScript)
  ├── Terminal.app (AppleScript automation)
  ├── Ghostty     (AX watcher + kernel TTY injection)
  └── Kitty       (env-safe session registration)

Terminal Multiplexer Adapters:
  ├── tmux        (detached session detection)
  └── cmux

Built-in:
  └── Embedded Terminal (xterm.js + node-pty)

Every competitor assumes a homogeneous environment. Claude Squad only sees tmux sessions. Intent only sees its own workspace — even with BYOA, it cannot discover your existing terminals. Codex App only runs its own agents. Warp Oz only runs in Warp. Superset runs agents in its own terminal. Agent Deck manages its own tmux panes. Paperclip orchestrates autonomous agents with no IDE awareness at all.

Real developer setups are not homogeneous. You might use Cursor for the frontend, Ghostty for infrastructure scripts, and iTerm2 for quick debugging. Callipso sees all three, assigns a global terminal index, and routes to any of them through one interface. Nobody else does this.

The Adapter Problem: Ghostty vs iTerm2

The reason nobody else attempts cross-app terminal discovery is that every terminal application exposes a different level of programmatic control — or none at all. There is no standard API for "list your terminals." Each application requires its own adapter, and the engineering ranges from trivial to absurd depending on what the application gives you to work with.

The two extremes in our adapter set are iTerm2 (full API) and Ghostty (no API). The contrast illustrates why this problem is hard and why a generic approach does not exist.

iTerm2: The Gold Standard

iTerm2 ships a Python API with async event monitors. Callipso runs a long-lived Python daemon that maintains a persistent connection to iTerm2 via ~/.iterm2-socket:

Electron App (Node.js)
  → iTerm2Adapter (TypeScript)
    → iTerm2DaemonClient (process manager)
      → Python Daemon (iterm2_daemon.py)
        → iTerm2 Python API (iterm2 module)
          → iTerm2 Application via Unix socket

The daemon subscribes to native event monitors — FocusMonitor, SessionTerminationMonitor, EachSessionOnceMonitor, LayoutChangeMonitor. Events arrive immediately when sessions are created, destroyed, or focused. No polling lag. The daemon queries session variables (CWD, job name, PID, TTY) directly. Terminal IDs are native UUIDs in the format iterm2:{window}:{tab}:{session}, stable across focus changes and reorders.

Sending text is a single API call: session.async_send_text(text). Focusing a session: session.async_activate(). Creating a tab with a specific working directory: window.async_create_tab(profile_customizations). No focus stealing, no clipboard manipulation, no keystroke simulation.

If the Python daemon fails to spawn (wrong Python version, missing module), the adapter gracefully degrades to it2api, iTerm2's built-in JSON-RPC CLI tool. Same operations, polling-based instead of event-driven. The fallback chain ensures iTerm2 terminals are always discoverable.

Ghostty: Engineering Without an API

Ghostty, created by HashiCorp co-founder Mitchell Hashimoto, has no scripting API, no extension system, no socket, no RPC interface. Yet it is one of the fastest-growing terminal emulators — 2ms key-to-screen latency — and developers are adopting it rapidly. We had to support it.

The solution stitches together four macOS primitives:

GhosttyAdapter (AppAdapter interface)
  → GhosttyManager
    ├─ GhosttyPoller
    │   ├─ Process tree scanning (ps + pgrep)
    │   ├─ Accessibility window enumeration (System Events)
    │   ├─ AX event watcher (compiled helper binary)
    │   └─ Shell-to-tab matching (CWD heuristics)
    │
    └─ GhosttyClient
        ├─ ghostty-bridge (kernel TTY injection)
        ├─ Clipboard paste fallback (Cmd+V)
        └─ AXPress tab switching (no focus steal)

Discovery: pgrep -x ghostty finds the main process. ps -e -o pid=,ppid=,tty=,command= builds the process tree. We walk descendants to find shell processes with real TTYs — each TTY is one terminal tab. Then the Accessibility API queries Ghostty's window hierarchy for tab counts, selected tab indices, and radio button labels (tab titles). The hard part is matching shells to visual tab positions: a three-pass heuristic compares CWD basenames against AX tab titles, then foreground process names, then falls back to PID order.

Control: A compiled ghostty-bridge binary writes directly to TTY devices using kernel injection — text appears in the terminal without Ghostty needing focus. For tab switching, the bridge sends AXPress on specific radio button elements identified by window position and tab index. Without the bridge, everything falls back to clipboard paste (Cmd+V) and keystroke simulation (Cmd+N for tab switching), which steals focus.

Event detection: A compiled ax-watcher helper binary observes Accessibility notifications (AXCreated, AXUIElementDestroyed, AXFocusedWindowChanged, AXSelectedChildrenChanged) and emits JSON events to stdout. The poller debounces these into immediate re-polls — sub-50ms detection latency, comparable to iTerm2's event-driven model.

The contrast:

Capability	iTerm2	Ghostty
Discovery	One API call: `app.terminal_windows`	Process tree + AX enumeration
Identity	Native UUID per session	TTY device name (`ttys057`)
Send text	`session.async_send_text()`	Kernel TTY injection or clipboard paste
Focus tab	`session.async_activate()`	AXPress or `Cmd+N` keystroke
Create tab	`window.async_create_tab(cwd)`	`Cmd+T` keystroke, then `cd`
Event detection	Native async monitors	Compiled AX watcher binary
Focus stealing	Never	Only in fallback mode
Split pane awareness	Full (sessions within tabs)	None
Latency (discovery)	under 10ms (event-driven)	~100-300ms (AX query)

Both adapters implement the same AppAdapter interface — poll(), sendText(), focusTerminal(), createTerminal(). The TerminalStoreManager does not know or care which adapter produced a terminal. A Ghostty terminal and an iTerm2 terminal are both TerminalEntity objects in the normalized store, routable by UUID.

This is why the adapter pattern matters. The complexity of controlling ten different platforms is contained within nine adapter implementations. The rest of the system — the store, the session manager, the voice router, the overlay UI — operates on a clean, uniform data model. Adding support for a new terminal app means writing one adapter, not modifying the core.

Voice Routing by Terminal Index

Callipso has two voice routing modes — magic word commands and hotkey routing. The magic word is the primary mode for most users.

Magic word "please": Say "fix the auth bug, please one" and the transcribed text routes to terminal 1. Say "add dark mode, please three" and terminal 3 gets it. The keyword parser extracts the destination from natural speech — no hotkeys, no switching windows, no clicking. The word "please" followed by a terminal number (or name) is all it takes.

The magic word also supports special commands:

"please new" — creates a new terminal
"please commit" — triggers a scoped commit
"please resume auth bug" — resumes an archived session by name

Hotkey routing (alternative mode): A clipboard-based state machine for users who prefer keyboard shortcuts.

Stage 1: Clipboard Detection
  STT engine transcribes speech → text lands in clipboard
  → Callipso detects the change, stores text with 30-second expiry

Stage 2: Destination Selection
  User presses Cmd+Opt+F1 (or F2, F3, F4)
  → Callipso routes stored text to that terminal index

State Machine:
  IDLE → TEXT_WAITING (clipboard stored)
       → DESTINATION_WAITING (hotkey pressed)
       → ROUTE_READY (both exist) → IDLE (text sent)

The state machine handles timing edge cases: speaking before pressing the hotkey, pressing the hotkey before speaking, or pressing a new hotkey before the previous route completes. A sticky destination mode keeps the target terminal active for 20 seconds, so consecutive voice commands go to the same place without re-pressing the hotkey.

VoxHerd is the only other tool doing voice seriously — an iOS app that routes spoken commands to agents via WebSocket with on-device recognition. But it routes by project, not by terminal index. You cannot say "please three" to target a specific terminal in a specific IDE window.

Non-Invasive Overlay Model

This is a design philosophy, not a feature. Callipso does not replace your IDE. It does not ask you to learn a new environment. It does not require abandoning tool configurations you have built over years.

The overlay floats on top of your desktop with click-through transparency. Terminal list, task status, recording controls — all visible at a glance without switching windows. You interact with your existing tools exactly as before. Callipso adds a control layer; it does not replace anything.

Intent explicitly says "the IDE is dead." We disagree. The IDE is not dead — it is just missing a conductor.

Session-Isolated Routing

When multiple Claude Code sessions run in the same Cursor window, they share the same CLAUDE_CODE_SSE_PORT. Naive routing based on the port sends your prompt to the wrong session.

Callipso solves this with session_id — a UUID generated per Claude Code invocation via the hooks system. The SessionStore maintains bidirectional maps between SessionId and TerminalUUID. Routing uses session_id for disambiguation, eliminating cross-pollination entirely.

This is subtle infrastructure, but it is the difference between "parallel agents work sometimes" and "parallel agents work reliably." Tools that rely on tmux or their own terminal implementation sidestep this problem by design. Tools that need to work across existing IDE terminals — which only Callipso does — must solve it explicitly.

Real-Time 3D Codebase Visualization

The Space tab renders your codebase as a 3D star field using Rust, WebAssembly, and WebGPU. AI agents appear as ships navigating between files. When an agent reads a file, the ship moves toward it. When it writes, the node pulses.

No other orchestrator has visual debugging at this level. It is experimental, it is optional, and it is the feature that makes people stop and watch. The Space Tab: Behind the Scenes post covers the Rust/WASM architecture in detail.

Where Callipso Is Behind

We would rather be honest about our gaps than have users discover them.

CLI Support Is Expanding but Not Equal

Callipso's deepest integration is with Claude Code — hooks, session IDs, task state, timer tracking, token usage, git diffs. Gemini CLI hook integration is now supported as well, but not at the same depth.

Claude Squad supports five CLIs. Superset now supports eight. Composio works with any MCP-compatible client. VoxHerd supports Claude Code, Codex, and Gemini CLI.

Our adapter architecture is extensible — adding deeper Gemini CLI or OpenCode support is engineering work, not a design problem. The gap has narrowed since our first version of this post, but if you run a mixed fleet across many different CLIs, some will have richer integration than others.

No Autonomous Planning Layer

Intent's coordinator-implementor-verifier pipeline is genuinely powerful for large tasks. You write a spec, the coordinator decomposes it, specialist agents execute in parallel, a verifier checks results. The spec updates as agents learn. It is a self-correcting system.

Composio's Planner-Executor architecture does something similar: high-level decomposition with correction loops when tools fail.

Callipso is a routing orchestrator. You decide what goes where. The system routes your voice, manages sessions, tracks state — but it does not decide how to decompose a large task into parallel subtasks. That planning remains with the human (or with the AI agent's own sub-agent spawning).

Both models have merit. Autonomous planning is more powerful for large, well-specified tasks. Human-in-the-loop routing is more flexible for exploratory work where the developer is actively steering. We chose the latter, and we think it is the right default for how most developers work today. But the planning layer is where Intent and Composio are ahead.

macOS Only

Callipso uses macOS-specific APIs extensively:

AppleScript for iTerm2 and Terminal.app control
NSWorkspace for application monitoring
Accessibility APIs for terminal discovery

The Codex App already runs on macOS and Windows. Claude Squad is cross-platform (Go + tmux). Warp Oz is cloud-based. Our addressable market is limited to macOS developers — which is a large audience (most AI-first developers are on Macs) but not the full market.

No Cloud or Headless Mode

Warp's Oz runs agents in persistent cloud environments, triggered by cron jobs, webhooks, GitHub Actions, or Slack messages. Composio can manage 30+ agents headlessly. These tools enable team-scale automation where agents run overnight, process issue backlogs, and create PRs without a human at the keyboard.

Callipso requires an active desktop session. It is a developer productivity tool, not a CI/CD agent farm. For solo developers and small teams working interactively, this is not a limitation. For organizations wanting to scale agent execution across a team, it is.

No Automated CI Feedback Loop

Conductor and Composio close the loop: agent pushes code, CI fails, agent reads the failure log, agent fixes and re-pushes. This cycle can repeat autonomously until tests pass.

Callipso has git worktree support, commit buttons, and merge-to-main UI. But the CI-failure-to-fix cycle is still manual. You see the failure, you tell the agent to fix it. The routing is fast, but the human is still in the loop for error recovery.

The Autonomous Orchestrators and Platform Plays

Several tools deserve separate mention because they operate at a fundamentally different level — headless, autonomous, dashboard-driven. You set a goal and walk away. No terminal windows, no real-time interaction, no voice.

Warp Oz is a cloud platform for running coding agents at enterprise scale. Agents run in persistent cloud environments with cron, webhook, and API triggers. Multi-model support (Claude, Codex, Gemini). Every agent run generates a shareable audit trail. Multi-repository support enables cross-repo changes. Self-hosted option for enterprises. Oz now writes 60% of Warp's own PRs — they are eating their own cooking.

Paperclip exploded onto the scene in April 2026 with 42,000+ GitHub stars and a provocative tagline: "open-source orchestration for zero-human companies." You build an org chart of AI agents — CEO, managers, workers — assign goals, and agents execute via headless CLI processes (claude --print with stdin piped, stdout captured). No terminal windows. No IDE awareness. No visibility into running processes. A heartbeat scheduler wakes agents at intervals, they check for assigned work, execute, and report results to a React dashboard. Agents communicate through an issue tracker database, not direct conversation. Paperclip is a back-office orchestrator: you define work, check the dashboard later. It is closer to Warp Oz than to Claude Squad — the agents run headlessly in isolated git worktrees under .paperclip/worktrees/, not in terminals you can see or interact with.

GitHub Copilot Agent can be assigned GitHub issues directly. It spins up an ephemeral dev environment via GitHub Actions, implements the fix, and creates a PR. No local environment needed. This is the zero-setup end of the spectrum.

New in April 2026: GitHub Copilot /fleet shipped a /fleet command that decomposes tasks into parallelizable sub-tasks and dispatches subagents. Each subagent gets its own context window with a shared filesystem. Available to all paid Copilot tiers. This is GitHub bringing multi-agent orchestration directly into its ecosystem — significant because of the distribution advantage. Every Copilot subscriber gets it for free.

These tools solve a different problem than Callipso. The metaphor: Paperclip and Warp Oz are like assigning tickets to a remote team and checking Jira in the morning. Callipso is like sitting in mission control watching every screen. They are for teams that want agents running autonomously at scale, or for developers who want orchestration without real-time interaction. Callipso is for developers who want to sit in front of their terminals, talk to their agents by name, and watch the work happen live.

VS Code's Built-In Multi-Agent Mode

Worth noting: VS Code shipped multi-agent orchestration in February 2026. You can run multiple Copilot agents in the same editor, each working on different files, with a coordination layer that prevents conflicts.

This is native, free, and deeply integrated. Combined with Copilot /fleet, VS Code developers who use only Copilot now have a fairly complete multi-agent story without installing anything. The limitation is scope: it only works within VS Code, only with Copilot, and only within a single editor window. Cross-IDE, cross-terminal, and cross-CLI orchestration are out of scope.

The Positioning Map

             Autonomous ──────────────────────────── Human-in-Loop
                  │                                       │
                  │  Intent (BYOA)                        │
        Replace   │  Codex App                             │
        IDE       │  Conductor                             │
                  │                                       │
   ───────────────┼───────────────────────────────────────┼──────
                  │                                       │
        Headless/ │  Warp Oz       Claude Squad            │  Callipso
        Single    │  Paperclip     Plural                  │
        Platform  │  GitHub Agent  Superset                │
                  │  Copilot /fleet Composio               │
                  │                                       │
              Multi-IDE ──────────────────────────── Multi-IDE

Callipso occupies a unique quadrant: multi-IDE support with human-in-the-loop voice routing. The space has gotten more crowded since March — Paperclip and Copilot /fleet are significant new entrants — but they sit firmly on the autonomous/headless side. Most competitors are either headless and autonomous (Warp Oz, Paperclip, Copilot /fleet), single-environment and human-controlled (Claude Squad, Conductor), or environment-replacing (Intent, even with BYOA). Nobody else sits in the multi-IDE, human-in-the-loop quadrant.

The question is whether that quadrant grows. We believe it does — because developers use multiple tools, and the tools that meet them where they are will win over the tools that ask them to move.

What We Are Watching

Three trends are shaping this space right now:

Agent-to-agent communication is shipping. Claude Code Agent Teams now has dedicated hooks — TeammateIdle, TaskCreated, TaskCompleted — with agent_id and agent_type fields for subagent tracking. Anthropic's Claude Code Review deploys agent teams on every PR, jumping their internal code review coverage from 16% to 54%. This is not theoretical anymore. When agents coordinate without human mediation, the orchestrator's role shifts from routing to monitoring. Callipso already tracks subagent file edits and shows them in the diff badge — but deeper integration with Agent Teams is on our roadmap.

MCP as a standard. Model Context Protocol is becoming the lingua franca for agent-tool communication. Composio already supports any MCP client. As more CLIs implement MCP, the adapter-per-platform model (which Callipso uses today) may give way to a single MCP integration that works everywhere. We are watching this closely — if MCP matures to the point where every terminal app exposes a standard interface, the hard adapter engineering we did for Ghostty and iTerm2 becomes unnecessary. That would be a good outcome for developers, even if it commoditizes our advantage.

The landscape is splitting into cockpits and back offices. Warp Oz, Paperclip, and Copilot /fleet are back offices — headless, autonomous, dashboard-driven. You define work, agents execute, you review results. Callipso, Claude Squad, and VoxHerd are cockpits — real-time, interactive, you are in the loop while agents work. These are fundamentally different interaction patterns. Paperclip's 42,000 stars prove there is enormous demand for the back-office model. But developers who want to watch their agents work, talk to them by voice, and steer in real time — that is a different market, and it is ours.

Our Bet

We believe the orchestration layer should be additive, not replacive. Developers should not have to abandon their IDE to gain multi-agent capabilities. The orchestrator should meet them where they are — in Cursor, in Ghostty, in VS Code, in iTerm2 — and provide a thin, powerful control surface on top.

The tools that ask you to replace everything are betting that agent-first development is so different it needs a new paradigm. Maybe they are right eventually. But in April 2026, developers are still working in IDEs they have customized for years, still using terminals with specific configurations, still switching between tools that each do one thing well. The landscape has gotten more crowded since we first wrote this post — but the new entrants reinforce our thesis. They are all either building new environments or living inside tmux. Nobody else is meeting developers where they already are.

An overlay that connects all of them — with voice routing, session isolation, and real-time visibility across ten platforms — that is the bet we are making. "Fix the login flow, please one." Terminal 1 gets it. Keep everything else exactly as it is.