Observation-first architecture: eliminate agent lockout from blocking wait tools

## 1. Problems

Agents get trapped when a pane wait hides the terminal behind a boolean. In a live dev loop, the agent is often tailing a server, watching tests, checking several panes, or deciding whether to interrupt. A blocking `wait_for_text` call gives it only `found` or timeout, so the agent cannot see that the server picked a different port, printed an error, is waiting for input, already finished, or that another pane now needs attention. This is the lockout reported in #59.

The deeper problem is tool mismatch. `wait_for_text` is useful for known future output from a process the agent does not control. It is a poor default for command completion, current-screen inspection, fuzzy diagnosis, or multi-pane monitoring. Those cases need different primitives:

| Agent intent | Better primitive | Common trap |
|---|---|---|
| "Did the command I sent finish?" | `wait_for_channel` with `tmux wait-for -S` | Waiting for a prompt or regex in `wait_for_text` |
| "What is visible right now?" | `snapshot_pane` / `capture_pane` | Calling a future-only wait |
| "Which pane mentions this?" | `search_panes` | Listing pane metadata and missing terminal text |
| "What changed since I last looked?" | Proposed `capture_since` (#60) | Repeated full snapshots or a long blocking wait |
| "Did this exact third-party line appear?" | `wait_for_text` | Treating it as a generic completion primitive |
| "Did anything change?" | `wait_for_content_change`, then inspect | Treating `changed=True` as semantic success |

The filed wait-family issues are concrete instances of that mismatch:

- #45 fixed stale scrollback instant matches by redefining `wait_for_text` as "new text after the call begins." That makes `search_panes` / `snapshot_pane` the correct answer for "is this already on screen?"
- #59 covers the 90-second agent lockout when the expected pattern is wrong, misspelled, already scrolled away, or semantically too narrow.
- #51 covers same-row output: `printf`, carriage-return progress bars, spinners, and status redraws can stay on the baseline row that `wait_for_text` deliberately skips to avoid stale matches.
- #55 covers wrap-spanning search misses: tmux's [`window_pane_search()`](https://github.com/tmux/tmux/blob/134ba6c/window.c#L1320) searches visual rows, so long build/test messages can split across the wrap boundary and silently miss on the fast path.
- #50 covers the two-call state/capture race: `wait_for_text` samples pane state, then calls `capture-pane`; tmux capture math is evaluated against live history in [`cmd-capture-pane`](https://github.com/tmux/tmux/blob/134ba6c/cmd-capture-pane.c#L107), while history can move through [`grid_collect_history()`](https://github.com/tmux/tmux/blob/134ba6c/grid.c#L386).
- #52 covers subprocess pressure under parallel waits: a 50 ms poll interval can become many tmux subprocesses per second across panes and agents.
- #53 covers lifecycle blindness in `wait_for_content_change`: pane death, respawn, and `pane_pid` changes are observation facts, not just content-diff edge cases.
- #54 covers notification-only risk signals: `ctx.warning()` may not reach the model or programmatic caller, so best-effort correctness must be exposed in structured tool results.
- #18 fixed the server-side version of this class for `wait_for_channel`: blocking `subprocess.run()` on the FastMCP event loop could stall the server. That is separate from agent lockout, where the server may stay healthy but the agent is still trapped inside an uninformative call.

The upstream substrates matter:

- tmux `wait-for` is the right zero-poll primitive for authored command synchronization; see [`cmd-wait-for.c`](https://github.com/tmux/tmux/blob/134ba6c/cmd-wait-for.c#L121).
- tmux grid/capture APIs are snapshots, not a durable log stream. History rollover, resize, alternate screen, and wrap behavior all affect what can be reconstructed from [`capture-pane`](https://github.com/tmux/tmux/blob/134ba6c/cmd-capture-pane.c#L107), [`grid_collect_history()`](https://github.com/tmux/tmux/blob/134ba6c/grid.c#L386), and alternate-screen handling in [`screen.c`](https://github.com/tmux/tmux/blob/134ba6c/screen.c#L676).
- FastMCP can report progress through [`Context.report_progress()`](https://github.com/PrefectHQ/fastmcp/blob/2bff372/fastmcp_slim/fastmcp/server/context.py#L389), and background tasks can send task status notifications, but UI/client notifications are not a substitute for agent-visible tool results.
- Python cancellation and timeouts need explicit async subprocess care. CPython documents `asyncio.create_subprocess_exec()` and `asyncio.wait_for()` as the async path, while `subprocess.run(timeout=...)` kills only the child it is managing after timeout; see [`asyncio-subprocess.rst`](https://github.com/python/cpython/blob/2b6a137/Doc/library/asyncio-subprocess.rst#L165) and [`subprocess.py`](https://github.com/python/cpython/blob/2b6a137/Lib/subprocess.py#L642).
- Shell composition is portable only at the simple `cmd; tmux wait-for -S channel` level. zsh documents list/separator and status behavior in [`grammar.yo`](https://github.com/zsh-users/zsh/blob/3e72a52/Doc/Zsh/grammar.yo#L66) and [`params.yo`](https://github.com/zsh-users/zsh/blob/3e72a52/Doc/Zsh/params.yo#L719). fish has different status variables, events, and prompt markers; see [`fish_for_bash_users.rst`](https://github.com/fish-shell/fish-shell/blob/fb29c85/doc_src/fish_for_bash_users.rst#L176), [`language.rst` events](https://github.com/fish-shell/fish-shell/blob/fb29c85/doc_src/language.rst#L2108), and [`mark-prompt`](https://github.com/fish-shell/fish-shell/blob/fb29c85/doc_src/language.rst#L2058). Status-preserving recipes should be shell-specific, not implied as universal.

## 2. Avenues of investigation

Primary investment: build `capture_since` (#60) as the observation-first primitive. It should be non-blocking, cursor-based, and bounded by `max_lines` / `max_bytes`, returning actual pane output plus cursor and correctness metadata. The first call should be immediately useful by returning visible screen content and establishing a cursor; later calls should return only deltas. This lets the agent tail dev servers, watch tests, inspect multiple panes, and self-correct without betting on one regex.

Keep `wait_for_channel` as the deterministic authored-command primitive. When the agent sends the command, compose a signal into the command text and wait on tmux's channel:

```console
$ command; tmux wait-for -S libtmux_mcp_done
```

That path is zero-poll inside tmux and avoids the prompt/regex/stale-output trap. The docs should keep the simple recipe portable, then add shell-specific status-preserving variants only after they are verified for zsh, fish, and POSIX-like shells.

Investigate FastMCP `task=True` as an optional enhancement, not the foundation. FastMCP supports task-enabled async tools through [`TaskConfig`](https://github.com/PrefectHQ/fastmcp/blob/2bff372/fastmcp_slim/fastmcp/utilities/tasks.py#L35), server-side task docs ([`docs/servers/tasks.mdx`](https://github.com/PrefectHQ/fastmcp/blob/2bff372/docs/servers/tasks.mdx#L49)), and client-side `task=True` calls ([`docs/clients/tasks.mdx`](https://github.com/PrefectHQ/fastmcp/blob/2bff372/docs/clients/tasks.mdx#L19)). This may reduce client/server lockout for long operations, but it does not by itself solve the core problem: the agent still needs model-visible pane output while deciding what to do.

Investigate event-driven pane streams separately. tmux has real streaming surfaces: [`pipe-pane`](https://github.com/tmux/tmux/blob/134ba6c/cmd-pipe-pane.c#L43) and control-mode output via [`control_write_output()`](https://github.com/tmux/tmux/blob/134ba6c/control.c#L474). These may support a future `stream_pane`, but they introduce lifecycle management, cleanup, raw PTY bytes, ANSI/control-sequence handling, buffering, safety-tier concerns, and stateful subscriptions. They should not block the simpler `capture_since` path.

Tighten the existing wait/search tools while the observation primitive is explored. #50-#55 still reduce real traps: race accounting, same-row behavior, adaptive backoff, lifecycle parity, structured risk fields, and wrap-aware search fallback all improve the current surface even if the recommended workflow shifts away from blocking waits.

Rewrite docs and server instructions around scenarios rather than tool names:

| Scenario | Recommended path |
|---|---|
| I launched a command and need completion | `send_keys("cmd; tmux wait-for -S chan")` + `wait_for_channel("chan")` |
| I need to inspect an already-running pane | `snapshot_pane` / `capture_pane` first |
| I need to tail or monitor output | `capture_since` once available |
| I need to search visible/current pane text | `search_panes`; use slow path where wrap-spanning text matters |
| I need a known future line from third-party output | `wait_for_text` with bounded timeout |
| I only need to know something changed | `wait_for_content_change`, followed by inspection |

Recommended ordering:

1. Ship documentation improvements immediately so agents stop reaching for `wait_for_text` as a generic completion tool.
2. Implement `capture_since` as the main new primitive (#60).
3. Continue the targeted correctness tickets (#50-#55) where they remain valuable.
4. Add optional FastMCP task support only after client behavior and dependency cost are understood.
5. Treat `pipe-pane` / control-mode streaming as a separate future `stream_pane` design, not a retrofit of blocking wait tools.

This keeps the architecture simple: make observation cheap, explicit, and model-visible; reserve blocking waits for cases where blocking is actually the right abstraction.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Observation-first architecture: eliminate agent lockout from blocking wait tools #61

1. Problems

2. Avenues of investigation

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Agent intent	Better primitive	Common trap
"Did the command I sent finish?"	`wait_for_channel` with `tmux wait-for -S`	Waiting for a prompt or regex in `wait_for_text`
"What is visible right now?"	`snapshot_pane` / `capture_pane`	Calling a future-only wait
"Which pane mentions this?"	`search_panes`	Listing pane metadata and missing terminal text
"What changed since I last looked?"	Proposed `capture_since` (#60)	Repeated full snapshots or a long blocking wait
"Did this exact third-party line appear?"	`wait_for_text`	Treating it as a generic completion primitive
"Did anything change?"	`wait_for_content_change`, then inspect	Treating `changed=True` as semantic success

Scenario	Recommended path
I launched a command and need completion	`send_keys("cmd; tmux wait-for -S chan")` + `wait_for_channel("chan")`
I need to inspect an already-running pane	`snapshot_pane` / `capture_pane` first
I need to tail or monitor output	`capture_since` once available
I need to search visible/current pane text	`search_panes`; use slow path where wrap-spanning text matters
I need a known future line from third-party output	`wait_for_text` with bounded timeout
I only need to know something changed	`wait_for_content_change`, followed by inspection

Observation-first architecture: eliminate agent lockout from blocking wait tools #61

Description

1. Problems

2. Avenues of investigation

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions