feat: chat runtime - pause/resume, SSE transport, React bindings by marslavish · Pull Request #5 · constructive-io/agentic-kit

marslavish · 2026-04-27T10:35:21Z

Builds on feat/features-complete. Adds the chat-runtime layer on top of the redesigned core: pausable tool execution, an SSE-serializable run handle, a headless React hook, a Next.js reference demo, and shared test infrastructure.

Summary

@agentic-kit/agent — pausable tools, AgentRunHandle (events / ReadableStream / SSE Response), maxSteps, decision lookup by toolCallId.
@agentic-kit/react (new package) — useChat hook that POSTs to an SSE endpoint and folds events into messages, streaming snapshot, pending decisions, and executing tools.
apps/nextjs-chat-demo (new) — Next.js App Router demo wiring agent.prompt(...).toResponse() to useChat, with a tool-approval UI.
agentic-kit — injectDeferralResults helper for the "user types instead of approving" flow; cross-fetch dropped in the OpenAI adapter in favor of native fetch.
Test infra — shared helpers under tools/test/ (scripted provider, SSE stub, fixtures), SSE parser tests, run-handle tests (443 LOC), useChat tests (1011 LOC).

What's New

`@agentic-kit/agent` — pause/resume + SSE

Pausable tools. Tools declare an optional decision JSON Schema. When the agent reaches a call with no attached decision, it emits tool_decision_pending and stops. Attach the decision to the matching toolCall block and call continue() to resume.
AgentRunHandle returned by prompt() / continue(), consumable exactly once as:
- await handle — run to completion
- handle.events() — async iterator of AgentEvents
- handle.toReadableStream() — ReadableStream<AgentEvent>
- handle.toResponse() — SSE Response ready to return from a Next.js / Hono / Express handler
parseSSEStream() exported from the package for clients consuming toResponse().
maxSteps cap on model invocations per run (resets in prompt(), persists across continue()); stopReason: 'completed' | 'max_steps' on agent_end.
Decision lookup by id. continue() and the underlying loop walk the message log backwards to find the most recent un-decided toolCall matching a given toolCallId, so callers may append unrelated messages between the pause and the response.

`@agentic-kit/react` — new package

Single hook useChat({ api, body?, initialMessages?, fetch?, on* }).
State: messages, streamingMessage, isStreaming, pendingDecisions: ReadonlyMap<string, ToolDecisionPendingEvent>, executingToolCallIds: ReadonlySet<string>, error.
Actions: send, sendMessages, setMessages (array or updater), respondWithDecision(toolCallId, value), abort().
abort() finalizes any visible streamed text as an assistant message and drops orphan toolCall blocks so the next call doesn't re-pause.
Callbacks: onMessage, onFinish, onDecisionPending, onToolExecutionStart/End, onError.
Headless — no UI, no run store, no runId. State lives in the message log.

`agentic-kit` — `injectDeferralResults`

For the case where the user types a new message while a tool is paused: synthesizes a stand-in toolResult for every toolCall that lacks both a decision and a paired result, so the server picks up a well-formed transcript.

import { injectDeferralResults, createUserMessage } from 'agentic-kit';

await sendMessages([
  ...injectDeferralResults(messages),
  createUserMessage(text),
]);

`apps/nextjs-chat-demo`

/api/chat/route.ts constructs an Agent, applies prior messages, and returns agent.prompt(...).toResponse().
Client uses useChat with chat-input, chat-messages, tool-call-card, tool-approval-card components.

Test infrastructure

tools/test/ — repo-internal helpers (no package.json, imported via tsconfig paths). Scripted provider, SSE stub, fixtures, shared index.
Provider unit suites refactored onto the shared helpers; default pnpm test stays deterministic and offline.
New suites: sse.test.ts (parser), run-handle.test.ts (443 LOC), use-chat.test.ts (1011 LOC under jsdom), inject-deferral-results.test.ts.
@agentic-kit/react is the only package on jsdom; everything else stays on node.

Cleanup

cross-fetch removed from the OpenAI adapter — runtimes are expected to provide fetch.
Packages expose a source export condition so workspace consumers can resolve TypeScript directly.

Test Plan

pnpm install && pnpm build && pnpm test is green across packages
apps/nextjs-chat-demo boots, streams a chat turn, and a paused tool can be approved/denied via respondWithDecision
Abort mid-stream preserves visible text and clears orphan toolCalls; next send() does not re-pause
injectDeferralResults flow: pause a tool, send a fresh user message instead of deciding, verify the next request carries synthesized stand-in results

yyyyaaa · 2026-05-13T07:06:45Z

wow great work, this is pretty complicated. I just have some design questions:

Decision-resume ordering. When a user message arrives during a pause, continue() appends the tool result at the tail of the log instead of
adjacent to its assistant block — the transform layer then synthesizes a placeholder for OpenAI/Anthropic. Is the non-trailing case in scope, and
how do you want to handle it (insert adjacent? reorder? reject continue() if a user message intervened? document as a constraint)?
Concurrency contract for prompt() / continue(). isStreaming is set on consumption, not on call, so two synchronous prompt()s both build handles
and race. What's the intended contract — reject-second, queue/steering, preempt, or doc-only?
events() early-break does not cancel. readableStreamToAsyncIterable releases the lock but never cancels (run-handle.ts:187), so breaking out of
for await parks the producer forever. toReadableStream/toResponse cancel correctly. Should events() match them, or is the asymmetry intentional?
what is the semantics for abort() during tool execution? executeOneTool catches the abort, records it as an isError tool result, and the loop calls the model again. Abort
during stream-generation does terminate. What is abort() supposed to mean during tool exec — stop immediately, drain remaining tools then stop, or per-tool only (current)?

marslavish · 2026-05-13T10:22:46Z

Thanks for the deep review — all four questions addressed in the latest push:

1. Decision-resume ordering. Went with reject-with-pointer. continue() now throws when non-toolResult messages have been appended after the pending assistant, with the error message pointing at injectDeferralResults() + prompt() for the user-typed-instead-of-approving flow. The transform-layer placeholder remains a fallback for legacy data but the typed path is now enforced. Added a test covering the throw.

2. Concurrency contract for prompt() / continue(). Reject-second. The agent now tracks an outstandingHandle and assertIdle() throws if prompt() or continue() is invoked while a prior handle hasn't been consumed (or before abort()). The handle clears itself as soon as its binder runs, so single-use is enforced at call time rather than at consumption time. Added a test.

3. events() early-break does not cancel. Was a bug, now matches toReadableStream / toResponse. readableStreamToAsyncIterable calls reader.cancel() in finally if the iteration didn't drain, so breaking out of for await propagates cancellation upstream and aborts the producer. Added a test.

4. abort() during tool execution. Went with "stop after current". The already-running tool receives the AbortSignal and can opt to abort itself; the loop no longer dispatches subsequent tools and won't re-invoke the model. agent_end.stopReason now carries 'completed' | 'max_steps' | 'aborted' so consumers can distinguish how a run ended. Added a test.

yyyyaaa · 2026-05-13T11:09:57Z

Looks good, thanks! I'll merge and publish now

…gaps (#7) * fix(react): close three useChat gaps surfaced by PR #5 review - send(): sync messagesRef before runStream so rapid synchronous sends both reach the outgoing request body - useState init: hydrate pendingDecisions from initialMessages so rehydrated paused tool calls render decision UI immediately - unmount: abort the in-flight fetch on cleanup to prevent leaked streams when the consumer unmounts mid-request * test: lock down regressions surfaced by PR #5 review Adds four regression tests that act as acceptance criteria for the fixes shipped in PR #5 and the companion useChat fixes in this branch. agent.test.ts: - injectDeferralResults() + prompt() places the synthetic toolResult adjacent to its assistant block (verifies the documented "user typed instead of deciding" recovery pattern produces provider-valid order). use-chat.test.ts: - initialMessages with a paused tool call hydrates pendingDecisions. - Two rapid synchronous send() calls both reach the outgoing body. - Unmount aborts the in-flight fetch.

marslavish added 6 commits April 27, 2026 17:54

chore(test): add shared helpers and integration scaffold

938c003

test: refactor unit tests to shared helpers

0dbeba8

test(agent): add sse parser tests

a1fae59

docs: add roadmap

536d30e

feat(agent): add pausable tools and run store

92f701b

docs: update roadmap for phase 1.1 progress

1adfadb

marslavish changed the base branch from main to feat/features-complete April 27, 2026 14:44

marslavish changed the title ~~feat: chat runtime foundation — pausable tools and test infra~~ (WIP) feat: pause/resume runtime, run store, useChat Apr 27, 2026

marslavish added 17 commits May 4, 2026 16:52

feat(agent): message-log pause/resume + run handle

08fc6c1

docs: align with message-log resume design

9ad2355

refactor(agent): export parseSSEStream

75706cf

fix(openai): prefer native fetch over cross-fetch

fac47ee

feat(react): add useChat hook package

3f5edbd

feat(app): add nextjs chat demo

c6605dc

docs: roadmap progress for phase 1.3

d02ba4f

chore: drop cross-fetch, expose source exports

4b26cbd

feat(agent): maxSteps + lookup decision by id

a0155e4

feat(react): lookup pending decision by id

e482c72

style(react): reformat use-chat

4780a2e

style(demo): reformat imports and indentation

afbe39d

feat(agentic-kit): add injectDeferralResults helper

7587403

feat(react): expand useChat with streaming and tool state

fc1a667

feat(demo): adopt expanded useChat surface

e6b3494

docs: expand package READMEs

d023c2f

chore: drop roadmap and integration scaffolding

5d3a73b

marslavish changed the title ~~(WIP) feat: pause/resume runtime, run store, useChat~~ feat: pause/resume runtime, run store, useChat May 12, 2026

marslavish changed the title ~~feat: pause/resume runtime, run store, useChat~~ feat: chat runtime - pause/resume, SSE transport, React bindings May 12, 2026

marslavish changed the base branch from feat/features-complete to main May 12, 2026 02:09

yyyyaaa reviewed May 13, 2026

View reviewed changes

Comment thread packages/agent/src/run-handle.ts

marslavish added 3 commits May 13, 2026 18:15

feat(agent): harden run lifecycle and abort handling

e722d63

feat(agent): add wait() and stream cancel on break

510196c

docs(agent): document run handle and continue rules

97a43a4

yyyyaaa merged commit cd00eaf into main May 13, 2026
12 checks passed

yyyyaaa mentioned this pull request May 13, 2026

fix(react)+test: lock down PR #5 regressions and close three useChat gaps #7

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: chat runtime - pause/resume, SSE transport, React bindings#5

feat: chat runtime - pause/resume, SSE transport, React bindings#5
yyyyaaa merged 26 commits into
mainfrom
feat/chat-runtime

marslavish commented Apr 27, 2026 •

edited

Loading

Uh oh!

Uh oh!

yyyyaaa commented May 13, 2026

Uh oh!

marslavish commented May 13, 2026

Uh oh!

yyyyaaa commented May 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

marslavish commented Apr 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

What's New

@agentic-kit/agent — pause/resume + SSE

@agentic-kit/react — new package

agentic-kit — injectDeferralResults

apps/nextjs-chat-demo

Test infrastructure

Cleanup

Test Plan

Uh oh!

Uh oh!

yyyyaaa commented May 13, 2026

Uh oh!

marslavish commented May 13, 2026

Uh oh!

yyyyaaa commented May 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

marslavish commented Apr 27, 2026 •

edited

Loading

`@agentic-kit/agent` — pause/resume + SSE

`@agentic-kit/react` — new package

`agentic-kit` — `injectDeferralResults`

`apps/nextjs-chat-demo`