Skip to content

Silveroboros-dev/Decision-Kernel

Repository files navigation

Decision Kernel

Decision Kernel is the verification and spend-governance layer sponsors use before releasing milestone-based budgets to hybrid human-agent teams.

It is the pivot of Governance OS into a broader verification and cost-governance layer for hybrid human-agent systems.

Governance OS proved the kernel in high-stakes finance: signal extraction, deterministic policy evaluation, exception routing, replayability, and audit-grade evidence. Decision Kernel generalizes that kernel for a different target: hybrid teams where work is done by humans, agents, or both, and where capital or authority cannot move safely unless the work becomes legible enough to verify.

Decision Kernel extends the validator idea beyond the chain. It turns milestone, spend, and authority claims in hybrid human-agent teams into governed state transitions: evidence is compressed into signal, policy validates or escalates the claim, and realized outcomes rescore the policy over time.

This repository was created in response to the Outlier Ventures Conviction Markets Request for Builders open call.

Submission Snapshot

  • Current state: runnable TypeScript simulation kernel with Zod schemas, fixtures, focused tests, CLI demo, and a prototype UI backed by generated trace JSON
  • Best-fit problems: 04 Verification without managers, 07 Cost governance for agents, with 05 Coordination without companies as the extension
  • Biggest blocker: the main blocker is access to live sponsor milestone workflows and spend data to calibrate policy thresholds and retrospective scoring beyond synthetic traces
  • What we want from Outlier Ventures: sharp feedback on wedge and business model, plus access to builders, operators, or funders already hitting these coordination problems

Why this repo exists

The Outlier Ventures Request for Builders is asking a real question: how do hybrid teams fund, coordinate, and share upside when the company is no longer the natural container?

Our answer is narrow and operational. Before programmable ownership or capital release, there has to be a trusted verification layer:

  • messy work has to become structured signal
  • policy has to decide routine cases
  • true exceptions have to be routed cleanly
  • agent spend has to be governed like payroll
  • outcomes have to update future trust and authority

Decision Kernel is that layer.

Best-fit problem framing

Decision Kernel maps most directly to:

  • 04 Verification without managers
  • 07 Cost governance for agents

With a credible extension into:

  • 05 Coordination without companies

Why this is a credible pivot

This is not a rebrand. Governance OS already proved the kernel in a narrower domain:

  • deterministic kernel and replayability
  • signal extraction with provenance
  • explicit policy evaluation
  • exception routing
  • audit log and evidence packs
  • approval-gated agent tooling

The pivot changes the governed object and learning loop:

  • from treasury and wealth workflows to general hybrid human-agent work
  • from a finance-specific policy engine to a general verification substrate
  • from human-only final authority to bounded authority for humans or agents under policy
  • from decision support alone to retrospective scoring of policies and decision-makers
  • from compliance framing to verification plus cost governance

It is a generalization of the same kernel to the problem set Outlier is actually pointing at.

Concrete Walkthrough

The shortest way to understand the product is the first target case:

  1. A hybrid team claims a milestone is complete and requests release of the next budget tranche.
  2. Decision Kernel extracts structured signal from artifacts, spend logs, test outputs, and workflow state.
  3. Policy checks whether the milestone is sufficiently evidenced and whether agent or tool spend stayed inside limits.
  4. Routine cases auto-clear. Missing evidence, threshold breaches, or high-materiality cases escalate.
  5. The chosen human or agent decision-maker logs a bounded decision with rationale and provenance.
  6. Later outcomes rescore the policy and the decision-maker, which changes future routing, trust, and authority.
  7. The result is not just a yes or no. It is a governed release of budget, authority, or further work.

Read the full example in docs/demo-walkthrough.md.

Runnable MVP

There is now a minimal TypeScript kernel plus a prototype trace viewer.

The kernel demonstrates both halves of the loop:

  1. routine auto-clear when evidence is complete and spend is inside policy
  2. missing-evidence escalation to a human operator
  3. spend breach handling that separates work verification from cost governance
  4. repeated bad auto-clear creating a policy revision candidate and supervised routing
  5. scoped trust updates that do not automatically change authority

To run the kernel:

npm ci --ignore-scripts
npm run verify
npm run demo

To open the prototype trace viewer:

npm run generate:traces
npm run prototype

Then open http://localhost:4175. The UI reads prototype/traces/generated-demo-trace.json; it is no longer the source of the scenario logic.

Security note: this repo pins npm to ignore-scripts=true in .npmrc, and the prototype server binds to 127.0.0.1 instead of all network interfaces.

What Is Real Today

  • Real today: strict Zod schemas, a runnable kernel, four fixture scenarios, generated trace output, scoped trust updates, authority-change decisions, policy revision candidates, and tests for policy evaluation, exception routing, and retrospective scoring
  • Simulated today: milestone evidence, spend events, routing, and rescoring are driven by synthetic traces rather than live sponsor workflow data
  • Not yet done: public hosted demo link and production data integration

Proof Lineage From Governance OS

This repo is early, but the predecessor project already contains the kinds of artifacts that make the kernel credible:

Artifact Why it matters
Decision journey Shows the deterministic evaluation → exception → immutable decision → evidence pack flow
Treasury demo script Shows a runnable end-to-end demo path with seeded scenarios and exported evidence
Constraint registry Shows explicit completeness, severity, and dedupe rules in code-native form
Canonicalized eval output Shows raw candidate signals being downgraded, classified, and normalized deterministically
Evidence export test Shows the audit bundle is treated as a first-class output, not presentation fluff
Grounding test and hallucination test Shows the coprocessor layer is verified against evidence instead of trusted implicitly

There is a fuller guide in docs/proof-artifacts.md.

Core loop

  1. Ingest a messy input: request, milestone claim, workflow state, spend event, anomaly, or agent output.
  2. Extract decision-relevant signal into structured state with provenance.
  3. Evaluate explicit policy against the state, budget, and authority constraints.
  4. Auto-clear routine cases when confidence and rules permit.
  5. Route exceptions to the right human or agent decision-maker.
  6. Log action, rationale, and evidence.
  7. Observe realized outcomes later.
  8. Rescore policies and decision-makers.
  9. Adapt future routing, trust, and authority allocations.

Design principles

  • Policy first. Discretion is expensive and should be reserved for edge cases.
  • Evidence before opinion. Claims must carry provenance.
  • Humans and agents share the same coordination plane.
  • Agent spend is governed spend.
  • Ex post learning matters more than ex ante confidence.
  • Governance should scale without collapsing into bureaucracy.

First Payer

The first payer is a protocol foundation running milestone-based grant programs for hybrid human-agent contributor teams.

The first user is the program operator reviewing whether work is sufficiently verified to release the next tranche. The governed subjects are contributors and agents spending against that budget.

They adopt this because three pains are immediate:

  • milestone claims are noisy and hard to verify
  • agent and tool spend can run ahead of judgment
  • every exception currently turns into ad hoc chat, meetings, or manual review

Why This Can Become a Business

The near-term business case is milestone release control for sponsors that need explicit approval, budget governance, and auditability around hybrid execution.

The moat is not just the interface. It is the accumulated policy library, decision history, provenance graph, outcome data, and trust-routing feedback loop. If Decision Kernel becomes the place where teams verify work, release budget, and learn which human or agent decision-makers are reliable, it compounds into a coordination system rather than a thin approval tool.

If that loop works, high-performing policies become reusable coordination assets: the basis for future markets around verified execution, not just votes.

Reviewer Order

If you only read three things, read these in order:

  1. this README
  2. npm run demo
  3. npm test
  4. docs/system-model.md
  5. docs/demo-walkthrough.md

Repo shape

  • docs/project-brief.md: short internal brief on the pivot, positioning, and how to describe the project
  • docs/demo-walkthrough.md: one concrete end-to-end case around milestone verification and spend governance
  • docs/proof-artifacts.md: Governance OS artifacts that support the credibility of the pivot
  • docs/thesis.md: the conceptual argument
  • docs/system-model.md: entities, flows, authority, and scoring logic
  • docs/mvp-roadmap.md: what to build first
  • src/schemas/: Zod boundary schemas for cases, signal, policy, budget, routing, decisions, outcomes, and traces
  • src/kernel/: executable extraction, policy evaluation, budget checks, routing, decision logging, and retrospective scoring
  • src/demo/: CLI demo and prototype trace generator
  • fixtures/: four synthetic milestone verification scenarios
  • tests/: focused kernel tests for policy evaluation, exception routing, and retrospective scoring
  • prototype/: static trace viewer backed by generated kernel output

Working name

Decision Kernel is the preferred name because it points to a small, composable control layer rather than a generic governance platform or a top-heavy management product.

About

Verification and cost-governance layer for hybrid human-agent systems

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors