Skip to content

What is Agent Runtime?

Agent Runtime defines how agent work is accepted, executed, observed, controlled, resumed, and audited. It is the layer below Agent UI and above concrete model providers, tool systems, context stores, artifact services, and host application storage.

Use Agent Runtime when a product needs stable semantics for:

  • submitted user turns and agent tasks with attempts, progress, graph edges, and delivery state
  • model routing, fallback, limits, and token/cost accounting
  • streaming text, reasoning, and structured output
  • tool calls, tool results, large output refs, and tool errors
  • human approval, structured input, interruption, and resume
  • queues, steering, long-running turns, and subagents
  • context assembly, memory retrieval, compaction, and missing context
  • artifact refs, evidence refs, replay cases, and review exports

Do not use it to define the visual interface, model provider API, connector protocol, business database schema, artifact file format, or evidence review policy. Those systems remain adjacent owners.

Layer map

LayerMain questionRuntime facts
inputWhat work was submitted and by whom?session, thread, turn, draft, attachments, source channel, request ids.
executionWhat is running and why?turn status, task lifecycle, task attempts, model routing, tool calls, action requests, subagents.
stateWhat can be resumed or inspected later?snapshots, thread read model, queue, pending requests, incidents, checkpoints.
coordinationWhat external systems were used?tool inventory, context refs, artifact refs, evidence refs, policy decisions.
observabilityCan the work be traced, replayed, reviewed, or audited?trace ids, spans, timeline, evidence pack, replay case, verification summaries.

The runtime may be embedded in a desktop app, hosted behind an HTTP API, run in a worker, or coordinate local and remote agents. The standard constrains facts and control semantics, not deployment shape.

Draft standard for portable agent execution runtimes.