Concepts¶

What it is / when to use it¶

This page defines the major moving parts of PenguiFlow and how they fit together.

Use it when you want to:

If you want runnable code first, start with Quickstart.

This page is not a full API reference. It focuses on the mental model and the contracts that matter.
It does not cover external tool configuration and auth (see Tooling and Tools & integrations).
It does not prescribe an application architecture; “planner-only”, “runtime-only”, and “mixed” are all valid.

A flow is a directed graph of async nodes. It provides:

Canonical: Flows & nodes.

A node is an async function wrapped with metadata and policy:

PenguiFlow supports two message styles:

Payload-only (fastest start): nodes receive and return plain Pydantic models.
Envelope-based (recommended for production): nodes pass a Message(payload=..., headers=..., trace_id=...), enabling:
per-trace correlation (trace_id),
per-trace cancellation,
deadlines,
streaming chunks that inherit routing metadata,
multi-tenant isolation via Headers.tenant.

Every node receives a ctx that can:

emit follow-up work (await ctx.emit(...)),
emit streaming chunks (await ctx.emit_chunk(...)),
access trace-scoped metadata used for observability and control (cancellation / deadlines).

Canonical: Streaming and Cancellation.

The planner is an LLM-driven loop that selects tools and orchestrates their execution (including parallel calls) with:

Prefer bounded queues (queue_maxsize > 0) and treat queue depth as a first-class signal.
Prefer envelopes (Message) when you need streaming, cancellation, deadlines, multi-tenant boundaries, or deterministic correlation.
Keep trace_id unique per request/session; treat it as part of your authorization story (don’t let a user fetch/cancel another user’s trace).
In production, add:
middlewares for structured FlowEvent logging, and/or
a StateStore for durability and event persistence.

fetch() hangs: nothing reached the Rookery sink (no egress node, egress returns None, or you didn’t call run()).
Cross-trace mixups: you reused trace_id across concurrent requests (use trace-scoped fetch or unique trace ids).
Streaming “does nothing”: you’re using payload-only messages; switch to envelope style and call ctx.emit_chunk(parent=Message(...), ...).
Retries amplify side effects: the node is not idempotent (use idempotency keys, or emit side effects only once).

Canonical runbooks live in the core pages:

The runtime emits structured FlowEvent for:

Operationally:

attach a middleware (e.g. penguiflow.middlewares.log_flow_events) early, and
decide where you persist events (often via StateStore) before production rollout.

Always set Headers.tenant when you use envelopes, and keep tenant boundaries consistent across a trace.
Don’t put secrets in payloads or message meta if you persist events/logs; prefer secret managers + redaction.
Treat trace_id + fetch(trace_id=...) + cancel(trace_id) as sensitive control surfaces in applications.

Run a minimal flow:

uv run python examples/quickstart/flow.py

Run a streaming example (chunks + final answer):

uv run python examples/roadmap_status_updates/flow.py

If you’re building an LLM agent, start with the planner template:

uv run penguiflow new my-agent --template react
uv run penguiflow dev --project-root my-agent

Need cancellation/deadlines/streaming: switch to envelopes (Message) and use trace_id per request.
Need parallel fan-out + join: use join_k and ensure you pass trace_id (see Concurrency).
Need pause/resume / HITL: use the planner (see Pause/resume).
Need tool integrations: use ToolNode and configure auth (see Tools configuration).