without-dag¶

Concurrent execution of DAG-shaped async workflows, liftable into a without Processor. A Processor[In, Out] is otherwise an opaque stream-to-stream closure; this package lets the inside of a per-event step be a graph of async sub-steps that run with bounded concurrency and recombine into one output. It is the value-level fan-out/fan-in the substrate leaves room for: one input value drives many concurrent computations, distinct from the stream-level splitting a distribute/merge vocabulary would do. See the without_dag API reference for the full surface.

Like without-asgi, this package handles only the mechanism, execution, and leaves the graph-defining policy to a frontend. The two layers live here for now but meet at one narrow seam, so a different frontend (a YAML loader, a signature-inspecting decorator) could be layered on later without either side importing the other's opinions.

The execution core¶

A set of Node values is compiled once into a Plan, then run with at most limit nodes in flight. Plan.of does the compiling; drive and evaluate run it:

class Plan:
    @classmethod
    def of(cls, nodes: Iterable[Node]) -> Plan: ...

async def drive(plan: Plan, inputs: Mapping[NodeKey, object], limit: int | None
) -> AsyncGenerator[tuple[NodeKey, object]]: ...

async def evaluate(plan: Plan, target: NodeKey, inputs: Mapping[NodeKey, object], limit: int | None
) -> object: ...

A Node is the seam: a value carrying its key, its ordered dependencies, and an async run that takes its dependencies' results as a tuple and returns its own. Results cross the seam as object, the same honest move without-web makes when it collects a heterogeneous mix of Extractor[object]; the typed frontend restores precision above it. There is only the compiled form: a caller runs a graph by compiling a Plan and reusing it, the way an HTTP client owns one pooled session rather than a fresh connection per request. The input-independent work (nodes by key, dependency edges, consumer counts) is done once in Plan.of, so a graph driven per event never repeats it.

Two properties fall out of the model:

Each node runs once. Its result is memoized and fed to every dependent, so a diamond's shared ancestor executes a single time with no glitch.
Acyclicity is proven at the boundary. The scheduler sits on stdlib graphlib.TopologicalSorter, whose prepare raises graphlib.CycleError on a cycle. Scheduling replicates the asyncio.wait(..., return_when=FIRST_COMPLETED) shape of without.limit_concurrency rather than calling it, because the scheduler needs the completed task's key to unlock its successors, and that per-completion identity is what limit_concurrency's lazy source hides.

A node that raises fails the whole run: the exception surfaces and any in-flight siblings are cancelled via without.cancel_futures.

drive and evaluate are the events and behavior reads of one scheduler, mirroring the substrate's own split. drive yields each (key, result) the instant it completes (in whatever order nodes finish, a node always after the dependencies it consumed), useful to react as results land or to read several outputs. evaluate is a consumer of it that keeps the one value target produces and drops the rest. Both run the whole graph: there is no target-based pruning, since fixing the target up front is not a real early return and the graph is small (its size is bounded by the per-event work, not the stream). A node with no path to the output still runs.

drive is pull-driven: the DAG advances only as the consumer iterates. To drive it in the background instead (so the graph makes progress while a slower consumer catches up), wrap it with without.buffer, which pumps any stream into a bounded queue on a background task.

The typed frontend¶

Graph is a builder that threads value types through the wiring. Graph.of opens a graph over its entry types and hands back the graph plus a tuple of one Handle per type, node adds a step wired to the handles it depends on (an arity-overload ladder ties each Handle[X] to the matching parameter of the step's function), and build freezes the result into a CompiledGraph[*Ins, Out] so its call is checked for argument count and types:

from without_dag import Graph

async def fetch(request: Request) -> Fetched: ...
async def parse(fetched: Fetched) -> Parsed: ...
async def render(fetched: Fetched, parsed: Parsed) -> Report: ...

graph, (request,) = Graph.of(Request)
fetched = graph.node(fetch, request)
parsed = graph.node(parse, fetched)          # parse must take a Fetched
report = graph.node(render, fetched, parsed)  # render must take (Fetched, Parsed)
run = graph.build(output=report, limit=4)

result: Report = await run(some_request)

Passing parse a handle whose type does not line up with its parameter is a mypy error, not a runtime surprise. Because a step can only depend on handles that already exist, a cycle is unrepresentable through this API; graphlib's check is a backstop for the object seam.

The graph carries its entry types in its own type (Graph[*Ins]), so build takes only the output handle: it recovers the inputs the graph already knows, rather than making you list them a second time and keep the two in sync. A general DAG may take several: graph, (a, b) = Graph.of(A, B) opens two entries, and the compiled graph is called run(a_value, b_value) with the count and types checked. Wrong arity or a mismatched value type is a static error.

build compiles the scheduling structure once, into the same object-seam Plan: the nodes by key, the dependency edges, and the consumer counts are all input-independent, so a CompiledGraph driven per event runs the nodes without re-analyzing the graph each time. build is the typed graph's Plan.of.

The behavior/events duality is typed here too: run(*inputs) samples the single output, while run.stream(*inputs) drives the precompiled Plan to yield each node's (key, result) as it completes (match a yielded key against a Handle's key to pick one out). Both check the inputs against *Ins.

Lifting into a Processor¶

A Stream carries one value per event, so only a single-input graph lifts into a Processor. In that case a CompiledGraph is an async (In) -> Out callable, which is exactly what from_map wants: this package adds no wrapper of its own, because there is nothing to add.

from without import collect, from_map, stream_from_iterable

processor = from_map(run)                      # Processor[Request, Report]
reports = await collect(processor(stream_from_iterable([request_a, request_b])))

Each event drives one bounded-concurrency DAG execution and yields its output, so the graph composes with compose, stream_from_iterable, collect, and the rest of the substrate unchanged. When a step needs several values, group them into one input object (a dataclass or tuple) so the graph keeps a single entry and still lifts.