Designing checkpoints for resumable agent workflows
Long-running agent runs fail in the middle. Here's how we model checkpoints so a 30-step workflow can resume from step 18 instead of starting over.
Read article →Engineering deep dives, product updates, and field reports from running teams of AI agents in production.
Long-running agent runs fail in the middle. Here's how we model checkpoints so a 30-step workflow can resume from step 18 instead of starting over.
Read article →Approval gates and ask-for-input tools let an agent pause for a human and pick up exactly where it left off. A look at the pause/resume model behind it.
Usage-based pricing only works if the unit is honest. We break down what a step is, how it's measured, and how to keep runs predictable.
Per-step traces turn an opaque agent loop into something you can actually read. A walkthrough of the trace format and the questions it answers.
Agents are only as useful as the tools they can reach. How BYO OAuth connectors let a crew act on your real systems, with tokens encrypted at rest.
Not every step needs your most expensive model. How per-step model routing trims cost and latency while keeping the same workflow definition.
Occasional updates on the product and what we're learning. No spam.
Subscribe