Labs

See an agent run, live

Two working demos of the patterns behind the case studies. The playground runs a miniature agent loop — plan, execute, output. The terminal replays real build sessions. Neither is a mockup of intelligence; both are honest about what automation actually looks like: structured, observable, and verifiable.

// Agent playground○ idle

input: "Hi, my invoice for May shows $0 but my card was charged $89."

○Parse customer message

○Identify category: billing.discrepancy

○Check tier: priority high

○Draft response with refund-flow link

The same plan → execute → verify loop that runs the support triage system.

Replays of three real build patterns: an agent scaffold, a document pipeline deploy, and a RAG index.

Why demos

What these prove

Agents plan before acting — the step list exists before execution starts
Every run produces structured, verifiable output, not vibes
Evals gate deployment: 47/50 passing is a number a business can reason about
Confidence and status are always visible — no black boxes in production

$ erick --find-bottleneck

Request a similar build

If one of these patterns maps to your operations, a call is the fastest way to find out what a production version would take.

30 minutes · no pitch deck · reply within 24h if you write instead

Book a call →