Published:

October 30, 2025

8 min

Written By

Nicolas Belissent

AI Engineer

October 30, 2025

8 min

When Your AI Agent Needs Contractors, Not Apprentices: Building Async Subagents

You’re a builder facing a job too big for your tools. You could hire an apprentice. They’d use your tools but need constant supervision. Or hire a contractor. They bring their own tools, have their own specialised skillset, work independently, and report back when done. For big jobs, the contractor is better. You keep working while they handle their part.

That’s exactly the problem we faced building our Integration Builder Agent.

The agent generates Integration configurations; transforming provider/SaaS endpoints into Unified APIs and AI Tools, researching documentation, writing descriptions, discovering comprehensive API actions. But time-consuming subtasks kept blocking the main agent; either by requiring human interaction or awaiting long-running MCP tool calls.

We needed contractors, not apprentices. We needed specialist subagents that work autonomously in the background with their own tools and context, only reporting back when finished. This post walks through how we implemented async subagents using Cloudflare’s Durable Objects and native RPC, transforming our Integration Builder agent from a single-threaded worker into a parallel task orchestrator.

The Challenge

Our Integration Builder Agent generates Integration Configurations (in YAML) that map provider endpoints (like Hibob, Zendesk, HubSpot or Workday and hundreds more) to be easily consumed by B2B SaaS & AI Agents. Each configuration includes between 20-100+ actions with detailed endpoint descriptions, comprehensive action coverage discovered through documentation, and proper field mappings and authentication configs.

We hit three fundamental constraints:

Context Window Limits
Generating configs, testing them, and researching API actions consumes massive context. Testing alone requires passing entire configs (1K to 10K tokens each) back and forth repeatedly. Add documentation research, OpenAPI spec analysis, and competitor analysis, and the Integration Builder would hit token limits while managing 50+ operations, losing context about earlier work.

Manual Subagent Orchestration in Claude Code
Claude Code supports subagents natively, but they’re “apprentices” from our analogy. They use the parent agent’s tools and context, but require manual user interaction to trigger and retrieve results, all within the same conversation thread. The Integration Builder couldn’t fire-and-forget; it had to stop and wait for human intervention to orchestrate each sub-task.

Long-Running Jobs Make Manual Orchestration Impractical
Even if manual orchestration were acceptable, research tasks take too long. Improving descriptions means reading docs and analyzing examples. Longer jobs could require exhaustive research across documentation, SDKs, and competitor implementations. The main agent can focus on aggregating information from other tools or subagents, rather than waiting.

Approaches We Evaluated

Claude Code Native Subagents

Claude Code CLI subagents are specialized AI assistants that operate in separate context windows with specific purposes. The main agent can invoke these subagents either automatically based on task descriptions or explicitly by mentioning them.

Pros‍

Built-in, zero setup, separate context windows preserve parent agent state

Cons

No programmatic orchestration, can’t fire-and-forget and poll for results. Regardless of running —dangerously-skip-permissions, the CC subagent needs to be manually triggered
Relies on conversational flow, subagents return results through the conversation thread
No persistent state between invocations
Limited to synchronous delegation patterns

This “apprentice” model works well for human-in-the-loop workflows where users manually trigger subagents and review results. But it doesn’t fit autonomous use cases where the Integration Builder needs to delegate long-running research tasks, continue building other config components, and automatically integrate results when ready, without human intervention.

MCP Tools (HTTP-Based)

Our first production attempt: expose the subagent as an MCP tool that the Integration Builder could call via HTTP.

Pros:

Standard protocol: Follows established MCP patterns that work across different environments and clients
Language-agnostic: Any client that speaks HTTP can connect, not tied to specific infrastructure
Easy integration: Works with existing MCP tooling like Claude Desktop without custom setup

Cons:

Not designed for long-running jobs: MCP’s request-response architecture expects immediate results, making it unsuitable for big jobs.
Wrong abstraction for Worker-to-Worker calls: MCP was designed for external tool servers, not in-process communication within the same infrastructure
Black box execution: Subagent’s autonomous iterations happen with no visibility into progress, logs, or intermediate state until completion

Our Solution: Async Subagents with Native RPC

We needed “contractor” agents that could work autonomously in the background without blocking the main agent, handle research-intensive tasks like improving 20+ endpoint descriptions or discovering comprehensive API actions, maintain their own context and tools, and report back only when finished. To achieve this, we built specialist subagents as separate Durable Objects that the Integration Builder calls via native RPC through an MCP tool. Each subagent runs independently with its own autonomous loop, making its own tool calls and research decisions while the Integration Builder continues other work.

How It Works: Native RPC on Cloudflare

What is RPC? Remote Procedure Call lets one program call a function in another program as if it were local code. Instead of constructing HTTP requests and parsing responses, you simply call a method: stub.improveDescriptions({ config, provider }).

Traditional RPC frameworks (like JSON-RPC or gRPC) use HTTP, but Cloudflare’s Durable Object RPC uses an internal communication protocol. When both Workers run in the same isolate, calls are essentially function calls across process boundaries: zero network latency and no HTTP overhead. This enables true fire-and-forget delegation and persistent state that continues execution after returning a response, which HTTP’s request-response cycle cannot provide.

Implementation:

Configure a binding in wrangler.toml that gives the Integration Builder access to the subagent Durable Object.
Get a stub (local proxy object) that represents the remote subagent: env.SUBAGENT.get(providerId)
Call methods on the stub as if they’re local: stub.improveDescriptions({ config, provider })
Receive a task ID immediately while the subagent queues work and executes in the background
Poll for results when needed and integrate the improved descriptions
Once the results are extracted, Cloudflare will auto-evict inactive DOs from memory.

The Integration Builder delegates work via RPC, gets a task ID back immediately, and continues working. The subagent runs in its own autonomous loop—making its own tool calls, managing its own iterations, and making decisions about research depth. When complete, the Integration Builder retrieves and integrates the results.

Architecture

The following diagram shows the complete end-to-end flow when using the improve_descriptions tool, from the user’s initial request through Claude Code CLI to the final result retrieval. It illustrates how the Integration Builder delegates work to the subagent via native RPC, and how the subagent autonomously executes its agentic loop while calling back to the Integration Builder’s tools when needed.

Example scenario: The Integration Builder generates a Zendesk config with 25 operations but generic descriptions. It delegates to the Improve Descriptions subagent, which autonomously researches each endpoint in the background while the Integration Builder continues building auth configs and field mappings.

Claude Code (or any other MCP client) can call the improve_descriptions tool to start the background task of improving descriptions and then call the get_improve_descriptions_task_status tool when it wants to check if the job is done. If it’s done, the agent can retrieve the result in the state.

Key implementation details:

Deterministic IDs: Each subagent run gets its own Durable Object instance, enabling per-subagent-run state, caching, and rate limiting.
Queue system: Cloudflare’s Agent SDK provides this.queue() which writes tasks to an internal queue and executes them in the background
Bidirectional tool access: RPC flows one way (Integration Builder → subagent), but tool calls flow back (subagent → Integration Builder Agent’s MCP server for vector search, web search, etc.)
Persistent state: Task results survive server restarts and hibernation, making the system production-ready

Implementation: Two-Tool Pattern

We expose two MCP tools to the Integration Builder:

improve_descriptions(config, provider)
=> Starts the task, returns { taskId } immediately
get_improve_descriptions_task_status(taskId, provider)
=> Polls for status (when it desires) and retrieves results

The Integration Builder generates a config, delegates description improvement, and continues building other components. Meanwhile, the subagent autonomously researches 20+ operations: reading docs, analysing examples, making tool calls to the parent agent’s MCP server for vector search and web research. The Integration Builder can poll for results, retrieve the config if the task is complete and then integrate the improved descriptions.

Key Takeaways

For AI agent builders: Native RPC (direct method calls between Workers, not over HTTP) provides zero-latency communication with persistent state that HTTP-based abstractions cannot match. MCP’s request-response protocol requires a two-tool pattern (start task, then poll status) because HTTP connections cannot maintain state or continue execution after returning a response.
The architecture: Durable Objects enable true fire-and-forget delegation with persistent state and background execution. This is something conversational tools like Claude Code subagents can’t provide.
The impact: Async subagents transformed our workflow from single-threaded blocking to parallel execution. The Integration Builder now delegates complex tasks and continues building while research happens autonomously in the background.

The future may bring streaming responses or protocol extensions. For now, native RPC with Durable Object persistence gives us reliable, scalable subagent delegation.

Want to see it in action? Book a demo to watch our Claude Code-based Integration Builder Agent.

Building something similar? Reach out to ai@stackone.com - we'd love to chat!