The Future of APIs: Forward Deployed Agents and Agent Cards

"Never send a human to do a machine's job."

‍

– Agent Smith, The Matrix (1999)

‍

The Three Eras

‍

Computing has had three eras. AI is repeating all of them, in fast forward.

‍

1. Mainframe

‍

One brain, dumb terminal. You send a line, you get a line back. ChatGPT perfected this.

‍

2. Desktop Application

‍

Intelligence moves to the device. You download the data and process it locally. The remote end is storage. This is the MCP assumption: pull the context down, reason on your machine.

‍

3. SaaS

‍

‍The remote end is the capable one. The client explores. The server guides that exploration. Salesforce isn't an Oracle client. It doesn't just store your data. It surfaces what matters, suggests what to do next, shapes how you work.

‍

SaaS companies live and die by FDEs integrating them into their customers' environments. Every new account needs the integration wired up. Someone still has to update the client's integration code by hand when things change.

‍

The Forward Deployed Agent does that job without the ticket.

‍

This is the future of APIs i.e. Agent Programming Interfaces, leading to a universe where enterprises derive the product of capabilities of individual agents, not their sum.

‍

Agents Should Be Distributed

‍

The MCP model works until you need to take action, not just read data.

‍

When an action fails, the retry loop runs on the client. But the remote system knows whether a retry makes sense. Claude's scheduled actions today require your laptop to stay awake. That is the wrong architecture. Execution that belongs to a task should live where the task lives, on the server, running whether or not your device is online.

‍

The same applies to orchestration, error recovery, and state. The intelligence that should own a task is often the one closest to the system it's operating on.

‍

The right model is distributed. A lightweight agent on the client side that understands user intent. A capable agent on the server side that owns execution. They negotiate. They communicate. The user-side agent doesn't pretend to know how the remote system works. The server-side agent doesn't need to understand the user's context.

‍

A2A is one step forward in this direction.

‍

A Product of Capabilities

‍

The most immediate version of this future already exists in how some people work today.

‍

WisprFlow captures your voice, interprets your intent, and hands it off to Claude, which changes the code. WisprFlow is the forward deployed agent. It understands your local context and delegates to the more capable remote agent. Claude doesn't need to know how you speak. WisprFlow doesn't need to know your codebase. The hard part, and the interesting part, is knowing exactly when to hand off, and when the answer is already good enough to act on.

‍

Today, the channel is one way. Someday, soon, WisprFlow will talk back. Reading Claude's response aloud, asking a clarifying question, telling you the build failed.

‍

The Bidirectional Future

‍

Imagine your iPhone had a framework to run a local model. Chrome ships with Gemini Nano built in. In case you missed it, because it was so long ago, they both did that last year. Every modern device has enough inference locally to understand what you want and to answer quickly.

‍

This changes what the server-side agent is for.

‍

When the client is dumb, the server has to do everything. When the client is capable, the two sides can divide the work intelligently. The local agent knows who you are, what you're doing, and what you meant. The remote agent knows the domain, the data, and the system. Neither can do the other's job well.

‍

The server can now guide exploration rather than just answer queries. It can say: you asked about revenue, but given what you've pulled so far, you probably want cohort retention. The client can push back: no, the user already has that, they need the segment breakdown. That exchange, two agents negotiating on behalf of their respective contexts, is not possible when one side is just a data pipe.

‍

This is not science fiction. It is an undercurrent already making waves. The question is not whether this wave arrives. It is whether your agent will get on top of it or be swept under.

‍

The Shift

‍

If you are building a platform that other agents will integrate with, the question is no longer "What does our SDK look like?"

‍

It is: "What does our Agent Card say?"

‍

The Agent Card is your new developer contract. Machine-readable, versioned, negotiated at runtime rather than at npm install time. The constraint is not hardware. It is not even the protocol. It is whether you still think of your SDK as code, or whether you have started thinking of it as a peer agent.

‍

The agent goes forward deployed, and it never needs a flight home.

‍

⬅ Read more Blogs