What you'll learn: How to grow from one agent to a portfolio without fragmentation or duplicated effort.
Key takeaways:
- Expansion follows three pathways. Deepening (adding intents to existing agents), broadening (building new agents), and extending (adding languages or channels).
- Sequence expansion by leverage and risk. Deepening reuses the most infrastructure. Extending carries the highest brand risk.
- The most common expansion failure is starving existing agent maintenance to fund new builds. Allocate capacity deliberately. 70% new build, 20% maintenance, 10% exploration.
- Shared components create compounding returns. Identity verification, context lookup, and monitoring tooling should be built once and reused everywhere.
- Portfolio governance becomes critical when there are three or more agents. Treat agents as a system with shared standards and operational practices.
The Install Reminder Agent had been running for eight months. Completion rates were stable. Operations were smooth. Leadership wanted more.
Marcus had three expansion candidates on the roadmap. The COI Verification Agent would automate the collection of certificates of insurance from drivers. An install verifier, which would confirm successful app installation days after the initial reminder. And Spanish-language support for the existing agent, serving the 30% of drivers who preferred Spanish.
Each represented a different kind of expansion. Marcus needed to understand which to pursue first and how to allocate resources across them.
Nina helped him see the pattern. Expansion followed three distinct pathways, each with different investment requirements and risk profiles.
Three pathways
Deepening meant adding intents to an existing agent. The install verifier would become a new intent within the Install Reminder Agent. Same infrastructure, same telephony, same operational practices. The agent would handle "remind and install" calls and also "verify installation" calls.
Deepening was low risk because it reused existing investment. The telephony was already provisioned. The monitoring was already running. The team already knew how to operate the agent. The new intent added capability without adding operational complexity.
Broadening meant building new agents for different use cases. The COI Verification Agent was a completely new workflow. Different tools, different conversation design, different success metrics. It shared the platform but nothing else.
Broadening required more investment. New tool contracts, new prompts, new test suites, new operational runbooks. But it also unlocked new value. COI collection was a manual process that consumed hundreds of operator hours monthly.
Extending meant adapting existing agents to new channels, languages, or markets. Spanish support for the Install Reminder Agent required new STT/TTS configurations, translated prompts, and a culturally adapted persona. The conversation logic stayed the same. The delivery changed.
Extending the required specialized investment. Translation was more than word substitution. The agent needed to sound natural in Spanish, not like an English speaker translating into Spanish. Pacing, phrasing, and cultural references all needed adaptation.
What to build first
Marcus evaluated each pathway against three criteria.
Shared infrastructure leverage measured how much existing investment could be reused. Deepening had the highest leverage. The install verifier reused everything except the new intent's prompt and tools. Broadening had moderate leverage. The COI Agent reused the platform and operational practices but needed everything else built anew. Extending had variable leverage. Spanish support reused the conversation logic but needed new voice infrastructure.
The risk profile measured what could go wrong and how severe the consequences would be. Deepening was the lowest risk because failures would be isolated to the new intent while the existing agent continued working. Broadening was a moderate risk because a new agent's failure wouldn't affect existing agents. Extending was the highest risk because a poorly localized agent could damage the brand with an entire language segment.
Value unlocked measured the business impact of success. Deepening unlocked incremental value. The install verifier improved completion rates for an existing process. Broadening unlocked new value. The COI Agent automated a manual process. Extending the unlocked market value. Spanish support reached 30% of drivers with a degraded experience.
Marcus plotted each option and chose the sequence: deepen first, broaden second, extend third.
The install verifier went first. High leverage, low risk, incremental value. A good way to build confidence and demonstrate consistent execution.
The COI Agent went second. Moderate leverage, moderate risk, new value. More investment is required, but with a clearer business impact.
Spanish support went third. Lower leverage, higher risk, significant market value. Worth doing, but worth doing carefully.
Where the time goes
With multiple agents in production and more in development, Marcus faced a resource allocation problem. How much capacity went to maintaining existing agents versus building new ones?
He'd seen the common failure. Teams starved existing agent maintenance to fund new builds. The existing agents degraded slowly. Small issues accumulated. By the time anyone noticed, the maintenance backlog was overwhelming.
Marcus established an allocation rule. 70% of capacity went to the current priority initiative. 20% went to maintenance and optimization of existing agents. 10% went to exploration and experimentation.
The 70% built the install verifier, then the COI Agent, then Spanish support. One major initiative at a time, fully staffed.
The 20% kept the Install Reminder Agent healthy. Weekly optimization reviews continued. Bugs got fixed. Monitoring stayed current. The existing agent didn't degrade while attention went elsewhere.
The 10% explored future possibilities. Could the platform handle inbound calls? Could agents transfer between each other? What would multi-language support require? Exploration seeded the roadmap for future prioritization.
The allocation wasn't rigid. Some weeks, a production incident demanded more maintenance attention. Some weeks, a major deadline demanded more building attention. But the baseline allocation prevented any category from being starved.
What gets reused
As the agent portfolio grew, Marcus noticed duplication. Each agent implemented identity verification in a slightly different way. Each had its own error-handling patterns. Each built monitoring dashboards from scratch.
Nina identified candidates for shared components.
Identity verification became a shared module. The same verification flow, prompts, and error handling are used by every agent who needs to verify callers. Changes to verification updated all agents simultaneously.
Context lookup became a shared service. When an agent needed to retrieve caller history, it used the same lookup service regardless of which agent was requesting it. The service was optimized once and used everywhere.
Knowledge base infrastructure became shared. Multiple agents could access the same policy documents, FAQ content, and operational procedures. New documents were indexed once and are available to all agents.
Operational tooling became standardized. Monitoring dashboards followed a template. Alerting configurations followed a pattern. Runbooks used consistent formats. New agents launched with operational infrastructure already in place.
Shared components created compounding returns. The second agent was faster to build than the first because it reused components. The third was faster than the second. The investment in getting components right early paid dividends as the portfolio grew.
Managing multiple agents
At three agents, informal coordination worked. With five agents, Marcus needed governance.
He established a portfolio review cadence. Monthly meetings with engineering, operations, and business stakeholders. Each agent's performance was reviewed. Resource allocation decisions made. Expansion priorities set.
The review covered standardized metrics across all agents. Completion rates, error rates, cost per interaction, business outcomes. Comparing agents on consistent metrics revealed which were healthy and which needed attention.
The review also covered dependencies. Did Agent A's performance depend on a backend service that Agent B also used? Could a change to that service affect both agents? Dependencies needed visibility across the portfolio.
Versioning became critical. Which prompt version was each agent running? Which tool contract versions? Which platform version? When an issue appeared, the governance process enabled rapid identification of which agents might be affected.
The automotive marketplace, managing over a thousand agent configurations across five countries, demonstrated the endpoint of portfolio governance. Their center of excellence treated agents as a system. No agent stood alone. Each was part of a portfolio with shared standards, shared infrastructure, and shared operational practices.
A year after the Install Reminder Agent launched, the logistics company operated three agents. The Install Reminder (now with verification), the COI Agent, and the Spanish Install Reminder.
The roadmap showed three more for the following year. Inbound support triage, callback scheduling, and Portuguese language support.
Marcus pulled up the timeline for the first agent. Four months from concept to production. The COI Agent, the third one they'd built, had taken six weeks. The Spanish Install Reminder, which reused almost everything from the English version, had taken two weeks.
The pattern was clear. Each agent launched faster than the one before. Shared components accumulated. Operational practices solidified. The team improved at a skill they'd use for years.
The Install Reminder Agent had been a project. The portfolio was becoming something else. A capability that would keep compounding as long as they kept building on what came before.

