Mercury Voice Agent
1. Overview
2. Problem
- Respond naturally in real time
- Integrate business tools and data sources
- Scale under high call volumes
- Handle edge conditions like voicemail, noise, and interruptions
3. Core Use Cases
-
Sales and lead qualification
- Automated first-touch calling for inbound leads
- Qualification scoring and CRM handoff
-
Support triage and routing
- Instant intent capture and issue classification
- Smart routing to specialized queues when escalation is needed
-
Collections and reminders
- Payment reminders with contextual follow-up prompts
- Adaptive retry logic based on call outcomes
-
Appointment and operations automation
- Confirmations, rescheduling, and reminder calls
- Backend tool calls for calendar and ticketing updates
4. System Design
4.1 FastAPI Runtime
4.2 Streaming Voice Pipeline
- Audio ingress from telephony provider
- STT transcription in near real time
- LLM reasoning and tool decisions
- TTS synthesis and playback to caller
4.3 Configurable Model Layers
- LLM
- Speech-to-text
- Text-to-speech
4.4 MCP Tool Calling
- Fetching customer records
- Updating CRM states
- Creating support tickets
- Scheduling callbacks
5. Reliability Features
5.1 Noise Handling And Interruption Control
5.2 Voicemail And Machine Detection
5.3 Concurrent Call Stability
6. Performance Outcomes
- ~800ms end-to-end conversational latency in optimized paths
- 1000+ concurrent call handling on tuned FastAPI infrastructure
- Consistent call-state management under long-running campaign loads








