Vapi promotes a deceptively simple promise: $0.05 per minute for voice AI calls. Scan the pricing page for three seconds and you might think you’ve found a bargain. But seasoned agencies know the $0.05 figure is only the orchestration fee — the tip of a much deeper cost iceberg.
The advertised price is almost never what you pay. Vapi says $0.05 per minute. That is true if you ignore the LLM, the transcription engine, the voice synthesis, and the telephony carrier. Add those in and you are at $0.15 to $0.31 per minute before your first call ends.
This guide is a line‑by‑line dissection of what a single minute of Vapi voice AI actually costs. You’ll get the real‑world figures agencies report after dozens of productions, learn why Vapi’s dashboard often hides the true cost, and walk away with a clear model for pricing your services — and keeping your margins intact.
A Vapi voice call isn’t one service. It’s five distinct billable components stacked on top of each other, each invoiced by a different provider:
| Component | What it does | Typical cost per minute |
|---|---|---|
| Vapi Orchestration | Manages WebSocket connections, audio streaming, and provider coordination | $0.05 |
| LLM (Large Language Model) | Powers the agent’s conversation intelligence | $0.02 – $0.20 |
| TTS (Text-to-Speech) | Synthesizes the agent’s spoken responses | $0.01 – $0.07 |
| STT (Speech-to-Text) | Transcribes what the caller says | $0.004 – $0.02 |
| Telephony | Routes the actual phone call | $0.005 – $0.03 |
Real‑world all‑in cost: $0.15 – $0.36 per minute (commonly $0.23 – $0.33), depending on your model and provider choices.
This is the only fixed component. Vapi charges $0.05 per minute for hosting and orchestrating the call — managing the WebSocket, keeping latency low, and stitching together the other four providers. Every other cost is passed through from third‑party services you select.
The language model you pick is the biggest swing factor. GPT‑4o can cost $0.08–$0.20 per minute depending on prompt length and token usage, while smaller models like GPT‑4o mini, Claude 3.5 Haiku, or Gemini Flash can bring this below $0.05/min. Longer prompts and multi‑turn conversations with large context windows push this number higher, sometimes dramatically.
Voice quality directly affects this line. Premium providers like ElevenLabs with the most natural human voices run ~$0.036/min, while budget options like Deepgram or Azure TTS cost ~$0.011/min. Open source voices can lower this to $0.002/min if you self‑host, but at the cost of setup complexity.
Deepgram (~$0.01/min) is the default, but agencies often switch to Assembly AI ($0.00025/min) or ElevenLabs ($0.00667/min) based on accuracy needs. This is typically the smallest line item.
Vapi doesn’t sell phone numbers. You bring your own telephony provider — Twilio, Vonage, or Telnyx — and pay that provider directly. Twilio inbound is ~$0.008–$0.014/min, outbound slightly more. Telnyx is often cheaper at $0.0055/min.
Even when you understand the four‑layer model, Vapi’s dashboard often obscures the real cost. Community forums are full of users reporting the same surprise:
“I can see the latency but I don’t see the costs.”
“The dashboard indicated our agent cost was approximately $0.09/min, which is what we relied on when scaling volume. Actual charges came in at roughly $0.16/min.”
“I use the GPT‑4o cluster model for my assistant (which is $0.08/min). But on the assistant dashboard, the average cost is ~$0.16/min – why?”
The discrepancy comes from two places. First, the dashboard sometimes displays only the Vapi platform fee, excluding provider costs that appear later on separate invoices. Second, Structured Output and Analysis fees, charges for post‑call data extraction, are often missing from the live cost estimate but appear on the final bill.
For agencies, this is poison. How do you price a client engagement when you can’t even see your own costs in real time?
Vapi offers a pay‑as‑you‑go Ad‑Hoc plan and several bundled tiers with included minutes. Here’s the effective per‑minute breakdown after accounting for typical provider costs:
| Tier | Monthly fee | Minutes included | Effective per‑min (incl. providers) | Overage rate |
|---|---|---|---|---|
| Ad‑Hoc | $0 | Pay‑as‑you‑go | ~$0.18/min | ~$0.18/min |
| Enterprise | Custom | Custom | Negotiable | Negotiable |
For enterprise deployments ($40,000–$70,000+/year contracts), Vapi bundles LLM, TTS, STT, and telephony into a single predictable invoice, but that’s only available at scale.
Agency owners who’ve deployed 40+ production agents have a consistent message: budget $0.23–$0.33/min all‑in, and don’t trust the dashboard alone:
“A single 3‑minute call can cost $0.30–0.60. At 500 calls/month, you’re looking at $150–300/mo just in usage — before you’ve paid a developer to build and maintain it.”
Vapi’s $0.05/min platform fee is the lowest among major voice AI platforms, but the total all‑in cost depends on your stack choices:
| Platform | Platform fee | Typical all‑in | Notes |
|---|---|---|---|
| Vapi | $0.05/min | $0.15–$0.36/min | Most flexible; you pick every provider |
| Retell AI | $0.07–$0.23/min | $0.15–$0.30/min | More inclusive pricing; fewer provider decisions |
| Bland AI | $0.09–$0.12/min all‑in | $0.09–$0.12/min | Most predictable; least flexible |
| Safina | $15–$25/month | ~$0.09/min | Bundled telephony + AI; limited customization |
Vapi is the cheapest platform fee per minute, but because you bring your own providers (BYOK), you carry the full cost of LLM, TTS, and STT yourself. The BYOK costs (ElevenLabs, OpenAI, Deepgram) are identical across platforms because they’re the same providers. The difference is that platforms like Retell and Bland negotiate volume discounts with these providers and pass some savings on to you.
If you need maximum control and have the engineering bandwidth to optimize your stack, Vapi is unmatched. If you need predictable billing and faster deployment, Retell or Bland may be better fits. The tradeoff is always control versus predictability.
Given these costs, how do you charge clients and maintain margin?
Model 1: Cost‑Plus Markup. Track actual Vapi + provider costs per client, apply a 1.3×–1.5× markup, and bill monthly. Risk: if costs spike (e.g., a client uses longer prompts), your margin compresses unless you monitor constantly.
Model 2: Flat Per‑Minute Rate. Charge $0.40–$0.55/min regardless of your underlying cost. Pro: predictable for both sides. Con: you need to know your average cost well. At $0.30/min cost, $0.40/min leaves you a 25% margin — slim.
Model 3: Tiered Subscription. $299/month for up to 1,000 minutes, then $0.35/min overage. Best for agencies with stable call volumes. Aligns client expectations with your cost structure.
Model 4: Flat Monthly Retainer. $400–$1,000/month based on forecasted volume. The Reddit thread from an agency owner captures the real market: “When I told him it would be 20 cents/min, he literally said ‘no way’ — but he was willing to pay $400/month fixed.” The agency owner was worried about losing money — but at $400/month for 1,000 minutes, that’s $0.40/min, more than enough to cover the $0.23/min all‑in cost.
The pricing model you choose should reflect your client’s psychology (some prefer fixed, others prefer usage‑based) and your ability to track real costs.
If you’re relying on Vapi’s native dashboard for cost tracking, you’re flying partially blind. The Five white‑label Vapi portal pulls per‑client analytics directly from Vapi’s API, calculates total cost (platform + providers), and applies your chosen pricing model automatically.
Vapi is an extraordinary platform for building voice AI agents. But its pricing model requires active management. Understand the five cost layers. Pick your providers based on both quality and cost. Model your pricing so your margins survive a spike in token usage. And if you’re serving multiple clients, put a system in place that tracks costs automatically — because spreadsheets don’t scale.
The agencies that win in voice AI aren’t the ones with the best prompts. They’re the ones who’ve mastered the economics.
Track costs, automate billing, and give clients a white‑label dashboard. Contact us to learn more about our white-label Vapi dashboard for client and cost management.