Calculating... until our next FREE Code-Along Session. Secure your spot now
GET INSTANT ACCESS READ MORE ABOUT FIVE

Vapi Pricing Demystified

Avatar photo
Dominik Keller
Apr 30th, 2026
Blog

Vapi Pricing Demystified: What Agencies Actually Pay Per Minute

Vapi promotes a deceptively simple promise: $0.05 per minute for voice AI calls. Scan the pricing page for three seconds and you might think you’ve found a bargain. But seasoned agencies know the $0.05 figure is only the orchestration fee — the tip of a much deeper cost iceberg.

The advertised price is almost never what you pay. Vapi says $0.05 per minute. That is true if you ignore the LLM, the transcription engine, the voice synthesis, and the telephony carrier. Add those in and you are at $0.15 to $0.31 per minute before your first call ends.

This guide is a line‑by‑line dissection of what a single minute of Vapi voice AI actually costs. You’ll get the real‑world figures agencies report after dozens of productions, learn why Vapi’s dashboard often hides the true cost, and walk away with a clear model for pricing your services — and keeping your margins intact.



The Real Cost Stack: Why $0.05 Is Just the Start

A Vapi voice call isn’t one service. It’s five distinct billable components stacked on top of each other, each invoiced by a different provider:

ComponentWhat it doesTypical cost per minute
Vapi OrchestrationManages WebSocket connections, audio streaming, and provider coordination$0.05
LLM (Large Language Model)Powers the agent’s conversation intelligence$0.02 – $0.20
TTS (Text-to-Speech)Synthesizes the agent’s spoken responses$0.01 – $0.07
STT (Speech-to-Text)Transcribes what the caller says$0.004 – $0.02
TelephonyRoutes the actual phone call$0.005 – $0.03

Real‑world all‑in cost: $0.15 – $0.36 per minute (commonly $0.23 – $0.33), depending on your model and provider choices.

1. Vapi Platform Fee (~$0.05/min)

This is the only fixed component. Vapi charges $0.05 per minute for hosting and orchestrating the call — managing the WebSocket, keeping latency low, and stitching together the other four providers. Every other cost is passed through from third‑party services you select.

2. LLM Costs ($0.02 – $0.20/min)

The language model you pick is the biggest swing factor. GPT‑4o can cost $0.08–$0.20 per minute depending on prompt length and token usage, while smaller models like GPT‑4o mini, Claude 3.5 Haiku, or Gemini Flash can bring this below $0.05/min. Longer prompts and multi‑turn conversations with large context windows push this number higher, sometimes dramatically.

3. Text-to-Speech ($0.01 – $0.07/min)

Voice quality directly affects this line. Premium providers like ElevenLabs with the most natural human voices run ~$0.036/min, while budget options like Deepgram or Azure TTS cost ~$0.011/min. Open source voices can lower this to $0.002/min if you self‑host, but at the cost of setup complexity.

4. Speech-to-Text ($0.004 – $0.02/min)

Deepgram (~$0.01/min) is the default, but agencies often switch to Assembly AI ($0.00025/min) or ElevenLabs ($0.00667/min) based on accuracy needs. This is typically the smallest line item.

5. Telephony ($0.005 – $0.03/min)

Vapi doesn’t sell phone numbers. You bring your own telephony provider — Twilio, Vonage, or Telnyx — and pay that provider directly. Twilio inbound is ~$0.008–$0.014/min, outbound slightly more. Telnyx is often cheaper at $0.0055/min.


What the Dashboards Won’t Tell You

Even when you understand the four‑layer model, Vapi’s dashboard often obscures the real cost. Community forums are full of users reporting the same surprise:

“I can see the latency but I don’t see the costs.”

“The dashboard indicated our agent cost was approximately $0.09/min, which is what we relied on when scaling volume. Actual charges came in at roughly $0.16/min.”

“I use the GPT‑4o cluster model for my assistant (which is $0.08/min). But on the assistant dashboard, the average cost is ~$0.16/min – why?”

The discrepancy comes from two places. First, the dashboard sometimes displays only the Vapi platform fee, excluding provider costs that appear later on separate invoices. Second, Structured Output and Analysis fees, charges for post‑call data extraction, are often missing from the live cost estimate but appear on the final bill.

For agencies, this is poison. How do you price a client engagement when you can’t even see your own costs in real time?


Vapi Pricing Tiers: What Each Plan Actually Costs

Vapi offers a pay‑as‑you‑go Ad‑Hoc plan and several bundled tiers with included minutes. Here’s the effective per‑minute breakdown after accounting for typical provider costs:

TierMonthly feeMinutes includedEffective per‑min (incl. providers)Overage rate
Ad‑Hoc$0Pay‑as‑you‑go~$0.18/min~$0.18/min
EnterpriseCustomCustomNegotiableNegotiable

For enterprise deployments ($40,000–$70,000+/year contracts), Vapi bundles LLM, TTS, STT, and telephony into a single predictable invoice, but that’s only available at scale.


What Real Agencies Are Reporting

Agency owners who’ve deployed 40+ production agents have a consistent message: budget $0.23–$0.33/min all‑in, and don’t trust the dashboard alone:

  • A single 3‑minute call costs $0.30–$0.60 at standard provider settings.
  • 500 calls/month runs $150–$300/month just in usage, before you’ve paid for development or maintenance.
  • At 300 minutes/month, agencies report $36–$72. At 2,000 minutes/month, costs climb to $260–$620 — a wide band that depends entirely on your LLM and TTS choices.
  • An agency with 10 clients averaging 500 minutes/month each spends ~$1,049/month on a Vapi + wrapper stack. If they negotiate better telephony and use smaller models, that can drop to $749/month.

“A single 3‑minute call can cost $0.30–0.60. At 500 calls/month, you’re looking at $150–300/mo just in usage — before you’ve paid a developer to build and maintain it.”


Vapi vs. Competitors: How Does It Stack Up?

Vapi’s $0.05/min platform fee is the lowest among major voice AI platforms, but the total all‑in cost depends on your stack choices:

PlatformPlatform feeTypical all‑inNotes
Vapi$0.05/min$0.15–$0.36/minMost flexible; you pick every provider
Retell AI$0.07–$0.23/min$0.15–$0.30/minMore inclusive pricing; fewer provider decisions
Bland AI$0.09–$0.12/min all‑in$0.09–$0.12/minMost predictable; least flexible
Safina$15–$25/month~$0.09/minBundled telephony + AI; limited customization

Vapi is the cheapest platform fee per minute, but because you bring your own providers (BYOK), you carry the full cost of LLM, TTS, and STT yourself. The BYOK costs (ElevenLabs, OpenAI, Deepgram) are identical across platforms because they’re the same providers. The difference is that platforms like Retell and Bland negotiate volume discounts with these providers and pass some savings on to you.

If you need maximum control and have the engineering bandwidth to optimize your stack, Vapi is unmatched. If you need predictable billing and faster deployment, Retell or Bland may be better fits. The tradeoff is always control versus predictability.


How to Price Your Agency Services (Without Losing Money)

Given these costs, how do you charge clients and maintain margin?

Model 1: Cost‑Plus Markup. Track actual Vapi + provider costs per client, apply a 1.3×–1.5× markup, and bill monthly. Risk: if costs spike (e.g., a client uses longer prompts), your margin compresses unless you monitor constantly.

Model 2: Flat Per‑Minute Rate. Charge $0.40–$0.55/min regardless of your underlying cost. Pro: predictable for both sides. Con: you need to know your average cost well. At $0.30/min cost, $0.40/min leaves you a 25% margin — slim.

Model 3: Tiered Subscription. $299/month for up to 1,000 minutes, then $0.35/min overage. Best for agencies with stable call volumes. Aligns client expectations with your cost structure.

Model 4: Flat Monthly Retainer. $400–$1,000/month based on forecasted volume. The Reddit thread from an agency owner captures the real market: “When I told him it would be 20 cents/min, he literally said ‘no way’ — but he was willing to pay $400/month fixed.” The agency owner was worried about losing money — but at $400/month for 1,000 minutes, that’s $0.40/min, more than enough to cover the $0.23/min all‑in cost.

The pricing model you choose should reflect your client’s psychology (some prefer fixed, others prefer usage‑based) and your ability to track real costs.


How Five Closes the Visibility Gap

If you’re relying on Vapi’s native dashboard for cost tracking, you’re flying partially blind. The Five white‑label Vapi portal pulls per‑client analytics directly from Vapi’s API, calculates total cost (platform + providers), and applies your chosen pricing model automatically.

  • Real‑time margins — See cost vs. revenue per client, updated hourly.
  • Automated Stripe billing — Stop exporting CSVs and manually creating invoices.
  • Client dashboards without the Vapi UI — Give clients a branded view of their usage, not the technical Vapi canvas.
  • No dashboard surprises — Five aggregates costs from all providers into a single client‑facing number, so you’re never surprised by a bill.

Take Control of Your Vapi Economics

Vapi is an extraordinary platform for building voice AI agents. But its pricing model requires active management. Understand the five cost layers. Pick your providers based on both quality and cost. Model your pricing so your margins survive a spike in token usage. And if you’re serving multiple clients, put a system in place that tracks costs automatically — because spreadsheets don’t scale.

The agencies that win in voice AI aren’t the ones with the best prompts. They’re the ones who’ve mastered the economics.


Build Your Profitable Voice AI Agency With Five

Track costs, automate billing, and give clients a white‑label dashboard. Contact us to learn more about our white-label Vapi dashboard for client and cost management.


Start developing your first application!

Get Started For Free Today

Sign Up Free Book a demo

Build Your Web App With Five

200+ Free Trials Started This Week

Start Free

Thank you for your message!

Our friendly staff will contact you shortly.

CLOSE