Voice agents API

Build real-time voice agents that listen, think, and speak. Wire up tools, knowledge, and phone numbers — and let agents hold the conversation.

$0.07/min
all-in — LLM, STT & TTS
60 min
free every month
SOC 2
Type II certified
bash
curl -X POST https://api.speechify.ai/v1/agents \
  -H "Authorization: Bearer $SPEECHIFY_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "name": "Support Bot",
    "prompt": "You are a friendly support agent for a SaaS product. Greet callers, answer questions about billing and account settings, and transfer to a human if you cannot help.",
    "first_message": "Hi, this is Sabrina with support. How can I help today?",
    "voice_id": "sabrina",
    "language": "en",
    "temperature": 0.7
  }'

Everything an agent needs

01

Tools & function calling

Let agents call your APIs mid-conversation to look things up, take actions, and complete tasks in real time.

02

Knowledge base

Ground every answer in your own content with built-in retrieval. Upload documents, files, and sitemaps.

03

Memory

Agents remember context within a conversation — and across calls — for interactions that feel continuous.

04

Phone numbers & telephony

Inbound and outbound calling over SIP. Provision numbers and connect agents straight to the phone network.

05

Webhooks & events

Stream conversation events to your backend as they happen, with signed, verifiable payloads.

06

Testing & simulation

Validate behavior before you ship with reply, tool, and full-conversation simulation tests.

By industry

Tuned to the workflows that matter most in your industry.

How we compare

Honest, side-by-side comparisons with the other voice-agent platforms.

One rate. Everything included.

Most platforms quote a platform fee, then bill the LLM, speech, and telephony on top. Speechify is one all-in rate per minute.

Most platforms
Platform fee
LLM passthrough +
Speech-to-text +
Text-to-speech +
Telephony +

Five line items, each metered separately. The bill takes a spreadsheet.

Speechify
From $0.07/min
  • LLM
  • Speech-to-text
  • Text-to-speech
  • Telephony orchestration

One line item. No passthrough, no token math — and 60 free minutes every month.

Read the full Voice Agents documentation

Guides for tools, knowledge bases, memory, phone numbers, webhooks, and testing.