Lifelike, expressive speech from text — in a single API call.
Real-time agents that listen, think, and speak.