Every voice model. Ultra-fast.


Now in public beta!

Every voice model.
Ultra-Fast.

A fast, stable, and scalable API for top new text-to-speech models, including Sesame CSM-1B and Dia. 

Vogent Voicelab is officially in public beta!

Get started in seconds

Sign up to get $5 in credits to run top voice models.

Use the latest voice models in seconds

Run the latest super-realistic voice models – like Sesame CSM-1B, Dia, Chatterbox, Orpheus, and more – with one API. Our hosted models run on an optimized voice inference stack and are post-trained to improve quality, so you can run state-of-the-art research in production. Get started in a few lines of code without managing compute.

nari-labs/dia

canopyai/orpheus

hexgrad/kokoro

resemble-ai/chatterbox

sesame/csm-1b

Better, faster, cheaper

Higher-quality than popular closed-source text-to-speech for a fraction of the price. Optimized compute for real-time inference and sub-200ms time-to-first-token, so you can use ultra-realistic models for voice agents.

Voice clone, fine-tune, and make it your own

Use zero-shot voice cloning natively, or run our fine-tune recipes to more deeply adjust style while training and hosting on our infrastructure.

Everything you need

Scale up instantly

From a single voiceover to thousands of concurrent voice agents, our infrastructure scales compute with your usage and deploys across the globe. 

Secure and Compliant

SOC 2 Type II and HIPAA compliant

Enterprise-level Deployment

Use our API's, or host our inference stack on-prem or in your VPC.

Committed-use Discounts

High-volume user? Get discounts by committing to monthly usage.