Every voice model. Ultra-fast.


Vogent Voicelab is officially in public beta!

Better, faster, cheaper

Higher-quality than popular closed-source text-to-speech for a fraction of the price. Optimized compute for real-time inference and fast time-to-first-token so you can use ultra-realistic models for voice agents.

Voice clone, fine-tune, and make it your own

Use zero-shot voice cloning natively, or run our fine-tune recipes to more deeply adjust style while training and hosting on our infrastructure.

Scale up instantly

From a single voiceover to thousands of concurrent voice agents, our infrastructure scales compute with your usage and deploys across the globe. 

Free

$0

/month

180 minutes of high-quality Text to Speech

Instant Voice Cloning

Instant Voice Cloning

API Access

API Access

Studio Access

Studio Access

Additional credits for ~6c/minute

Discord Support Channel

Discord Support Channel

Starter

$20

/month

Everything in Free, plus

500 minutes of high-quality Text to Speech

3 concurrent requests

3 concurrent requests

Additional credits for ~4c/minute

Pro

Most Popular

Official Framer Resource

$150

/month

Everything in Starter, plus

5000 minutes of high-quality Text to Speech

Hosted fine-tunes

Hosted fine-tunes

30 concurrent requests

30 concurrent requests

HIPAA-compliant workspace

HIPAA-compliant workspace

Additional credits for ~3c/minute

Dedicated Slack channel

Dedicated Slack channel

Business

Contact Us

Everything in Pro, plus

Dedicated account manager

On-prem/VPC deployments

On-prem/VPC deployments

Custom-trained voices

Custom-trained voices

Unlimited concurrency

Unlimited concurrency

Volume discounts

Secure and Compliant

SOC 2 Type II and HIPAA compliant

Enterprise-level Deployment

Use our API's, or host our inference stack on-prem or in your VPC.

Committed-use Discounts

High-volume user? Get discounts by committing to monthly usage.

Use the latest voice models in seconds

Run the latest super-realistic voice models – like Sesame CSM-1B, Dia, Chatterbox, Orpheus, and more – with one API. Our hosted models run on an optimized voice inference stack and are post-trained to improve quality, so you can run state-of-the-art research in production. Get started in a few lines of code without managing compute.

nari-labs/dia

canopyai/orpheus

hexgrad/kokoro

resemble-ai/chatterbox

sesame/csm-1b

Vogent Voicelab is officially in public beta!

Use the latest voice models in seconds

Run the latest super-realistic voice models – like Sesame CSM-1B, Dia, Chatterbox, Orpheus, and more – with one API. Our hosted models run on an optimized voice inference stack and are post-trained to improve quality, so you can run state-of-the-art research in production. Get started in a few lines of code without managing compute.

hexgrad/kokoro

canopyai/orpheus

resemble-ai/chatterbox

nari-labs/dia

sesame/csm-1b

nari-labs/dia

Now in public beta!

Every voice model.
Ultra-Fast.

A fast, stable, and scalable API for top new text-to-speech models, including Sesame CSM-1B and Dia. 

Connect to Content

Add layers or components to infinitely loop on your page.

Boost your productivity

A more effective way to track progress

Effortlessly turn your ideas into a fully functional, responsive, no-code SaaS website in just minutes with the set of free components for Framer.

Integration ecosystem

Track your progress and motivate your efforts everyday.

Goal setting and tracking

Set and track goals with manageable task breakdowns.

Secure data encryption

Ensure your data’s safety with top-tier encryption.

Customizable notifications

Get alerts on tasks and deadlines that matter most.

Better, faster, cheaper

Higher-quality than popular closed-source text-to-speech for a fraction of the price. Optimized compute for real-time inference and sub-200ms time-to-first-token, so you can use ultra-realistic models for voice agents.

Voice clone, fine-tune, and make it your own

Use zero-shot voice cloning natively, or run our fine-tune recipes to more deeply adjust style while training and hosting on our infrastructure.

Secure and compliant

Enhance your productivity by connecting with your favorite tools, keeping all your essentials in one place.

Enterprise-level Deployment

Use our API's, or host our inference stack on-prem or in your VPC.

Committed-use Discounts

High-volume user? Get discounts by committing to monthly usage.

Scale up instantly

From a single voiceover to thousands of concurrent voice agents, our infrastructure scales compute with your usage and deploys across the globe. 

Free

$0

/month

180 minutes of high-quality Text to Speech

Instant Voice Cloning

Instant Voice Cloning

API Access

API Access

Studio Access

Studio Access

Additional credits for ~6c/minute

Discord Support Channel

Discord Support Channel

Starter

$20

/month

Everything in Free, plus

780 minutes of high-quality Text to Speech

3 concurrent requests

3 concurrent requests

Additional credits for ~4c/minute

Pro

Most popular

$150

/month

Everything in Starter, plus

Hosted fine-tunes

Hosted fine-tunes

30 concurrent requests

30 concurrent requests

HIPAA-compliant workspace

HIPAA-compliant workspace

Additional credits for ~3c/minute

Dedicated Slack channel

Dedicated Slack channel

Enterprise

Contact Us

Everything in Pro, plus

Dedicated account manager

On-prem/VPC deployments

On-prem/VPC deployments

Custom-trained voices

Custom-trained voices

Unlimited concurrency

Unlimited concurrency

Volume discounts

Testimonials

What our users say

Connect to Content

Add layers or components to infinitely loop on your page.

Get started in seconds

Sign up to get $5 in credits to run top voice models.

Every voice model. Ultra-fast.

Product

Features

Integrations

Updates

FAQ

Pricing

Company

About

Blog

Careers

Manifesto

Press

Contact

Resources

Examples

Community

Guides

Docs

Legal

Privacy

Terms

Security

Now in public beta!

Ultra-Fast
Ultra-Realistic
Voice AI

The most realistic text-to-speech API, powered by top research like CSM-1B and Dia.

Higher quality than other TTS, for a fraction of the cost.

Connection lost - attempting to reconnect