Cartesia blog

[News]

Introducing Ink-2:The #1-ranked STT built for voice agents

Eli PughJul 9, 2026

A spoken question spiraling word by word into a microphone, evoking speech-to-text transcription

A Guide to Choosing Voice AI Models

A practical framework for evaluating ASR, TTS, and turn-detection models against your real-world use case — not lab conditions.

[Product]

Jun 30, 2026

A Beginner's Guide to Voice AI Terminology

A plain-language glossary of the 20 key terms you need to understand, evaluate, and build conversational voice AI.

[Product]

Jun 30, 2026

Cartesia achieves GDPR compliance

Cartesia's text-to-speech platform is now GDPR compliant, reaffirming our commitment to data protection, privacy, and responsible human-centered AI.

[News]

Sep 24, 2025

Introducing Line: The Modern Voice Agent Development Platform

Introducing Line by Cartesia: the modern voice agent development platform. Line was built to be code-first, because best-in-class products are built in code.

[Product]

Aug 19, 2025

Hierarchical modeling

Introducing H-Nets, hierarchical architectures that learn directly from raw data by grouping inputs into higher-level concepts, overcoming tokenization limits.

[Research]

Jul 11, 2025

Introducing Ink: speech-to-text models for real-time conversation

Today we’re introducing Ink, a new family of streaming speech-to-text (STT) models for developers building real-time voice applications. Ink-Whisper is the fastest, most affordable STT model–designed for enterprise-grade voice agents.

[Product]

Jun 10, 2025

Introducing Organizations and Dashboards

We’re building Cartesia for developers scaling voice AI. Today, we’re introducing two features to make collaboration and visibility easier: Organizations and Dashboards.

[Product]

May 15, 2025

Introducing Professional Voice Cloning

Introducing Professional Voice Cloning (PVC), professional-quality voice clones trained on Sonic, now available on Startup+ plans.

[Product]

May 6, 2025

Cartesia Python SDK v2.0.0

We are excited to announce the release of v2.0.0 of our Python SDK, polishing the developer experience when using Cartesia's AI voice capabilities with Python.

[Engineering]

Mar 28, 2025

Cartesia Named to 7th Annual Enterprise Tech 30 List Presented by Wing Venture Capital

Cartesia, a leading provider of generative audio models, today announced it has been named to the seventh annual Enterprise Tech 30—a definitive list of the most promising, private enterprise tech companies across all stages of maturity.

[News]

Mar 25, 2025

Introducing Narrations: create and edit long-form audio content with precision

Today we're excited to introduce Narrations, a platform that enables creators to transform written content into polished audio productions with unprecedented control and efficiency.

[Product]

Mar 18, 2025

Series A and the future of voice AI

We’re thrilled to announce our $64 million Series A led by Kleiner Perkins. The new funding will help us expand our team and invest in research to build the next generation of models.

[News]

Mar 11, 2025

Llamba: scaling distilled recurrent models for efficient language processing

The next few years will usher in a new era of on-device AI. On-device models will power a wide range of applications.

[Research]

Mar 5, 2025

How to build a voice AI agent with Cartesia

A step-by-step guide to building a natural, low-latency voice AI agent with Cartesia's Sonic text-to-speech, from architecture to production deployment.

[Engineering]

Mar 3, 2025

State of voice AI 2024

In our first 2024 State of Voice report, we highlight the key infrastructure breakthroughs and emerging use cases driving the industry forward, and look ahead to what’s next in 2025.

[Research]

Dec 19, 2024

Announcing our seed round

We’re excited to announce our $27M seed round, led by Index Ventures with participation from Lightspeed, Factory, Conviction, General Catalyst, A*, SV Angel, and 90 amazing angel investors.

[News]

Dec 12, 2024

‘Tis the Hackathon season at Cartesia

October 2024 was a busy month for Cartesia. We brought together over 2,000 builders across San Francisco and gave away $20,000 in prizes for the most innovative ideas built on Sonic.

[Engineering]

Nov 14, 2024

Introducing voice changer: transform audio your way

Voice Changer transforms the voice of any audio clip while preserving its original delivery, emotion, and prosody — with studio-quality voices or clones.

[Product]

Oct 30, 2024

Introducing our next 8 languages on Sonic Multilingual

Today, we're excited to announce the Alpha Release of our next 8 languages—Hindi, Italian, Korean, Dutch, Polish, Russian, Swedish, and Turkish—on Sonic Multilingual.

[Product]

Oct 8, 2024

The on-device intelligence update

At Cartesia, our mission is to build the next generation of AI: ubiquitous, interactive intelligence that runs wherever you are.

[Research]

Aug 28, 2024

Announcing Sonic: a low‑latency voice model for lifelike speech

We're releasing Sonic, our low-latency voice model that generates lifelike speech today.

[Research]

May 31, 2024

Based: Simple linear attention language models balance the recall‑throughput tradeoff

Based is 56% and 44% faster at processing prompts than FlashAttention-2 and Mamba respectively. Based achieves 24x higher text generation throughput than FlashAttention-2.

[Research]

Mar 4, 2024