From the team.

Announcements, engineering deep-dives, and notes from the team building real-time AI.

Featured post

Cartesia achieves GDPR compliance

· Karan Goel

GDPR logo next to text, Cartesia is now GDPR compliant
Introducing Line: The Modern Voice Agent Development Platform

· Karan Goel

Introducing Line: The Modern Voice Agent Development Platform

Introducing Line by Cartesia: the modern voice agent development platform. Line was built to be code-first, because best-in-class products are built in code.

Read post
Hierarchical modeling

· Albert Gu, Brandon Wang

Hierarchical modeling

Read post
Introducing Ink: speech-to-text models for real-time conversation

· Arjun Desai

Introducing Ink: speech-to-text models for real-time conversation

Today we’re introducing Ink, a new family of streaming speech-to-text (STT) models for developers building real-time voice applications. Ink-Whisper is the fastest, most affordable STT model–designed for enterprise-grade voice agents.

Read post
Introducing Organizations and Dashboards

· Brandon Yang

Introducing Organizations and Dashboards

We’re building Cartesia for developers scaling voice AI. Today, we’re introducing two features to make collaboration and visibility easier: Organizations and Dashboards.

Read post
Introducing Professional Voice Cloning

· Brandon Yang

Introducing Professional Voice Cloning

Introducing Professional Voice Cloning (PVC), professional-quality voice clones trained on Sonic, now available on Startup+ plans.

Read post
Cartesia Python SDK v2.0.0

· Arjun Desai

Cartesia Python SDK v2.0.0

We are excited to announce the release of v2.0.0 of our Python SDK, polishing the developer experience when using Cartesia's AI voice capabilities with Python.

Read post
Cartesia Named to 7th Annual Enterprise Tech 30 List Presented by Wing Venture Capital

· Karan Goel

Cartesia Named to 7th Annual Enterprise Tech 30 List Presented by Wing Venture Capital

Cartesia, a leading provider of generative audio models,  today announced it has been named to the seventh annual Enterprise Tech 30—a definitive list of the most promising, private enterprise tech companies across all stages of maturity.

Read post
Introducing Narrations: create and edit long-form audio content with precision

· Chang Chen

Introducing Narrations: create and edit long-form audio content with precision

Today we're excited to introduce Narrations, a platform that enables creators to transform written content into polished audio productions with unprecedented control and efficiency.

Read post
Series A and the future of voice AI

· Karan Goel

Series A and the future of voice AI

We’re thrilled to announce our $64 million Series A led by Kleiner Perkins. The new funding will help us expand our team and invest in research to build the next generation of models.

Read post
Llamba: scaling distilled recurrent models for efficient language processing

· Aviv Bick, Tobias Katsch, Nimit Sohoni, Arjun Desai, Albert Gu

Llamba: scaling distilled recurrent models for efficient language processing

The next few years will usher in a new era of on-device AI. On-device models will power a wide range of applications.

Read post
How to build a voice AI agent with Cartesia

· Chang Chen

How to build a voice AI agent with Cartesia

How to Build a Voice AI Agent with Cartesia

Read post
State of voice AI 2024

· Karan Goel

State of voice AI 2024

In our first 2024 State of Voice report, we highlight the key infrastructure breakthroughs and emerging use cases driving the industry forward, and look ahead to what’s next in 2025.

Read post
Announcing our seed round

· Karan Goel

Announcing our seed round

We’re excited to announce our $27M seed round, led by Index Ventures with participation from Lightspeed, Factory, Conviction, General Catalyst, A*, SV Angel, and 90 amazing angel investors.

Read post
‘Tis the Hackathon season at Cartesia

· Karan Goel

‘Tis the Hackathon season at Cartesia

October 2024 was a busy month for Cartesia. We brought together over 2,000 builders across San Francisco and gave away $20,000 in prizes for the most innovative ideas built on Sonic.

Read post
Introducing voice changer: transform audio your way

· Karan Goel

Introducing voice changer: transform audio your way

Read post
Introducing our next 8 languages on Sonic Multilingual

· Karan Goel

Introducing our next 8 languages on Sonic Multilingual

Today, we're excited to announce the Alpha Release of our next 8 languages—Hindi, Italian, Korean, Dutch, Polish, Russian, Swedish, and Turkish—on Sonic Multilingual.

Read post
The on-device intelligence update

· Karan Goel

The on-device intelligence update

At Cartesia, our mission is to build the next generation of AI: ubiquitous, interactive intelligence that runs wherever you are.

Read post
Announcing Sonic: a low‑latency voice model for lifelike speech

· Karan Goel

Announcing Sonic: a low‑latency voice model for lifelike speech

We're releasing Sonic, our low-latency voice model that generates lifelike speech today.

Read post
Based: Simple linear attention language models balance the recall‑throughput tradeoff

· Sabri Eyuboglu, Simran Arora, Michael Zhang

Based: Simple linear attention language models balance the recall‑throughput tradeoff

Based is 56% and 44% faster at processing prompts than FlashAttention-2 and Mamba respectively. Based achieves 24x higher text generation throughput than FlashAttention-2.

Read post
Mamba‑3B-SlimPJ: State-space models rivaling the best Transformer architecture

· Albert Gu, Tri Dao

Mamba‑3B-SlimPJ: State-space models rivaling the best Transformer architecture

We're releasing the strongest Mamba language model yet, Mamba-3B-SlimPJ, in partnership with Cartesia & Together under an Apache 2.0 license.

Read post