From the team.
Announcements, engineering deep-dives, and notes from the team building real-time AI.
Featured post
Cartesia achieves GDPR compliance
· Karan Goel

· Karan Goel
Introducing Line: The Modern Voice Agent Development Platform
Introducing Line by Cartesia: the modern voice agent development platform. Line was built to be code-first, because best-in-class products are built in code.
Read post
· Albert Gu, Brandon Wang
Hierarchical modeling
Read post
· Arjun Desai
Introducing Ink: speech-to-text models for real-time conversation
Today we’re introducing Ink, a new family of streaming speech-to-text (STT) models for developers building real-time voice applications. Ink-Whisper is the fastest, most affordable STT model–designed for enterprise-grade voice agents.
Read post
· Brandon Yang
Introducing Organizations and Dashboards
We’re building Cartesia for developers scaling voice AI. Today, we’re introducing two features to make collaboration and visibility easier: Organizations and Dashboards.
Read post
· Brandon Yang
Introducing Professional Voice Cloning
Introducing Professional Voice Cloning (PVC), professional-quality voice clones trained on Sonic, now available on Startup+ plans.
Read post
· Arjun Desai
Cartesia Python SDK v2.0.0
We are excited to announce the release of v2.0.0 of our Python SDK, polishing the developer experience when using Cartesia's AI voice capabilities with Python.
Read post
· Karan Goel
Cartesia Named to 7th Annual Enterprise Tech 30 List Presented by Wing Venture Capital
Cartesia, a leading provider of generative audio models, today announced it has been named to the seventh annual Enterprise Tech 30—a definitive list of the most promising, private enterprise tech companies across all stages of maturity.
Read post
· Chang Chen
Introducing Narrations: create and edit long-form audio content with precision
Today we're excited to introduce Narrations, a platform that enables creators to transform written content into polished audio productions with unprecedented control and efficiency.
Read post
· Karan Goel
Series A and the future of voice AI
We’re thrilled to announce our $64 million Series A led by Kleiner Perkins. The new funding will help us expand our team and invest in research to build the next generation of models.
Read post
· Aviv Bick, Tobias Katsch, Nimit Sohoni, Arjun Desai, Albert Gu
Llamba: scaling distilled recurrent models for efficient language processing
The next few years will usher in a new era of on-device AI. On-device models will power a wide range of applications.
Read post
· Chang Chen
How to build a voice AI agent with Cartesia
How to Build a Voice AI Agent with Cartesia
Read post
· Karan Goel
State of voice AI 2024
In our first 2024 State of Voice report, we highlight the key infrastructure breakthroughs and emerging use cases driving the industry forward, and look ahead to what’s next in 2025.
Read post
· Karan Goel
Announcing our seed round
We’re excited to announce our $27M seed round, led by Index Ventures with participation from Lightspeed, Factory, Conviction, General Catalyst, A*, SV Angel, and 90 amazing angel investors.
Read post
· Karan Goel
‘Tis the Hackathon season at Cartesia
October 2024 was a busy month for Cartesia. We brought together over 2,000 builders across San Francisco and gave away $20,000 in prizes for the most innovative ideas built on Sonic.
Read post
· Karan Goel
Introducing voice changer: transform audio your way
Read post
· Karan Goel
Introducing our next 8 languages on Sonic Multilingual
Today, we're excited to announce the Alpha Release of our next 8 languages—Hindi, Italian, Korean, Dutch, Polish, Russian, Swedish, and Turkish—on Sonic Multilingual.
Read post
· Karan Goel
The on-device intelligence update
At Cartesia, our mission is to build the next generation of AI: ubiquitous, interactive intelligence that runs wherever you are.
Read post
· Karan Goel
Announcing Sonic: a low‑latency voice model for lifelike speech
We're releasing Sonic, our low-latency voice model that generates lifelike speech today.
Read post
· Sabri Eyuboglu, Simran Arora, Michael Zhang
Based: Simple linear attention language models balance the recall‑throughput tradeoff
Based is 56% and 44% faster at processing prompts than FlashAttention-2 and Mamba respectively. Based achieves 24x higher text generation throughput than FlashAttention-2.
Read post
· Albert Gu, Tri Dao
Mamba‑3B-SlimPJ: State-space models rivaling the best Transformer architecture
We're releasing the strongest Mamba language model yet, Mamba-3B-SlimPJ, in partnership with Cartesia & Together under an Apache 2.0 license.
Read postCompany
Regions
Company