Former Google Text‑to‑Speech Product Lead partners with Cartesia for Business AI Phone Agents
October 9, 2024
“I became an early adopter of Cartesia the day they launched as soon as I saw how low their latency was. As the former Product Lead for Google Text-to-Speech, I've been closely monitoring advancements in voice AI technology. It was only a year ago that the industry celebrated achieving latency times under one second, a milestone that seemed groundbreaking at the time. Sonic is the only product in existence with model latency of less than 100 ms, outperforming its next best alternative by a factor of four. This level of performance represents a quantum leap forward, surpassing what I had anticipated as feasible in the short term.”
— Bob Summers, CEO & Founder of Goodcall
Goodcall helps businesses from solo operations to large enterprises with in-house call centers grow with an AI phone agent for reception, sales and service that sets up in minutes. By making a substantial contribution to customers' bottom line, Goodcall supports a wide range of applications, such as capturing sales leads, qualifying customers, answering service, scheduling for service businesses, automating frequent inbound workflows and more.. Additionally, Goodcall’s agents continuously learn from customer interactions to offer a custom agent for every business without code or prompting which is 100% accurate.
Bob’s expertise in speech technologies stems from his former position leading Google’s Text-to-Speech product and co-founding a business phone assistant within Google’s product incubator, Area 120. Bob’s early product feedback has been integral in helping Cartesia build the best-in-class solution for conversational AI. Today, just 3 months after Bob discovered Cartesia, Goodcall has switched 100% of their text-to-speech generation for 2,217 unique voice agents from Eleven Labs to Cartesia.
Try playing with Goodcall’s Sonic-powered agents on their website. They cover a wide range of use cases like qualifying sales leads, insurance logistics, and concierges.
Goodcall chose Sonic for the following comprehensive feature set:
Industry-Leading Latency - With Sonic, Goodcall achieves a 90 ms on average for time-to-first-audio, including both model and network latency, which is more than four times as fast as the performance experienced with Eleven Labs.
Best-In-Class Conversational Quality - Bob was able to silently switch his voice provider to Sonic without 0 customer complaints and consistently hit 97% interaction rate, measured by whether a call receiver stayed on the line to complete their request when an AI voice was running the call. “One customer asked whether we’d made a significant change because the voices sounded much closer to a real human,” said Bob.
Voice Design Capabilities - “The manner in which I can instantly clone a voice and blend it with another one is something I’ve never seen before. I love that I can do it in just a couple clicks, with the most intuitive user interface.”
“With his extensive experience as the Product Lead for Google’s Text-to-Speech product, Bob is a pioneer in the generative speech industry, whose voice technology continues to power widely used products such as Google Maps. We’re thankful to have had his input on our product from day one and for the opportunity to push forward the field of conversational voice agents with Goodcall.”
— Karan Goel, CEO, Cartesia