AI

Speechmatics raises $62M for its inclusive approach to speech-to-text AI

Comment

Image Credits: Speechmatics

Last week I wrote about an AI startup that’s building technology that can alter, in real time, the accent of someone’s speech. But what if the AI goal instead is to make it possible for people speaking in whatever way they do, to be understood just as they are, and to remove some of the bias inherent in a lot of AI systems in the process? There’s a major need for that, too, and now a U.K. startup called Speechmatics — which has built AI to translate speech to text, regardless of the accent or how the person speaks — is announcing $62 million in funding to expand its business.

Susquehanna Growth Equity out of the U.S. led the round with U.K. investors AlbionVC and IQ Capital also participating. This Series B is a big step up for Speechmatics. The company was originally spun out back in 2006 of AI research in Cambridge by founder Dr. Tony Robinson, and prior to this had only raised around $10 million (Albion and IQ are among those past backers, along with the CIA-backed In-Q-Tel and others).

In the interim it has built up a customer base of some 170 — it only sells B2B, to power consumer-facing or business-facing services — and while it doesn’t disclose the full list, some of the names include what3words, 3Play Media, Veritone, Deloitte UK and Vonage, which variously use the tech not just for making transcriptions in the traditional sense; but for taking in spoken words to help other aspects of an app function, such as automatic captioning, or to power wider accessibility features.

Its engine today is able to translate speech to text in 34 languages, and in addition to using the funding both to continue improving the accuracy there, and for business development, it will be adding more languages and looking at different use cases, such as building speech to text that can be used in the more tricky environment of motor vehicles (where motor noise and vibrations impact how AIs can ingest the sounds).

“What we have done is gather millions of hours of data in our effort to tackle AI bias. Our goal is to understand any and every voice, in multiple languages,” said Katy Wigdahl, the CEO of the startup (a title she co-held with Robinson, who has since stepped back from an executive role recently).

This manifests in the company’s product focus as well as its mission, and that’s something it’s also looking to expand.

“The way we look at language is global,” Wigdahl said. “Google will have a different pack for every version of English but our one pack will understand every one.” It initially only made its tech available by way of a private API that it sold to customers; now in an effort to bring in more users and potentially more paying users, it’s also offering more open API tools to developers to play with the tech, and a drag-and-drop sampler on its site.

And indeed, if one of Speechmatics’ challenges is in training AI to be more human in its understanding of how people speak, the other is to carve out a name for itself against other major providers of speech-to-text technology.

Wigdahl said the company today competes against “Big Tech” — that is, major companies like Amazon, Google and Microsoft (which now has Nuance) that have built speech recognition engines and provide the tech as a service to third parties.

But it says it consistently scores better than these in tests for being able to comprehend when languages are spoken in the many ways that they are. (One test it cited to me was Stanford’s ‘Racial Disparities in Speech Recognition’ study, where it recorded “an overall accuracy of 82.8% for African American voices compared to Google (68.6%) and Amazon (68.6).” It said that “equates to a 45% reduction in speech recognition errors — the equivalent of three words in an average sentence. It also provided TC with a “competitor weighted average”: 

Image Credits: Speechmatics (opens in a new window)

There is indeed a massive opportunity here, though, when you consider that between smaller developers and massive, outsized technology giants like Apple, Google, Microsoft and Amazon there are hundreds of giant companies that might not be quite at the level (or interest) of building in-house AI for this purpose, but if you take for example a company like Spotify, are definitely are interested in it, and definitely would prefer not to be reliant on those huge companies, which are also sometimes their competitors, and sometimes their outright foils. (To be clear, Wigdahl did not tell me Spotify was a customer, but said that that is a typical example of the kind of size and situation in which someone might knock on Speechmatics’ door.)

That too has been partly why investors are so keen to fund this company. Susquehanna has a history of backing companies that look like they might give the power players a run for their money (it was an early and big backer of Tik Tok).

“The Speechmatics team are undoubtedly a different pedigree of technologists,” said Jonathan Klahr, managing director of Susquehanna Growth Equity, in a statement. “We started tracking Speechmatics when our portfolio companies told us that again and again Speechmatics win on accuracy against all the other options including those coming from ‘Big Tech’ players. We are primed to work with the team to ensure that more companies can get exposed to and adopt this superior technology.” Klahr is joining the board with this round.

Indeed, as tech becomes more naturalized and those making it look for more ways to reduce any and all friction that there might be around usage of that tech, voice has emerged as a major opportunity point, as well as a pain point. So having tech that works in “reading” and understanding all kinds of voices can potentially get applied in all kinds of ways.

“Our view is voice will become the increasingly dominant human-machine interface and Speechmatics are the category leaders in applying deep learning to speech, with category defining accuracy and understanding across industry use-case and requirements,” added Robert Whitby-Smith, a partner at AlbionVC. “We have witnessed the impressive growth of the team and product over the last few years since our Series A investment in 2019 and as responsible investors we are delighted to support the company’s inclusive mission to understand every voice globally.” 

More TechCrunch

Snowflake is the latest company in a string of high-profile security incidents and sizable data breaches caused by the lack of MFA.

Hundreds of Snowflake customer passwords found online are linked to info-stealing malware

The buy will benefit ChromeOS, Google’s lightweight Linux-based operating system, by giving ChromeOS users greater access to Windows apps “without the hassle of complex installations or updates.”

Google acquires Cameyo to bring Windows apps to ChromeOS

Mistral is no doubt looking to grow revenue as it faces considerable — and growing — competition in the generative AI space.

Mistral launches new services and SDK to let customers fine-tune its models

The warning for the Ai Pin was issued “out of an abundance of caution,” according to Humane.

Humane urges customers to stop using charging case, citing battery fire concerns

The keynote will be focused on Apple’s software offerings and the developers that power them, including the latest versions of iOS, iPadOS, macOS, tvOS, visionOS and watchOS.

Watch Apple kick off WWDC 2024 right here

As WWDC 2024 nears, all sorts of rumors and leaks have emerged about what iOS 18 and its AI-powered apps and features have in store.

What to expect from Apple’s AI-powered iOS 18 at WWDC 2024

Welcome to Elon Musk’s X. The social network formerly known as Twitter where the rules are made up and the check marks don’t matter. Or do they? The Tesla and…

Elon Musk’s X: A complete timeline of what Twitter has become

TechCrunch has kept readers informed regarding Fearless Fund’s courtroom battle to provide business grants to Black women. Today, we are happy to announce that Fearless Fund CEO and co-founder Arian…

Fearless Fund’s Arian Simone coming to Disrupt 2024

Bridgy Fed is one of the efforts aimed at connecting the fediverse with the web, Bluesky and, perhaps later, other networks like Nostr.

Bluesky and Mastodon users can now talk to each other with Bridgy Fed

Zoox, Amazon’s self-driving unit, is bringing its autonomous vehicles to more cities.  The self-driving technology company announced Wednesday plans to begin testing in Austin and Miami this summer. The two…

Zoox to test self-driving cars in Austin and Miami 

Called Stable Audio Open, the generative model takes a text description and outputs a recording up to 47 seconds in length.

Stability AI releases a sound generator

It’s not just instant-delivery startups that are struggling. Oda, the Norway-based online supermarket delivery startup, has confirmed layoffs of 150 jobs as it drastically scales back its expansion ambitions to…

SoftBank-backed grocery startup Oda lays off 150, resets focus on Norway and Sweden

Newsletter platform Substack is introducing the ability for writers to send videos to their subscribers via Chat, its private community feature, the company announced on Wednesday. The rollout of video…

Substack brings video to its Chat feature

Hiya, folks, and welcome to TechCrunch’s inaugural AI newsletter. It’s truly a thrill to type those words — this one’s been long in the making, and we’re excited to finally…

This Week in AI: Ex-OpenAI staff call for safety and transparency

Ms. Rachel isn’t a household name, but if you spend a lot of time with toddlers, she might as well be a rockstar. She’s like Steve from Blues Clues for…

Cameo fumbles on Ms. Rachel fundraiser as fans receive credits instead of videos  

Cartwheel helps animators go from zero to basic movement, so creating a scene or character with elementary motions like taking a step, swatting a fly or sitting down is easier.

Cartwheel generates 3D animations from scratch to power up creators

The new tool, which is set to arrive in Wix’s app builder tool this week, guides users through a chatbot-like interface to understand the goals, intent and aesthetic of their…

Wix’s new tool taps AI to generate smartphone apps

ClickUp Knowledge Management combines a new wiki-like editor and with a new AI system that can also bring in data from Google Drive, Dropbox, Confluence, Figma and other sources.

ClickUp wants to take on Notion and Confluence with its new AI-based Knowledge Base

New York City, home to over 60,000 gig delivery workers, has been cracking down on cheap, uncertified e-bikes that have resulted in battery fires across the city.  Some e-bike providers…

Whizz wants to own the delivery e-bike subscription space, starting with NYC

This is the last major step before Starliner can be certified as an operational crew system, and the first Starliner mission is expected to launch in 2025. 

Boeing’s Starliner astronaut capsule is en route to the ISS 

TechCrunch Disrupt 2024 in San Francisco is the must-attend event for startup founders aiming to make their mark in the tech world. This year, founders have three exciting ways to…

Three ways founders can shine at TechCrunch Disrupt 2024

Google’s newest startup program, announced on Wednesday, aims to bring AI technology to the public sector. The newly launched “Google for Startups AI Academy: American Infrastructure” will offer participants hands-on…

Google’s new startup program focuses on bringing AI to public infrastructure

eBay’s newest AI feature allows sellers to replace image backgrounds with AI-generated backdrops. The tool is now available for iOS users in the U.S., U.K., and Germany. It’ll gradually roll…

eBay debuts AI-powered background tool to enhance product images

If you’re anything like me, you’ve tried every to-do list app and productivity system, only to find yourself giving up sooner rather than later because managing your productivity system becomes…

Hoop uses AI to automatically manage your to-do list

Asana is using its work graph to train LLMs with the goal of creating AI assistants that work alongside human employees in company workflows.

Asana introduces ‘AI teammates’ designed to work alongside human employees

Taloflow, an early stage startup changing the way companies evaluate and select software, has raised $1.3M in a seed round.

Taloflow puts AI to work on software vendor selection to reduce costs and save time

The startup is hoping its durable filters can make metals refining and battery recycling more efficient, too.

SiTration uses silicon wafers to reclaim critical minerals from mining waste

Spun out of Bosch, Dive wants to change how manufacturers use computer simulations by both using modern mathematical approaches and cloud computing.

Dive goes cloud-native for its computational fluid dynamics simulation service

The tension between incumbents and fintechs has existed for decades. But every once in a while, the two groups decide to put their competition aside and work together. In an…

When foes become friends: Capital One partners with fintech giants Stripe, Adyen to prevent fraud

After growing 500% year-over-year in the past year, Understory is now launching a product focused on the renewable energy sector.

Insurance provider Understory gets into renewable energy following $15M Series A