Transportation

Parallel Domain says autonomous driving won’t scale without synthetic data

Comment

parallel domain team photo
Image Credits: Parallel Domain

Achieving autonomous driving safely requires near endless hours of training software on every situation that could possibly arise before putting a vehicle on the road. Historically, autonomy companies have collected hordes of real-world data with which to train their algorithms, but it’s impossible to train a system how to handle edge cases based on real-world data alone. Not only that, but it’s time-consuming to even collect, sort and label all that data in the first place.

Most self-driving vehicle companies, like Cruise, Waymo and Waabi, use synthetic data for training and testing perception models with speed and a level of control that’s impossible with data collected from the real world. Parallel Domain, a startup that has built a data-generation platform for autonomy companies, says synthetic data is a critical component to scaling the AI that powers vision and perception systems and preparing them for the unpredictability of the physical world.

The startup just closed a $30 million Series B led by March Capital, with participation from return investors Costanoa Ventures, Foundry Group, Calibrate Ventures and Ubiquity Ventures. Parallel Domain has been focused on the automotive market, supplying synthetic data to some of the major OEMs that are building advanced driver assistance systems and autonomous driving companies building much more advanced self-driving systems. Now, Parallel Domain is ready to expand into drones and mobile computer vision, according to co-founder and CEO Kevin McNamara.

“We’re also really doubling down on generative AI approaches for content generation,” McNamara told TechCrunch. “How can we use some of the advancements in generative AI to bring a much broader diversity of things and people and behaviors into our worlds? Because again, the hard part here is really, once you have a physically accurate renderer, how do you actually go build the million different scenarios a car is going to need to encounter?”

The startup also wants to hire a team to support its growing customer base across North America, Europe and Asia, according to McNamara.

Virtual world building

A sample of Parallel Domain's synthetic data
A sample of Parallel Domain’s synthetic data. Image Credit: Parallel Domain

When Parallel Domain was founded in 2017, the startup was hyper focused on creating virtual worlds based on real-world map data. Over the past five years, Parallel Domain has added to its world generation by filling it with cars, people, different times of day, weather and all the range of behaviors that make those worlds interesting. This enables customers — of which Parallel Domain counts Google, Continental, Woven Planet and Toyota Research Institute — to generate dynamic camera, radar and lidar data that they would need to actually train and test their vision and perception systems, said McNamara. 

Parallel Domain’s synthetic data platform consists of two modes: training and testing. When training, customers will describe high-level parameters — for example, highway driving with 50% rain, 20% at night and an ambulance in every sequence — on which they want to train their model, and the system will generate hundreds of thousands of examples to meet those parameters.

On the testing side, Parallel Domain offers an API that allows the customer to control the placement of dynamic things in the world, which can then be hooked up to their simulator to test specific scenarios.

Waymo, for example, is particularly keen on using synthetic data to test for different weather conditions, the company told TechCrunch. (Disclaimer: Waymo is not a confirmed Parallel Domain customer.) Waymo sees weather as a new lens it can apply to all the miles it has driven in the real world and in simulation, since it would be impossible to recollect all those experiences with arbitrary weather conditions.

Whether it’s testing or training, whenever Parallel Domain’s software creates a simulation, it is able to automatically generate labels to correspond with each simulated agent. This helps machine learning teams do supervised learning and testing without having to go through the arduous process of labeling data themselves.

Parallel Domain envisions a world in which autonomy companies use synthetic data for most, if not all, of their training and testing needs. Today, the ratio of synthetic to real-world data varies from company to company. More established businesses with the historical resources to have collected lots of data are using synthetic data for about 20% to 40% of their needs, whereas companies that are earlier in their product development process are relying 80% on synthetic versus 20% real world, according to McNamara.

Julia Klein, partner at March Capital and now one of Parallel Domain’s board members, said she thinks synthetic data will play a critical role in the future of machine learning. 

“Obtaining the real-world data that you need to train computer vision models is oftentimes an obstacle and there’s hold ups in terms of being able to get that data in, to label that data, to get it ready to a position where it can actually be used,” Klein told TechCrunch. “What we’ve seen with Parallel Domain is that they’re expediting that process considerably, and they’re also addressing things that you may not even get in real world datasets.”

More TechCrunch

Long-time Android Engineering VP Dave Burke said today that he is stepping down from the role. Burke, who spent 14 years building Android, is not leaving Alphabet and is exploring…

Android Engineering VP Dave Burke steps down, as he explores “AI/bio” roles within the company

When Jordan Nathan launched his DTC nontoxic cookware company, Caraway, in 2019, he knew he was not the only founder trying to sell a new brand of pots and pans…

Why being the last company to launch in a category can pay off

Out of an abundance of caution, the car took two minutes to turn a corner.

This humanoid robot can drive cars — sort of

There has been a silly amount of drama in the run-up to Tesla‘s annual shareholder meeting on Thursday. The company is set to hold a vote on “re-ratifying” the $56…

Ahead of Tesla’s big shareholder vote, let’s re-read the judge’s opinion that got us here

To give users more control over the contacts an app can and cannot access, the permissions screen has two stages.

iOS 18 cracks down on apps asking for full address book access

The push to produce a robotic intelligence that can fully leverage the wide breadth of movements opened up by bipedal humanoid design has been a key topic for researchers.

Generative AI takes robots a step closer to general purpose

A TechCrunch review of LinkedIn data found that Ford has built this team up to around 300 employees over the last year.

Ford’s secretive, low-cost EV team is growing with talent from Rivian, Tesla and Apple

The most critical systems of our modern world rely on GPS, from aviation and road networks to emergency and disaster response, from precision farming and power grids to weather forecasting…

Tern AI wants to reduce reliance on GPS with low-cost navigation alternative 

Since fintech startup Brex’s inception in 2017, its two co-founders Henrique Dubugras and Pedro Franceschi have run the company as co-CEOs. But starting today, the pair told TechCrunch in an…

Fintech Brex abandons co-CEO model, talks IPO, cash burn and plans for a secondary sale

Hiya, folks, and welcome to TechCrunch’s regular AI newsletter. This week in AI, Apple stole the spotlight. At the company’s Worldwide Developers Conference (WWDC) in Cupertino, Apple unveiled Apple Intelligence,…

This Week in AI: Apple won’t say how the sausage gets made

India’s largest wealth manager focused on ultra-high-net-worth individuals, 360 One WAM, has agreed to acquire popular Indian mutual fund investment app ET Money for about $44 million. Earlier called IIFL…

India’s 360 One acquires mutual fund app ET Money for $44M

Helen Toner, a former OpenAI board member and the director of strategy at Georgetown’s Center for Security and Emerging Technology, is worried Congress might react in a “knee-jerk” way where…

Helen Toner worries ‘not super functional’ Congress will flub AI policy

Layoffs are tough. This year alone, we’ve already seen 60,000 job cuts across 254 companies according to layoffs.fyi. Looking for ways to grow your network can be even harder during…

Layoffs Got You Down? Get a Half-Price Expo+ Pass at Disrupt 2024

YouTube announced this week the rollout of “Thumbnail Test & Compare,” a new tool for creators to see which thumbnail performs the best. The feature first launched to select creators…

YouTube creators can now test multiple video thumbnails

Waymo has voluntarily issued a software recall to all 672 of its Jaguar I-Pace robotaxis after one of them collided with a telephone pole. This is Waymo’s second recall. The…

Waymo issues second recall after robotaxi hit telephone pole

The hotel guest management technology company’s platform digitizes the hotel guest journey from post-booking through checkout.

Insight Partners backs Canary Technologies’ mission to elevate hotel guest experiences

The TechCrunch team runs down all of the biggest news from the Apple WWDC 2024 keynote in an easy-to-skim digest.

Here’s everything Apple announced at the WWDC 2024 keynote, including Apple Intelligence, Siri makeover

InScope leverages machine learning and large language models to provide financial reporting and auditing processes for mid-market and enterprises.

Lightspeed Venture Partners leads $4.3M seed in automated financial reporting fintech InScope

Venture fundraising has been a slog over the last few years, even for firms with a strong track record. That’s Foresite Capital’s experience. Despite having 47 IPOs, 28 M&As and…

Foresite Capital raises $900M sixth fund for investing in life sciences companies

A year ago, Databricks acquired MosaicML for $1.3 billion. Now rebranded as Mosaic AI, the platform has become integral to Databricks’ AI solutions. Today, at the company’s Data + AI…

Databricks expands Mosaic AI to help enterprises build with LLMs

RetailReady targets the $40 billion compliance market to help reduce the number of retail compliance losses that shippers incur annually due to incorrectly shipped packages.

YC grad RetailReady raises $3.3M for an AI warehouse app that hopes to save brands billions

Since its launch in 2013, Databricks has relied on its ecosystem of partners, such as Fivetran, Rudderstack, and dbt, to provide tools for data preparation and loading. But now, at…

Databricks launches LakeFlow to help its customers build their data pipelines

A big shoutout to the early-stage founders who missed the application window for the Startup Battlefield 200 (SB 200) at TechCrunch Disrupt. We have exciting news just for you! You…

Bonus: An extra week to apply to Startup Battlefield 200

When one of the co-creators of the popular open source stream-processing framework Apache Flink launches a new startup, it’s worth paying attention. Stephan Ewen was among the founding team of…

Restate raises $7M for its lightweight workflows-as-code platform

With most residential solar panels installed by smaller companies, customer experience can be a mixed bag. To try to address the quality and consistency problem, Civic Renewables is buying small…

Civic Renewables is rolling up residential solar installers to improve quality and grow the market

Small VC firms require deep trust, mutual support and long-term commitment among the partners — a kinship that, in many ways, resembles a family dynamic. Colin Anderson (Palantir’s ex-CFO and…

Friends & Family Capital, a fund founded by ex-Palantir CFO and son of IVP’s founder, unveils third $118M fund

Fisker is issuing the first recall for its all-electric Ocean SUV because of problems with the warning lights, according to new information published by the National Highway Traffic Safety Administration…

Fisker’s troubled Ocean SUV gets its first recall

Gorilla, a Belgian company that serves the energy sector with real-time data and analytics for pricing and forecasting, has raised €23 million ($25 million) in a Series B round led…

Gorilla, a Belgian startup that helps energy providers crunch big data, raises $25M

South Korea’s fabless AI chip industry saw a slew of fundraising events over the last couple of years as demand for hardware to power AI applications skyrocketed, and it seems…

Fabless AI chip makers Rebellions and Sapeon to merge as competition heats up in global AI hardware industry

Here’s a list of third-party apps that were Sherlocked by Apple at this year’s WWDC.

The apps that Apple sherlocked at WWDC 2024