Enterprise

MOSTLY AI raises $25 million to further commercialize synthetic data in Europe and the US

Comment

Computer Processor Processing Artificial Intelligence Data. Glowing Chip. Computer And Technology Related 3D Illustration Render.
Image Credits: yucelyilmaz / Getty Images

Austrian synthetic data startup MOSTLY AI today announced that it has raised a $25 million Series B round. British VC firm Molten Ventures led the operation, with participation from new investor Citi Ventures. Two existing investors also returned: Munich-based 42CAP, and Berlin-based Earlybird, which had led MOSTLY AI’s $5 million Series A round in 2020.

Synthetic data is fake data, but not random: MOSTLY AI uses artificial intelligence to achieve a high degree of fidelity to its clients’ databases. Its data sets “look just as real as a company’s original customer data with just as many details, but without the original personal data points,” the company says.

Talking to TechCrunch, MOSTLY AI CEO Tobias Hann said that the company plans to use the proceeds to push the boundaries of what its product can do, grow its team and gain more customers both in Europe and in the U.S., where it already has offices in New York City.

MOSTLY AI was founded in Vienna in 2017, and the General Data Protection Regulation (GDPR) was implemented across the EU one year later. This demand for privacy-preserving solutions and the concomitant rise of machine learning have created significant momentum for synthetic data. Gartner predicts that by 2024, 60% of the data used for the de­vel­op­ment of AI and an­a­lyt­ics projects will be syn­thet­i­cally gen­er­ated.

MOSTLY AI’s typical clients are Fortune 100 banks and insurers, as well as telcos. These three highly regulated sectors drive most of the demand for synthetic tabular data, alongside healthcare.

Unlike some of its competitors, MOSTLY AI hasn’t put its focus on healthcare in the past, but it could change. “It’s certainly something that we are watching closely and we are actually starting some pilot projects this year,” the CEO said.

The democratization of AI means that synthetic data will eventually be used well beyond Fortune 100 companies, Hann told TechCrunch. His company therefore plans to serve smaller organizations and a wider range of sectors in the future. But until now, it made sense for MOSTLY AI to focus on enterprise-level clients.

At the moment, enterprise companies are the ones that have the budgets, need and sophistication to work with synthetic data, Hann said. To match their expectations, MOSTLY AI obtained ISO certifications.

Talking to Hann, one thing becomes clear: While the startup has a solid technical footing, it is equally invested in the commercialization of its technology and in the business value it can add for its clients. “MOSTLY AI is leading this emerging and rapidly-growing space in terms of both customer deployments and expertise,” Molten Ventures’ investment director Christoph Hornung said.

The need to comply with privacy laws such as the GDPR and CCPA clearly drives demand for synthetic data, but it’s not the only factor at play. For instance, demand in Europe is also driven by a wider cultural context; while in the U.S., it also results from a desire to innovate. For instance, use cases can include advanced analytics, predictive algorithms, fraud detection and pricing models — but without data that can be traced back to specific users.

“Many companies are proactively approaching the space because they understand that customers value privacy,” Hann said. “These companies understand that they can also gain a competitive advantage when dealing and working with data in a privacy-preserving way.”

Seeing more U.S. companies wanting to adopt synthetic data in innovative ways is the key reason MOSTLY AI wants to grow its team in the U.S. But it is also recruiting more generally, both in Vienna and remotely. Its plan is to increase its headcount from 35 to 65 people by the end of the year.

Hann expects 2022 to be “the year where synthetic data will take off,” and beyond this year, “a really strong decade for synthetic data.” This will be supported by growing demand for responsible AI, articulated around key concepts such as AI fairness and explainability. Synthetic data helps answer these challenges. “It enables enterprises to augment and de-bias their data sets,” Hann said.

Machine learning aside, MOSTLY AI sees lots of potential for synthetic data to be leveraged in software testing. Supporting these use cases requires making synthetic data accessible not only to data scientists, but also to software engineers and quality testers. It’s with them in mind that MOSTLY AI came up a few months ago with version 2.0 of its platform. “MOSTLY AI 2.0 can be implemented on premise or in a private cloud, and adapts to different data structures of the company using it,” the company wrote at the time.

“We are clearly a B2B software infrastructure company,” Hann said. Both in its Series A and B rounds, the company looked for investors who understood that approach.

Molten Ventures being a publicly listed VC and consequently not subject to typical funding cycles also carried some weight, Hann confirmed when I asked. “Having this long-term commitment from a partner is something that was very appealing to us, because it’s a little more flexible.”

It doesn’t hurt either that Citi Ventures is the venture arm of Citigroup, and that it is headquartered in the U.S. “We’re significantly increasing the team in the U.S., and it’s always great to also have a U.S.-based investor that can help with network and relationships there,” Hann said.

With $25 million in new funding and an increased U.S. presence, MOSTLY AI will now have more resources to compete against other companies in its segment of the synthetic data space. These include Tonic.ai, which raised a $35 million Series B last September; Gretel AI, which disclosed a $50 million Series B round last October; and seed-funded British startup Hazy, as well as players that focus on specific verticals.

“We do see more and more players emerging in the space and in the market in general, so it certainly shows that there’s a lot of interest there,” Hann said.

More TechCrunch

Mike Krieger, one of the co-founders of Instagram and, more recently, the co-founder of personalized news app Artifact (which TechCrunch corporate parent Yahoo recently acquired), is joining Anthropic as the…

Anthropic hires Instagram co-founder as head of product

Seven firms so far have signed on to standardize the way data is collected and shared.

Venture firms form alliance to standardize data collection

As cloud adoption continues to surge towards the $1 trillion mark in annual spend, we’re seeing a wave of enterprise startups gaining traction with customers and investors for tools to…

Alkira connects with $100M for a solution that connects your clouds

Charging has long been the Achilles’ heel of electric vehicles. One startup thinks it has a better way for apartment dwelling EV drivers to charge overnight.

Orange Charger thinks a $750 outlet will solve EV charging for apartment dwellers

So did investors laugh them out of the room when they explained how they wanted to replace Quickbooks? Kind of.

Embedded accounting startup Layer secures $2.3M toward goal of replacing Quickbooks

While an increasing number of companies are investing in AI, many are struggling to get AI-powered projects into production — much less delivering meaningful ROI. The challenges are many. But…

Weka raises $140M as the AI boom bolsters data platforms

PayHOA, a previously bootstrapped Kentucky-based startup that offers software for self-managed homeowner associations (HOAs), is an example of how real-world problems can translate into opportunity. It just raised a $27.5…

Meet PayHOA, a profitable and once-bootstrapped SaaS startup that just landed a $27.5M Series A

Restaurant365, which offers a restaurant management suite, has raised a hot $175M from ICONIQ Growth, KKR and L Catterton.

Restaurant365 orders in $175M at $1B+ valuation to supersize its food service software stack 

Venture firm Shilling has launched a €50M fund to support growth-stage startups in its own portfolio and to invest in startups everywhere else. 

Portuguese VC firm Shilling launches €50M opportunity fund to back growth-stage startups

Chang She, previously the VP of engineering at Tubi and a Cloudera veteran, has years of experience building data tooling and infrastructure. But when She began working in the AI…

LanceDB, which counts Midjourney as a customer, is building databases for multimodal AI

Trawa simplifies energy purchasing and management for SMEs by leveraging an AI-powered platform and downstream data from customers. 

Berlin-based trawa raises €10M to use AI to make buying renewable energy easier for SMEs

Lydia is splitting itself into two apps — Lydia for P2P payments and Sumeria for those looking for a mobile-first bank account.

Lydia, the French payments app with 8 million users, launches mobile banking app Sumeria

Cargo ships docking at a commercial port incur costs called “disbursements” and “port call expenses.” This might be port dues, towage, and pilotage fees. It’s a complex patchwork and all…

Shipping logistics startup Harbor Lab raises $16M Series A led by Atomico

AWS has confirmed its European “sovereign cloud” will go live by the end of 2025, enabling greater data residency for the region.

AWS confirms will launch European ‘sovereign cloud’ in Germany by 2025, plans €7.8B investment over 15 years

Go Digit, an Indian insurance startup, has raised $141 million from investors including Goldman Sachs, ADIA, and Morgan Stanley as part of its IPO.

Indian insurance startup Go Digit raises $141M from anchor investors ahead of IPO

Peakbridge intends to invest in between 16 and 20 companies, investing around $10 million in each company. It has made eight investments so far.

Food VC Peakbridge has new $187M fund to transform future of food, like lab-made cocoa

For over six decades, the nonprofit has been active in the financial services sector.

Accion’s new $152.5M fund will back financial institutions serving small businesses globally

Meta’s newest social network, Threads, is starting its own fact-checking program after piggybacking on Instagram and Facebook’s network for a few months.

Threads finally starts its own fact-checking program

Looking Glass makes trippy-looking mixed-reality screens that make things look 3D without the need of special glasses. Today, it launches a pair of new displays, including a 16-inch mode that…

Looking Glass launches new 3D displays

Replacing Sutskever is Jakub Pachocki, OpenAI’s director of research.

Ilya Sutskever, OpenAI co-founder and longtime chief scientist, departs

Intuitive Machines made history when it became the first private company to land a spacecraft on the moon, so it makes sense to adapt that tech for Mars.

Intuitive Machines wants to help NASA return samples from Mars

As Google revamps itself for the AI era, offering AI overviews within its search results, the company is introducing a new way to filter for just text-based links. With the…

Google adds ‘Web’ search filter for showing old-school text links as AI rolls out

Blue Origin’s New Shepard rocket will take a crew to suborbital space for the first time in nearly two years later this month, the company announced on Tuesday.  The NS-25…

Blue Origin to resume crewed New Shepard launches on May 19

This will enable developers to use the on-device model to power their own AI features.

Google is building its Gemini Nano AI model into Chrome on the desktop

It ran 110 minutes, but Google managed to reference AI a whopping 121 times during Google I/O 2024 (by its own count). CEO Sundar Pichai referenced the figure to wrap…

Google mentioned ‘AI’ 120+ times during its I/O keynote

Firebase Genkit is an open source framework that enables developers to quickly build AI into new and existing applications.

Google launches Firebase Genkit, a new open source framework for building AI-powered apps

In the coming months, Google says it will open up the Gemini Nano model to more developers.

Patreon and Grammarly are already experimenting with Gemini Nano, says Google

As part of the update, Reddit also launched a dedicated AMA tab within the web post composer.

Reddit introduces new tools for ‘Ask Me Anything,’ its Q&A feature

Here are quick hits of the biggest news from the keynote as they are announced.

Google I/O 2024: Here’s everything Google just announced

LearnLM is already powering features across Google products, including in YouTube, Google’s Gemini apps, Google Search and Google Classroom.

LearnLM is Google’s new family of AI models for education