Startups

Tonic is betting that synthetic data is the new big data to solve scalability and security

Comment

Image Credits: Vertigo3d (opens in a new window) / Getty Images

Big data is a sham. For years now, we have been told that every company should save every last morsel of digital exhaust in some sort of database, lest management lose some competitive intelligence against … a competitor, or something.

There is just one problem with big data though: It’s honking huge.

Processing petabytes of data to generate business insights is expensive and time-consuming. Worse, all that data hanging around paints a big, bright red target on the back of the company for every hacker group in the world. Big data is expensive to maintain, expensive to protect and expensive to keep private. And the upshot might not be all that much in the end after all — oftentimes, well-curated and chosen data sets can provide faster and better insight than endless quantities of raw data.

What should a company do? Well, they need a Tonic to ameliorate their big data sins.

Tonic is a “synthetic data” platform that transforms raw data into more manageable and private data sets usable by software engineers and business analysts. Along the way, Tonic’s algorithms de-identify the original data and create statistically identical but synthetic data sets, which means that personal information isn’t shared insecurely.

For instance, an online shopping platform will have transaction history on its customers and what they purchased. Sharing that data with every engineer and analyst in the company is dangerous, since that purchase history could have personally identifying details to which no one without a need-to-know should have access. Tonic could take that original payments data and transform it into a new, smaller data set with exactly the same statistical properties, but not tied to original customers. That way, an engineer could test their app or an analyst could test their marketing campaign, all without triggering concerns about privacy.

Synthetic data and other ways to handle the privacy of large data sets has garnered massive attention from investors in recent months. We reported last week on Skyflow, which raised a round to use polymorphic encryption to ensure that employees only have access to the data they need and are blocked from accessing the rest. BigID takes a more overarching view of just tracking what data is where and who should have access to it (i.e. data governance) based on local privacy laws.

Tonic’s approach has the benefit of helping solve not just privacy issues, but also scalability challenges as data sets get larger and larger in size. That combination has attracted the attention of investors: This morning, the company announced that it has raised $8 million in a Series A led by Glenn Solomon and Oren Yunger of GGV, the latter of whom will join the company’s board.

The company was founded in 2018 by a quad of founders: CEO Ian Coe worked with COO Karl Hanson (they first met in middle school as well) and CTO Andrew Colombi while they were all working at Palantir, and Coe also formerly worked with the company’s head of engineering Adam Kamor while at Tableau. That training at some of the largest and most successful data infrastructure companies from the Valley forms part of the product DNA for Tonic.

Tonic’s team. Photo via Tonic.

Coe explained that Tonic is designed to prevent some of the most obvious security flaws that arise in modern software engineering. In addition to saving data pipelining time for engineering teams, Tonic “also means that they’re not worried about sensitive data going from production environments to lower environments that are always less secure than your production systems.”

He said that the idea for what would become Tonic originated while troubleshooting problems at a Palantir banking client. They needed data to solve a problem, but that data was super sensitive, and so the team ended up using synthetic data to bridge the difference. Coe wants to expand the utility of synthetic data to more people in a more rigorous way, particularly given the legal changes these days. “I think regulatory pressure is really pushing teams to change their practices” around data, he noted.

The key to Tonic’s technology is its subsetter, which evaluates raw data and starts to statistically define the relationships between all the records. Some of that analysis is automated depending on the data sources, and when it can’t be automated, Tonic’s UI can help a data scientist onboard data sets and define those relationships manually. In the end, Tonic generates these synthetic data sets usable by all the customers of that data inside a company.

With the new round of funding, Coe wants to continue doubling down on ease-of-use and onboarding and proselytizing the benefit of this model for his clients. “In a lot of ways, we’re creating a category, and that means that people have to understand and also get the value [and have] the early-adopter mindset,” he said.

In addition to lead investor GGV, Bloomberg Beta, Xfund, Heavybit and Silicon Valley CISO Investments participated in the round, as well as angels Assaf Wand and Anthony Goldbloom.

Skyflow raises $17.5M more to help companies protect your personal data

More TechCrunch

The National Democratic Alliance (NDA) has emerged victorious in India’s 2024 general election, but with a smaller majority compared to 2019. According to post-election analysis by Goldman Sachs, JP Morgan,…

Modi-led coalition’s election win signals policy continuity in India – but also spending cuts

Featured Article

A comprehensive list of 2024 tech layoffs

The tech layoff wave is still going strong in 2024. Following significant workforce reductions in 2022 and 2023, this year has already seen 60,000 job cuts across 254 companies, according to independent layoffs tracker Layoffs.fyi. Companies like Tesla, Amazon, Google, TikTok, Snap and Microsoft have conducted sizable layoffs in the…

10 hours ago
A comprehensive list of 2024 tech layoffs

Featured Article

What to expect from WWDC 2024: iOS 18, macOS 15 and so much AI

Apple is hoping to make WWDC 2024 memorable as it finally spells out its generative AI plans.

11 hours ago
What to expect from WWDC 2024: iOS 18, macOS 15 and so much AI

We just announced the breakout session winners last week. Now meet the roundtable sessions that really “rounded” out the competition for this year’s Disrupt 2024 audience choice program. With five…

The votes are in: Meet the Disrupt 2024 audience choice roundtable winners

The malicious attack appears to have involved malware transmitted through TikTok’s DMs.

TikTok acknowledges exploit targeting high-profile accounts

It’s unusual for three major AI providers to all be down at the same time, which could signal a broader infrastructure issues or internet-scale problem.

AI apocalypse? ChatGPT, Claude and Perplexity all went down at the same time

Welcome to TechCrunch Fintech! This week, we’re looking at LoanSnap’s woes, Nubank’s and Monzo’s positive milestones, a plethora of fintech fundraises and more! To get a roundup of TechCrunch’s biggest…

A look at LoanSnap’s troubles and which neobanks are having a moment

Databricks, the analytics and AI giant, has acquired data management company Tabular for an undisclosed sum. (CNBC reports that Databricks paid over $1 billion.) According to Tabular co-founder Ryan Blue,…

Databricks acquires Tabular to build a common data lakehouse standard

ChatGPT, OpenAI’s text-generating AI chatbot, has taken the world by storm. What started as a tool to hyper-charge productivity through writing essays and code with short text prompts has evolved…

ChatGPT: Everything you need to know about the AI-powered chatbot

The next few weeks could be pivotal for Worldcoin, the controversial eyeball-scanning crypto venture co-founded by OpenAI’s Sam Altman, whose operations remain almost entirely shuttered in the European Union following…

Worldcoin faces pivotal EU privacy decision within weeks

OpenAI’s chatbot ChatGPT has been down for several users across the globe for the last few hours.

OpenAI fixes the issue that caused ChatGPT outage for several hours

True Fit, the AI-powered size-and-fit personalization tool, has offered its size recommendation solution to thousands of retailers for nearly 20 years. Now, the company is venturing into the generative AI…

True Fit leverages generative AI to help online shoppers find clothes that fit

Audio streaming service TuneIn is teaming up with Discord to bring free live radio to the platform. This is TuneIn’s first collaboration with a social platform and one that is…

Discord and TuneIn partner to bring live radio to the social platform

The early victors in the AI gold rush are selling the picks and shovels needed to develop and apply artificial intelligence. Just take a look at data-labeling startup Scale AI…

Scale AI founder Alexandr Wang is coming to Disrupt 2024

Try to imagine the number of parts that go into making a rocket engine. Now imagine requesting and comparing quotes for each of those parts, getting approvals to purchase the…

Engineer brothers found Forge to modernize hardware procurement

Raspberry Pi has released a $70 AI extension kit with a neural network inference accelerator that can be used for local inferencing, for the Raspberry Pi 5.

Raspberry Pi partners with Hailo for its AI extension kit

When Stacklet’s founders, Travis Stanfield and Kapil Thangavelu, came out of Capital One in 2020 to launch their startup, most companies weren’t all that concerned with constraining cloud costs. But…

Stacklet sees demand grow as companies take cloud cost control more seriously

Fivetran’s Managed Data Lake Service aims to remove the repetitive work of managing data lakes.

Fivetran launches a managed data lake service

Lance Riedel and Nigel Daley both spent decades in search discovery, but it was while working at Pinterest that they began trying to understand how to use search engines to…

How a couple of former Pinterest search experts caught Biz Stone’s attention

GetWhy helps businesses carry out market studies and extract insights from video-based interviews using AI.

GetWhy, a market research AI platform that extracts insights from video interviews, raises $34.5M

AI-powered virtual physical therapy platform Sword Health has seen its valuation soar 50% to $3 billion.

Sword Health raises $130M and its valuation soars to $3B

Jeffrey Katzenberg and Sujay Jaswa, along with three general partners, manage $1.5 billion in assets today through their Build, Venture and Seed strategies.

WndrCo officially gets into venture capital with fresh $460M across two funds

The startup targets the middle ground between platforms that offer rigid templates, and those that facilitate a full-control approach.

Storyblok raises $80M to add more AI to its ‘headless’ CMS aimed at non-technical people

The startup has been pursuing a ground-up redesign of a well-understood technology.

‘Star Wars’ lasers and waterfalls of molten salt: How Xcimer plans to make fusion power happen

Sēkr, a startup that offers a mobile app for outdoor enthusiasts and campers, is launching a new AI tool for planning road trips. The new tool, called Copilot, is available…

Travel app Sēkr can plan your next road trip with its new AI tool

Microsoft’s education-focused flavor of its cloud productivity suite, Microsoft 365 Education, is facing investigation in the European Union. Privacy rights nonprofit noyb has just lodged two complaints with Austria’s data…

Microsoft hit with EU privacy complaints over schools’ use of 365 Education suite

Since the shock of Russia’s 2022 invasion of Ukraine, solar energy has been having a moment in Europe. Electricity prices have been going up while the investment required to get…

Samara is accelerating the energy transition in Spain one solar panel at a time

Featured Article

DEI backlash: Stay up-to-date on the latest legal and corporate challenges

It’s clear that this year will be a turning point for DEI.

1 day ago
DEI backlash: Stay up-to-date on the latest legal and corporate challenges

The keynote will be focused on Apple’s software offerings and the developers that power them, including the latest versions of iOS, iPadOS, macOS, tvOS, visionOS and watchOS.

Watch Apple kick off WWDC 2024 right here

Hello and welcome back to TechCrunch Space. Unfortunately, Boeing’s Starliner launch was delayed yet again, this time due to issues with one of the three redundant computers used by United…

TechCrunch Space: China’s victory