Enterprise

Predibase exits stealth with a low-code platform for building AI models

Comment

abstract multicolored wave length
Image Credits: MR.Cole_Photographer / Getty Images

Data science teams are stymied by disorganization at their companies, impacting efforts to deploy timely AI and analytics projects. In a recent survey of “data executives” at U.S.-based companies, 44% said that they’ve not hired enough, were too siloed off to be effective and haven’t been given clear roles. Respondents said that they were most concerned about the impact of a revenue loss or hit to brand reputation stemming from failing AI systems and a trend toward splashy investments with short-term payoffs.

These are ultimately organizational challenges. But Piero Molino, the co-founder of AI development platform Predibase, says that inadequate tooling often exacerbates them.

“The major challenges we see today in the industry are that machine learning projects tend to have elongated time-to-value and very low access across an organization. As a result, most machine learning tasks in an organization are bottlenecked on an oversubscribed centralized data science team,” Molino told TechCrunch via email. “Given these challenges, organizations today need to choose between two flawed approaches when it comes to developing machine learning. They can build their own systems from data to deployment using low-level APIs that give them the flexibility machine learning tasks typically require at the cost of complexity. Or they can choose to use a blackbox off-the-shelf ‘AutoML’ solution that simplifies their problem at the expense of flexibility and control.”

The market for synthetic data is bigger than you think

Indeed, while worldwide spending on AI technologies was estimated at $35.8 billion in 2019, nearly 80% of companies have seen their AI projects stall as a result of issues with data quality and a lack of confidence in AI systems, according to an Alegion report. Being an entrepreneur (and a salesperson), Molino asserts that his product, Predibase, is a solution to this — or at least a step toward one.

Predibase, which today emerged from stealth with $16.25 million in Series A funding led by Greylock with participation from the Factory and angel investors, allows a user to specify an AI system as a file that tells the platform what the user wants (e.g., recognizing objects in an image) and figures out a way to fill that need. Molino describes it as a “declarative” approach to AI development, borrowing a term from computer science that refers to code written to describe what a developer wishes to accomplish.

“Machine learning projects today usually take six months to a year at most organizations we’ve worked with. We want to drastically reduce that [by bringing] a low-code but high-ceiling machine learning tool to organizations” Molino continued. “Typically, most companies are bottlenecked by data science resources, meaning product and analyst teams are blocked by a scarce and expensive resource. With Predibase, we’ve seen engineers and analysts build and operationalize models directly.”

Predibase is built on top of open source technologies including Horovod, a framework for AI model training, and Ludwig, a suite of machine learning tools. Both were originally developed at Uber, which several years ago transitioned governance of the projects to the Linux Foundation.

Molino, who joined Uber by way of the company’s acquisition of startup Geometric Intelligence, helped to create Ludwig in 2019. Predibase’s other co-founder, Travis Addair, was the lead maintainer for Horovod while working as a senior software engineer at Uber.

To launch Predibase, Molino and Addair teamed up with former Google Cloud AI product manager Devvret Rishi and Stanford computer science professor Chris Ré, one of the co-founders of Lattice.io, a data mining and machine learning company that Apple purchased in 2017.

Predibase is designed to enable developers to define AI pipelines in just a few lines of code while scaling up to petabytes of data across thousands of machines. As Molino explains it, using the platform, a user can create a text-analyzing AI system in six lines of code that specifies the input and output data. If they want to iterate and customize that system, Predibase lets them add parameters in the configuration file that affords a more granular level of control.

Predibase integrates with data sources including Snowflake, Google BigQuery and Amazon S3 for model training. Users can train models through the platform or programmatically, depending on the use case, and then host and serve or deploy those models into local production environments.

“Apart from lowering time to value, Predibase allows users to work with different modalities of data using the same toolset. With Predibase, we’ve seen users train models on images for classification, text data like emails for triage, tabular data for detection and regression tasks, and even audio datasets that would’ve required heavy in-house sophistication without the native capabilities in the platform,” Molino said. “For many working in this space, Predibase provides a net new capability when tackling use cases on unstructured data.”

Broadly speaking, no-code development platforms are on the rise, and a number of startups compete directly with Predibase, including AI orchestration startup Union.ai and low-code data engineering platform Prophecy (not to mention SageMaker and Vertex AI). But Molino’s view is that while rivals satisfy the demand in the enterprise for simple solutions, they do so at the cost of flexibility, leading customers to “hit a ceiling and churn out.”

“[L]ike infrastructure as code simplified IT, our platform allows users to focus on the ‘what’ of their models rather than the ‘how,’ allowing them to break free of the usual limits of low-code systems using an extensible configuration … We provide model explainability out of the box so users can understand which features are driving predictions,” he said. “[Our platform] has been used at Fortune 500 companies like a leading U.S. tech company, a large national bank and large U.S. healthcare company.”

The pitch sufficiently impressed angels like Kaggle CEO Anthony Goldbloom and former Intel AI COO Remi El-Ouazzane, both of whom invested. Other notable backers include Kaggle CTO Ben Hamner and Zoubin Ghahramani, a professor of information engineering at Cambridge and senior research scientist at Google Brain.

Molino says that the fresh capital from the Series A will be used to take Predibase’s beta product to a wider market — it’s currently invite only. It’ll also be put toward growing Predibase’s team of machine learning engineers and building out a go-to-market organization, expanding the company’s 21-person team.

More TechCrunch

Featured Article

A comprehensive list of 2024 tech layoffs

The tech layoff wave is still going strong in 2024. Following significant workforce reductions in 2022 and 2023, this year has already seen 60,000 job cuts across 254 companies, according to independent layoffs tracker Layoffs.fyi. Companies like Tesla, Amazon, Google, TikTok, Snap and Microsoft have conducted sizable layoffs in the…

4 hours ago
A comprehensive list of 2024 tech layoffs

Featured Article

What to expect from WWDC 2024: iOS 18, macOS 15 and so much AI

Apple is hoping to make WWDC 2024 memorable as it finally spells out its generative AI plans.

5 hours ago
What to expect from WWDC 2024: iOS 18, macOS 15 and so much AI

We just announced the breakout session winners last week. Now meet the roundtable sessions that really “rounded” out the competition for this year’s Disrupt 2024 audience choice program. With five…

The votes are in: Meet the Disrupt 2024 audience choice roundtable winners

The malicious attack appears to have involved malware transmitted through TikTok’s DMs.

TikTok acknowledges exploit targeting high-profile accounts

It’s unusual for three major AI providers to all be down at the same time, which could signal a broader infrastructure issues or internet-scale problem.

AI apocalypse? ChatGPT, Claude and Perplexity all went down at the same time

Welcome to TechCrunch Fintech! This week, we’re looking at LoanSnap’s woes, Nubank’s and Monzo’s positive milestones, a plethora of fintech fundraises and more! To get a roundup of TechCrunch’s biggest…

A look at LoanSnap’s troubles and which neobanks are having a moment

Databricks, the analytics and AI giant, has acquired data management company Tabular for an undisclosed sum. (CNBC reports that Databricks paid over $1 billion.) According to Tabular co-founder Ryan Blue,…

Databricks acquires Tabular to build a common data lakehouse standard

ChatGPT, OpenAI’s text-generating AI chatbot, has taken the world by storm. What started as a tool to hyper-charge productivity through writing essays and code with short text prompts has evolved…

ChatGPT: Everything you need to know about the AI-powered chatbot

The next few weeks could be pivotal for Worldcoin, the controversial eyeball-scanning crypto venture co-founded by OpenAI’s Sam Altman, whose operations remain almost entirely shuttered in the European Union following…

Worldcoin faces pivotal EU privacy decision within weeks

OpenAI’s chatbot ChatGPT has been down for several users across the globe for the last few hours.

OpenAI fixes the issue that caused ChatGPT outage for several hours

True Fit, the AI-powered size-and-fit personalization tool, has offered its size recommendation solution to thousands of retailers for nearly 20 years. Now, the company is venturing into the generative AI…

True Fit leverages generative AI to help online shoppers find clothes that fit

Audio streaming service TuneIn is teaming up with Discord to bring free live radio to the platform. This is TuneIn’s first collaboration with a social platform and one that is…

Discord and TuneIn partner to bring live radio to the social platform

The early victors in the AI gold rush are selling the picks and shovels needed to develop and apply artificial intelligence. Just take a look at data-labeling startup Scale AI…

Scale AI founder Alexandr Wang is coming to Disrupt 2024

Try to imagine the number of parts that go into making a rocket engine. Now imagine requesting and comparing quotes for each of those parts, getting approvals to purchase the…

Engineer brothers found Forge to modernize hardware procurement

Raspberry Pi has released a $70 AI extension kit with a neural network inference accelerator that can be used for local inferencing, for the Raspberry Pi 5.

Raspberry Pi partners with Hailo for its AI extension kit

When Stacklet’s founders, Travis Stanfield and Kapil Thangavelu, came out of Capital One in 2020 to launch their startup, most companies weren’t all that concerned with constraining cloud costs. But…

Stacklet sees demand grow as companies take cloud cost control more seriously

Fivetran’s Managed Data Lake Service aims to remove the repetitive work of managing data lakes.

Fivetran launches a managed data lake service

Lance Riedel and Nigel Daley both spent decades in search discovery, but it was while working at Pinterest that they began trying to understand how to use search engines to…

How a couple of former Pinterest search experts caught Biz Stone’s attention

GetWhy helps businesses carry out market studies and extract insights from video-based interviews using AI.

GetWhy, a market research AI platform that extracts insights from video interviews, raises $34.5M

AI-powered virtual physical therapy platform Sword Health has seen its valuation soar 50% to $3 billion.

Sword Health raises $130M and its valuation soars to $3B

Jeffrey Katzenberg and Sujay Jaswa, along with three general partners, manage $1.5 billion in assets today through their Build, Venture and Seed strategies.

WndrCo officially gets into venture capital with fresh $460M across two funds

The startup targets the middle ground between platforms that offer rigid templates, and those that facilitate a full-control approach.

Storyblok raises $80M to add more AI to its ‘headless’ CMS aimed at non-technical people

The startup has been pursuing a ground-up redesign of a well-understood technology.

‘Star Wars’ lasers and waterfalls of molten salt: How Xcimer plans to make fusion power happen

Sēkr, a startup that offers a mobile app for outdoor enthusiasts and campers, is launching a new AI tool for planning road trips. The new tool, called Copilot, is available…

Travel app Sēkr can plan your next road trip with its new AI tool

Microsoft’s education-focused flavor of its cloud productivity suite, Microsoft 365 Education, is facing investigation in the European Union. Privacy rights nonprofit noyb has just lodged two complaints with Austria’s data…

Microsoft hit with EU privacy complaints over schools’ use of 365 Education suite

Since the shock of Russia’s 2022 invasion of Ukraine, solar energy has been having a moment in Europe. Electricity prices have been going up while the investment required to get…

Samara is accelerating the energy transition in Spain one solar panel at a time

Featured Article

DEI backlash: Stay up-to-date on the latest legal and corporate challenges

It’s clear that this year will be a turning point for DEI.

1 day ago
DEI backlash: Stay up-to-date on the latest legal and corporate challenges

The keynote will be focused on Apple’s software offerings and the developers that power them, including the latest versions of iOS, iPadOS, macOS, tvOS, visionOS and watchOS.

Watch Apple kick off WWDC 2024 right here

Hello and welcome back to TechCrunch Space. Unfortunately, Boeing’s Starliner launch was delayed yet again, this time due to issues with one of the three redundant computers used by United…

TechCrunch Space: China’s victory

The court ruling said that Fearless Fund’s Strivers Grant likely violates the Civil Rights Act of 1866, which bans the use of race in contracts.

An appeals court rules that VC Fearless Fund cannot issue grants to Black women, but the fight continues