Enterprise

Predibase exits stealth with a low-code platform for building AI models

Comment

abstract multicolored wave length
Image Credits: MR.Cole_Photographer / Getty Images

Data science teams are stymied by disorganization at their companies, impacting efforts to deploy timely AI and analytics projects. In a recent survey of “data executives” at U.S.-based companies, 44% said that they’ve not hired enough, were too siloed off to be effective and haven’t been given clear roles. Respondents said that they were most concerned about the impact of a revenue loss or hit to brand reputation stemming from failing AI systems and a trend toward splashy investments with short-term payoffs.

These are ultimately organizational challenges. But Piero Molino, the co-founder of AI development platform Predibase, says that inadequate tooling often exacerbates them.

“The major challenges we see today in the industry are that machine learning projects tend to have elongated time-to-value and very low access across an organization. As a result, most machine learning tasks in an organization are bottlenecked on an oversubscribed centralized data science team,” Molino told TechCrunch via email. “Given these challenges, organizations today need to choose between two flawed approaches when it comes to developing machine learning. They can build their own systems from data to deployment using low-level APIs that give them the flexibility machine learning tasks typically require at the cost of complexity. Or they can choose to use a blackbox off-the-shelf ‘AutoML’ solution that simplifies their problem at the expense of flexibility and control.”

The market for synthetic data is bigger than you think

Indeed, while worldwide spending on AI technologies was estimated at $35.8 billion in 2019, nearly 80% of companies have seen their AI projects stall as a result of issues with data quality and a lack of confidence in AI systems, according to an Alegion report. Being an entrepreneur (and a salesperson), Molino asserts that his product, Predibase, is a solution to this — or at least a step toward one.

Predibase, which today emerged from stealth with $16.25 million in Series A funding led by Greylock with participation from the Factory and angel investors, allows a user to specify an AI system as a file that tells the platform what the user wants (e.g., recognizing objects in an image) and figures out a way to fill that need. Molino describes it as a “declarative” approach to AI development, borrowing a term from computer science that refers to code written to describe what a developer wishes to accomplish.

“Machine learning projects today usually take six months to a year at most organizations we’ve worked with. We want to drastically reduce that [by bringing] a low-code but high-ceiling machine learning tool to organizations” Molino continued. “Typically, most companies are bottlenecked by data science resources, meaning product and analyst teams are blocked by a scarce and expensive resource. With Predibase, we’ve seen engineers and analysts build and operationalize models directly.”

Predibase is built on top of open source technologies including Horovod, a framework for AI model training, and Ludwig, a suite of machine learning tools. Both were originally developed at Uber, which several years ago transitioned governance of the projects to the Linux Foundation.

Molino, who joined Uber by way of the company’s acquisition of startup Geometric Intelligence, helped to create Ludwig in 2019. Predibase’s other co-founder, Travis Addair, was the lead maintainer for Horovod while working as a senior software engineer at Uber.

To launch Predibase, Molino and Addair teamed up with former Google Cloud AI product manager Devvret Rishi and Stanford computer science professor Chris Ré, one of the co-founders of Lattice.io, a data mining and machine learning company that Apple purchased in 2017.

Predibase is designed to enable developers to define AI pipelines in just a few lines of code while scaling up to petabytes of data across thousands of machines. As Molino explains it, using the platform, a user can create a text-analyzing AI system in six lines of code that specifies the input and output data. If they want to iterate and customize that system, Predibase lets them add parameters in the configuration file that affords a more granular level of control.

Predibase integrates with data sources including Snowflake, Google BigQuery and Amazon S3 for model training. Users can train models through the platform or programmatically, depending on the use case, and then host and serve or deploy those models into local production environments.

“Apart from lowering time to value, Predibase allows users to work with different modalities of data using the same toolset. With Predibase, we’ve seen users train models on images for classification, text data like emails for triage, tabular data for detection and regression tasks, and even audio datasets that would’ve required heavy in-house sophistication without the native capabilities in the platform,” Molino said. “For many working in this space, Predibase provides a net new capability when tackling use cases on unstructured data.”

Broadly speaking, no-code development platforms are on the rise, and a number of startups compete directly with Predibase, including AI orchestration startup Union.ai and low-code data engineering platform Prophecy (not to mention SageMaker and Vertex AI). But Molino’s view is that while rivals satisfy the demand in the enterprise for simple solutions, they do so at the cost of flexibility, leading customers to “hit a ceiling and churn out.”

“[L]ike infrastructure as code simplified IT, our platform allows users to focus on the ‘what’ of their models rather than the ‘how,’ allowing them to break free of the usual limits of low-code systems using an extensible configuration … We provide model explainability out of the box so users can understand which features are driving predictions,” he said. “[Our platform] has been used at Fortune 500 companies like a leading U.S. tech company, a large national bank and large U.S. healthcare company.”

The pitch sufficiently impressed angels like Kaggle CEO Anthony Goldbloom and former Intel AI COO Remi El-Ouazzane, both of whom invested. Other notable backers include Kaggle CTO Ben Hamner and Zoubin Ghahramani, a professor of information engineering at Cambridge and senior research scientist at Google Brain.

Molino says that the fresh capital from the Series A will be used to take Predibase’s beta product to a wider market — it’s currently invite only. It’ll also be put toward growing Predibase’s team of machine learning engineers and building out a go-to-market organization, expanding the company’s 21-person team.

More TechCrunch

Pinecone, the vector database startup founded by Edo Liberty, the former head of Amazon’s AI Labs, has long been at the forefront of helping businesses augment large language models (LLMs)…

Pinecone launches its serverless vector database out of preview

Young geothermal energy wells can be like budding prodigies, each brimming with potential to outshine their peers. But like people, most decline with age. In California, for example, the amount…

Special mud helps XGS Energy get more power out of geothermal wells

The market play is clear from the outset: The $449 headphones are firmly targeted at an audience that would otherwise be purchasing the Bose QC Ultra or Apple AirPods Max.

Sonos finally made some headphones

Adobe says the feature is up to the task, regardless of how complex of a background the object is set against.

Adobe brings Firefly AI-powered Generative Remove to Lightroom

All cars suffer when the mercury drops, but electric vehicles suffer more than most as heaters draw more power and batteries charge more slowly as the liquid electrolyte inside thickens.…

Porsche invests in battery startup South 8 to boost cold-weather EV performance

Scale AI has raised a $1 billion Series F round from a slew of big-name institutional and corporate investors including Amazon and Meta.

Data-labeling startup Scale AI raises $1B as valuation doubles to $13.8B

The new coalition, Tech Against Scams, will work together to find ways to fight back against the tools used by scammers and to better educate the public against financial scams.

Meta, Match, Coinbase and others team up to fight online fraud and crypto scams

It’s a wrap: European Union lawmakers have given the final approval to set up the bloc’s flagship, risk-based regulations for artificial intelligence.

EU Council gives final nod to set up risk-based regulations for AI

London-based fintech Vitesse has closed a $93 million Series C round of funding led by investment giant KKR.

Vitesse, a payments and treasury management platform for insurers, raises $93M to fuel US expansion

Zen Educate, an online marketplace that connects schools with teachers, has raised $37 million in a Series B round of funding. The raise comes amid a growing teacher shortage crisis…

Zen Educate raises $37M and acquires Aquinas Education as it tries to address the teacher shortage

“When I heard the released demo, I was shocked, angered and in disbelief that Mr. Altman would pursue a voice that sounded so eerily similar to mine.”

Scarlett Johansson says that OpenAI approached her to use her voice

A new self-driving truck — manufactured by Volvo and loaded with autonomous vehicle tech developed by Aurora Innovation — could be on public highways as early as this summer.  The…

Aurora and Volvo unveil self-driving truck designed for a driverless future

The European venture capital firm raised its fourth fund as fund as climate tech “comes of age.”

ETF Partners raises €285M for climate startups that will be effective quickly — not 20 years down the road

Copilot, Microsoft’s brand of generative AI, will soon be far more deeply integrated into the Windows 11 experience.

Microsoft wants to make Windows an AI operating system, launches Copilot+ PCs

Hello and welcome back to TechCrunch Space. For those who haven’t heard, the first crewed launch of Boeing’s Starliner capsule has been pushed back yet again to no earlier than…

TechCrunch Space: Star(side)liner

When I attended Automate in Chicago a few weeks back, multiple people thanked me for TechCrunch’s semi-regular robotics job report. It’s always edifying to get that feedback in person. While…

These 81 robotics companies are hiring

The top vehicle safety regulator in the U.S. has launched a formal probe into an April crash involving the all-electric VinFast VF8 SUV that claimed the lives of a family…

VinFast crash that killed family of four now under federal investigation

When putting a video portal in a public park in the middle of New York City, some inappropriate behavior will likely occur. The Portal, the vision of Lithuanian artist and…

NYC-Dublin real-time video portal reopens with some fixes to prevent inappropriate behavior

Longtime New York-based seed investor, Contour Venture Partners, is making progress on its latest flagship fund after lowering its target. The firm closed on $42 million, raised from 64 backers,…

Contour Venture Partners, an early investor in Datadog and Movable Ink, lowers the target for its fifth fund

Meta’s Oversight Board has now extended its scope to include the company’s newest platform, Instagram Threads, and has begun hearing cases from Threads.

Meta’s Oversight Board takes its first Threads case

The company says it’s refocusing and prioritizing fewer initiatives that will have the biggest impact on customers and add value to the business.

SeekOut, a recruiting startup last valued at $1.2 billion, lays off 30% of its workforce

The U.K.’s self-proclaimed “world-leading” regulations for self-driving cars are now official, after the Automated Vehicles (AV) Act received royal assent — the final rubber stamp any legislation must go through…

UK’s autonomous vehicle legislation becomes law, paving the way for first driverless cars by 2026

ChatGPT, OpenAI’s text-generating AI chatbot, has taken the world by storm. What started as a tool to hyper-charge productivity through writing essays and code with short text prompts has evolved…

ChatGPT: Everything you need to know about the AI-powered chatbot

SoLo Funds CEO Travis Holoway: “Regulators seem driven by press releases when they should be motivated by true consumer protection and empowering equitable solutions.”

Fintech lender SoLo Funds is being sued again by the government over its lending practices

Hard tech startups generate a lot of buzz, but there’s a growing cohort of companies building digital tools squarely focused on making hard tech development faster, more efficient and —…

Rollup wants to be the hardware engineer’s workhorse

TechCrunch Disrupt 2024 is not just about groundbreaking innovations, insightful panels, and visionary speakers — it’s also about listening to YOU, the audience, and what you feel is top of…

Disrupt Audience Choice vote closes Friday

Google says the new SDK would help Google expand on its core mission of connecting the right audience to the right content at the right time.

Google is launching a new Android feature to drive users back into their installed apps

Jolla has taken the official wraps off the first version of its personal server-based AI assistant in the making. The reborn startup is building a privacy-focused AI device — aka…

Jolla debuts privacy-focused AI hardware

The ChatGPT mobile app’s net revenue first jumped 22% on the day of the GPT-4o launch and continued to grow in the following days.

ChatGPT’s mobile app revenue saw its biggest spike yet following GPT-4o launch

Dating app maker Bumble has acquired Geneva, an online platform built around forming real-world groups and clubs. The company said that the deal is designed to help it expand its…

Bumble buys community building app Geneva to expand further into friendships