Enterprise

Predibase exits stealth with a low-code platform for building AI models

Comment

abstract multicolored wave length
Image Credits: MR.Cole_Photographer / Getty Images

Data science teams are stymied by disorganization at their companies, impacting efforts to deploy timely AI and analytics projects. In a recent survey of “data executives” at U.S.-based companies, 44% said that they’ve not hired enough, were too siloed off to be effective and haven’t been given clear roles. Respondents said that they were most concerned about the impact of a revenue loss or hit to brand reputation stemming from failing AI systems and a trend toward splashy investments with short-term payoffs.

These are ultimately organizational challenges. But Piero Molino, the co-founder of AI development platform Predibase, says that inadequate tooling often exacerbates them.

“The major challenges we see today in the industry are that machine learning projects tend to have elongated time-to-value and very low access across an organization. As a result, most machine learning tasks in an organization are bottlenecked on an oversubscribed centralized data science team,” Molino told TechCrunch via email. “Given these challenges, organizations today need to choose between two flawed approaches when it comes to developing machine learning. They can build their own systems from data to deployment using low-level APIs that give them the flexibility machine learning tasks typically require at the cost of complexity. Or they can choose to use a blackbox off-the-shelf ‘AutoML’ solution that simplifies their problem at the expense of flexibility and control.”

The market for synthetic data is bigger than you think

Indeed, while worldwide spending on AI technologies was estimated at $35.8 billion in 2019, nearly 80% of companies have seen their AI projects stall as a result of issues with data quality and a lack of confidence in AI systems, according to an Alegion report. Being an entrepreneur (and a salesperson), Molino asserts that his product, Predibase, is a solution to this — or at least a step toward one.

Predibase, which today emerged from stealth with $16.25 million in Series A funding led by Greylock with participation from the Factory and angel investors, allows a user to specify an AI system as a file that tells the platform what the user wants (e.g., recognizing objects in an image) and figures out a way to fill that need. Molino describes it as a “declarative” approach to AI development, borrowing a term from computer science that refers to code written to describe what a developer wishes to accomplish.

“Machine learning projects today usually take six months to a year at most organizations we’ve worked with. We want to drastically reduce that [by bringing] a low-code but high-ceiling machine learning tool to organizations” Molino continued. “Typically, most companies are bottlenecked by data science resources, meaning product and analyst teams are blocked by a scarce and expensive resource. With Predibase, we’ve seen engineers and analysts build and operationalize models directly.”

Predibase is built on top of open source technologies including Horovod, a framework for AI model training, and Ludwig, a suite of machine learning tools. Both were originally developed at Uber, which several years ago transitioned governance of the projects to the Linux Foundation.

Molino, who joined Uber by way of the company’s acquisition of startup Geometric Intelligence, helped to create Ludwig in 2019. Predibase’s other co-founder, Travis Addair, was the lead maintainer for Horovod while working as a senior software engineer at Uber.

To launch Predibase, Molino and Addair teamed up with former Google Cloud AI product manager Devvret Rishi and Stanford computer science professor Chris Ré, one of the co-founders of Lattice.io, a data mining and machine learning company that Apple purchased in 2017.

Predibase is designed to enable developers to define AI pipelines in just a few lines of code while scaling up to petabytes of data across thousands of machines. As Molino explains it, using the platform, a user can create a text-analyzing AI system in six lines of code that specifies the input and output data. If they want to iterate and customize that system, Predibase lets them add parameters in the configuration file that affords a more granular level of control.

Predibase integrates with data sources including Snowflake, Google BigQuery and Amazon S3 for model training. Users can train models through the platform or programmatically, depending on the use case, and then host and serve or deploy those models into local production environments.

“Apart from lowering time to value, Predibase allows users to work with different modalities of data using the same toolset. With Predibase, we’ve seen users train models on images for classification, text data like emails for triage, tabular data for detection and regression tasks, and even audio datasets that would’ve required heavy in-house sophistication without the native capabilities in the platform,” Molino said. “For many working in this space, Predibase provides a net new capability when tackling use cases on unstructured data.”

Broadly speaking, no-code development platforms are on the rise, and a number of startups compete directly with Predibase, including AI orchestration startup Union.ai and low-code data engineering platform Prophecy (not to mention SageMaker and Vertex AI). But Molino’s view is that while rivals satisfy the demand in the enterprise for simple solutions, they do so at the cost of flexibility, leading customers to “hit a ceiling and churn out.”

“[L]ike infrastructure as code simplified IT, our platform allows users to focus on the ‘what’ of their models rather than the ‘how,’ allowing them to break free of the usual limits of low-code systems using an extensible configuration … We provide model explainability out of the box so users can understand which features are driving predictions,” he said. “[Our platform] has been used at Fortune 500 companies like a leading U.S. tech company, a large national bank and large U.S. healthcare company.”

The pitch sufficiently impressed angels like Kaggle CEO Anthony Goldbloom and former Intel AI COO Remi El-Ouazzane, both of whom invested. Other notable backers include Kaggle CTO Ben Hamner and Zoubin Ghahramani, a professor of information engineering at Cambridge and senior research scientist at Google Brain.

Molino says that the fresh capital from the Series A will be used to take Predibase’s beta product to a wider market — it’s currently invite only. It’ll also be put toward growing Predibase’s team of machine learning engineers and building out a go-to-market organization, expanding the company’s 21-person team.

More TechCrunch

Firebase Genkit is an open source framework that enables developers to quickly build AI into new and existing applications.

Google launches Firebase Genkit, a new open source framework for building AI-powered apps

In the coming months, Google says it will open up the Gemini Nano model to more developers.

Patreon and Grammarly are already experimenting with Gemini Nano, says Google

As part of the update, Reddit also launched a dedicated AMA tab within the web post composer.

Reddit introduces new tools for ‘Ask Me Anything,’ its Q&A feature

Here are quick hits of the biggest news from the keynote as they are announced.

Google I/O 2024: Here’s everything Google just announced

LearnLM is already powering features across Google products, including in YouTube, Google’s Gemini apps, Google Search and Google Classroom.

LearnLM is Google’s new family of AI models for education

The official launch comes almost a year after YouTube began experimenting with AI-generated quizzes on its mobile app. 

Google is bringing AI-generated quizzes to academic videos on YouTube

Around 550 employees across autonomous vehicle company Motional have been laid off, according to information taken from WARN notice filings and sources at the company.  Earlier this week, TechCrunch reported…

Motional cut about 550 employees, around 40%, in recent restructuring, sources say

The keynote kicks off at 10 a.m. PT on Tuesday and will offer glimpses into the latest versions of Android, Wear OS and Android TV.

Google I/O 2024: Watch all of the AI, Android reveals

It ran 110 minutes, but Google managed to reference AI a whopping 121 times during Google I/O 2024 (by its own count). CEO Sundar Pichai referenced the figure to wrap…

Google mentioned ‘AI’ 120+ times during its I/O keynote

Google Play has a new discovery feature for apps, new ways to acquire users, updates to Play Points, and other enhancements to developer-facing tools.

Google Play preps a new full-screen app discovery feature and adds more developer tools

Soon, Android users will be able to drag and drop AI-generated images directly into their Gmail, Google Messages and other apps.

Gemini on Android becomes more capable and works with Gmail, Messages, YouTube and more

Veo can capture different visual and cinematic styles, including shots of landscapes and timelapses, and make edits and adjustments to already-generated footage.

Google Veo, a serious swing at AI-generated video, debuts at Google I/O 2024

In addition to the body of the emails themselves, the feature will also be able to analyze attachments, like PDFs.

Gemini comes to Gmail to summarize, draft emails, and more

The summaries are created based on Gemini’s analysis of insights from Google Maps’ community of more than 300 million contributors.

Google is bringing Gemini capabilities to Google Maps Platform

Google says that over 100,000 developers already tried the service.

Project IDX, Google’s next-gen IDE, is now in open beta

The system effectively listens for “conversation patterns commonly associated with scams” in-real time. 

Google will use Gemini to detect scams during calls

The standard Gemma models were only available in 2 billion and 7 billion parameter versions, making this quite a step up.

Google announces Gemma 2, a 27B-parameter version of its open model, launching in June

This is a great example of a company using generative AI to open its software to more users.

Google TalkBack will use Gemini to describe images for blind people

This will enable developers to use the on-device model to power their own AI features.

Google is building its Gemini Nano AI model into Chrome on the desktop

Google’s Circle to Search feature will now be able to solve more complex problems across psychics and math word problems. 

Circle to Search is now a better homework helper

People can now search using a video they upload combined with a text query to get an AI overview of the answers they need.

Google experiments with using video to search, thanks to Gemini AI

A search results page based on generative AI as its ranking mechanism will have wide-reaching consequences for online publishers.

Google will soon start using GenAI to organize some search results pages

Google has built a custom Gemini model for search to combine real-time information, Google’s ranking, long context and multimodal features.

Google is adding more AI to its search results

At its Google I/O developer conference, Google on Tuesday announced the next generation of its Tensor Processing Units (TPU) AI chips.

Google’s next-gen TPUs promise a 4.7x performance boost

Google is upgrading Gemini, its AI-powered chatbot, with features aimed at making the experience more ambient and contextually useful.

Google’s Gemini updates: How Project Astra is powering some of I/O’s big reveals

Veo can generate few-seconds-long 1080p video clips given a text prompt.

Google’s image-generating AI gets an upgrade

At Google I/O, Google announced upgrades to Gemini 1.5 Pro, including a bigger context window. .

Google’s generative AI can now analyze hours of video

The AI upgrade will make finding the right content more intuitive and less of a manual search process.

Google Photos introduces an AI search feature, Ask Photos

Apple released new data about anti-fraud measures related to its operation of the iOS App Store on Tuesday morning, trumpeting a claim that it stopped over $7 billion in “potentially…

Apple touts stopping $1.8B in App Store fraud last year in latest pitch to developers

Online travel agency Expedia is testing an AI assistant that bolsters features like search, itinerary building, trip planning, and real-time travel updates.

Expedia starts testing AI-powered features for search and travel planning