AI

Union.ai raises $10M to simplify AI and ML workflow orchestration

Comment

Abstract background of wires and glowing particles
Image Credits: shulz / Getty Images

Union.ai, a startup emerging from stealth with a commercial version of the open source AI orchestration platform Flyte, today announced that it raised $10 million in a round contributed by NEA and “select” angel investors. CEO Ketan Umare says that the proceeds will be put toward supporting the Flyte community by “improving the accessibility, performance and reliability of Flyte” and broadening the array of systems that Flyte integrates with.

While companies find AI’s predictive power alluring, particularly on the data analytics side of the organization, achieving meaningful results with AI often proves to be a challenge. It’s true that AI can help to project revenue, for example, by identifying trends in buying and selling. But implementing and maintaining the data pipelines necessary to keep AI systems from drifting to inaccuracy can require substantial technical resources.

That’s where Flyte comes in — a platform for programming and processing concurrent AI and data analytics workflows. Union’s team, including Umare, helped to build Flyte while at Lyft, where it was used to help create a system to calculate the estimated time of arrival (ETA) for drivers to get from point A to point B.

“[Union’s] founders first met at Lyft, where we joined the team responsible for calculating the ETA for a Lyft driver to get from point A to point B,” Umare told TechCrunch via email. “Searching for the right solution led the team deep into machine learning techniques, which came with requirements to use large amounts of data and deliver robust models to production consistently … The techniques used were platformized, and the solution was used widely at Lyft.”

Lyft contributed Flyte to open source in 2020, granting the trademark to the Linux Foundation a year later. That’s when Union’s team saw an opportunity to layer paid services on top of the project in the cloud.

“A managed version of Flyte, called Union Cloud, will allow smaller teams and organizations to use the power of Flyte without the need to staff up on infrastructure teams,” Umare continued. “We [founded Union] because we believe that machine learning and data workflows are fundamentally different from software deployments. This is because software is more precise with a slower lifecycle while machine learning and data workflows start off being experimental and may need to be quickly productionized.”

Taking Flyte

Umare and Union’s other co-founders, Haytham Abuelfutuh and George Snelling, all have deep backgrounds in the tech industry. Prior to joining Lyft, Umare was a senior software engineer at Amazon and a principal engineer at Oracle, where he led development of a block storage product for an infrastructure-as-a-service and bare metal offering. Abuelfutuh spent seven years as an engineer at Microsoft and three as a developer at Google, where he helped to ship an internal software library for first-party apps including Google Photos. Snelling — also a Microsoft veteran — co-founded several startups (Westside, LabKey and Patchr) and spent time at Salesforce as a senior director of engineering.

With Union Cloud — the launch of which coincides with the release of Flyte version 1.0 — Umare says the goal is to reduce (and ideally eliminate) the unwieldy infrastructure that can crop up in data science projects and hamstring development. At their worst, messy abstractions can necessitate rebuilding infrastructure to deploy AI to production, Umare points out — negatively affecting the potential return on investment.

According to a 2021 Wakefield Research report, enterprise data engineers spend nearly half their time building and maintaining data pipelines. Sixty-nine percent of respondents to the survey — mainly data engineers — said that business outcomes would improve if their teams could contribute more to business decisions and spend less time on manual pipeline management.

“Production machine learning is still in its infancy at the moment, especially at companies outside big tech. Thus, most companies start off with DIY — that is our primary competition,” Umare said. “We took a radically different, first-principles approach to defining what a workflow means for machine learning and data scientists. We started with a goal to minimize human errors and try to help predict problems ahead of time [and worked] closely with extremely sophisticated and a diverse set of partners like Spotify, Gojek and Freenome [to help] refine the solution.”

Union Cloud inherits all of Flyte’s characteristics and capabilities, including connectors between computation back ends that record all changes to an AI pipeline. Union Cloud also stores a history of all a pipeline’s executions and provides a dashboard, command-line interface and API to interact with the computations.

Union Cloud — and Flyte — define workflows as multiple tasks. Workflows and tasks can be written in any programming language and stay on-premises, as does data moving through those components.

Cloud advantage

So what’s the value add with Union Cloud? Umare says that it adds “agility, reproducibility, and security” to Flyte by centralizing infrastructure management and maintaining “high” privacy and compliance standards. “Our products are built with zero-trust principles in mind and thus our users can use [it] to build a self-serve platform that still maintains high security standards,” he continued. “Data science is very academic, which directly affects machine learning. There is a lot of fantastic research and literature that is available in academia, which is hard to productionize. We need to bridge both these worlds in a structured and repeatable way.”

Umare also sees Union Cloud as a way to reduce the cost of developing new products and systems in a way that the open source Flyte project can’t accomplish. While he concedes that similar efforts from other vendors exist, like AWS Sagemaker, he believes that they fail to integrate well with the rest of the data science ecosystem.

“We have been at this problem for over five years, refining our solution and iterating based on real-world feedback and requirements,” Umare said. “The machine learning sector is already large and growing within traditional companies as well. We view growth potential to not be limited by the size of the current demand however, but rather by the experience we can deliver, which is why we’ve focused purely on customer success and open source adoption. This will lead to revenue growth in the near future.”

On the topic of growth, Union plans to double its 20-person headcount by the end of the year as it focuses on product buildout. Umare didn’t have statistics to share on Union Cloud interest or uptake, but reiterated that “thousands” of users across companies such as Lyft, Spotify, Toyota subsidiary Woven Planet, and biotech and finance brands have adopted Flyte.

More TechCrunch

Google says it’s developed a new family of generative AI models “fine-tuned” for learning: LearnLM. A collaboration between Google’s DeepMind AI research division and Google Research, LearnLM models — built…

LearnLM is Google’s new family of AI models for education

The official launch comes almost a year after YouTube began experimenting with AI-generated quizzes on its mobile app. 

Google is bringing AI-generated quizzes to academic videos on YouTube

Around 550 employees across autonomous vehicle company Motional have been laid off, according to information taken from WARN notice filings and sources at the company.  Earlier this week, TechCrunch reported…

Motional cut about 550 employees, around 40%, in recent restructuring, sources say

The keynote kicks off at 10 a.m. PT on Tuesday and will offer glimpses into the latest versions of Android, Wear OS and Android TV.

Google I/O 2024: Watch all of the AI, Android reveals

It ran 110 minutes, but Google managed to reference AI a whopping 121 times during Google I/O 2024 (by its own count). CEO Sundar Pichai referenced the figure to wrap…

Google mentioned ‘AI’ 120+ times during its I/O keynote

Here are quick hits of the biggest news from the keynote as they are announced.

Google I/O 2024: Here’s everything Google just announced

Google Play has a new discovery feature for apps, new ways to acquire users, updates to Play Points, and other enhancements to developer-facing tools.

Google Play preps a new full-screen app discovery feature and adds more developer tools

Soon, Android users will be able to drag and drop AI-generated images directly into their Gmail, Google Messages and other apps.

Gemini on Android becomes more capable and works with Gmail, Messages, YouTube and more

Veo can capture different visual and cinematic styles, including shots of landscapes and timelapses, and make edits and adjustments to already-generated footage.

Google Veo, a serious swing at AI-generated video, debuts at Google I/O 2024

In addition to the body of the emails themselves, the feature will also be able to analyze attachments, like PDFs.

Gemini comes to Gmail to summarize, draft emails, and more

The summaries are created based on Gemini’s analysis of insights from Google Maps’ community of more than 300 million contributors.

Google is bringing Gemini capabilities to Google Maps Platform

Google says that over 100,000 developers already tried the service.

Project IDX, Google’s next-gen IDE, is now in open beta

The system effectively listens for “conversation patterns commonly associated with scams” in-real time. 

Google will use Gemini to detect scams during calls

The standard Gemma models were only available in 2 billion and 7 billion parameter versions, making this quite a step up.

Google announces Gemma 2, a 27B-parameter version of its open model, launching in June

This is a great example of a company using generative AI to open its software to more users.

Google TalkBack will use Gemini to describe images for blind people

Firebase Genkit is an open source framework that enables developers to quickly build AI into new and existing applications.

Google launches Firebase Genkit, a new open source framework for building AI-powered apps

This will enable developers to use the on-device model to power their own AI features.

Google is building its Gemini Nano AI model into Chrome on the desktop

Google’s Circle to Search feature will now be able to solve more complex problems across psychics and math word problems. 

Circle to Search is now a better homework helper

People can now search using a video they upload combined with a text query to get an AI overview of the answers they need.

Google experiments with using video to search, thanks to Gemini AI

A search results page based on generative AI as its ranking mechanism will have wide-reaching consequences for online publishers.

Google will soon start using GenAI to organize some search results pages

Google has built a custom Gemini model for search to combine real-time information, Google’s ranking, long context and multimodal features.

Google is adding more AI to its search results

At its Google I/O developer conference, Google on Tuesday announced the next generation of its Tensor Processing Units (TPU) AI chips.

Google’s next-gen TPUs promise a 4.7x performance boost

Google is upgrading Gemini, its AI-powered chatbot, with features aimed at making the experience more ambient and contextually useful.

Google’s Gemini updates: How Project Astra is powering some of I/O’s big reveals

Veo can generate few-seconds-long 1080p video clips given a text prompt.

Google’s image-generating AI gets an upgrade

At Google I/O, Google announced upgrades to Gemini 1.5 Pro, including a bigger context window. .

Google’s generative AI can now analyze hours of video

The AI upgrade will make finding the right content more intuitive and less of a manual search process.

Google Photos introduces an AI search feature, Ask Photos

Apple released new data about anti-fraud measures related to its operation of the iOS App Store on Tuesday morning, trumpeting a claim that it stopped over $7 billion in “potentially…

Apple touts stopping $1.8B in App Store fraud last year in latest pitch to developers

Online travel agency Expedia is testing an AI assistant that bolsters features like search, itinerary building, trip planning, and real-time travel updates.

Expedia starts testing AI-powered features for search and travel planning

Welcome to TechCrunch Fintech! This week, we look at the drama around TabaPay deciding to not buy Synapse’s assets, as well as stocks dropping for a couple of fintechs, Monzo raising…

Inside TabaPay’s drama-filled decision to abandon its plans to buy Synapse’s assets

The person who claimed to have stolen the physical addresses of 49 million Dell customers appears to have taken more data from a different Dell portal, TechCrunch has learned. The…

Threat actor scraped Dell support tickets, including customer phone numbers