Startups

Replicate wants to take the pain out of running and hosting ML models

Comment

Big and small metal gear with copy space. negotiating with corporate venture capital startups
Image Credits: Ivan Bajic (opens in a new window) / Getty Images

Replicate, a startup that runs machine learning models in the cloud, today launched out of stealth with $17.8 million in venture capital backing; $12.5 million of the total came from a Series A led by Andreessen Horowitz with participation from Y Combinator, Sequoia and angel investors including Figma CEO Dylan Field and Vercel’s Guillermo Rauch, while the rest was from a previously undisclosed seed round.

The company was co-founded by Ben Firshman, who led open source product efforts at Docker, and Andreas Jansson, previously a machine learning engineer at Spotify. The way Firshman tells it, he and Jansson came to the mutual realization that AI was accelerating at an “absurd” pace, but that technical barriers were standing in the way of mass adoption.

Enter Replicate, which offers a library of open source models that software developers can run with a few lines of code. The platform can automatically generate an API server for custom machine learning models, deployed on a large cluster of GPUs.

“If you get a ton of traffic, we scale up to handle the demand. If you don’t get any traffic, we scale down to zero and don’t charge a thing,” Firshman explained. “We only bill you for how long your code is running. The alternative is usually deploying models yourself on Amazon Web Services. Typically, you’d have to battle with servers, Kubernetes, GPUs, API servers, auto-scaling and more.”

Core to Replicate is Cog, an open source tool that lets developers package machine learning models in a standard, production-ready container format. Firshman and Jansson developed Cog, which runs on any newer macOS, Linux or Windows 11 machine.

“AI is currently too hard to use for software engineers and you have to be a machine learning engineer to use it,” Firshman said. “Companies and the industry as a whole is being held back by the lack of machine learning experts. We’re making it possible for software engineers to use machine learning with zero experience, with just a few lines of code, so they can build products with AI and apply it to business problems.”

Replicate
Replicate hosts thousands of ready-to-use models, including text-to-image and image-to-text models (à la Stable Diffusion). Image Credits: Replicate

Replicate isn’t the only one doing this. The startup competes with vendors including Hugging Face and OctoML (and to an extent Runway ML), which collectively have raised hundreds of millions in venture capital. Google, Amazon and Microsoft could be considered rivals, as well — offering their own solutions for developing, launching and maintaining machine leaning models in the cloud. (See SageMaker, AutoML and Azure’s no-code ML tools).

So what sets Replicate apart? Firshman claims the developer experience is “much better,” which of course remains to be seen — after all, Replicate is brand-spanking new. One clear point of differentiation, though, is the expansiveness of Replicate’s AI library. The platform offers diffusion models including Stable Diffusion, models for creating and editing videos, upscaling models for images and various image-to-text and text-to-image models.

Swift, painless deployment is the focus. Replicate’s website promises: “With Replicate and tools like Next.js and Vercel, you can wake up with an idea and watch it hit the front page of Hacker News by the time you go to bed.”

The marketing appears to be resonating with the developer community, which has enthusiastically embraced Replicate over the past few months — at least according to Firshman. He says that the platform’s seen 149% month-over-month growth in active users and 125% growth in API calls since the middle of last year. Enterprise customers include Character.ai, Labelbox and Unsplash.

“We’ve effectively been indexing the growth in generative AI,” Firshman said. “Founders are building tons of new products, investors are investing in it and users are clamoring for all these new things.”

Leaning into generative AI is certainly a wise decision on Replicate’s part. The segment — under which technologies like ChatGPT and Stable Diffusion fall — has seen a massive uptick in investment over the past several years. PitchBook (via Bezinga) reports that VCs funneled 425% more dollars into generative AI in 2022 compared to 2020, with the space reaching $2.1 billion total capital pledged in 2022.

Firshman sees the growth continuing — and Replicate benefitting.

“It hasn’t yet entered the enterprise’s consciousness how much generative AI is going to upend so many parts of their business: customer support, marketing, sales, content creation, and probably other things we haven’t anticipated yet,” he said. “Very soon, customer support will be mostly automated and extremely good — not the terrible chatbots of the past. Creating assets for marketing will be mostly automated. Most of the ads you see will be automatically generated and personalized. Creating assets for video games will be mostly automated. And this is with the technology we have today.”

More TechCrunch

Accurate weather forecasts are critical to industries like agriculture, and they’re also important to help prevent and mitigate harm from inclement weather events or natural disasters. But getting forecasts right…

Deal Dive: Can blockchain make weather forecasts better? WeatherXM thinks so

pcTattletale’s website was briefly defaced and contained links containing files from the spyware maker’s servers, before going offline.

Spyware app pcTattletale was hacked and its website defaced

Featured Article

With a16z-backed Synapse’s collapse, BaaS fintech is a mess and 10 million consumers could be hurt

Synapse’s bankruptcy shows just how treacherous things are for the often-interdependent fintech world when one key player hits trouble. 

5 hours ago
With a16z-backed Synapse’s collapse, BaaS fintech is a mess and 10 million consumers could be hurt

Sarah Myers West, profiled as part of TechCrunch’s Women in AI series, is managing director at the AI Now institute.

Women in AI: Sarah Myers West says we should ask, ‘Why build AI at all?’

Keeping up with an industry as fast-moving as AI is a tall order. So until an AI can do it for you, here’s a handy roundup of recent stories in the world…

This Week in AI: OpenAI and publishers are partners of convenience

Evan, a high school sophomore from Houston, was stuck on a calculus problem. He pulled up Answer AI on his iPhone, snapped a photo of the problem from his Advanced…

AI tutors are quietly changing how kids in the US study, and the leading apps are from China

Welcome to Startups Weekly — Haje‘s weekly recap of everything you can’t miss from the world of startups. Sign up here to get it in your inbox every Friday. Well,…

Startups Weekly: Drama at Techstars. Drama in AI. Drama everywhere.

Last year’s investor dreams of a strong 2024 IPO pipeline have faded, if not fully disappeared, as we approach the halfway point of the year. 2024 delivered four venture-backed tech…

From Plaid to Figma, here are the startups that are likely — or definitely — not having IPOs this year

Federal safety regulators have discovered nine more incidents that raise questions about the safety of Waymo’s self-driving vehicles operating in Phoenix and San Francisco.  The National Highway Traffic Safety Administration…

Feds add nine more incidents to Waymo robotaxi investigation

Terra One’s pitch deck has a few wins, but also a few misses. Here’s how to fix that.

Pitch Deck Teardown: Terra One’s $7.5M Seed deck

Chinasa T. Okolo researches AI policy and governance in the Global South.

Women in AI: Chinasa T. Okolo researches AI’s impact on the Global South

TechCrunch Disrupt takes place on October 28–30 in San Francisco. While the event is a few months away, the deadline to secure your early-bird tickets and save up to $800…

Disrupt 2024 early-bird tickets fly away next Friday

Another week, and another round of crazy cash injections and valuations emerged from the AI realm. DeepL, an AI language translation startup, raised $300 million on a $2 billion valuation;…

Big tech companies are plowing money into AI startups, which could help them dodge antitrust concerns

If raised, this new fund, the firm’s third, would be its largest to date.

Harlem Capital is raising a $150 million fund

About half a million patients have been notified so far, but the number of affected individuals is likely far higher.

US pharma giant Cencora says Americans’ health information stolen in data breach

Attention, tech enthusiasts and startup supporters! The final countdown is here: Today is the last day to cast your vote for the TechCrunch Disrupt 2024 Audience Choice program. Voting closes…

Last day to vote for TC Disrupt 2024 Audience Choice program

Featured Article

Signal’s Meredith Whittaker on the Telegram security clash and the ‘edge lords’ at OpenAI 

Among other things, Whittaker is concerned about the concentration of power in the five main social media platforms.

1 day ago
Signal’s Meredith Whittaker on the Telegram security clash and the ‘edge lords’ at OpenAI 

Lucid Motors is laying off about 400 employees, or roughly 6% of its workforce, as part of a restructuring ahead of the launch of its first electric SUV later this…

Lucid Motors slashes 400 jobs ahead of crucial SUV launch

Google is investing nearly $350 million in Flipkart, becoming the latest high-profile name to back the Walmart-owned Indian e-commerce startup. The Android-maker will also provide Flipkart with cloud offerings as…

Google invests $350 million in Indian e-commerce giant Flipkart

A Jio Financial unit plans to purchase customer premises equipment and telecom gear worth $4.32 billion from Reliance Retail.

Jio Financial unit to buy $4.32B of telecom gear from Reliance Retail

Foursquare, the location-focused outfit that in 2020 merged with Factual, another location-focused outfit, is joining the parade of companies to make cuts to one of its biggest cost centers –…

Foursquare just laid off 105 employees

“Running with scissors is a cardio exercise that can increase your heart rate and require concentration and focus,” says Google’s new AI search feature. “Some say it can also improve…

Using memes, social media users have become red teams for half-baked AI features

The European Space Agency selected two companies on Wednesday to advance designs of a cargo spacecraft that could establish the continent’s first sovereign access to space.  The two awardees, major…

ESA prepares for the post-ISS era, selects The Exploration Company, Thales Alenia to develop cargo spacecraft

Expressable is a platform that offers one-on-one virtual sessions with speech language pathologists.

Expressable brings speech therapy into the home

The French Secretary of State for the Digital Economy as of this year, Marina Ferrari, revealed this year’s laureates during VivaTech week in Paris. According to its promoters, this fifth…

The biggest French startups in 2024 according to the French government

Spotify is notifying customers who purchased its Car Thing product that the devices will stop working after December 9, 2024. The company discontinued the device back in July 2022, but…

Spotify to shut off Car Thing for good, leading users to demand refunds

Elon Musk’s X is preparing to make “likes” private on the social network, in a change that could potentially confuse users over the difference between something they’ve favorited and something…

X should bring back stars, not hide ‘likes’

The FCC has proposed a $6 million fine for the scammer who used voice-cloning tech to impersonate President Biden in a series of illegal robocalls during a New Hampshire primary…

$6M fine for robocaller who used AI to clone Biden’s voice

Welcome back to TechCrunch Mobility — your central hub for news and insights on the future of transportation. Sign up here for free — just click TechCrunch Mobility! Is it…

Tesla lobbies for Elon and Kia taps into the GenAI hype

Crowdaa is an app that allows non-developers to easily create and release apps on the mobile store. 

App developer Crowdaa raises €1.2M and plans a US expansion