Startups

Replicate wants to take the pain out of running and hosting ML models

Comment

Big and small metal gear with copy space. negotiating with corporate venture capital startups
Image Credits: Ivan Bajic (opens in a new window) / Getty Images

Replicate, a startup that runs machine learning models in the cloud, today launched out of stealth with $17.8 million in venture capital backing; $12.5 million of the total came from a Series A led by Andreessen Horowitz with participation from Y Combinator, Sequoia and angel investors including Figma CEO Dylan Field and Vercel’s Guillermo Rauch, while the rest was from a previously undisclosed seed round.

The company was co-founded by Ben Firshman, who led open source product efforts at Docker, and Andreas Jansson, previously a machine learning engineer at Spotify. The way Firshman tells it, he and Jansson came to the mutual realization that AI was accelerating at an “absurd” pace, but that technical barriers were standing in the way of mass adoption.

Enter Replicate, which offers a library of open source models that software developers can run with a few lines of code. The platform can automatically generate an API server for custom machine learning models, deployed on a large cluster of GPUs.

“If you get a ton of traffic, we scale up to handle the demand. If you don’t get any traffic, we scale down to zero and don’t charge a thing,” Firshman explained. “We only bill you for how long your code is running. The alternative is usually deploying models yourself on Amazon Web Services. Typically, you’d have to battle with servers, Kubernetes, GPUs, API servers, auto-scaling and more.”

Core to Replicate is Cog, an open source tool that lets developers package machine learning models in a standard, production-ready container format. Firshman and Jansson developed Cog, which runs on any newer macOS, Linux or Windows 11 machine.

“AI is currently too hard to use for software engineers and you have to be a machine learning engineer to use it,” Firshman said. “Companies and the industry as a whole is being held back by the lack of machine learning experts. We’re making it possible for software engineers to use machine learning with zero experience, with just a few lines of code, so they can build products with AI and apply it to business problems.”

Replicate
Replicate hosts thousands of ready-to-use models, including text-to-image and image-to-text models (à la Stable Diffusion). Image Credits: Replicate

Replicate isn’t the only one doing this. The startup competes with vendors including Hugging Face and OctoML (and to an extent Runway ML), which collectively have raised hundreds of millions in venture capital. Google, Amazon and Microsoft could be considered rivals, as well — offering their own solutions for developing, launching and maintaining machine leaning models in the cloud. (See SageMaker, AutoML and Azure’s no-code ML tools).

So what sets Replicate apart? Firshman claims the developer experience is “much better,” which of course remains to be seen — after all, Replicate is brand-spanking new. One clear point of differentiation, though, is the expansiveness of Replicate’s AI library. The platform offers diffusion models including Stable Diffusion, models for creating and editing videos, upscaling models for images and various image-to-text and text-to-image models.

Swift, painless deployment is the focus. Replicate’s website promises: “With Replicate and tools like Next.js and Vercel, you can wake up with an idea and watch it hit the front page of Hacker News by the time you go to bed.”

The marketing appears to be resonating with the developer community, which has enthusiastically embraced Replicate over the past few months — at least according to Firshman. He says that the platform’s seen 149% month-over-month growth in active users and 125% growth in API calls since the middle of last year. Enterprise customers include Character.ai, Labelbox and Unsplash.

“We’ve effectively been indexing the growth in generative AI,” Firshman said. “Founders are building tons of new products, investors are investing in it and users are clamoring for all these new things.”

Leaning into generative AI is certainly a wise decision on Replicate’s part. The segment — under which technologies like ChatGPT and Stable Diffusion fall — has seen a massive uptick in investment over the past several years. PitchBook (via Bezinga) reports that VCs funneled 425% more dollars into generative AI in 2022 compared to 2020, with the space reaching $2.1 billion total capital pledged in 2022.

Firshman sees the growth continuing — and Replicate benefitting.

“It hasn’t yet entered the enterprise’s consciousness how much generative AI is going to upend so many parts of their business: customer support, marketing, sales, content creation, and probably other things we haven’t anticipated yet,” he said. “Very soon, customer support will be mostly automated and extremely good — not the terrible chatbots of the past. Creating assets for marketing will be mostly automated. Most of the ads you see will be automatically generated and personalized. Creating assets for video games will be mostly automated. And this is with the technology we have today.”

More TechCrunch

Struggling EV startup Fisker has laid off hundreds of employees in a bid to stay alive, as it continues to search for funding, a buyout or prepare for bankruptcy. Workers…

Fisker cuts hundreds of workers in bid to keep EV startup alive

Chinese EV manufacturers face a new challenge in their pursuit of U.S. customers: a new House bill that would limit or ban the introduction of their connected vehicles. The bill,…

Chinese EV makers, and their connected vehicles, targeted by new House bill

With the release of iOS 18 later this year, Apple may again borrow ideas third-party apps. This time it’s Arc that could be among those affected.

Is Apple planning to ‘sherlock’ Arc?

TechCrunch Disrupt 2024 will be in San Francisco on October 28–30, and we’re already excited! This is the startup world’s main event, and it’s where you’ll find the knowledge, tools…

Meet Visa, Mercury, Artisan, Golub Capital and more at TC Disrupt 2024

Featured Article

The women in AI making a difference

As a part of a multi-part series, TechCrunch is highlighting women innovators — from academics to policymakers —in the field of AI.

3 hours ago
The women in AI making a difference

Ifeel is being offered as part of an employer’s or insurance provider’s healthcare coverage.

Mental health insurance platform ifeel raises a $20 million Series B

Instead of opening the user’s actual browser or a WebView, Custom Tabs let users remain in their app while browsing.

Google Chrome becomes a ‘picture-in-picture’ app

Sanil Chawla remembers the meetings he had with countless artists in college. Those creatives were looking for one thing: sustainable economic infrastructure that could help them scale rather than drown…

Slingshot raises $2.2 million to provide financial services to artists

A startup called Firefly that’s tackling the thorny and growing issue of cloud asset management with an “infrastructure as code” solution has raised $23 million in funding. That comes on…

Firefly forges on after co-founder murdered by Hamas

Mistral, the French AI startup backed by Microsoft and valued at $6 billion, has released its first generative AI model for coding, dubbed Codestral. Like other code-generating models, Codestral is…

Mistral releases Codestral, its first generative AI model for code

Pinterest announced today that it is evolving its Creator Inclusion Fund to now be called the Pinterest Inclusion Fund. Pinterest teamed up with Shopify’s Build Black and Build Native programs…

Pinterest expands its Creator Fund to allow founders

Cadillac may seem a bit too traditional to hang its driving cap on EVs. And yet, that hasn’t stopped the GM brand from rolling out — or at least showing…

Cadillac’s new Optiq EV is designed to hook young hipsters

Alex Taub, a longtime founder with multiple exits under his belt, believes it’s time to disrupt the meme industry. “I have this big thesis that meme tech is going to…

This founder says meme tech is the next big thing

Lux, the startup behind popular pro photography app Halide and others, is venturing into video with its latest app launch. On Wednesday, the company announced Kino, a new video capture app…

Kino is a new iPhone app for videographers from the makers of Halide

DevOps startup Harness has shown itself to be an ambitious company, building a broad platform of services while also dabbling in M&A when it made sense to fill in functionality.…

Harness snags Split.io as it goes all in on feature flags and experiments

Microsoft’s Copilot, a generative AI-powered tool that can generate text as well as answer specific questions, is now available as an in-app chatbot on Telegram, the instant messaging app.  Currently…

Microsoft’s Copilot is now on Telegram

HBO’s new documentary, “MoviePass, MovieCrash,” tells a story that many of us know about: how MoviePass, the subscription-based movie ticketing startup, was a catastrophic failure. After a series of mishaps…

MoviePass co-founders speak their truth in HBO’s new documentary 

The watch features a variety of different 3D games, unlocking more play time the more kids move.

Fitbit’s new kid smartwatch is a little Wiimote, a little Tamagotchi

In the video, a crowd is roaring at a packed summer music festival. As a beat starts playing over the speakers, the performer finally walks onstage: It’s the Joker. Clad…

Discord has become an unlikely center for the generative AI boom

After the Wirecard scandal, Germany’s financial regulator BaFin started to look more closely at young fintech startups that wanted to grow at a rapid pace — it’s better to be…

Germany’s financial regulator ends anti-money laundering cap on N26 signups after $10M fine

Among other things, this includes the ability to trace code from source to binary packages across both platforms, single sign-on support and unified project structures.

JFrog and GitHub team up to closely integrate their source code and binary platforms

The company’s public fund disbursement and e-commerce platform makes accepting school tuition and enabling educational enrichment more accessible. 

Tech startup Odyssey goes on journey to help states implement school choice programs

A new startup called Kinnect aims to help people privately save generational memories, traditions, recipes and more. The company’s app, launched this month, lets people create invite-only spaces where they…

Kinnect’s new app aims to help families record and store generational memories

Spotify has hiked its premium subscription in France by an eye-watering €0.13, in response to a new music-streaming tax.

Spotify hikes subscription price in France by 1.2% to match new music-streaming tax

The European Union has taken the wraps off the structure of the new AI Office, the ecosystem-building and oversight body that’s being established under the bloc’s AI Act. The risk-based…

With the EU AI Act incoming this summer, the bloc lays out its plan for AI governance

Solutions by Text, a company that gives people a way to pay their bills and apply for loans via text messaging, has secured $110 million in new growth funding. Edison…

Bootstrapped for over a decade, this Dallas company just secured $110M to help people pay bills by text

Owners of small- and medium-sized businesses check their bank balances daily to make financial decisions. But it’s entrepreneur Yoseph West’s assertion that there’s typically information and functions missing from bank…

Relay raises $32.2 million to help smaller businesses manage their cash flow

When other firms were investing and raising eye-popping sums, Clean Energy Ventures took a different approach. It appears to be paying off.

How Clean Energy Ventures avoided the pandemic bubble and raised a $305M fund

PwC, the management consulting giant, will become OpenAI’s biggest customer to date, covering 100,000 users.

OpenAI signs 100K PwC workers to ChatGPT’s enterprise tier as PwC becomes its first resale partner

Tech enthusiasts and entrepreneurs, the clock is ticking! With just 72 hours remaining until the early-bird ticket deadline for TechCrunch Disrupt 2024, now is the time to secure your spot…

72 hours left of the Disrupt early-bird sale