Startups

Replicate wants to take the pain out of running and hosting ML models

Comment

Big and small metal gear with copy space. negotiating with corporate venture capital startups
Image Credits: Ivan Bajic (opens in a new window) / Getty Images

Replicate, a startup that runs machine learning models in the cloud, today launched out of stealth with $17.8 million in venture capital backing; $12.5 million of the total came from a Series A led by Andreessen Horowitz with participation from Y Combinator, Sequoia and angel investors including Figma CEO Dylan Field and Vercel’s Guillermo Rauch, while the rest was from a previously undisclosed seed round.

The company was co-founded by Ben Firshman, who led open source product efforts at Docker, and Andreas Jansson, previously a machine learning engineer at Spotify. The way Firshman tells it, he and Jansson came to the mutual realization that AI was accelerating at an “absurd” pace, but that technical barriers were standing in the way of mass adoption.

Enter Replicate, which offers a library of open source models that software developers can run with a few lines of code. The platform can automatically generate an API server for custom machine learning models, deployed on a large cluster of GPUs.

“If you get a ton of traffic, we scale up to handle the demand. If you don’t get any traffic, we scale down to zero and don’t charge a thing,” Firshman explained. “We only bill you for how long your code is running. The alternative is usually deploying models yourself on Amazon Web Services. Typically, you’d have to battle with servers, Kubernetes, GPUs, API servers, auto-scaling and more.”

Core to Replicate is Cog, an open source tool that lets developers package machine learning models in a standard, production-ready container format. Firshman and Jansson developed Cog, which runs on any newer macOS, Linux or Windows 11 machine.

“AI is currently too hard to use for software engineers and you have to be a machine learning engineer to use it,” Firshman said. “Companies and the industry as a whole is being held back by the lack of machine learning experts. We’re making it possible for software engineers to use machine learning with zero experience, with just a few lines of code, so they can build products with AI and apply it to business problems.”

Replicate
Replicate hosts thousands of ready-to-use models, including text-to-image and image-to-text models (à la Stable Diffusion). Image Credits: Replicate

Replicate isn’t the only one doing this. The startup competes with vendors including Hugging Face and OctoML (and to an extent Runway ML), which collectively have raised hundreds of millions in venture capital. Google, Amazon and Microsoft could be considered rivals, as well — offering their own solutions for developing, launching and maintaining machine leaning models in the cloud. (See SageMaker, AutoML and Azure’s no-code ML tools).

So what sets Replicate apart? Firshman claims the developer experience is “much better,” which of course remains to be seen — after all, Replicate is brand-spanking new. One clear point of differentiation, though, is the expansiveness of Replicate’s AI library. The platform offers diffusion models including Stable Diffusion, models for creating and editing videos, upscaling models for images and various image-to-text and text-to-image models.

Swift, painless deployment is the focus. Replicate’s website promises: “With Replicate and tools like Next.js and Vercel, you can wake up with an idea and watch it hit the front page of Hacker News by the time you go to bed.”

The marketing appears to be resonating with the developer community, which has enthusiastically embraced Replicate over the past few months — at least according to Firshman. He says that the platform’s seen 149% month-over-month growth in active users and 125% growth in API calls since the middle of last year. Enterprise customers include Character.ai, Labelbox and Unsplash.

“We’ve effectively been indexing the growth in generative AI,” Firshman said. “Founders are building tons of new products, investors are investing in it and users are clamoring for all these new things.”

Leaning into generative AI is certainly a wise decision on Replicate’s part. The segment — under which technologies like ChatGPT and Stable Diffusion fall — has seen a massive uptick in investment over the past several years. PitchBook (via Bezinga) reports that VCs funneled 425% more dollars into generative AI in 2022 compared to 2020, with the space reaching $2.1 billion total capital pledged in 2022.

Firshman sees the growth continuing — and Replicate benefitting.

“It hasn’t yet entered the enterprise’s consciousness how much generative AI is going to upend so many parts of their business: customer support, marketing, sales, content creation, and probably other things we haven’t anticipated yet,” he said. “Very soon, customer support will be mostly automated and extremely good — not the terrible chatbots of the past. Creating assets for marketing will be mostly automated. Most of the ads you see will be automatically generated and personalized. Creating assets for video games will be mostly automated. And this is with the technology we have today.”

More TechCrunch

TechCrunch has kept readers informed regarding Fearless Fund’s courtroom battle to provide business grants to Black women. Today, we are happy to announce that Fearless Fund CEO and co-founder Arian…

Fearless Fund’s Arian Simone coming to Disrupt 2024

Bridgy Fed is one of the efforts aimed at connecting the fediverse with the web, Bluesky and, perhaps later, other networks like Nostr.

Bluesky and Mastodon users can now talk to each other with Bridgy Fed

Zoox, Amazon’s self-driving unit, is bringing its autonomous vehicles to more cities.  The self-driving technology company announced Wednesday plans to begin testing in Austin and Miami this summer. The two…

Zoox to test self-driving cars in Austin and Miami 

Called Stable Audio Open, the generative model takes a text description and outputs a recording up to 47 seconds in length.

Stability AI releases a sound generator

It’s not just instant-delivery startups that are struggling. Oda, the Norway-based online supermarket delivery startup, has confirmed layoffs of 150 jobs as it drastically scales back its expansion ambitions to…

SoftBank-backed grocery startup Oda lays off 150, resets focus on Norway and Sweden

Newsletter platform Substack is introducing the ability for writers to send videos to their subscribers via Chat, its private community feature, the company announced on Wednesday. The rollout of video…

Substack brings video to its Chat feature

Hiya, folks, and welcome to TechCrunch’s inaugural AI newsletter. It’s truly a thrill to type those words — this one’s been long in the making, and we’re excited to finally…

This Week in AI: Ex-OpenAI staff call for safety and transparency

Ms. Rachel isn’t a household name, but if you spend a lot of time with toddlers, she might as well be a rockstar. She’s like Steve from Blues Clues for…

Cameo fumbles on Ms. Rachel fundraiser as fans receive credits instead of videos  

Cartwheel helps animators go from zero to basic movement, so creating a scene or character with elementary motions like taking a step, swatting a fly or sitting down is easier.

Cartwheel generates 3D animations from scratch to power up creators

The new tool, which is set to arrive in Wix’s app builder tool this week, guides users through a chatbot-like interface to understand the goals, intent and aesthetic of their…

Wix’s new tool taps AI to generate smartphone apps

ClickUp Knowledge Management combines a new wiki-like editor and with a new AI system that can also bring in data from Google Drive, Dropbox, Confluence, Figma and other sources.

ClickUp wants to take on Notion and Confluence with its new AI-based Knowledge Base

New York City, home to over 60,000 gig delivery workers, has been cracking down on cheap, uncertified e-bikes that have resulted in battery fires across the city.  Some e-bike providers…

Whizz wants to own the delivery e-bike subscription space, starting with NYC

This is the last major step before Starliner can be certified as an operational crew system, and the first Starliner mission is expected to launch in 2025. 

Boeing’s Starliner astronaut capsule is en route to the ISS 

TechCrunch Disrupt 2024 in San Francisco is the must-attend event for startup founders aiming to make their mark in the tech world. This year, founders have three exciting ways to…

Three ways founders can shine at TechCrunch Disrupt 2024

Google’s newest startup program, announced on Wednesday, aims to bring AI technology to the public sector. The newly launched “Google for Startups AI Academy: American Infrastructure” will offer participants hands-on…

Google’s new startup program focuses on bringing AI to public infrastructure

eBay’s newest AI feature allows sellers to replace image backgrounds with AI-generated backdrops. The tool is now available for iOS users in the U.S., U.K., and Germany. It’ll gradually roll…

eBay debuts AI-powered background tool to enhance product images

If you’re anything like me, you’ve tried every to-do list app and productivity system, only to find yourself giving up sooner than later because sooner than later, managing your productivity…

Hoop uses AI to automatically manage your to-do list

Asana is using its work graph to train LLMs with the goal of creating AI assistants that work alongside human employees in company workflows.

Asana introduces ‘AI teammates’ designed to work alongside human employees

Taloflow, an early stage startup changing the way companies evaluate and select software, has raised $1.3M in a seed round.

Taloflow puts AI to work on software vendor selection to reduce costs and save time

The startup is hoping its durable filters can make metals refining and battery recycling more efficient, too.

SiTration uses silicon wafers to reclaim critical minerals from mining waste

Spun out of Bosch, Dive wants to change how manufacturers use computer simulations by both using modern mathematical approaches and cloud computing.

Dive goes cloud-native for its computational fluid dynamics simulation service

The tension between incumbents and fintechs has existed for decades. But every once in a while, the two groups decide to put their competition aside and work together. In an…

When foes become friends: Capital One partners with fintech giants Stripe, Adyen to prevent fraud

After growing 500% year-over-year in the past year, Understory is now launching a product focused on the renewable energy sector.

Insurance provider Understory gets into renewable energy following $15M Series A

Ashkenazi will start her new role at Google’s parent company on July 31, after 23 years at Eli Lilly.

Alphabet brings on Eli Lilly’s Anat Ashkenazi as CFO

Tobiko aims to reimagine how teams work with data by offering a dbt-compatible data transformation platform.

With $21.8M in funding, Tobiko aims to build a modern data platform

In 1816, French physician René Laennec invented an instrument that allowed doctors to listen to the heart and lungs. That device — a stethoscope — eventually evolved from a simple…

Eko Health scores $41M to detect heart and lung disease earlier and more accurately

The number of satellites on low Earth orbit is poised to explode over the coming years as more mega-constellations come online. This will create new opportunities for bad actors to…

DARPA and Slingshot build system to detect ‘wolf in sheep’s clothing’ adversary satellites

SAP sees WalkMe’s focus on automating contextual, in-app support as bringing value to its own enterprise customers.

SAP to acquire digital adoption platform WalkMe for $1.5B

The National Democratic Alliance (NDA) has emerged victorious in India’s 2024 general election, but with a smaller majority compared to 2019. According to post-election analysis by Goldman Sachs, JPMorgan, CLSA,…

Modi-led coalition’s election win signals policy continuity in India — and spending cuts

Featured Article

A comprehensive list of 2024 tech layoffs

The tech layoff wave is still going strong in 2024. Following significant workforce reductions in 2022 and 2023, this year has already seen 60,000 job cuts across 254 companies, according to independent layoffs tracker Layoffs.fyi. Companies like Tesla, Amazon, Google, TikTok, Snap and Microsoft have conducted sizable layoffs in the…

24 hours ago
A comprehensive list of 2024 tech layoffs