AI

Deepset raises $14M to help companies build NLP apps

Comment

Image Credits: Getty Images

Natural language processing (NLP), the field of AI that involves parsing text for tasks including summarization and generation, is a fast-growing technology. According to a 2021 survey from John Snow Labs and Gradient Flow, 60% of tech leaders indicated that their NLP budgets grew by at least 10% compared to 2020, while a third said that their spending climbed by more than 30%. Fortune Business Insights pegged the NLP market at $16.53 billion in 2020.

Against this backdrop, Deepset, the startup behind the open source NLP framework Haystack, today announced that it raised $14 million in a Series A investment led by GV with participation from Harpoon Ventures, System.One, Lunar Ventures and Acequia Capital. The capital infusion arrived alongside Deepset Cloud, a new subscription product for building NLP-powered software.

“Driven by [our] belief in open source, the Deepset team has … been contributing models and research outcomes to the open source NLP community [for years],” Rusic told TechCrunch via email. “Haystack, the company’s flagship open source product, was born out of the experiences, expertise and know-how gained while building NLP for large organizations and the need for a proper set of building blocks for scalable, API-driven NLP back-end applications.”

CEO Milos Rusic co-founded Deepset with Malte Pietsch and Timo Möller in 2018. Pietsch and Möller — who have data science backgrounds — came from Plista, an adtech startup, where they worked on products including an AI-powered ad creation tool.

Haystack lets developers build pipelines for NLP use cases. Originally created for search applications, the framework can power engines that answer specific questions (e.g., “Why are startups moving to Berlin?”) or sift through documents.

Haystack can also field “knowledge-based” searches that look for granular information on websites with a lot of data or internal wikis. Rusic says that Haystack has been used to automate risk management workflows at financial services companies, returning results for queries like “What is the business outlook?” and “How did revenues evolve in the past years?” Other organizations, like Alcatel-Lucent Enterprise, have leveraged Haystack to launch virtual assistants that recommend documents to field technicians.

Haystack
A screenshot of the Haystack interface. Image Credits: Haystack

According to Rusic, the goal with Haystack was to enable developers and product divisions to build modern, API-driven NLP apps successfully — and quickly. He notes that, while it’s often straightforward for a data science team to come up with a prototype, challenges can arise in transitioning from prototype to production. About 80% of AI projects — including NLP projects — never make it into production, according to a 2019 Gartner survey.

“[With Haystack,] development teams … are equipped with all the components to build a full-stack NLP application and are guided with the proper workflows … Modern NLP moves very fast, and it’s much easier to bridge the gap between the cutting-edge research and the actual production-ready technologies through open source,” Rusic said. “[Prebuilt NLP systems] are the basis [for Haystack] and often provide great results in pipelines without additional training. Customization, if needed, happens with end users and experts who provide feedback by testing and using new iterations of a [system] or a pipeline.”

But not every company chooses — or wishes — to go the DIY route. For those preferring a managed solution, there’s the aforementioned Deepset Cloud, which supports customers across the NLP service lifecycle. The service starts with experimentation — i.e., testing and evaluating an app, and adjusting it to a use case, and building a proof of concept — and ends with labeling and monitoring the app in production.

“All NLP services that are developed [with Deepset Cloud] can be used in any end application, simply by integrating an API,” Rusic said. “Example applications are NLP-driven enterprise search (think ‘modern Google-like’ search) and knowledge management.”

With the new financing secured ($15.6 million in total), Deepset aims to translate its open source success — thousands of organizations currently use Haystack — into increased revenue. Rusic says that the 30-person, Berlin, Germany-based company was bootstrapped and break-even before raising its first funding round in 2021, and now has large enterprise customers including Airbus.

“[With the new funding,] we’ll continue to build the open source Haystack NLP project — adding more features, making it even more straightforward for NLP-savvy back-end developers to create NLP services,” Rusic said. “[We’ll also] develop Deepset Cloud into a fully fledged enterprise software-as-a-service to build language-aware applications. This will include enabling more flexible workflows, more granular product lifecycle guidance, and offering essential and supplemental tools, like labeling and data integrations.”

More TechCrunch

Google DeepMind has taken the wraps off a new version AlphaFold, their transformative machine learning model that predicts the shape and behavior of proteins. AlphaFold 3 is not only more…

Google DeepMind debuts huge AlphaFold update and free proteomics-as-a-service web app

Close to a decade ago, brothers Aviv and Matteo Shapira co-founded a company, Replay, that created a video format for 360-degree replays — the sorts of replays that have become…

Controversial drone company Xtend leans into defense with new $40 million round

Usually, when something starts to rot, it gets pitched in the trash. But Joanne Rodriguez wants to turn the concept of rot on its head by growing fungus on trash…

Mycocycle uses mushrooms to upcycle old tires and construction waste

Mushrooms continue to be a big area for alternative proteins. Canada-based Maia Farms recently raised $1.7 million to develop a blend of mushroom and plant-based protein using biomass fermentation. There’s…

Meati Foods bites into another $100M amid growth to 7,000 retail locations

Cleaning the outside of buildings is a dirty job, and it’s also dangerous. Lucid Bots came on the scene in 2018 with its Sherpa line of drones to clean windows…

Lucid Bots secures $9M for drones to clean more than your windows

High interest rates and financial pressures make it more important than ever for finance teams to have a better handle on their cash flow, and several startups are hoping to…

Israeli startup Panax raises a $10M Series A for its AI-driven cash flow management platform

For the founders of Atlan, a data governance startup, data has always been at the heart of what they do, even before they launched the company. In fact, co-founders Prukalpa…

Atlan scores $105M for its data control plane, as LLMs boost importance of data

For decades, the Global Positioning System (GPS) has maintained a de facto monopoly on positioning, navigation and timing, because it’s cheap and already integrated into billions of devices around the…

Xona Space Systems closes $19M Series A to build out ultra-accurate GPS alternative

Kyle Kuzma is a lot of things. He’s a forward for the Washington Wizards NBA team and a 2020 NBA champion. He’s also a style icon — depending on who…

NBA champion Kyle Kuzma looks to bring his team mentality to Scrum Ventures

Lipids are fatty, waxy or oily compounds that, for instance, typically come in the form of fats and oils. As a result they are heavily used in the production of…

After a $20M Series A funding, Germany’s Insempra plans eco-friendly lipid production

Tesla CEO Elon Musk has said that lidar sensors are a “crutch” for autonomous vehicles. But his company has bought so many from Luminar that Tesla is now the lidar-maker’s…

Tesla is Luminar’s largest lidar customer

U.S. realty trust giant Brandywine Realty Trust has confirmed a cyberattack that resulted in the theft of data from its network. In a filing with regulators on Tuesday, the Philadelphia-based…

Brandywine Realty Trust says data stolen in ransomware attack

Rivian lost $1.45 billion in the first quarter, showing that its recent company-wide cost-cutting measures have a ways to go before it can approach profitability. The EV-maker brought in $1.2…

Rivian loses $1.45B as cost-cutting measures continue

Meta is rolling out an expanded set of generative AI tools for advertisers, after first announcing a set of AI features last October. Now, instead of only being able to…

Meta’s AI tools for advertisers can now create full new images, not just new backgrounds

On April 29, Senators Jon Ossoff (D-GA) and Marsha Blackburn (R-SC) proposed a bipartisan bill to protect children from online sexual exploitation. President Biden officially signed the REPORT Act into…

Biden signs bill to protect children from online sexual abuse and exploitation

The pandemic ushered in an e-bike boom. But like so many other pandemic trends, that boom didn’t last. The last year has seen e-bike startups VanMoof and Cake file for…

Bloom is reinventing how e-bikes are made in the US

At its iPad-focused event on Monday, Apple announced a new and improved Magic Keyboard, its keyboard accessory for iPad. The Magic Keyboard has been “completely redesigned” to be much thinner…

Apple unveils a new Magic Keyboard at iPad event

Apple isn’t yet ready to unveil its broader AI strategy — it’s saving that for its Worldwide Developer Conference in June — but the tech giant did make sure to…

Apple highlights AI features, including M4 neural engine, at iPad event

The New York Times Games announced on Tuesday that it’s launching a Wordle archive, offering subscribers access to more than 1,000 past Wordle puzzles. The company has started rolling out the Wordle…

NYT Games launches a Wordle archive with access to more than 1,000 past puzzles

Robert Kahn has been a consistent presence on the Internet since its creation — obviously, since he was its co-creator. But like many tech pioneers his resumé is longer than…

Crypto? AI? Internet co-creator Robert Kahn already did it … decades ago

Amazon is launching a new tool, Bedrock Studio, designed to let organizations experiment with generative AI models, collaborate on those models, and ultimately build generative AI-powered apps. Available in public…

Bedrock Studio is Amazon’s attempt to simplify generative AI app development

Featured Article

A comprehensive list of 2024 tech layoffs

The tech layoff wave is still going strong in 2024. Following significant workforce reductions in 2022 and 2023, this year has already seen 60,000 job cuts across 254 companies, according to independent layoffs tracker Layoffs.fyi. Companies like Tesla, Amazon, Google, TikTok, Snap and Microsoft have conducted sizable layoffs in the first months of 2024. Smaller-sized…

23 hours ago
A comprehensive list of 2024 tech layoffs

Oyo, the Indian budget-hotel chain startup, is negotiating with investors to raise a new round of funding that could cut the Indian firm’s valuation to $3 billion or lower, three…

India’s Oyo, once valued at $10B, seeks new funding at 70% discount

Five takeaways from the indictment of Dmitry Yuryevich Khoroshev, the hacker who U.S. and U.K. authorities accuse of being the mastermind of the LockBit ransomware gang.

What we learned from the indictment of LockBit’s mastermind

Jumia’s revenue and gross merchandise volume showed growth despite a decrease in quarterly active customers, according to its Q1 2024 report. Revenue increased by 19% year-over-year (57% in constant currency)…

Jumia is back, growing total sales and orders in Q1 2024

Welcome to TechCrunch Fintech! This week, we’re looking at Mercury’s latest expansions, wallet-as-a-service startup Ansa’s raise and more! To get a roundup of TechCrunch’s biggest and most important fintech stories…

Inside Mercury’s competitive push into software and Ramp’s potential M&A targets

Today is Apple iPad Event day, and we bring you all the iPad goodness you can stand, including if some of the rumors are true of what’s coming, like a…

Here’s everything Apple just announced at its Let Loose event, including new iPad Pro with M4 chip, iPad Air, Apple Pencil and more

TikTok is suing the United States government in an effort to block a law that would ban TikTok if its parent company, ByteDance, fails to sell it within a year.…

TikTok sues the US government over law that could ban the app

Meta is encouraging more users to post to its X rival Threads. In its latest experiment, the company is providing an easy toggle for users to cross-post from Instagram to…

Threads is testing cross-posting from Instagram globally

Apple just updated its two high-end tablets: the iPad Air and the iPad Pro. While the entry-level iPad didn’t receive an update, the company lowered its price, too. And of…

Here’s Apple’s new iPad lineup