Startups

NeuReality lands $35M to bring AI accelerator chips to market

Comment

computer circuit board
Image Credits: CHRISTOPH BURGSTEDT/SCIENCE PHOTO LIBRARY / Getty Images

The growing demand for AI, particularly generative AI (i.e., AI that generates images, text and more), is supercharging the AI inferencing chip market. Inferencing chips accelerate the AI inferencing process, which is where AI systems generate outputs (e.g., text, images, audio) based on what they learned while “training” on a specific set of data. AI inferencing chips can be — and have been — used to yield faster generations from systems such as Stable Diffusion, which translates text prompts into artwork, and OpenAI’s GPT-3, which extends a few lines of prose into full-length poems, essays and more.

A number of vendors — both startups and well-established players — are actively developing and selling access to AI inferencing chips. There’s Hailo, Mythic and Flex Logix, to name a few upstarts. And on the incumbent side, Google’s competing for dominance with its tensor processing units (TPUs) while Amazon’s betting on Inferentia. But the competition, while fierce, hasn’t scared away firms like NeuReality, which occupy the AI chip inferencing market but aim to differentiate themselves by offering a suite of software and services to support their hardware.

On the subject, NeuReality today announced that it raised $35 million in a Series A funding round led by Samsung Ventures, Cardumen Capital, Varana Capital, OurCrowd and XT Hi-Tech with participation from SK Hynix, Cleveland Avenue, Korean Investment Partners, StoneBridge, and Glory Ventures. Co-founder and CEO Moshe Tanach tells TechCrunch that the tranche will be put toward finalizing the design of NeuReality’s flagship AI inferencing chip in early 2023 and shipping it to customers.

NeuReality was founded with the vision to build a new generation of AI inferencing solutions that are unleashed from traditional CPU-centric architectures and deliver high performance and low latency, with the best possible efficiency in cost and power consumption,” Tanach told TechCrunch via email. “Most companies that can leverage AI don’t have the funds nor the huge R&D that Amazon, Meta and other huge companies investing in AI have. NeuReality will bring AI tech to anyone who wants to deploy easily and affordably.”

NeuReality was co-founded in 2019 by Tzvika Shmueli, Yossi Kasus and Tanach, who previously served as a director of engineering at Marvell and Intel. Shmueli was formerly the VP of back-end infrastructure at Mellanox Technologies and the VP of engineering at Habana Labs. As for Kasus, he held a senior director of engineering role at Mellanox and was the head of integrations at semiconductor company EZchip.

From the start, NeuReality focused on bringing to market AI hardware for cloud data centers and “edge” computers, or machines that run on-premises and do most of their data processing offline. Tanach says that the startup’s current-generation product lineup, the Network Attached Processing Unit (NAPU), is optimized for AI inference applications, including computer vision (think algorithms that recognize objects in photos), natural language processing (text-generating and classifying systems) and recommendation engines (like the type that suggest products on e-commerce sites).

NeuReality’s NAPU is essentially a hybrid of multiple types of processors. It can perform functions like AI inferencing load balancing, job scheduling and queue management, which have traditionally been done in software but not necessarily very efficiently.

NeuReality
Image Credits: NeuReality

NeuReality’s NR1, an FPGA-based SKU within the NAPU family, is a network-attached “server on a chip” with an embedded AI inferencing accelerator along with networking and virtualization capabilities. NeuReality also offers the NR1-M module, a PCIe card containing an NR1 and a network-attached inference server, and a separate module — the NR1-S — that pairs several NR1-Ms with the NR1.

On the software side, NeuReality delivers a set of tools, including a software development kit for cloud and local workloads, a deployment manager to help with runtime issues and a monitoring dashboard.

“The software for AI inference [and] the tools for heterogeneous compute and automated flow of compilation and deployment … is the magic that supports our innovative hardware approach,” Tanach said. “The first beneficiaries of the NAPU technology are enterprises and cloud solution providers that need infrastructure to support their chatbots, voice bots, automatic transcriptions and sentiment analysis as well as computer vision use cases for document scans, defect detection, etc.  … While the world was focusing on the deep learning processor improvements, NeuReality focused on optimizing the system around it and the software layers above it to provide higher efficiency and a much easier flow to deploy inference.”

NeuReality, it must be noted, has yet to back up some of its performance claims with empirical evidence. It told ZDNet in a recent article that it estimates its hardware will deliver a 15x improvement in performance per dollar compared to the available GPUs and ASICs offered by deep learning accelerator vendors, but NeuReality hasn’t released validating benchmarking data. The startup also hasn’t detailed its proprietary networking protocol, a protocol that it has previously claimed is more performant than existing solutions.

Those items aside, delivering hardware at massive scale isn’t easy — particularly where it involves custom AI inferencing chips. But Tanach argues that NeuReality has laid the necessary groundwork, partnering with AMD-owned semiconductor manufacturer Xilinx for production and inking a partnership with IBM to work on hardware requirements for the NR1. (IBM, which is also a NeuReality design partner, previously said it’s “evaluating” the startup’s products for use in the IBM cloud.) NeuReality has been shipping prototypes to partners since May 2021, Tanach says.

According to Tanach, beyond IBM, NeuReality is working with Lenovo, AMD and unnamed cloud solution providers, system integrators, deep learning accelerator vendors and “inference-consuming” enterprises on deployments. Tanach declined, however, to reveal how many customers the startup currently has or what roughly it’s projecting in terms of revenue.

“We see that the pandemic is slowing companies down and pushing for consolidation between the many deep learning vendors. However, for us it doesn’t change anything, since late next year or sometime through 2024 inference deployment is expected to explode — and our technology is exactly the enabler and driver of that growth,” Tanach said. “The NAPU will bring AI for a broader set of less technical companies. It is also set to allow large-scale users such as ‘hyperscalers’ and next-wave data center customers to support their growing scale of AI usage.”

Ori Kirshner, the head of Samsung Ventures in Israel, added in an emailed statement: “We see substantial and immediate need for higher efficiency and easy-to-deploy inference solutions for data centers and on-premises use cases, and this is why we are investing in NeuReality. The company’s innovative disaggregation, data movement and processing technologies improve computation flows, compute-storage flows, and in-storage compute — all of which are critical for the ability to adopt and grow AI solutions.”

NeuReality, which currently has 40 employees, plans to hire 20 more over the next two fiscal quarters. To date, it’s raised $48 million in venture capital.

More TechCrunch

In 1816, French physician René Laennec invented an instrument that allowed doctors to listen to human hearts and lungs. That device — a stethoscope — eventually evolved from a simple…

Eko Health scores $41M to detect heart disease earlier and more accurately

The number of satellites on low Earth orbit is poised to explode over the coming years as more mega-constellations come online, and it will create new opportunities for bad actors…

DARPA and Slingshot build system to detect ‘wolf in sheep’s clothing’ adversary satellites

SAP sees WalkMe’s focus on automating contextual, in-app support as bringing value to its own enterprise customers.

SAP to acquire digital adoption platform WalkMe for $1.5B

The National Democratic Alliance (NDA) has emerged victorious in India’s 2024 general election, but with a smaller majority compared to 2019. According to post-election analysis by Goldman Sachs, JP Morgan,…

Modi-led coalition’s election win signals policy continuity in India – but also spending cuts

Featured Article

A comprehensive list of 2024 tech layoffs

The tech layoff wave is still going strong in 2024. Following significant workforce reductions in 2022 and 2023, this year has already seen 60,000 job cuts across 254 companies, according to independent layoffs tracker Layoffs.fyi. Companies like Tesla, Amazon, Google, TikTok, Snap and Microsoft have conducted sizable layoffs in the…

15 hours ago
A comprehensive list of 2024 tech layoffs

Featured Article

What to expect from WWDC 2024: iOS 18, macOS 15 and so much AI

Apple is hoping to make WWDC 2024 memorable as it finally spells out its generative AI plans.

15 hours ago
What to expect from WWDC 2024: iOS 18, macOS 15 and so much AI

We just announced the breakout session winners last week. Now meet the roundtable sessions that really “rounded” out the competition for this year’s Disrupt 2024 audience choice program. With five…

The votes are in: Meet the Disrupt 2024 audience choice roundtable winners

The malicious attack appears to have involved malware transmitted through TikTok’s DMs.

TikTok acknowledges exploit targeting high-profile accounts

It’s unusual for three major AI providers to all be down at the same time, which could signal a broader infrastructure issues or internet-scale problem.

AI apocalypse? ChatGPT, Claude and Perplexity all went down at the same time

Welcome to TechCrunch Fintech! This week, we’re looking at LoanSnap’s woes, Nubank’s and Monzo’s positive milestones, a plethora of fintech fundraises and more! To get a roundup of TechCrunch’s biggest…

A look at LoanSnap’s troubles and which neobanks are having a moment

Databricks, the analytics and AI giant, has acquired data management company Tabular for an undisclosed sum. (CNBC reports that Databricks paid over $1 billion.) According to Tabular co-founder Ryan Blue,…

Databricks acquires Tabular to build a common data lakehouse standard

ChatGPT, OpenAI’s text-generating AI chatbot, has taken the world by storm. What started as a tool to hyper-charge productivity through writing essays and code with short text prompts has evolved…

ChatGPT: Everything you need to know about the AI-powered chatbot

The next few weeks could be pivotal for Worldcoin, the controversial eyeball-scanning crypto venture co-founded by OpenAI’s Sam Altman, whose operations remain almost entirely shuttered in the European Union following…

Worldcoin faces pivotal EU privacy decision within weeks

OpenAI’s chatbot ChatGPT has been down for several users across the globe for the last few hours.

OpenAI fixes the issue that caused ChatGPT outage for several hours

True Fit, the AI-powered size-and-fit personalization tool, has offered its size recommendation solution to thousands of retailers for nearly 20 years. Now, the company is venturing into the generative AI…

True Fit leverages generative AI to help online shoppers find clothes that fit

Audio streaming service TuneIn is teaming up with Discord to bring free live radio to the platform. This is TuneIn’s first collaboration with a social platform and one that is…

Discord and TuneIn partner to bring live radio to the social platform

The early victors in the AI gold rush are selling the picks and shovels needed to develop and apply artificial intelligence. Just take a look at data-labeling startup Scale AI…

Scale AI founder Alexandr Wang is coming to Disrupt 2024

Try to imagine the number of parts that go into making a rocket engine. Now imagine requesting and comparing quotes for each of those parts, getting approvals to purchase the…

Engineer brothers found Forge to modernize hardware procurement

Raspberry Pi has released a $70 AI extension kit with a neural network inference accelerator that can be used for local inferencing, for the Raspberry Pi 5.

Raspberry Pi partners with Hailo for its AI extension kit

When Stacklet’s founders, Travis Stanfield and Kapil Thangavelu, came out of Capital One in 2020 to launch their startup, most companies weren’t all that concerned with constraining cloud costs. But…

Stacklet sees demand grow as companies take cloud cost control more seriously

Fivetran’s Managed Data Lake Service aims to remove the repetitive work of managing data lakes.

Fivetran launches a managed data lake service

Lance Riedel and Nigel Daley both spent decades in search discovery, but it was while working at Pinterest that they began trying to understand how to use search engines to…

How a couple of former Pinterest search experts caught Biz Stone’s attention

GetWhy helps businesses carry out market studies and extract insights from video-based interviews using AI.

GetWhy, a market research AI platform that extracts insights from video interviews, raises $34.5M

AI-powered virtual physical therapy platform Sword Health has seen its valuation soar 50% to $3 billion.

Sword Health raises $130M and its valuation soars to $3B

Jeffrey Katzenberg and Sujay Jaswa, along with three general partners, manage $1.5 billion in assets today through their Build, Venture and Seed strategies.

WndrCo officially gets into venture capital with fresh $460M across two funds

The startup targets the middle ground between platforms that offer rigid templates, and those that facilitate a full-control approach.

Storyblok raises $80M to add more AI to its ‘headless’ CMS aimed at non-technical people

The startup has been pursuing a ground-up redesign of a well-understood technology.

‘Star Wars’ lasers and waterfalls of molten salt: How Xcimer plans to make fusion power happen

Sēkr, a startup that offers a mobile app for outdoor enthusiasts and campers, is launching a new AI tool for planning road trips. The new tool, called Copilot, is available…

Travel app Sēkr can plan your next road trip with its new AI tool

Microsoft’s education-focused flavor of its cloud productivity suite, Microsoft 365 Education, is facing investigation in the European Union. Privacy rights nonprofit noyb has just lodged two complaints with Austria’s data…

Microsoft hit with EU privacy complaints over schools’ use of 365 Education suite

Since the shock of Russia’s 2022 invasion of Ukraine, solar energy has been having a moment in Europe. Electricity prices have been going up while the investment required to get…

Samara is accelerating the energy transition in Spain one solar panel at a time