Featured Article

Is there a future in light-powered AI chips?

Photonics is proving to be a tough nut to crack

Comment

Central Computer Processor digital technology and innovations
Image Credits: Olemedia / Getty Images

The growing compute power necessary to train sophisticated AI models such as OpenAI’s ChatGPT might eventually run up against a wall with mainstream chip technologies.

In a 2019 analysis, OpenAI found that from 1959 to 2012, the amount of power used to train AI models doubled every two years, and that the power usage began rising seven times faster after 2012.

It’s already causing strain. Microsoft is reportedly facing an internal shortage of the server hardware needed to run its AI, and the scarcity is driving prices up. CNBC, speaking to analysts and technologists, estimates the current cost of training a ChatGPT-like model from scratch to be over $4 million.

One solution to the AI training dilemma that’s been proposed is photonic chips, which use light to send signals rather than the electricity that conventional processors use. Photonic chips could in theory lead to higher training performance because light produces less heat than electricity, can travel faster and is far less susceptible to changes in temperature and electromagnetic fields.

LightmatterLightOn, Luminous Computing, Intel and NTT are among the companies developing photonic technologies. But while the technology generated much excitement a few years ago — and attracted a lot of investment — the sector has cooled noticeably since then.

There are various reasons why, but the general message from investors and analysts studying photonics is that photonic chips for AI, while promising, aren’t the panacea they were once believed to be.

“One important insight for us has been that, for data center applications — and this is where most people think photonic AI compute would likely be applied — nobody is interested in inference or training alone,” said Christian Patze, the investment director at M Ventures, over email. M Ventures backed Celestial AI, one of the companies developing light-based AI accelerator chips. “A new technology, be it photonic or electronic, has to be able to work equally well for both applications, otherwise it’s not of interest for large customers in the space.”

Inference — that is, running trained models — is fairly expensive. According to one estimate, ChatGPT was costing OpenAI $100,000 per day at its peak, which works out to an eye-watering $3 million per month.

But on a more fundamental level, Patze isn’t convinced that “photonic compute” approaches are the best possible option for training. Photonic chips are physically larger than their electronic counterparts and difficult to mass-produce, for one, owing to the immaturity of photonic chip fabrication plants. Moreover, photonic architectures still largely rely on electronic control circuits, which can create bottlenecks.

“Where it concerns photonic chips, it very much depends on what the algorithm and workload is for the training and how those algorithms can be mapped onto photonic compute architectures,” Patze said. “People missed that AI compute is a lot more than doing [certain operations] efficiently and that mapping advanced AI models onto theoretically fast optical compute engines is not trivial — or rather very hard.”

The other issue with photonics chip tech, according to Patze, is the power required to convert data into a format that the chips can work with. With conventional, silicon-based hardware, data is transferred and stored in the digital domain, whereas in photonics, it’s analog. Converting between digital and analog necessitates dedicated converter components, which tend to be power hungry.

There’s yet another unresolved challenge in photonics: signal regeneration, or the process of regenerating optical signals degraded during transmission through photonic chips. Most chips perform signal regeneration through “optical to electrical to optical” conversion, in which a weak and distorted optical signal is detected and restored in a digital, electronic piece of hardware and then retransmitted to the optical domain. But this causes the signal to become distorted over time, leading to issues downstream.

For these reasons and others, Gartner analyst Anushree Verma, who studies emerging technologies and trends, believes that it’ll take more than eight years for photonic computing to become mainstream.

“If we think of an all-optical computing system, it needs a variety of components such as optical gates, optical switches, optical memories, optical interconnects and light sources,” Verma said. “Commercial availability of the critical components for photonic technology is still very far away for various reasons … The customer concentration as well as the technological challenges are not particularly hospitable to entrants.”

So is there a future in photonics for AI yet? Probably. It might just require a reframing.

Patze sees photonics helping to overcome one of the big bottlenecks in AI compute: delivering speedy access to large amounts of memory. For large AI models like OpenAI’s GPT-4, AI accelerator cards — think chips like Google’s TPUs or AWS’ Trainium — in server racks usually have tens or hundreds of cores and hundreds of gigabytes of memory that need to be interlinked. When strained with data, these classical electrical interconnects start to fail or become quite slow. But photonic interconnects don’t; they have far more capacity.

“Longer-distance connections at high bandwidth and low latency is exactly where photonic interconnects shine,” Patze said. “Hence, it’s not surprising that some companies are turning to those to overcome the data movement bottleneck in modern AI compute. The photonics interconnect approach also has much shorter development timelines, as photonic interconnect components and circuit design in principle are well known from the optical communication industry.”

Verma sees the photonics AI accelerator chip industry evolving in three stages over the next several years. First, she expects a wave of hybrid connectivity — silicon and photonics packaged together — followed by a flood of integrated platforms (the second phase), and then eventually full stack photonic computing as the third stage.

By 2027, Verma thinks that 10% of network switch deployments will have co-packaging optics, driven by the need for increasing bandwidth and reducing power consumption, up from a negligible percentage now.

“The primary challenge is the manufacturing cost reduction,” Verma said, “but the market will be driven by the need of increasing bandwidth and reducing power consumption, up from negligible now … Photonic computing is still in its very early stage.”

To her point, Mordor Intelligence predicts that the photonic integrated circuit market will be worth $26.42 billion by 2027.

More TechCrunch

Welcome to Startups Weekly — Haje‘s weekly recap of everything you can’t miss from the world of startups. Sign up here to get it in your inbox every Friday. Well,…

Startups Weekly: Drama at Techstars. Drama in AI. Drama everywhere.

Last year’s investor dreams of a strong 2024 IPO pipeline have faded, if not fully disappeared, as we approach the halfway point of the year. 2024 delivered four venture-backed tech…

From Plaid to Figma, here are the startups that are likely — or definitely — not having IPOs this year

Federal safety regulators have discovered nine more incidents that raise questions about the safety of Waymo’s self-driving vehicles operating in Phoenix and San Francisco.  The National Highway Traffic Safety Administration…

Feds add nine more incidents to Waymo robotaxi investigation

Terra One’s pitch deck has a few wins, but also a few misses. Here’s how to fix that.

Pitch Deck Teardown: Terra One’s $7.5M Seed deck

Chinasa T. Okolo researches AI policy and governance in the Global South.

Women in AI: Chinasa T. Okolo researches AI’s impact on the Global South

TechCrunch Disrupt takes place on October 28–30 in San Francisco. While the event is a few months away, the deadline to secure your early-bird tickets and save up to $800…

Disrupt 2024 early-bird tickets fly away next Friday

Another week, and another round of crazy cash injections and valuations emerged from the AI realm. DeepL, an AI language translation startup, raised $300 million on a $2 billion valuation;…

Big tech companies are plowing money into AI startups, which could help them dodge antitrust concerns

If raised, this new fund, the firm’s third, would be its largest to date.

Harlem Capital is raising a $150 million fund

About half a million patients have been notified so far, but the number of affected individuals is likely far higher.

US pharma giant Cencora says Americans’ health information stolen in data breach

Attention, tech enthusiasts and startup supporters! The final countdown is here: Today is the last day to cast your vote for the TechCrunch Disrupt 2024 Audience Choice program. Voting closes…

Last day to vote for TC Disrupt 2024 Audience Choice program

Featured Article

Signal’s Meredith Whittaker on the Telegram security clash and the ‘edge lords’ at OpenAI 

Among other things, Whittaker is concerned about the concentration of power in the five main social media platforms.

12 hours ago
Signal’s Meredith Whittaker on the Telegram security clash and the ‘edge lords’ at OpenAI 

Lucid Motors is laying off about 400 employees, or roughly 6% of its workforce, as part of a restructuring ahead of the launch of its first electric SUV later this…

Lucid Motors slashes 400 jobs ahead of crucial SUV launch

Google is investing nearly $350 million in Flipkart, becoming the latest high-profile name to back the Walmart-owned Indian e-commerce startup. The Android-maker will also provide Flipkart with cloud offerings as…

Google invests $350 million in Indian e-commerce giant Flipkart

A Jio Financial unit plans to purchase customer premises equipment and telecom gear worth $4.32 billion from Reliance Retail.

Jio Financial unit to buy $4.32B of telecom gear from Reliance Retail

Foursquare, the location-focused outfit that in 2020 merged with Factual, another location-focused outfit, is joining the parade of companies to make cuts to one of its biggest cost centers –…

Foursquare just laid off 105 employees

“Running with scissors is a cardio exercise that can increase your heart rate and require concentration and focus,” says Google’s new AI search feature. “Some say it can also improve…

Using memes, social media users have become red teams for half-baked AI features

The European Space Agency selected two companies on Wednesday to advance designs of a cargo spacecraft that could establish the continent’s first sovereign access to space.  The two awardees, major…

ESA prepares for the post-ISS era, selects The Exploration Company, Thales Alenia to develop cargo spacecraft

Expressable is a platform that offers one-on-one virtual sessions with speech language pathologists.

Expressable brings speech therapy into the home

The French Secretary of State for the Digital Economy as of this year, Marina Ferrari, revealed this year’s laureates during VivaTech week in Paris. According to its promoters, this fifth…

The biggest French startups in 2024 according to the French government

Spotify is notifying customers who purchased its Car Thing product that the devices will stop working after December 9, 2024. The company discontinued the device back in July 2022, but…

Spotify to shut off Car Thing for good, leading users to demand refunds

Elon Musk’s X is preparing to make “likes” private on the social network, in a change that could potentially confuse users over the difference between something they’ve favorited and something…

X should bring back stars, not hide ‘likes’

The FCC has proposed a $6 million fine for the scammer who used voice-cloning tech to impersonate President Biden in a series of illegal robocalls during a New Hampshire primary…

$6M fine for robocaller who used AI to clone Biden’s voice

Welcome back to TechCrunch Mobility — your central hub for news and insights on the future of transportation. Sign up here for free — just click TechCrunch Mobility! Is it…

Tesla lobbies for Elon and Kia taps into the GenAI hype

Crowdaa is an app that allows non-developers to easily create and release apps on the mobile store. 

App developer Crowdaa raises €1.2M and plans a US expansion

Back in 2019, Canva, the wildly successful design tool, introduced what the company was calling an enterprise product, but in reality it was more geared toward teams than fulfilling true…

Canva launches a proper enterprise product — and they mean it this time

TechCrunch Disrupt 2024 isn’t just an event for innovation; it’s a platform where your voice matters. With the Disrupt 2024 Audience Choice Program, you have the power to shape the…

2 days left to vote for Disrupt Audience Choice

The United States Department of Justice and 30 state attorneys general filed a lawsuit against Live Nation Entertainment, the parent company of Ticketmaster, for alleged monopolistic practices. Live Nation and…

Ticketmaster antitrust lawsuit could give new hope to ticketing startups

The U.K. will shortly get its own rulebook for Big Tech, after peers in the House of Lords agreed Thursday afternoon to pass the Digital Markets, Competition and Consumer bill…

‘Pro-competition’ rules for Big Tech make it through UK’s pre-election wash-up

Spotify’s addition of its AI DJ feature, which introduces personalized song selections to users, was the company’s first step into an AI future. Now, Spotify is developing an alternative version…

Spotify experiments with an AI DJ that speaks Spanish

Call Arc can help answer immediate and small questions, according to the company. 

Arc Search’s new Call Arc feature lets you ask questions by ‘making a phone call’