AI

How any SaaS company can monetize generative AI

Comment

Hydroelectricity meter with dials tracking the customer's usage
Image Credits: Image Source (opens in a new window) / Getty Images

Puneet Gupta

Contributor

Puneet Gupta is the CEO and co-founder of Amberflo.io. He was formerly a general manager at AWS.

More posts from Puneet Gupta

If you work in SaaS, you’ve likely already been part of a conversation at your company about how your customers can benefit with increased value from your products infused with generative AI, large language models (LLMs) or custom AI/ML models.

As you hash out your approach and draw up the product roadmap, I wanted to call out an important aspect — one that I couldn’t help but draw an analogy to the good ol’ California Gold Rush. Don’t show up to the gold rush without a shovel!

Similarly, don’t overlook the monetization aspect of your SaaS + AI. Factor it in at the outset and integrate the right plumbing at the start — not as an afterthought or post-launch.

Last year, I wrote about the inevitable shift to metered pricing for SaaS. The catalyst that would propel the shift was unknown at the time, but the foundational thesis was intact. No one could have predicted back then that a particular form of AI would serve to be that catalyst.

SaaS + AI — what got you here won’t get you there!

First thing to realize is that what is required is not merely a “pricing” change. It is a business model change. Traditionally, SaaS pricing has been a relatively lightweight exercise with a simple per seat model and a price point set sufficiently high above underlying costs to attain desired margins.

A pricing change would be a change in what you charge; for example, going from $79 per user/month to $99 per user/month. A monetization model change is a fundamental shift in how you charge, and with AI as a consumption vector, it inevitably requires a need for accurate metering and usage-based pricing models.

There’s already a handful of great examples of companies leveraging usage-based pricing to monetize AI, including OpenAI and all companies that provide foundational AI models and services, and the likes of Twilio, Snap, Quizlet, Instacart, and Shopify that are integrating with these services to offer customer-facing tooling.

Why usage-based pricing is a natural fit for generative AI

One challenge of monetizing generative AI is that the prompts and outputs vary in length, and the prompt/output size and resource consumption are directly related — with a larger prompt requiring greater resources to process and vice versa.

Adding to the complexity, one customer may use the tool sparingly while another could be generating new text multiple times daily for weeks on end, resulting in a much larger cost footprint. Any viable pricing model must account for this variability and scale accordingly.

On top of this, services like ChatGPT are themselves priced according to a usage-based model. This means that any tools leveraging ChatGPT or other models will be billed based on the usage; since the back-end costs of providing service are inherently variable, the customer-facing billing should be usage-based as well.

To deliver the most fair and transparent pricing, and enable frictionless adoption and user growth, companies should look to usage-based pricing. Having both elastic front-end usage and back-end costs position generative AI products as ideal fits with usage-based pricing. Here’s how to get started.

Meter front-end usage and back-end resource consumption

Companies leverage prebuilt or trained models from a plethora of companies and may further train them with their custom dataset and then incorporate them into their technology stack as features. To obtain complete visibility into usage costs and margins, each usage call (be it API or direct) to AI infrastructure should be metered to understand the usage (underlying cost footprint).

How much resources were consumed to service the request (e.g., token counts, duration, result size, frequency, and any other performance metrics)?

You may choose to do any of the following:

  • Add a markup over the underlying generative AI providers’ cost structure.
  • Price the customer-facing pricing plans on a tiered model based on volume consumption.
  • Create hybrid charge vectors (some that carry forward generative AI’s cost models, in addition to new vectors that are unique to your products or services).

By metering both the customer-facing charge vectors and the corresponding back-end consumption vectors, companies can create and iterate on usage-based pricing plans and have a real-time view into business KPIs like margin and costs, as well as technical KPIs like service performance and overall traffic. After creating the meters, deploy them to the solution or application where events are originating to begin tracking real-time usage.

Track usage, margins, and account health for all customers

Once the metering infrastructure is deployed, begin visualizing usage and costs in real time as usage occurs and customers leverage the generative services. Identify power users and lagging accounts and empower customer-facing teams with contextual data to provide value at every touchpoint.

Since generative AI services like ChatGPT use a token-based billing model, obtain granular token-level consumption information for each customer using your service. This helps to inform customer-level margins and usage for AI services in your products and is valuable intel going into sales and renewal conversations. Without a highly accurate and available real-time metering service, this level of fidelity into customer-level consumption, costs, and margins would not be possible.

Launch and iterate with flexible usage-based pricing

After deploying meters to track the usage and performance of the generative AI solution, the next step is to monetize this usage with usage-based pricing. Identify the value metrics that customers should be charged for.

For text generation, this could be the markup over tokens or underlying resources used or the total processing time to serve the response. For image generation, it could be the size of the input prompt, the resolution of the image generated, or the number of images generated. Commonly, the final pricing will be built from some combination of multiple factors like those described.

After creating the pricing plan and assigning to customers, real-time usage will be tracked by metering and will be rated and billed with the pricing engine. The on-demand invoice will need to be kept up-to-date, so at any time both the vendor or customers can view current usage and charges.

Integrate with your existing tools for next-generation customer success

Once metering is deployed and the billing service is configured, the final step is to integrate with third-party tools inside your organization to make usage and billing data visible and actionable. Integrate with CRM tooling to augment customer records with live usage data or help streamline support ticket resolution.

With real-time usage data being collected, integrate this system with finance and accounting tools for usage-based revenue recognition, invoice tracking, and other tasks.

The emergence of ChatGPT welcomed a new era of interest and investment in AI technology. Consequently, there is an ongoing boom of new products and services coming to market that integrate with ChatGPT and similar tools to deliver customer-facing generative AI capabilities, with ongoing discussions about pricing and go-to-market.

Don’t show up to the gold rush without a shovel. As you experiment with leveraging generative AI and building it into your applications, set up usage-based metering in parallel so that you have a deeper understanding of how your customers are using the application and where they are getting value.

From there, leverage these insights to build a transparent and fair business model that’s profitable at scale.

More TechCrunch

Welcome back to TechCrunch’s Week in Review — TechCrunch’s newsletter recapping the week’s biggest news. Want it in your inbox every Saturday? Sign up here. Over the past eight years,…

Fisker collapsed under the weight of its founder’s promises

What is AI? We’ve put together this non-technical guide to give anyone a fighting chance to understand how and why today’s AI works.

WTF is AI?

President Joe Biden has vetoed H.J.Res. 109, a congressional resolution that would have overturned the Securities and Exchange Commission’s current approach to banks and crypto. Specifically, the resolution targeted the…

President Biden vetoes crypto custody bill

Featured Article

Industries may be ready for humanoid robots, but are the robots ready for them?

How large a role humanoids will play in that ecosystem is, perhaps, the biggest question on everyone’s mind at the moment.

11 hours ago
Industries may be ready for humanoid robots, but are the robots ready for them?

VCs are clamoring to invest in hot AI companies, willing to pay exorbitant share prices for coveted spots on their cap tables. Even so, most aren’t able to get into…

VCs are selling shares of hot AI companies like Anthropic and xAI to small investors in a wild SPV market

The fashion industry has a huge problem: Despite many returned items being unworn or undamaged, a lot, if not the majority, end up in the trash. An estimated 9.5 billion…

Deal Dive: How (Re)vive grew 10x last year by helping retailers recycle and sell returned items

Tumblr officially shut down “Tips,” an opt-in feature where creators could receive one-time payments from their followers.  As of today, the tipping icon has automatically disappeared from all posts and…

You can no longer use Tumblr’s tipping feature 

Generative AI improvements are increasingly being made through data curation and collection — not architectural — improvements. Big Tech has an advantage.

AI training data has a price tag that only Big Tech can afford

Keeping up with an industry as fast-moving as AI is a tall order. So until an AI can do it for you, here’s a handy roundup of recent stories in the world…

This Week in AI: Can we (and could we ever) trust OpenAI?

Jasper Health, a cancer care platform startup, laid off a substantial part of its workforce, TechCrunch has learned.

General Catalyst-backed Jasper Health lays off staff

Featured Article

Live Nation confirms Ticketmaster was hacked, says personal information stolen in data breach

Live Nation says its Ticketmaster subsidiary was hacked. A hacker claims to be selling 560 million customer records.

1 day ago
Live Nation confirms Ticketmaster was hacked, says personal information stolen in data breach

Featured Article

Inside EV startup Fisker’s collapse: how the company crumbled under its founders’ whims

An autonomous pod. A solid-state battery-powered sports car. An electric pickup truck. A convertible grand tourer EV with up to 600 miles of range. A “fully connected mobility device” for young urban innovators to be built by Foxconn and priced under $30,000. The next Popemobile. Over the past eight years, famed vehicle designer Henrik Fisker…

1 day ago
Inside EV startup Fisker’s collapse: how the company crumbled under its founders’ whims

Late Friday afternoon, a time window companies usually reserve for unflattering disclosures, AI startup Hugging Face said that its security team earlier this week detected “unauthorized access” to Spaces, Hugging…

Hugging Face says it detected ‘unauthorized access’ to its AI model hosting platform

Featured Article

Hacked, leaked, exposed: Why you should never use stalkerware apps

Using stalkerware is creepy, unethical, potentially illegal, and puts your data and that of your loved ones in danger.

1 day ago
Hacked, leaked, exposed: Why you should never use stalkerware apps

The design brief was simple: each grind and dry cycle had to be completed before breakfast. Here’s how Mill made it happen.

Mill’s redesigned food waste bin really is faster and quieter than before

Google is embarrassed about its AI Overviews, too. After a deluge of dunks and memes over the past week, which cracked on the poor quality and outright misinformation that arose…

Google admits its AI Overviews need work, but we’re all helping it beta test

Welcome to Startups Weekly — Haje‘s weekly recap of everything you can’t miss from the world of startups. Sign up here to get it in your inbox every Friday. In…

Startups Weekly: Musk raises $6B for AI and the fintech dominoes are falling

The product, which ZeroMark calls a “fire control system,” has two components: a small computer that has sensors, like lidar and electro-optical, and a motorized buttstock.

a16z-backed ZeroMark wants to give soldiers guns that don’t miss against drones

The RAW Dating App aims to shake up the dating scheme by shedding the fake, TikTok-ified, heavily filtered photos and replacing them with a more genuine, unvarnished experience. The app…

Pitch Deck Teardown: RAW Dating App’s $3M angel deck

Yes, we’re calling it “ThreadsDeck” now. At least that’s the tag many are using to describe the new user interface for Instagram’s X competitor, Threads, which resembles the column-based format…

‘ThreadsDeck’ arrived just in time for the Trump verdict

Japanese crypto exchange DMM Bitcoin confirmed on Friday that it had been the victim of a hack resulting in the theft of 4,502.9 bitcoin, or about $305 million.  According to…

Hackers steal $305M from DMM Bitcoin crypto exchange

This is not a drill! Today marks the final day to secure your early-bird tickets for TechCrunch Disrupt 2024 at a significantly reduced rate. At midnight tonight, May 31, ticket…

Disrupt 2024 early-bird prices end at midnight

Instagram is testing a way for creators to experiment with reels without committing to having them displayed on their profiles, giving the social network a possible edge over TikTok and…

Instagram tests ‘trial reels’ that don’t display to a creator’s followers

U.S. federal regulators have requested more information from Zoox, Amazon’s self-driving unit, as part of an investigation into rear-end crash risks posed by unexpected braking. The National Highway Traffic Safety…

Feds tell Zoox to send more info about autonomous vehicles suddenly braking

You thought the hottest rap battle of the summer was between Kendrick Lamar and Drake. You were wrong. It’s between Canva and an enterprise CIO. At its Canva Create event…

Canva’s rap battle is part of a long legacy of Silicon Valley cringe

Voice cloning startup ElevenLabs introduced a new tool for users to generate sound effects through prompts today after announcing the project back in February.

ElevenLabs debuts AI-powered tool to generate sound effects

We caught up with Antler founder and CEO Magnus Grimeland about the startup scene in Asia, the current tech startup trends in the region and investment approaches during the rise…

VC firm Antler’s CEO says Asia presents ‘biggest opportunity’ in the world for growth

Temu is to face Europe’s strictest rules after being designated as a “very large online platform” under the Digital Services Act (DSA).

Chinese e-commerce marketplace Temu faces stricter EU rules as a ‘very large online platform’

Meta has been banned from launching features on Facebook and Instagram that would have collected data on voters in Spain using the social networks ahead of next month’s European Elections.…

Spain bans Meta from launching election features on Facebook, Instagram over privacy fears

Stripe, the world’s most valuable fintech startup, said on Friday that it will temporarily move to an invite-only model for new account sign-ups in India, calling the move “a tough…

Stripe curbs its India ambitions over regulatory situation