A broad new array of generative AI-focused tools for developers is available in Nvidia AI Enterprise 5.0. Credit: Nvidia Version 5.0 of Nvidia’s enterprise-spanning AI software platform will feature a smorgasbord of microservices designed to speed app development and provide quick ways to ramp up deployments, the company announced today at its GPU Technology Conference. These microservices are provided as downloadable software containers used to deploy enterprise applications, Nvidia said in an official blog post. They’re split into two main categories — Nvidia NIM, which covers microservices related to deploying production AI models, and CUDA-X, for microservices like cuOpt, the company’s optimization engine. For NIM microservices the focus is on deployment times for generative AI apps, which the company said can be reduced “from weeks to minutes” with its services. The microservices include Triton Inference Server for standardizing AI model deployment, and TensorRT-LLM to help optimize and define large language models, making it easier for companies to experiment with LLMs without having to delve into C++ or Nvidia CUDA. They’ll be accessible via Amazon SageMaker, Google Kubernetes Engine, and Microsoft Azure AI, and integrations with AI frameworks like Deepset, LangChain and LlamaIndex are also supported. CUDA-X microservices, by contrast, are more focused on data preparation and model training, as well as tools to enable developers to tie their generative AI apps to business data, whether that’s numerical information, text, or images. Other microservices in this category are almost applications of their own, like Nvidia Riva for translation and speech AI, the aforementioned cuOpt for process and routing optimization and Earth-2 for climate and weather simulations. A host of further integrations is also coming to AI Enterprise 5.0, the company said. Business data hosted on Box, Cloudera, Cohesity, Datastax and the like can be used in AI applications as of version 5.0, and Nvidia-powered hardware can be found in servers and PCs from most major vendors, including Dell, HPE and Lenovo. Nvidia described the microservices as a new layer in its full-stack computing platform, connecting model developers with platform providers and enterprises and providing a standardized path for running custom AI models across clouds, data centers, workstations and PCs. Nvidia’s AI Enterprise 5.0 is available for developers to tinker with for free as of now, and enterprise licenses can be purchased for $4,500 per GPU per year, or $1 per GPU per hour in the cloud. Related content brandpost Sponsored by VMware How to build a successful agile development culture - and why your business needs one Mastering agile: Addressing familiar challenges and common misconceptions for successful software development. By Mike Freedman, Senior Director, and Michael Coté, Senior Member of Technical Staff, VMware Tanzu by Broadcom May 20, 2024 6 mins Devops Software Development brandpost Sponsored by Broadcom Driving digital transformation success: Serge Lucio's insights on Value Stream Management Navigating the VSM landscape: Strategies for seamless digital transformation—a chat with Serge Lucio, General Manager of the Agile Operations Division at Broadcom By Marla Schimke, Head of Product and Growth Marketing, Broadcom's ValueOps Software Division May 20, 2024 4 mins Digital Transformation feature 10 projects top of mind for IT leaders today From embracing AI to modernizing infrastructure, IT leaders are focusing more on key business differentiators, risk mitigation, emerging issues, and transforming IT to accelerate change. By Mary Pratt May 20, 2024 11 mins Business IT Alignment Data and Information Security IT Strategy opinion Assembly required: 8 myths about knowledge management debunked Business leaders intent on fostering innovative cultures must differentiate between knowledge management and knowledge assembly. One involves systems, data, and collaboration; the other, insights, dialogue, serendipity, and courses of action. By Daniel Forrester and Jerold Zimmerman May 20, 2024 13 mins Content Management Systems Document Management Systems Staff Management PODCASTS VIDEOS RESOURCES EVENTS SUBSCRIBE TO OUR NEWSLETTER From our editors straight to your inbox Get started by entering your email address below. Please enter a valid email address Subscribe