article thumbnail

Accelerating generative AI requires the right storage

CIO

In generative AI, data is the fuel, storage is the fuel tank and compute is the engine. All this data means that organizations adopting generative AI face a potential, last-mile bottleneck, and that is storage. Novel approaches to storage are needed because generative AI’s requirements are vastly different.

article thumbnail

EnCharge AI emerges from stealth with $21.7M to develop AI accelerator hardware

TechCrunch

EnCharge AI , a company building hardware to accelerate AI processing at the edge , today emerged from stealth with $21.7 Speaking to TechCrunch via email, co-founder and CEO Naveen Verma said that the proceeds will be put toward hardware and software development as well as supporting new customer engagements.

Hardware 216
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Inferencing holds the clues to AI puzzles

CIO

As with many data-hungry workloads, the instinct is to offload LLM applications into a public cloud, whose strengths include speedy time-to-market and scalability. Inferencing funneled through RAG must be efficient, scalable, and optimized to make GenAI applications useful.

article thumbnail

Prioritizing AI? Don’t shortchange IT fundamentals

CIO

Fundamentals like security, cost control, identity management, container sprawl, data management, and hardware refreshes remain key strategic areas for CIOs to deal with. But rapid hardware advances may mean CIOs need to budget for much shorter hardware refresh cycles in future to stay up to date.

article thumbnail

Microsoft acquires Fungible, a maker of data processing units, to bolster Azure

TechCrunch

In December, reports suggested that Microsoft had acquired Fungible, a startup fabricating a type of data center hardware known as a data processing unit (DPU), for around $190 million. ” A DPU is a dedicated piece of hardware designed to handle certain data processing tasks, including security and network routing for data traffic. .”

Azure 282
article thumbnail

ASUS unveils powerful, cost-effective AI servers based on modular design

CIO

That means hardware designed from the ground up for maximum performance, data center integration, AI development support, optimal cooling, and easy vertical and horizontal scaling. That architecture lets ASUS servers exploit the latest NVIDIA advances in GPUs, CPUs, NVME storage, and PCIe Gen5 interfaces.

article thumbnail

Addressing cloud waste: 4 steps to cloud computing cost optimization

CIO

This is accomplished by lowering the burden on an application’s underlying operating system and hardware to increase utilization of that hardware to do more work with the same resources or shut down/use fewer instances. Cloud Computing, Cloud Storage In the latter case, your cloud bill can see immediate savings.

Cloud 305