Dell AI

Dell Technologies & NVIDIA Collaborate On Full-Stack Generative AI Solutions

Context

Dell Technologies and NVIDIA are working together to reduce the complexity of building and deploying infrastructure for Generative AI. The two companies announced Project Helix earlier this year at Dell Technologies World, which Dell described as delivering full-stack solutions with technical expertise and pre-built tools based on Dell and NVIDIA infrastructure and software.

Dell and NVIDIA have announced the first concrete elements resulting from Project Helix. The companies are delivering validated designs for inference systems based on NVIDIA accelerators and software, a professional services offering to help enterprises embrace generative AI, and a new Dell Precision workstation for AI development.  

News: Generative AI Validated Designs

Validated designs allow IT organizations to take a recipe-driven approach to building infrastructure. Dell has a long history of enabling rapid technology adoption with validated designs, including Dell validated designs for analytics, HPC, and ORAN, among many others. Now Dell offers a set of validated designs for generative AI inference.

Dell and NVIDIA co-engineered the Dell Validated Design for Generative AI, providing a blueprint for building infrastructure for generative AI inference. The validated design is a range of pre-tested, proven configurations based on Dell PowerEdge servers combined with the appropriate NVIDIA accelerators.

Dell’s Validated Design for Generative AI with NVIDIA is available globally through traditional Dell channels. Dell also makes systems based on the validated design available through its APEX as-a-service offering.

Servers & Storage

The blueprint for generative AI provides a choice of server, either the Dell PowerEdge XE8640, PowerEdge XE9680 or PowerEdge R760xa. All of these servers are based on the latest generation Intel Xeon processor. Given AMD’s success in this space, it’s surprising that there doesn’t seem to be an AMD option. The servers in the validated designs support between four and eight NVIDIA Hopper H100 GPUs connected with NVIDIA’s NVLink technology.

Diagram showing various configuration options for Dell's Validated Design for Generative AI
Dell Validated Design Options for Generative AI

The storage options are broader than the server offerings. Dell is supporting its PowerScale filter and ECS object storage. Both systems support NVIDIA’s GPUDirect for increased performance and decreased latency when serving data to the GPUs within the cluster.

Based on NVIDIA’s Enterprise AI Software Stack

While generative AI is hardware intensive, it’s the software stack that makes the difference. Each Dell validated design for generative AI relies heavily on NVIDIA’s enterprise AI stack. This includes the following NVIDIA software elements:

NVIDIA AI Enterprise provides an end-to-end, cloud-native suite of AI and data analytics software.

  • Triton Inference Server for standardizing and accelerating AI model deployment and execution in production environments.
  • Triton Model Analyzer analyzes AI models to identify potential deployment issues, including latency and sufficiency of the hardware.
  • Faster Transformer technology, for optimized language processing.
  • NVIDIA NeMo Framework for building, customizing, and deploying generative AI models with billions of parameters.
  • Cluster Manager manages the provisioning and operation of AI nodes with an AI cluster.
Block diagram showing the elements of Dell & NVIDIA's validated reference design for GenAI
Dell/NVIDIA Validated Reference Design

News: Dell Precision AI Workstation

Dell’s Precision Workstations have long been popular with data scientists and AI researchers. As part of its recent announcement, Dell announced new Dell Precision AI Workstations.

The new workstations arrive with a mix of models based on AMD Threadripper and Intel Xeon processors and are equipped with up to four NVIDIA RTX 6000 GPUs. The new workstation models will be available through the usual Dell sales channels in August.

News: Professional Services for AI

Professional Services has always been central to Dell’s overall customer experience. The company offers services across a broad range of spaces, including cloud adoption, APEX, big data, business resiliency, and several digital transformation specialties. Dell has extended its services offerings to now include generative AI.

Dell’s professional services team works directly with its customers across the full lifecycle of an AI implementation. This includes working with the customer to create a generative AI strategy that identifies high-value use cases and following through with full-stack implementation services. Post-deployment, Dell professional services can stay involved to ensure operational efficiency, provide managed services, and even handle staff training.

Analysis

AI is everywhere. It’s nearly impossible to go a day without hearing about yet another way generative AI will change how enterprises operate. It’s not all hype; generative AI is disruptive. Generative AI will impact business processes and is already changing how enterprises think about digital transformation. More critically, AI is changing how IT practitioners think about infrastructure.

Given the constant bombardment of AI-focused news, it’s easy to forget how new the technology really is. The industry is still figuring out best practices. None of this technology is commodity. Any IT organization with a generative AI project will struggle with the cost and complexity of the solution, and anything that mitigates that is a benefit.

I’ve always been a fan of Dell’s validated designs, and I really like the new options for generative AI. Following Dell and NVIDIA’s blueprints removes much of the risk in building and deploying infrastructure for generative AI. Engaging Dell’s professional services team to help removes even more risk, coming as close as you can get to guaranteeing success. Anything that simplifies life for an IT practitioner, as Dell’s new offerings do, is goodness.

Disclosure: The author is an industry analyst, and NAND Research an industry analyst firm, that engages in, or has engaged in, research, analysis, and advisory services with many technology companies, which may include those mentioned in this article. The author does not hold any equity positions with any company mentioned in this article.