In Conversation: Extending HCI to the Edge

I recently sat with Nutanix’s Greg White, Senior Director of Product Marketing, to understand the challenges of deploying resources, including AI, to the edge. Greg also talked in-depth about how hyper-converged architectures (HCI) can be leveraged to address many of these challenges.
Research Note: HPE OpsRamp Enhancements

Hewlett Packard Enterprises introduced several enhancements to its OpsRamp solution at its recent HPE Discover event in Las Vegas that bolster its autonomous IT operations vision.
HPE Updates Partner Programs for AI

At HPE Discover 2024, HPE announced an ambitious new AI enablement program in collaboration with NVIDIA to boost profitability and deliver new revenue streams for partners. This program includes enhanced competencies and resources across AI, compute, storage, networking, hybrid cloud, sustainability, and HPE GreenLake offerings.
Research Note: NVIDIA AI Computing by HPE

At its HPE Discover event in Las Vegas, Hewlett Packard Enterprise and NVIDIA announced a collaborative effort to accelerate the adoption of generative AI across enterprises.
The collaboration, branded as NVIDIA AI Computing by HPE, introduces a portfolio of co-developed AI solutions and joint go-to-market integrations to address the complexities and barriers of large-scale AI adoption.
Research Note: VAST Data/Cisco AI Collaboration

VAST Data and Cisco announced a new collaboration to deliver a robust, high-performance AI data infrastructure that integrates seamlessly with an Ethernet-based AI fabric designed to handle data at an exabyte scale. Each company brings specialized expertise to create a unified, high-performance AI data platform.
Research Note: AMD Computex MI325x & MI350 Accelerator Announcements

At the 2024 Computex event in Taiwan, AMD CEO Lisa Su revealed details about AMD’s upcoming MI350 and MI325X accelerators, follow-ons to its current MI300x products, highlighting significant advancements in AI performance and memory capacity. The new products are positioned as key components in AMD’s strategy to lead the AI accelerator market.
Quick Take: Lenovo & Cisco Expand Relationship for AI Innovation

Ahead of Cisco Live, Lenovo and Cisco announced a global strategic partnership to deliver fully integrated infrastructure and networking solutions that accelerate digital transformation and AI innovation for businesses of all sizes.
Quick Take: IBM & AWS Collaborate on Responsible AI

At the IBM Think conference, IBM announced a collaboration with Amazon Web Services (AWS) to integrate the full suite of IBM’s watsonx AI and data platform with AWS services. The partnership helps streamline enterprise AI scaling through an open, hybrid approach with comprehensive governance.
Research Note: Red Hat Summit 2024 AI Announcements

At its recent Red Hat Summit, Red Hat announced several new products and enhancements, many of which simplify or enable the use of AI within the enterprise. These include its new OpenShift AI, OpenShift LightSpeed enhancements, and a new RHEL AI release.
Research Note: UALink Alliance & Accelerator Interconnect Specification

UALink is a new open standard designed to rival NVIDIA’s proprietary NVLink technology. It facilitates high-speed, direct GPU-to-GPU communication crucial for scaling out complex computational tasks across multiple graphics processing units (GPUs) or accelerators within servers or computing pods.
Research Note: IBM InstructLab LLM Tool Kit

At IBM’s 2024 Think conference in Boston, IBM Research unveiled InstructLab, developed in collaboration with Red Hat, to address the inefficiencies of existing training approaches by enabling collaborative, cost-effective model customization.
Quick Take: Microsoft Corp. and G42 Digital Investments in Kenya

Microsoft and G42 announced a substantial digital investment initiative in Kenya, partnering with the Republic of Kenya’s Ministry of Information, Communications, and the Digital Economy.
The initiative, supported by an initial $1 billion investment arranged by G42, aims to enhance Kenya’s digital infrastructure, AI capabilities, and connectivity, fostering the region’s digital transformation and economic growth.
Quick Take: CoreWeave $7.5B Debt Financing to Expand AI Infrastructure

GPU-cloud provider CoreWeave announced a definitive agreement for a $7.5 billion debt financing round. The funds will be used to expand CoreWeave’s high-performance computing infrastructure to meet the demands of existing contracts with enterprise customers and AI innovators.
Microsoft New Phi-3 Model Additions

This week at the Microsoft Build 2024 conference, the tech giant announced an exciting set of updates to its Phi-3 family of small, open models. The news includes the introduction of Phi-3-vision, a multimodal model that combines language and vision capabilities, providing developers with powerful tools for generative AI applications.
Quick Take: IBM and Salesforce Strategic Partnership

At IBM Think in Boston this week, IBM and Salesforce announced an expanded strategic partnership integrating IBM’s watsonx AI and Data Platform capabilities with the Salesforce Einstein 1 Platform. This collaboration gives enterprise customers great choice and flexibility in AI and data deployment.
Research Brief: Dell AI Factory

At NVIDIA GTC earlier this year, Dell announced a collaboration with NVIDIA to deliver the Dell AI Factory with NVIDIA that was heavily based on NVIDIA technology. At this year’s Dell Tech World, Dell went further, introducing its own Dell AI Factory, while also updating the Dell AI Factory with NVIDIA.
Research Note: Google Trillium TPU

The Trillium TPU, Google’s sixth-generation TPU, was announced at Google I/O. It promises unprecedented compute performance, memory capacity, and energy efficiency for generative AI training and inference.
Research Note: Palantir Q1 2024 Earnings

Panatir Technologies reported its Q1 2024 earnings, surpassed revenue expectations with its robust 21% year-over-year growth, achieving $634 million. While the company met consensus estimates on EPS, it delivered weaker-than-expected guidance.
Research Note: IBM Open Sources Granite Code Models

IBM is releasing its Granite code models to the open-source community, aiming to simplify coding for a broad range of developers. The Granite models, part of IBM’s wider initiative to harness AI in software development, range from 3 to 34 billion parameters and include base and instruction-following variants.
Quick Take: Panasas Rebrands as VDURA, Shifts to SaaS Model

Legacy parallel file system provider Panasas is transitioning from hardware sales to focusing on the public cloud under its new brand, VDURA. Traditionally, Panasas offered the PanFS software platform, which supports high-capacity drives and diverse connectivity options targeting the HPC market.
Research Report: Oracle Database Vector Search

Oracle introduced full support for vectors, including vector search, in its just-released Oracle Database 23ai. Known as AI Vector Search, this capability represents a significant advancement in how databases can store, index, and search data semantically.
Nvidia And Dell Build An AI Factory Together

At last month’s Nvidia GTC conference, Dell Technologies unveiled the Dell AI Factory with Nvidia, a comprehensive set of enterprise AI solutions aimed at simplifying the adoption and integration of AI for businesses.
Dell also announced enhancements to its flagship PowerEdge XE9680 server, including options for liquid-cooling, which allows the server to deliver the full capabilities of Nvidia’s newly announced AI accelerators.
Research Note: NVIDIA H100 Confidential Computing

This week, NVIDIA made its confidential computing capabilities for its flagship NVIDIA Hopper H100 GPU, previewed in August 2023, generally available. This makes NVIDIA’s H100 the first GPU with these capabilities, which are critical for protecting data as it is being processed.
This Research Note looks at confidential computing and how it works on the NVIDIA H100 GPU.
AWS And Nvidia Expand AI Relationship

At the Nvidia GTC event in San Jose, Nvidia and Amazon Web Services made a series of wide-ranging announcements that showed a broad and strategic collaboration to accelerate AI innovation and infrastructure capabilities globally.
The joint announcements included the introduction of Nvidia Grace Blackwell GPU-based Amazon EC2 instances, Nvidia DGX Cloud integration, and, most critically, a pivotal collaboration called Project Ceiba.
Quick Take: Apple’s OpenELM Small Language Model

Apple this week unveiled its OpenELM, a set of small AI language models designed to run on local devices like smartphones rather than rely on cloud-based data centers. This reflects a growing trend toward smaller, more efficient AI models that can operate on consumer devices without significant computational resources.
Research Note: Lenovo’s AMD-based AI Portfolio Update

Lenovo announced a comprehensive set of AMD-based updates to its AI infrastructure portfolio, which include GPU-rich and thermal-efficient systems designed for compute-intensive workloads in various industries, including financial services and healthcare.
The new offerings, designed in partnership with AMD, address the growing demand for compute-intensive workloads across industries, providing the flexibility and scalability required for AI deployments.
Research Note: NVIDIA Acquires Run:AI

NVIDIA announced an agreement to acquire Run:ai, a startup specializing in chip management and orchestration software based on Kubernetes. The acquisition is part of CEO Jensen Huang’s strategy to diversify Nvidia’s revenue streams from chips to software.
Quick Take: Vultr Launches Global Inference Cloud

Vultr recently announced the launch of Vultr Cloud Inference, a new serverless platform aimed at transforming AI scalability and reach. The new solution facilitates AI model deployment and inference capabilities worldwide.
Research Note: Microsoft Phi-3 Small Language Model
Microsoft recently announced its Phi-3 small language models (SLMs), designed to deliver powerful performance at a reduced cost. These SLMs offer a compelling option for developers and businesses looking to harness the potential of generative AI.
This Research Note looks at what Microsoft announced.
Research Note: AWS Bedrock GenAI Enhancements

Amazon announced updates to its Bedrock generative AI platform that expands its capabilities while improving the user experience. These enhancements focus on helping developers create AI applications quickly and securely.
Quick Take: SAP’s New Business AI Capabilities
At the recent NVIDIA GTC event, SAP and NVIDIA announced an expanded partnership to enhance generative AI integration across SAP’s cloud solutions and applications. The collaboration focuses on developing SAP Business AI, which integrates scalable, business-specific generative AI capabilities within various SAP offerings, including SAP Datasphere, and SAP BTP.
Quick Take: VAST Data’s Nvidia DPU-Based AI Cloud Architecture

VAST Data recently introduced a new AI cloud architecture based on Nvidia’s BlueField-3 DPU technology. The architecture is designed to improve performance, security, and efficiency for AI data services. The approach seeks to enhance data center operations and introduce a secure, zero-trust environment by integrating storage and database processing into AI servers.
Research Note: Intel Gaudi 3

Intel announced its long-anticipated new Intel Gaudi 3 AI accelerator at its Intel Vision event. The new accelerator offers significant improvements over the previous generation Gaudi 3 processor and promises to challenge Nvidia’s current generation accelerators in training and inference for LLMs and multimodal models.
Research Note; Arm Ethos U-65 microNPU

Arm introduced its new Ethos-U65 microNPU (Neural Processing Unit). This state-of-the-art AI accelerator facilitates machine learning (ML) inference in many embedded systems and high-performance devices.
Quick Take: Wasabi AiR Intelligent Media Storage

Wasabi AiR integrates artificial intelligence to transform how video content is stored, accessed, and utilized. The new offering combines the cost-effectiveness and high performance of Wasabi’s object storage with sophisticated AI capabilities, including automatic metadata tagging and multilingual searchable speech-to-text transcription.
Quick Take: Hammerspace Hyperscale NAS For AI & HPC

Hammerspace unveiled its new high-performance NAS architecture, Hyperscale NAS, to cater to the growing demands of enterprise AI, machine learning, deep learning initiatives, and the increasing use of GPU computing both on-premises and in the cloud.
Quick Take: Datastax Acquires Langflow

DataStax announced the acquisition of Logspace, the company behind Langflow, a low-code tool for building applications based on Retrieval-Augmented Generation (RAG). The terms of the deal were not disclosed.
Research Note: MLPerf Inference 4.0 Results
MLCommons released the results of its MLPerf Inference v4.0 benchmarks, which introduced two new workloads, Llama 2 and Stable Diffusion XL.
Since its inception in 2018, MLPerf has established itself as a crucial benchmark in the accelerator market. The benchmarks offer detailed comparisons across a variety of system configurations for specific use cases.
Research Note: Databricks DBRX LLM

Databricks launched DBRX, a new open, general-purpose Large Language Model (LLM) that sets a new benchmark for performance and efficiency.
DBRX surpasses the capabilities of existing models like GPT-3.5 while also demonstrating competitive performance with closed models such as Gemini 1.0 Pro, making it a formidable player in general-purpose applications and specialized coding tasks.
Is NVIDIA Lagging in Lucrative Automotive Segment?

Nvidia’s most recent earnings release is a tremendous achievement for the company, with reported revenue of $22.1 billion, up an incredible 265% year-on-year. Earnings grew an equally unbelievable 765% year-on-year.
Its automotive revenue was $281 million.
Research Note: Dell AI Factory with NVIDIA

At the 2024 NVIDIA GTC conference, Dell Technologies unveiled the Dell AI Factory with Nvidia, a comprehensive set of enterprise AI solutions aimed at simplifying the adoption and integration of AI for businesses.
Dell also announced enhancements to its flagship PowerEdge XE9680 server, including introducing Dell’s first liquid-cooled server solution, which allows the server to deliver the full capabilities of NVIDIA’s newly announced AI accelerators.
Research Note: NVIDIA & AWS’s Broad AI-Focused Collaboration

At the Nvidia GTC event in San Jose, Nvidia and Amazon Web Services made a series of wide-ranging announcements that showed a broad and strategic collaboration to accelerate global AI innovation and infrastructure capabilities.
The joint announcements included the introduction of NVIDIA Grace Blackwell GPU-based Amazon EC2 instances, NVIDIA DGX Cloud integration, and, most critically, a pivotal collaboration called Project Ceiba.
Quick Take: VAST Data and Supermicro Collaborate on Scalable AI Solution

Supermicro/VAST Data’s new solution provides innovative parallel architecture and unified global namespace ensure optimal GPU utilization, scalability, and smooth data access from edge to cloud, eliminating the usual trade-offs between performance and capacity.
Quick Take: IBM & the GSMA’s Collaboration for GenAI Adoption in Telecom

The GSMA and IBM recently announced a significant new collaboration to promote the adoption and development of generative artificial intelligence (AI) skills within the telecom industry.
This partnership launches through two main initiatives: the GSMA Advance’s AI Training program and the GSMA Foundry Generative AI program.
Quick Take: Juniper Network’s AI-Native Networking Platform

Juniper Networks announced its AI-Native Networking Platform, designed to fully integrate AI into network operations to enhance experiences for users and operators. The platform, a first in the industry, is built to use AI to make network connections more reliable, secure, and measurable.
Quick Take: WEKA Brings Data Platform to NexGen Cloud

WekaIO is partnering with NexGen Cloud, the leading UK-based sustainable infrastructure-as-a-service provider, to establish a high-performance foundation for NexGen Cloud’s upcoming AI Supercloud. This Supercloud and NexGen Cloud’s Hyperstack GPU-as-a-Service platform will leverage WEKA’s technology.
Quick Take: Oracle’s OCI Generative AI Service

Oracle announced the general availability of its OCI Generative AI Service, along with several substantial enhancements to its data science and cloud offerings. Let’s take a look at what Oracle announced.
Research Note: Inside Juniper’s Next-Gen AI Networking

Juniper Networks introduced its AI-Native Networking Platform, designed to fully integrate AI into network operations to enhance experiences for users and operators. This platform, a first in the industry, is built to use AI to make network connections more reliable, secure, and measurable.
Research Note: Inside Intel’s 4Q 2023 Data Center & AI Earnings

The latest earnings release from Intel Corporation offers a comprehensive overview of the company’s current trajectory and outlook, underscoring significant strides in its ambitious IDM 2.0 transformation.
This Research Note focuses primarily on Intel’s Data Center & AI Group (DCAI) and those elements that impact enterprise infrastructure.
Equinix Launches Fully Managed NVIDIA DGX AI

Equinix launched a fully managed private cloud service to facilitate enterprises’ acquisition and management of NVIDIA DGX AI supercomputing infrastructure. This service is aimed at helping businesses build and run custom generative AI models.
Research Note: Oracle’s New GenAI Features for Oracle Cloud

Oracle announced the general availability of its OCI Generative AI Service, along with several substantial enhancements to its data science and cloud offerings, including the beta release of new genAI agents.
IBM And Meta Launch an AI Alliance for Safe AI

Created by IBM and Meta, the AI Alliance is a testament to the belief that open and transparent innovation is crucial for harnessing AI advancements in a way that prioritizes safety, diversity, and widespread economic opportunity.
Research Note: HPE Intends to Acquire Juniper Networks

Hewlett Packard Enterprise (HPE) has announced its definitive agreement to acquire Juniper Networks, Inc., a leader in AI-native networks, for approximately $14 billion in an all-cash transaction, a premium of about 32% over Juniper’s closing stock price on the day the deal was announced.
Research Note: Intel “Emerald Rapids” 5th Generation Xeon Processor
Intel unveiled its new Emerald Rapids processors, part of its 5th-Gen Xeon Scalable lineup at its recent “AI Everywhere” event. The new processors arrive with multiple features designed to enhance performance across workloads, including AI and HPC.
IBM Introduces watsonx.governance for AI Governance

IBM’s focus on enterprise-readiness continues with its announcement of watsonx.governance, designed to assist business in managing and governing AI models, will be generally available in early December.
Dell Partners with Hugging Face & Meta for Enterprise AI

Dell and Hugging Face are working together to aid enterprise adoption of AI by providing a platform where businesses can easily select, deploy, and fine-tune AI models for their specific use cases using Dell’s infrastructure.
Research Note: AMD’s new MI300X & MI300A AI Accelerators

AMD recently announced its new MI300x and MI300a AI accelerators, its most significant challenge to NVIDIA yet.
Research Note: AMD ROCm 6 Software Stack

At its AI-focused event on December 6, Advanced Micro Devices (AMD) introduced the latest release of its ROCm software stack, marking a significant update in the field of high-performance computing (HPC) and artificial intelligence (AI). The new release introduces enhancements that align closely with the company’s latest hardware, notably the MI300 series GPUs.
Research Note: Microsoft’s New Processor & Accelerator

At its recent Microsoft Ignite event, the company announced the launch of the Azure Cobalt 100 CPU and the Maia 100 AI accelerator, marking a significant pivot in Microsoft’s approach to its cloud infrastructure and representing the company’s commitment to driving innovation in high-performance computing and AI.
Research Note: Elastic Fiscal Q2 2024 Earnings

Elastic this week released its fiscal Q2 2024 earnings, handily beating consensus estimates for both revenue and EPS. This Research Note explores Elastic’s earnings for the quarter, taking a special look at the impact of generative AI on the company’s business.
Research Note: Lenovo Fiscal Q2 2024 Earnings

This Research Note explores Lenovo Group Limited’s financial and operational performance across its distinct business units in its Fiscal Q2 2024. Focusing on the Solutions and Services Group (SSG), Infrastructure Solutions Group (ISG), and Intelligent Device Group (IDG), looking at each business unit’s contributions to Lenovo’s overall business strategy.
IBM Launches $500M AI Venture Fund

IBM this week unveiled a $500 million venture fund to invest in AI companies, ranging from early-stage startups to high-growth enterprises.
Research Note: Inside the IBM Research NorthPole Accelerator

IBM Research has developed and released details on a groundbreaking AI chip called NorthPole, which could revolutionize AI hardware systems. Unlike traditional computer chips, NorthPole integrates processing units and memory on the same chip, eliminating the von Neumann bottleneck and significantly improving efficiency.
Research Note: Databricks Acquires Arcion

Databricks announced its intention to acquire Arcion, an enterprise data replication specialist and a part of the Databricks Ventures portfolio. The acquisition, valued at over $100 million, is set to bolster Databricks’ capability to natively ingest data from a myriad of databases and SaaS applications into their Lakehouse Platform.
Lenovo Relentlessly Focused on AI at Tech World 2023

Lenovo’s Global Tech World is underway in Austin this week where the company shows off the latest innovations from its wide range of product teams. Despite its expansive portfolio, the clear focus for Lenovo is on capturing mindshare around enterprise AI while also riding the wave propelling most of the growth in the IT industry.
IBM Acquires Manta, Bolsters Watson.x

IBM announced the acquisition of Manta Software for an undisclosed amount. Manta is a leading data lineage platform that allows enterprises to understand their data better.
VAST Data to Provide Data Infrastructure for Lambda

Lambda and VAST Data have engaged in a new strategic partnership that brings the VAST Data Platform to Lambda.
Confluent Introduces Partner-Driven Real-Time AI Initiative + New AI Tools

Confluent, a leader in data streaming and steward of the open-source Apache Kafka system, recently announced a new initiative called “Data Streaming for AI” to aid organizations in developing real-time AI applications. It also introduced new AI-enabled tools.
ServiceNow Introduces Now Assist, Bringing GenAI to its Now Platform

ServiceNow unveiled its new “Now Assist” family of solutions earlier this month as part of a major expansion of its powerful Now Platform, introducing new generative AI capabilities for IT Service Management, Customer Service Management, HR Service Delivery, and creators.
OpenText Releases Aviator AI Tools for Business & Technologists

OpenText unveiled a broad expansion of its enterprise-class Aviator platform with new AI capabilities spanning business and technology needs. Aviator is OpenText’s AI platform, designed to enable organizations to leverage AI for various applications without moving their data.
Google Brings Vertex AI Search to Healthcare

Google Cloud introduced enhancements to its Vertex AI Search tailored for healthcare and life sciences organizations. The solution enables the application of medical-tuned generative AI search on diverse data sources, including clinical data like FHIR data and clinical notes.
Inside Microsoft’s New Cloud-Based Data & AI Solutions for Healthcare

Microsoft announced its first industry-specific application of its Fabric analytics platform, introducing new healthcare-focused AI models, and releasing new AI-driven tools to simplify the clinician experience.
Dell’s Validated Design for Generative AI + New Tools & Services

Dell Technologies announced a significant expansion of its Generative AI Solutions portfolio to empower businesses embarking on their GenAI journeys. This expansion delivers advanced infrastructure, professional services, and a collaborative data solution to help organizations derive intelligence from their data securely and efficiently.
IBM Releases Granite Foundation Models for Enterprise AI

IBM takes a big step towards addressing enterprise requirements for generative AI with its new Granite Foundation Models. Let’s look at what IBM announced.
VAST Data Powers the Infrastructure Behind CoreWeave’s AI Cloud

CoreWeave, a leading specialty cloud provider for AI systems, announced that VAST Data has been selected to provide the data infrastructure underpinning CoreWeave’s offerings.
Quick Take: Amazon Invests $4B in AI Model Developer Anthropic

Amazon has invested $4 billion in model developer Anthropic, with Amazon Web Services (AWS) becoming its primary cloud provider.
Research Note: Inside IBM WatsonX

IBM recently unveiled its updated AI portfolio, “watsonx” The new set of offerings provides an enterprise-ready AI and data platform consisting of three intertwined solution stacks:
Google Cloud Introduces New Vertex AI Capabilities

Google introduced its Vertex AI machine learning (ML) platform in 2021 to allow its customers to manage the entire ML lifecycle, from model development to deployment. Vertex AI combines data engineering, data science, and ML engineering workflows, enabling teams to collaborate using a common toolset and scale your application. Last week at Google’s Google Cloud […]
Marvell Sees Momentum in Cloud, AI, and Automotive

Last week, Marvell released its earnings for the second quarter of its fiscal 2024, demonstrating robust performance with $1.34 billion in top-line revenue. While that number was down year-over-year, it surpassed the midpoint of the company’s guidance. Insight: Growth in Cloud & AI Marvell’s data center business was a bright spot. Revenue from that business […]
Nutanix Introduces new Generative AI-in-a-Box

Earlier this month, Nutanix announced its new GPT-in-a-Box™ solution, which promises to simplify the implementation of Generative Pre-trained Transformer (GPT) capabilities. Nutanix’s solution aims to help organizations harness the power of AI while maintaining robust control over their data and applications. New: Nutanix GPT-in-a-Box Nutanix’s new GPT-in-a-Box brings together the elements necessary to deploy a […]
Dynatrace Beats Earnings, Acquires A Company & Embraces Generative AI

Over the past month Dynatrace has beat earnings, announced the acquisition of Rookout, a developer-focused observability start-up, and is bringing new generative AI features to its a Davis AI engine.
Dell Technologies & NVIDIA Collaborate On Full-Stack Generative AI Solutions

Dell & NVIDIA deliver new validated designs for inference systems based on NVIDIA accelerators and software, a professional services offering to help enterprises embrace generative AI, and a new Dell Precision workstation for AI development.
HPE Brings AI-as-a-Service to the Enterprise with Its GreenLake for LLMs

Hewlett Packard Enterprise has always taken a decidedly infrastructure-centric approach to delivering flexible consumption-based compute, storage, and networking to enterprise IT. HPE provides that flexibility through its popular HPE GreenLake offerings. Last week at its annual HPE Discover Event in Las Vegas, HPE introduced the first of what the company says will be many domain-specific […]
MongoDB Embraces AI & Reduces Developer Friction With New Features

Servicing the quickly evolving needs of modern application development requires rapid innovation and fast product cycles. MongoDB demonstrated both last week at its MongoDB.local 2023 event in New York City, introducing a broad set of new features and services. The announcements cover a wide breadth of territory, with new capabilities to leverage the latest AI […]
Elastic’s Elasticsearch Relevance Engine Enables Generative AI Search
Background The challenge for an enterprise wanting to harness the power of large language models (LLMs) is that a language model is only as capable as the data it’s trained on and understands. This hampers the ability to leverage the technology to solve real-world business problems. LLMs become infinitely more powerful when deeply integrated with […]
The Innovative Cooling Approach Behind NVIDIA’s $5M COOLERCHIPS Grant
Background Cooling a data center was a challenge even before the current AI-driven boom in accelerated computing heated up. Servers run hot, with processor thermal designs reaching 500 watts by 2025. Add GPUs to the mix, some of which approach 700W today, and the problems of power consumption and heat dissipation begin to expand exponentially. […]
Nutanix’s Project Beacon Lights The Way To A Unified Hybrid Multi-Cloud Future
Background In the hybrid-cloud world, every infrastructure has a distinct way of managing workloads. Every cloud, even on-prem consumption-based offerings, forces a different control plane. Unfortunately, for all the flexibility that the cloud enables, that flexibility is offset by increased complexity. This is a problem facing nearly every enterprise IT shop. Nutanix’s Cloud Manager offering […]
NVIDIA Grows Momentum in Public Cloud
NVIDIA lives at the center of the AI revolution. Its GPUs are the most common, and most powerful. Beyond its hardware, NVIDIA is enabling the adoption of AI with software tools that span the gamut from edge inference to autonomous driving to medical imaging. The list truly is limitless.
NVIDIA Updates Data Center Platform Strategy at GTC 2023
It’s become clear that NVIDIA strives to own the entire platform for AI-infused analytics. The time is right, as accelerated AI is fueling a shift in how enterprises derive value from their data and how businesses operate and engage with their customers.
Dynatrace Blends AI, Automation & Observability With New Offerings
Understanding what’s happening across that infrastructure is a difficult challenge — predicting what may happen next in that infrastructure? That seems impossible.