![]() |
submitted by /u/Denis_Vo [visit reddit] [comments] |

![]() |
submitted by /u/Denis_Vo [visit reddit] [comments] |
NVIDIA AI Enterprise is a suite of AI software, certified to run on VMware vSphere 7 Update 2 with NVIDIA-Certified volume servers. It includes key enabling technologies and software from NVIDIA for rapid deployment, management and scaling of AI workloads in the virtualized data center running on VMware vSphere.
NVIDIA AI Enterprise is a suite of AI software, certified to run on VMware vSphere 7 Update 2 with NVIDIA-Certified volume servers. It includes key enabling technologies and software from NVIDIA for rapid deployment, management and scaling of AI workloads in the virtualized data center running on VMware vSphere. The NVIDIA AI Enterprise suite also enables IT Administrators, Data Scientists, and AI Researchers to quickly run NVIDIA AI applications and libraries optimized for GPU acceleration by reducing deployment time and ensuring reliable performance.
NVIDIA AI Enterprise suite is licensed and supported by NVIDIA. After the joint announcement at VMworld in September 2020, NVIDIA and VMware have continued work to improve the integration between their joint offerings. NVIDIA and VMware are committed to continued collaboration to tightly couple VMware vSphere with the NVIDIA AI Enterprise suite. This article discusses the new features introduced with VMware vSphere 7 Update 2 release and the new NVIDIA AI Enterprise software suite.
The introduction of NVIDIA RDMA capabilities into vSphere for NVIDIA virtualized GPU (vGPU) allows deep learning training to scale out to multiple nodes with near bare metal performance for even the largest deep learning training workloads.
RDMA technology is featured in NVIDIA ConnectX SmartNICs and BlueField DPUs and improves the bandwidth and latency when moving data directly between a network interface card (NIC) and GPU memory.
IT administrators can use the tools they are familiar with, like VMware vCenter, to provision multiple nodes as VMs. These VMs can be configured to use NVIDIA networking and vGPU resources for RDMA.
VMware’s integration with RDMA over Converged Ethernet (RoCE) results in AI and ML capabilities that are accelerated faster than ever before. vSphere 7 Update 2 with NVIDIA AI Enterprise software supports RDMA with ATS capabilities on Intel CPUs further optimizing the GPUDirect bandwidth between NIC and GPU so that throughput is not limited by PCIe bus speeds. This means that a data scientist can iterate on new data and re-train many more times in a day, dramatically increasing their productivity.
Now let’s look at the new VMware features which have further enabled Deep Learning inferencing workloads. vSphere 7 Update 2 supports the latest GPU Ampere architecture such as the NVIDIA A100 GPU. This GPU can be configured to use Multi-Instance GPU (MIG). This type of GPU partitioning can be particularly beneficial for inferencing workloads that do not fully saturate the GPUs compute capacity and for uses cases which require low latency response and error isolation. The graph below illustrates the performance of natural language inference using virtualized GPUs enabled with MIG compared to virtualized CPU as well as bare metal.
Let’s look at a use case example of how a single NVIDIA A100 configured with MIG mode enabled can be used for multiple inferencing workloads with VMware vSphere. NVIDIA Triton Inference Server is an AI application framework included in the NVIDIA AI Enterprise suite. Available as a Docker container, it integrates with Kubernetes for orchestration and auto-scaling. This solution allows front end client applications to submit inferencing requests from the AI inference cluster and can service models within the AI model repository.
Let’s look further at this use case, for example multiple end users or departments submit inference request to perform object detection on satellite imagery. Within the AI model repository, there are pre-trained object detection models which detect the presence of multiple objects in the satellite imagery, such as buildings, trees, fire hydrants or well pads. A single NVIDIA A100 GPU can be used for servicing the multiple inferencing requests by leveraging MIG spatial partitioning, thereby optimizing the utilization of a valuable and powerful GPU resource within the enterprise. The graph below illustrates the performance of ResNet-50 object detection inference using virtualized GPUs with MIG enabled compared to virtualized CPU only as well as bare metal.
Using Triton Inference Server, with added MIG support in vSphere 7.0 U2, the NVIDIA A100 – 40GB GPU can be partitioned up to 7 GPU slices, each slice or instance has its own dedicated compute resources that run in parallel with predictable throughput and latency. IT administrators use vCenter to assign a VM a single MIG partition. Read VMware’s technical blog post for additional details, “Multiple Machine Learning Workloads Using GPUs: New Features in vSphere 7 Update 2“.
As enterprises move toward AI and cloud computing, a new data center architecture is needed to enable both existing and modern. Accelerated servers can be added to the core enterprise data center and managed with standard tools like VMware vCenter. As a result of NVIDIAs close partnership, VMware has brought in new features to vSphere 7.0 U2 which provides the highest quality of low latency response ML/AI applications backed by vGPU in the enterprise.
Online Conference to Feature Jensen Huang Keynote and 1,300 Talks from Leaders in Data Center, Networking, Graphics and Autonomous VehiclesSANTA CLARA, Calif., March 09, 2021 (GLOBE NEWSWIRE) …
An A-list of female AI researchers and industry executives will take the stage at next month’s GPU Technology Conference to share the latest breakthroughs in every industry imaginable. Recognized in Forbes as a top conference for women to attend to further their careers in AI, GTC runs online, April 12-16, and is set to draw Read article >
The post Innovators, Researchers, Industry Leaders: Meet the Women Headlining at GTC appeared first on The Official NVIDIA Blog.
With GeForce NOW, over 5 million gamers are playing their favorite games in the cloud on PC, Mac, Chromebook, NVIDIA SHIELD TV, Android and iOS devices. With over 800 instantly available games and 80+ free-to-play games, there’s something for everyone. And there are multiple ways to build your library. We’ll review how to sync your Read article >
The post How to Build Your Game Library in the Cloud appeared first on The Official NVIDIA Blog.
This April, learn about the future of AI-powered transportation from those who are building it. The NVIDIA GPU Technology Conference returns to the virtual stage April 12-16, featuring autonomous vehicle leaders in a range of talks, panels and virtual networking events. Attendees will also have access to hands-on training for self-driving development and other deep Read article >
The post From Audi to Zoox: Autonomous Vehicle Innovators to Showcase Latest Breakthroughs at GTC 2021 appeared first on The Official NVIDIA Blog.
The autonomous trucking industry is about to get a major new addition. Self-driving truck company Plus announced that its upcoming autonomous vehicle platform will be built on NVIDIA DRIVE Orin. This software-defined system will continuously improve upon the safety and efficiency of the delivery and logistics industry with high-performance compute and AI algorithms that can Read article >
The post A Plus for Autonomous Trucking: Startup to Build Next-Gen Self-Driving Platform with NVIDIA DRIVE Orin appeared first on The Official NVIDIA Blog.
Runs on VMware vSphere; Optimized, Certified and Supported by NVIDIA; Hundreds of Thousands of Customers in World’s Largest Industries Can Now Adopt NVIDIA AI Enterprise at Scale SANTA CLARA, …
As enterprises modernize their data centers to power AI-driven applications and data science, NVIDIA and VMware are making it easier than ever to develop and deploy a multitude of different AI workloads in the modern hybrid cloud. The companies have teamed up to optimize the just-announced update to vSphere — VMware vSphere 7 Update 2 Read article >
The post How Suite It Is: NVIDIA and VMware Deliver AI-Ready Enterprise Platform appeared first on The Official NVIDIA Blog.
Artists, this is your chance to push past creative limits — and win great prizes — while exploring NVIDIA Omniverse through a new design contest. Called “Create with Marbles,” the contest is set in Omniverse, the groundbreaking platform for virtual collaboration, creation and simulation, and based on the Marbles RTX demo that first previewed at Read article >
The post Artists: Unleash Your Marble Arts in NVIDIA Omniverse Design Challenge appeared first on The Official NVIDIA Blog.