GeForce NOW is turning up the heat this summer with a hot new deal. For a limited time, save 40% on six-month Performance memberships and enjoy premium GeForce RTX-powered gaming for half a year. Members can jump into all the action this summer, whether traveling or staying cool at home. Eleven new games join the
Read Article
The introduction of the llm-d community at Red Hat Summit 2025 marks a significant step forward in accelerating generative AI inference innovation for the open…
The introduction of the llm-d community at Red Hat Summit 2025 marks a significant step forward in accelerating generative AI inference innovation for the open source ecosystem. Built on top of vLLM and Inference Gateway, llm-d extends the capabilities of vLLM with Kubernetes-native architecture for large-scale inference deployments. This post explains key NVIDIA Dynamo components that…
Just Released: NVIDIA HPC SDK v25.5
The new release includes support for CUDA 12.9, updated library components, and performance improvements.
The new release includes support for CUDA 12.9, updated library components, and performance improvements.
As robots increasingly make their way to the largest enterprises’ manufacturing plants and warehouses, the need for access to critical business and operational data has never been more crucial. At its Sapphire conference, SAP announced it is collaborating with NEURA Robotics and NVIDIA to enable its SAP Joule agents to connect enterprise data and processes
Read Article
Exploring Quantization Backends in Diffusers
Master AI with Google Cloud & NVIDA. Access an exclusive community, resources, and rewards.
Master AI with Google Cloud & NVIDA. Access an exclusive community, resources, and rewards.
Assembly of multiple parts plays a critical role across nearly every major industry such as manufacturing, automotive, aerospace, electronics, and medical…
Assembly of multiple parts plays a critical role across nearly every major industry such as manufacturing, automotive, aerospace, electronics, and medical devices. Despite its widespread use, robotic assembly continues to be a significant challenge. It involves complex interactions where robots must manipulate objects through continuous physical contact, requiring a high level of precision and…
At NVIDIA GTC 2025, we announced NVIDIA Dynamo, a high-throughput, low-latency open-source inference serving framework for deploying generative AI and reasoning…
At NVIDIA GTC 2025, we announced NVIDIA Dynamo, a high-throughput, low-latency open-source inference serving framework for deploying generative AI and reasoning models in large-scale distributed environments. The latest v0.2 release of Dynamo includes: In this post, we’ll walk through these features and how they can help you get more out of your GPU investments.
The exponential growth of AI workloads is increasing data center power demands. Traditional 54 V in-rack power distribution, designed for kilowatt (KW)-scale…
The exponential growth of AI workloads is increasing data center power demands. Traditional 54 V in-rack power distribution, designed for kilowatt (KW)-scale racks, isn’t designed to support the megawatt (MW)-scale racks coming soon to modern AI factories. NVIDIA is leading the transition to 800 V HVDC data center power infrastructure to support 1 MW IT racks and beyond, starting in 2027.