DataBloom - Part 10

Misc

R²D²: Training Generalist Robots with NVIDIA Research Workflows and World Foundation Models

Post author By
Post date July 16, 2025
No Comments on R²D²: Training Generalist Robots with NVIDIA Research Workflows and World Foundation Models

Robot GIF. A major challenge in robotics is training robots to perform new tasks without the massive effort of collecting and labeling datasets for every new task and…

A major challenge in robotics is training robots to perform new tasks without the massive effort of collecting and labeling datasets for every new task and environment. Recent research efforts from NVIDIA aim to solve this challenge through the use of generative AI, world foundation models (WFMs) like NVIDIA Cosmos, and data generation blueprints such as NVIDIA Isaac GR00T-Mimic and GR00T-Dreams.

Source

Misc

CUTLASS: Principled Abstractions for Handling Multidimensional Data Through Tensors and Spatial Microkernels

Post author By
Post date July 16, 2025
No Comments on CUTLASS: Principled Abstractions for Handling Multidimensional Data Through Tensors and Spatial Microkernels

In the era of generative AI, utilizing GPUs to their maximum potential is essential to training better models and serving users at scale. Often, these models…

In the era of generative AI, utilizing GPUs to their maximum potential is essential to training better models and serving users at scale. Often, these models have layers that cannot be expressed as off-the-shelf library operations due to subtle modifications, and DL compilers typically forgo the last few percentage points of optimizations to make their deployment feasible.

Source

Misc

CUTLASS 3.x: Orthogonal, Reusable, and Composable Abstractions for GEMM Kernel Design

Post author By
Post date July 16, 2025
No Comments on CUTLASS 3.x: Orthogonal, Reusable, and Composable Abstractions for GEMM Kernel Design

GEMM optimization on GPUs is a modular problem. Performant implementations need to specify hyperparameters such as tile shapes, math and copy instructions, and…

Source

Misc

Seq vs Seq: the Ettin Suite of Paired Encoders and Decoders

Post author By
Post date July 16, 2025
No Comments on Seq vs Seq: the Ettin Suite of Paired Encoders and Decoders

Misc

Migrating the Hub from Git LFS to Xet

Post author By
Post date July 15, 2025
No Comments on Migrating the Hub from Git LFS to Xet

Misc

Deadline Extended — Create a Project G-Assist Plug-In for a Chance to Win an NVIDIA GeForce RTX GPU and Laptop

Post author By
Post date July 15, 2025
No Comments on Deadline Extended — Create a Project G-Assist Plug-In for a Chance to Win an NVIDIA GeForce RTX GPU and Laptop

Submissions for NVIDIA’s Plug and Play: Project G-Assist Plug-In Hackathon are due Sunday, July 20, at 11:59pm PT. RTX AI Garage offers all the tools and resources to help. The hackathon invites the community to expand the capabilities of Project G-Assist, an experimental AI assistant available through the NVIDIA App that helps users control and
Read Article

Misc

NVIDIA Dynamo Adds Support for AWS Services to Deliver Cost-Efficient Inference at Scale

Post author By
Post date July 15, 2025
No Comments on NVIDIA Dynamo Adds Support for AWS Services to Deliver Cost-Efficient Inference at Scale

Dynamo image. Amazon Web Services (AWS) developers and solution architects can now take advantage of NVIDIA Dynamo on NVIDIA GPU-based Amazon EC2, including Amazon EC2 P6…

Amazon Web Services (AWS) developers and solution architects can now take advantage of NVIDIA Dynamo on NVIDIA GPU-based Amazon EC2, including Amazon EC2 P6 accelerated by NVIDIA Blackwell, with added support for Amazon Simple Storage (S3), in addition to existing integrations with Amazon Elastic Kubernetes Services (EKS) and AWS Elastic Fabric Adapter (EFA). This update unlocks a new level of…

Source

Misc

Accelerate AI Model Orchestration with NVIDIA Run:ai on AWS

Post author By
Post date July 15, 2025
No Comments on Accelerate AI Model Orchestration with NVIDIA Run:ai on AWS

When it comes to developing and deploying advanced AI models, access to scalable, efficient GPU infrastructure is critical. But managing this infrastructure…

When it comes to developing and deploying advanced AI models, access to scalable, efficient GPU infrastructure is critical. But managing this infrastructure across cloud-native, containerized environments can be complex and costly. That’s where NVIDIA Run:ai can help. NVIDIA Run:ai is now generally available on AWS Marketplace, making it even easier for organizations to streamline their AI…

Source

Misc

NVIDIA CEO Jensen Huang Promotes AI in Washington, DC and China

Post author By
Post date July 14, 2025
No Comments on NVIDIA CEO Jensen Huang Promotes AI in Washington, DC and China

This month, NVIDIA founder and CEO Jensen Huang promoted AI in both Washington, D.C. and Beijing — emphasizing the benefits that AI will bring to business and society worldwide. In the U.S. capital, Huang met with President Trump and U.S. policymakers, reaffirming NVIDIA’s support for the Administration’s effort to create jobs, strengthen domestic AI infrastructure and onshore
Read Article

Misc

Enabling Fast Inference and Resilient Training with NCCL 2.27

Post author By
Post date July 14, 2025
No Comments on Enabling Fast Inference and Resilient Training with NCCL 2.27

As AI workloads scale, fast and reliable GPU communication becomes vital, not just for training, but increasingly for inference at scale. The NVIDIA Collective…

As AI workloads scale, fast and reliable GPU communication becomes vital, not just for training, but increasingly for inference at scale. The NVIDIA Collective Communications Library (NCCL) delivers high-performance, topology-aware collective operations: , , , , and optimized for NVIDIA GPUs and a variety of interconnects including PCIe, NVLink, Ethernet (RoCE), and InfiniBand (IB).

Source