Categories
Misc

Migrating the Hub from Git LFS to Xet

Categories
Misc

Deadline Extended — Create a Project G-Assist Plug-In for a Chance to Win an NVIDIA GeForce RTX GPU and Laptop

Submissions for NVIDIA’s Plug and Play: Project G-Assist Plug-In Hackathon are due Sunday, July 20, at 11:59pm PT. RTX AI Garage offers all the tools and resources to help. The hackathon invites the community to expand the capabilities of Project G-Assist, an experimental AI assistant available through the NVIDIA App that helps users control and
Read Article

Categories
Misc

NVIDIA Dynamo Adds Support for AWS Services to Deliver Cost-Efficient Inference at Scale

Dynamo image.Amazon Web Services (AWS) developers and solution architects can now take advantage of NVIDIA Dynamo on NVIDIA GPU-based Amazon EC2, including Amazon EC2 P6…Dynamo image.

Amazon Web Services (AWS) developers and solution architects can now take advantage of NVIDIA Dynamo on NVIDIA GPU-based Amazon EC2, including Amazon EC2 P6 accelerated by NVIDIA Blackwell, with added support for Amazon Simple Storage (S3), in addition to existing integrations with Amazon Elastic Kubernetes Services (EKS) and AWS Elastic Fabric Adapter (EFA). This update unlocks a new level of…

Source

Categories
Misc

Accelerate AI Model Orchestration with NVIDIA Run:ai on AWS

When it comes to developing and deploying advanced AI models, access to scalable, efficient GPU infrastructure is critical. But managing this infrastructure…

When it comes to developing and deploying advanced AI models, access to scalable, efficient GPU infrastructure is critical. But managing this infrastructure across cloud-native, containerized environments can be complex and costly. That’s where NVIDIA Run:ai can help. NVIDIA Run:ai is now generally available on AWS Marketplace, making it even easier for organizations to streamline their AI…

Source

Categories
Misc

NVIDIA CEO Jensen Huang Promotes AI in Washington, DC and China

This month, NVIDIA founder and CEO Jensen Huang promoted AI in both Washington, D.C. and Beijing — emphasizing the benefits that AI will bring to business and society worldwide.  In the U.S. capital, Huang met with President Trump and U.S. policymakers, reaffirming NVIDIA’s support for the Administration’s effort to create jobs, strengthen domestic AI infrastructure and onshore
Read Article

Categories
Misc

Enabling Fast Inference and Resilient Training with NCCL 2.27

As AI workloads scale, fast and reliable GPU communication becomes vital, not just for training, but increasingly for inference at scale. The NVIDIA Collective…

As AI workloads scale, fast and reliable GPU communication becomes vital, not just for training, but increasingly for inference at scale. The NVIDIA Collective Communications Library (NCCL) delivers high-performance, topology-aware collective operations: , , , , and optimized for NVIDIA GPUs and a variety of interconnects including PCIe, NVLink, Ethernet (RoCE), and InfiniBand (IB).

Source

Categories
Misc

Upcoming Livestream: Techniques for Building High-Performance RAG Applications

Discover leaderboard-winning RAG techniques, integration strategies, and deployment best practices.

Discover leaderboard-winning RAG techniques, integration strategies, and deployment best practices.

Source

Categories
Misc

NCCL Deep Dive: Cross Data Center Communication and Network Topology Awareness

As the scale of AI training increases, a single data center (DC) is not sufficient to deliver the required computational power. Most recent approaches to…

As the scale of AI training increases, a single data center (DC) is not sufficient to deliver the required computational power. Most recent approaches to address this challenge rely on multiple data centers being co-located or geographically distributed. In a recently open-sourced feature, the NVIDIA Collective Communication Library (NCCL) is now able to communicate across multiple data centers…

Source

Categories
Misc

Just Released: NVDIA Run:ai 2.22

NVDIA Run:ai 2.22 is now here. It brings advanced inference capabilities, smarter workload management, and more controls.

NVDIA Run:ai 2.22 is now here. It brings advanced inference capabilities, smarter workload management, and more controls.

Source

Categories
Misc

Enhancing Multilingual Human-Like Speech and Voice Cloning with NVIDIA Riva TTS

While speech AI is used to build digital assistants and voice agents, its impact extends far beyond these applications. Core technologies like text-to-speech…

While speech AI is used to build digital assistants and voice agents, its impact extends far beyond these applications. Core technologies like text-to-speech (TTS) and automatic speech recognition (ASR) are driving innovation across industries. They’re enabling real-time translation, powering interactive digital humans, and even helping restore speech for individuals who’ve lost their voices.

Source