Categories
Misc

Train a Reasoning-Capable LLM in One Weekend with NVIDIA NeMo

Have you ever wanted to build your own reasoning model but thought it was too complicated or required massive resources? Think again. With NVIDIA’s powerful…

Have you ever wanted to build your own reasoning model but thought it was too complicated or required massive resources? Think again. With NVIDIA’s powerful tools and datasets, you can train a small, effective reasoning model in about 48 hours, all on a single GPU. Even better, we’ve made all the code available to you to get started right away. Let’s dive in.

Source

Categories
Misc

Understanding NCCL Tuning to Accelerate GPU-to-GPU Communication

The NVIDIA Collective Communications Library (NCCL) is essential for fast GPU-to-GPU communication in AI workloads, using various optimizations and tuning to…

The NVIDIA Collective Communications Library (NCCL) is essential for fast GPU-to-GPU communication in AI workloads, using various optimizations and tuning to boost performance. However, as platforms diversify, default NCCL settings may not always deliver optimal results. This post discusses why tuning is important and how users can enhance performance with custom tuner plugins. It also presents a…

Source

Categories
Misc

AI On: How Financial Services Companies Use Agentic AI to Enhance Productivity, Efficiency and Security

Editor’s note: This post is part of the AI On blog series, which explores the latest techniques and real-world applications of agentic AI, chatbots and copilots. The series also highlights the NVIDIA software and hardware powering advanced AI agents, which form the foundation of AI query engines that gather insights and perform tasks to transform
Read Article

Categories
Misc

Building Robotic Mental Models with NVIDIA Warp and Gaussian Splatting

A decorative GIF.This post explores a promising direction for building dynamic digital representations of the physical world, a topic gaining increasing attention in recent…A decorative GIF.

This post explores a promising direction for building dynamic digital representations of the physical world, a topic gaining increasing attention in recent research. We introduce an approach for constructing a digital twin in a robotic setting that stays continuously synchronized with the real world in real time. Such a twin can provide rich state information that supports and enhances a wide…

Source

Categories
Misc

Kimi-K2-Instruct Now Available as NVIDIA NIM

Try the new 1T-parameter open source MoE LLM today.

Try the new 1T-parameter open source MoE LLM today.

Source

Categories
Misc

Traditional RAG vs. Agentic RAG—Why AI Agents Need Dynamic Knowledge to Get Smarter

Ever relied on an old GPS that didn’t know about the new highway bypass, or a sudden road closure? It might get you to your destination, but not in the most…

Ever relied on an old GPS that didn’t know about the new highway bypass, or a sudden road closure? It might get you to your destination, but not in the most efficient or accurate way. AI agents face a similar challenge: they often rely on static training data. This data is fixed at a point in time—while it was current when created, it can quickly become outdated. This limitation can cause…

Source

Categories
Misc

Automating Network Design in NVIDIA Air with Ansible and Git

Black and white topology of connected nodes in NVIDIA Air.At its core, NVIDIA Air is built for automation. Every part of your network can be coded, versioned, and set to trigger automatically. This includes creating…Black and white topology of connected nodes in NVIDIA Air.

At its core, NVIDIA Air is built for automation. Every part of your network can be coded, versioned, and set to trigger automatically. This includes creating the topology, configuring the network, and validating its setup. Automation reduces manual error, speeds up testing, and brings the same rigor to networking that modern DevOps teams apply to software development. Let’s discuss the basic…

Source

Categories
Misc

Consilium: When Multiple LLMs Collaborate

Categories
Misc

Optimizing for Low-Latency Communication in Inference Workloads with JAX and XLA

Running inference with large language models (LLMs) in production requires meeting stringent latency constraints. A critical stage in the process is LLM decode,…

Source

Categories
Misc

3 pandas Workflows That Slowed to a Crawl on Large Datasets—Until We Turned on GPUs

If you work with pandas, you’ve probably hit the wall. It’s that moment when your trusty workflow, so elegant on smaller datasets, grinds to a halt on a…

If you work with pandas, you’ve probably hit the wall. It’s that moment when your trusty workflow, so elegant on smaller datasets, grinds to a halt on a large one. A script that once took seconds now crawls for minutes. Your next steps are predictable and frustrating. You might downsample your data and lose fidelity, rewrite your logic to process data in chunks, or face the daunting task of…

Source