Categories
Misc

Consilium: When Multiple LLMs Collaborate

Categories
Misc

Optimizing for Low-Latency Communication in Inference Workloads with JAX and XLA

Running inference with large language models (LLMs) in production requires meeting stringent latency constraints. A critical stage in the process is LLM decode,…

Source

Categories
Misc

3 pandas Workflows That Slowed to a Crawl on Large Datasets—Until We Turned on GPUs

If you work with pandas, you’ve probably hit the wall. It’s that moment when your trusty workflow, so elegant on smaller datasets, grinds to a halt on a…

If you work with pandas, you’ve probably hit the wall. It’s that moment when your trusty workflow, so elegant on smaller datasets, grinds to a halt on a large one. A script that once took seconds now crawls for minutes. Your next steps are predictable and frustrating. You might downsample your data and lose fidelity, rewrite your logic to process data in chunks, or face the daunting task of…

Source

Categories
Misc

Arc Virtual Cell Challenge: A Primer

Categories
Misc

Hackathon Winners Bring Agentic AI to Life with the NVIDIA NeMo Agent Toolkit

Decorative image.The best way to learn a new toolkit is to build something real, and that’s exactly what developers did at the recent NVIDIA NeMo Agent Toolkit Hackathon. Over…Decorative image.

The best way to learn a new toolkit is to build something real, and that’s exactly what developers did at the recent NVIDIA NeMo Agent Toolkit Hackathon. Over two weeks, participants across skill levels—from students to seasoned professionals—experimented, prototyped, and created intelligent multi-agent AI workflows using the open-source NeMo Agent toolkit (formerly known as the AIQ toolkit).

Source

Categories
Misc

Isambard-AI, the UK’s Most Powerful AI Supercomputer, Goes Live

The University of Bristol’s Isambard-AI, powered by NVIDIA Grace Hopper Superchips, delivers 21 exaflops of AI performance, making it the fastest system in the U.K. and among the most energy-efficient globally.

Categories
Misc

NVIDIA Canary‑Qwen‑2.5B: Open‑Source ASR/LLM for Superior Transcription and Summarization

Top‑ranked on the HuggingFace Open‑ASR leaderboard, the model is production‑ready.

Top‑ranked on the HuggingFace Open‑ASR leaderboard, the model is production‑ready.

Source

Categories
Misc

Safeguard Agentic AI Systems with the NVIDIA Safety Recipe

As large language models (LLMs) power more agentic systems capable of performing autonomous actions, tool use, and reasoning, enterprises are drawn to their…

As large language models (LLMs) power more agentic systems capable of performing autonomous actions, tool use, and reasoning, enterprises are drawn to their flexibility and low inference costs. This growing autonomy elevates risks, introducing goal misalignment, prompt injection, unintended behaviors, and reduced human oversight, making the incorporation of robust safety measures paramount.

Source

Categories
Misc

Feature Engineering at Scale: Optimizing ML Models in Semiconductor Manufacturing with NVIDIA CUDA‑X Data Science

In our previous post, we introduced the setup of predictive modeling in chip manufacturing and operations, highlighting common challenges such as imbalanced…

In our previous post, we introduced the setup of predictive modeling in chip manufacturing and operations, highlighting common challenges such as imbalanced datasets and the need for more nuanced evaluation metrics. We also explored how NVIDIA CUDA-X data science libraries—like cuDF and cuML—can help overcome these challenges and accelerate machine learning workflows.In this blog…

Source

Categories
Misc

Back to The Future: Evaluating AI Agents on Predicting Future Events