Top‑ranked on the HuggingFace Open‑ASR leaderboard, the model is production‑ready.
Top‑ranked on the HuggingFace Open‑ASR leaderboard, the model is production‑ready.
Top‑ranked on the HuggingFace Open‑ASR leaderboard, the model is production‑ready.
Top‑ranked on the HuggingFace Open‑ASR leaderboard, the model is production‑ready.
As large language models (LLMs) power more agentic systems capable of performing autonomous actions, tool use, and reasoning, enterprises are drawn to their…
As large language models (LLMs) power more agentic systems capable of performing autonomous actions, tool use, and reasoning, enterprises are drawn to their flexibility and low inference costs. This growing autonomy elevates risks, introducing goal misalignment, prompt injection, unintended behaviors, and reduced human oversight, making the incorporation of robust safety measures paramount.
In our previous post, we introduced the setup of predictive modeling in chip manufacturing and operations, highlighting common challenges such as imbalanced…
In our previous post, we introduced the setup of predictive modeling in chip manufacturing and operations, highlighting common challenges such as imbalanced datasets and the need for more nuanced evaluation metrics. We also explored how NVIDIA CUDA-X data science libraries—like cuDF and cuML—can help overcome these challenges and accelerate machine learning workflows.In this blog…
Listen up citizens, the law is back and patrolling the cloud. Nacon’s RoboCop Rogue City — Unfinished Business launches today in the cloud, bringing justice to every device, everywhere. Log in, lock and load, and take charge with the ten games heading to the cloud this week. Plus, Cyberpunk 2077’s 2.3 update amps up the
Read Article
By 2030, the World Health Organization projects a global shortage of over 15 million healthcare workers, including surgeons, radiologists, and nurses. In the…
By 2030, the World Health Organization projects a global shortage of over 15 million healthcare workers, including surgeons, radiologists, and nurses. In the US, the shortfall could reach 124,000 physicians by 2034. Rising demand, aging populations, and limited human capacity are pushing healthcare systems to a breaking point. AI-enabled robotic systems offer a path forward.
A major challenge in robotics is training robots to perform new tasks without the massive effort of collecting and labeling datasets for every new task and…
A major challenge in robotics is training robots to perform new tasks without the massive effort of collecting and labeling datasets for every new task and environment. Recent research efforts from NVIDIA aim to solve this challenge through the use of generative AI, world foundation models (WFMs) like NVIDIA Cosmos, and data generation blueprints such as NVIDIA Isaac GR00T-Mimic and GR00T-Dreams.
In the era of generative AI, utilizing GPUs to their maximum potential is essential to training better models and serving users at scale. Often, these models…
In the era of generative AI, utilizing GPUs to their maximum potential is essential to training better models and serving users at scale. Often, these models have layers that cannot be expressed as off-the-shelf library operations due to subtle modifications, and DL compilers typically forgo the last few percentage points of optimizations to make their deployment feasible.
GEMM optimization on GPUs is a modular problem. Performant implementations need to specify hyperparameters such as tile shapes, math and copy instructions, and…