DataBloom - Part 97

Misc

Accelerating Google’s QPU Development with New Quantum Dynamics Capabilities

Post author By
Post date November 18, 2024
No Comments on Accelerating Google’s QPU Development with New Quantum Dynamics Capabilities

Google QPU development enabling dynamics simulations Quantum dynamics describes how complex quantum systems evolve in time and interact with their surroundings. Simulating quantum dynamics is extremely difficult…

Quantum dynamics describes how complex quantum systems evolve in time and interact with their surroundings. Simulating quantum dynamics is extremely difficult yet critical for understanding and predicting the fundamental properties of materials. This is of particular importance in the development of quantum processing units (QPUs), where quantum dynamics simulations enable QPU developers to…

Source

Misc

Fusing Epilog Operations with Matrix Multiplication Using nvmath-python

Post author By
Post date November 18, 2024
No Comments on Fusing Epilog Operations with Matrix Multiplication Using nvmath-python

Code showing how to use epilogs with matrix multiplication in nvmath-python. nvmath-python (Beta) is an open-source Python library, providing Python programmers with access to high-performance mathematical operations from NVIDIA CUDA-X…

nvmath-python (Beta) is an open-source Python library, providing Python programmers with access to high-performance mathematical operations from NVIDIA CUDA-X math libraries. nvmath-python provides both low-level bindings to the underlying libraries and higher-level Pythonic abstractions. It is interoperable with existing Python packages, such as PyTorch and CuPy. In this post, I show how to…

Source

Misc

Effortlessly Scale NumPy from Laptops to Supercomputers with NVIDIA cuPyNumeric

Post author By
Post date November 18, 2024
No Comments on Effortlessly Scale NumPy from Laptops to Supercomputers with NVIDIA cuPyNumeric

A photo of two GPU clusters and another picture of four scientific computing workflows demonstrating computational fluid dynamics. Python is the most common programming language for data science, machine learning, and numerical computing. It continues to grow in popularity among scientists…

Python is the most common programming language for data science, machine learning, and numerical computing. It continues to grow in popularity among scientists and researchers. In the Python ecosystem, NumPy is the foundational Python library for performing array-based numerical computations. NumPy’s standard implementation operates on a single CPU core, with only a limited set of operations…

Source

Offsites

Sphere surface area proof sketch

Post author By
Post date November 17, 2024
No Comments on Sphere surface area proof sketch

Offsites

Newton’s Fractal is beautiful

Post author By
Post date November 16, 2024
No Comments on Newton’s Fractal is beautiful

Misc

NVIDIA NIM 1.4 Ready to Deploy with 2.4x Faster Inference

Post author By
Post date November 15, 2024
No Comments on NVIDIA NIM 1.4 Ready to Deploy with 2.4x Faster Inference

The demand for ready-to-deploy high-performance inference is growing as generative AI reshapes industries. NVIDIA NIM provides production-ready microservice…

The demand for ready-to-deploy high-performance inference is growing as generative AI reshapes industries. NVIDIA NIM provides production-ready microservice containers for AI model inference, constantly improving enterprise-grade generative AI performance. With the upcoming NIM version 1.4 scheduled for release in early December, request performance is improved by up to 2.4x out-of-the-box with…

Source

Misc

Streamlining AI Inference Performance and Deployment with NVIDIA TensorRT-LLM Chunked Prefill

Post author By
Post date November 15, 2024
No Comments on Streamlining AI Inference Performance and Deployment with NVIDIA TensorRT-LLM Chunked Prefill

In this blog post, we take a closer look at chunked prefill, a feature of NVIDIA TensorRT-LLM that increases GPU utilization and simplifies the deployment…

In this blog post, we take a closer look at chunked prefill, a feature of NVIDIA TensorRT-LLM that increases GPU utilization and simplifies the deployment experience for developers. This builds on our previous post discussing how advanced KV cache optimization features in TensorRT-LLM improve performance up to 5x in use cases that require system prefills. When a user submits a request to…

Source

Offsites

The Triangle Of Power

Misc

Exploring the Case of Super Protocol with Self-Sovereign AI and NVIDIA Confidential Computing

Post author By
Post date November 14, 2024
No Comments on Exploring the Case of Super Protocol with Self-Sovereign AI and NVIDIA Confidential Computing

A cloud with a cybersecurity lock icon, surrounded by a sphere of connected nodes. Confidential and self-sovereign AI is a new approach to AI development, training, and inference where the user’s data is decentralized, private, and…

Confidential and self-sovereign AI is a new approach to AI development, training, and inference where the user’s data is decentralized, private, and controlled by the users themselves. This post explores how the capabilities of Confidential Computing (CC) are expanded through decentralization using blockchain technology. The problem being solved is most clearly shown through the use of…

Source

Misc

Deep Learning Model Boosts Accuracy in Long-Range Weather and Climate Forecasting

Post author By
Post date November 14, 2024
No Comments on Deep Learning Model Boosts Accuracy in Long-Range Weather and Climate Forecasting

A picture of a hurricane. Dale Durran, a professor in the Atmospheric Sciences Department at the University of Washington, introduces a breakthrough deep learning model that combines…

Dale Durran, a professor in the Atmospheric Sciences Department at the University of Washington, introduces a breakthrough deep learning model that combines atmospheric and oceanic data to set new climate and weather prediction accuracy standards. In this NVIDIA GTC 2024 session, Durran presents techniques that reduce reliance on traditional parameterizations, enabling the model to bypass…

Source