Categories
Misc

Maximum Performance and Minimum Footprint for AI Apps with NVIDIA TensorRT Weight-Stripped Engines

Decorative image of TensorRT workflow on a black background.NVIDIA TensorRT, an established inference library for data centers, has rapidly emerged as a desirable inference backend for NVIDIA GeForce RTX and NVIDIA RTX…Decorative image of TensorRT workflow on a black background.

NVIDIA TensorRT, an established inference library for data centers, has rapidly emerged as a desirable inference backend for NVIDIA GeForce RTX and NVIDIA RTX GPUs. Now, deploying TensorRT into apps has gotten even easier with prebuilt TensorRT engines. The newly released TensorRT 10.0 with weight-stripped engines offers a unique solution for minimizing the engine shipment size by reducing…

Source

Categories
Misc

Spotlight: Cisco Enhances Workload Security and Operational Efficiency with NVIDIA BlueField-3 DPUs

As cyberattacks become more sophisticated, organizations must constantly adapt with cutting-edge solutions to protect their critical assets. One such solution…

As cyberattacks become more sophisticated, organizations must constantly adapt with cutting-edge solutions to protect their critical assets. One such solution is Cisco Secure Workload, a comprehensive security solution designed to safeguard application workloads across diverse infrastructures, locations, and form factors. Cisco recently announced version 3.9 of the Cisco Secure Workload…

Source

Categories
Misc

Confidential and Self-Sovereign AI: Best Practices for Enhancing Security and Autonomy

 Join the webinar on June 11th with NVIDIA and Super Protocol to learn about the benefits of Confidential Computing for Web3 AI.

Join the webinar on June 11th with NVIDIA and Super Protocol to learn about the benefits of Confidential Computing for Web3 AI.

Source

Categories
Misc

Reallusion Brings Digital Characters to Life with NVIDIA AI

In today’s digital age, creating realistic animated characters is crucial for filmmakers, game developers, and content creators looking to bring their visions…

In today’s digital age, creating realistic animated characters is crucial for filmmakers, game developers, and content creators looking to bring their visions to life. Reallusion is at the forefront of this cutting-edge art form, using powerful AI technologies like NVIDIA Audio2Face and NVIDIA Maxine to craft lifelike digital humans and character animations. A major challenge exists in…

Source

Categories
Misc

Introducing SDXL-Lightning: New Lightning-Fast Model on NVIDIA API Catalog

Create high-resolution images with remarkable efficiency with the Advanced text-to-image generation model, SDXL-Lightning, available and optimized now on the…

Create high-resolution images with remarkable efficiency with the Advanced text-to-image generation model, SDXL-Lightning, available and optimized now on the NVIDIA API Catalog.

Source

Categories
Misc

SOLAR-10.7B: Optimized Model Tailored Instruction Following, Reasoning, and Mathematical Tasks

Enhance efficiency and performance in instruction-based NLP tasks with SOLAR-10.7B, especially in following instructions, reasoning, and mathematical tasks.

Enhance efficiency and performance in instruction-based NLP tasks with SOLAR-10.7B, especially in following instructions, reasoning, and mathematical tasks.

Source

Categories
Misc

NVIDIA Text Embedding Model Tops MTEB Leaderboard

An illustration representing an embedding model.The latest embedding model from NVIDIA—NV-Embed—set a new record for embedding accuracy with a score of 69.32 on the Massive Text Embedding Benchmark…An illustration representing an embedding model.

The latest embedding model from NVIDIA—NV-Embed—set a new record for embedding accuracy with a score of 69.32 on the Massive Text Embedding Benchmark (MTEB), which covers 56 embedding tasks. Highly accurate and effective models like NV-Embed are key to transforming vast amounts of data into actionable insights. NVIDIA provides top-performing models through the NVIDIA API catalog.

Source

Categories
Misc

Explainer: What Is Generative AI?

Sunset, molecule, and avatar composite.Generative AI enables users to quickly generate new content based on a variety of inputs. Inputs and outputs to these models can include text, images, sounds,…Sunset, molecule, and avatar composite.

Generative AI enables users to quickly generate new content based on a variety of inputs. Inputs and outputs to these models can include text, images, sounds, animation, 3D models, or other types of data.

Source

Categories
Misc

Seamlessly Deploying a Swarm of LoRA Adapters with NVIDIA NIM

The latest state-of-the-art foundation large language models (LLMs) have billions of parameters and are pretrained on trillions of tokens of input text. They…

The latest state-of-the-art foundation large language models (LLMs) have billions of parameters and are pretrained on trillions of tokens of input text. They often achieve striking results on a wide variety of use cases without any need for customization. Despite this, studies have shown that the best accuracy on downstream tasks can be achieved by adapting LLMs with high-quality…

Source

Categories
Misc

Why Accelerated Data Processing Is Crucial for AI Innovation in Every Industry

Across industries, AI is supercharging innovation with machine-powered computation. In finance, bankers are using AI to detect fraud more quickly and keep accounts safe, telecommunications providers are improving networks to deliver superior service, scientists are developing novel treatments for rare diseases, utility companies are building cleaner, more reliable energy grids and automotive companies are making
Read Article