NV-CLIP, a cutting-edge multimodal embeddings model for image and text, is now generally available.
NV-CLIP, a cutting-edge multimodal embeddings model for image and text, is now generally available.
NV-CLIP, a cutting-edge multimodal embeddings model for image and text, is now generally available.
NV-CLIP, a cutting-edge multimodal embeddings model for image and text, is now generally available.
Microsoft Bing Visual Search enables people around the world to find content using photographs as queries. The heart of this capability is Microsoft’s TuringMM…
Microsoft Bing Visual Search enables people around the world to find content using photographs as queries. The heart of this capability is Microsoft’s TuringMM visual embedding model that maps images and text into a shared high-dimensional space. Operating on billions of images across the web, performance is critical. This post details efforts to optimize the TuringMM pipeline using NVIDIA…
Producing commercials is resource-intensive, requiring physical locations and various props and setups to display products in different settings and…
Producing commercials is resource-intensive, requiring physical locations and various props and setups to display products in different settings and environments for more accurate consumer targeting. This traditional process is not only expensive and time-consuming but also can be destructive to the physical environment. It leaves you with no ability to capture a new angle after you return home.
Polars, one of the fastest-growing data analytics tools, has just crossed 9M monthly downloads. As a modern DataFrame library, it is designed for efficiently…
Polars, one of the fastest-growing data analytics tools, has just crossed 9M monthly downloads. As a modern DataFrame library, it is designed for efficiently processing datasets that fit on a single machine, without the overhead and complexity of distributed computing systems that are required for massive-scale workloads. As enterprises grapple with complex data problems—ranging from…
Developers in the fields of image-guided surgery and surgical vision face unique challenges in creating systems and applications that can significantly improve…
Developers in the fields of image-guided surgery and surgical vision face unique challenges in creating systems and applications that can significantly improve surgical workflows. One such challenge is efficiently combining multi-modal imaging data, such as preoperative 3D patient images with intra-operative video. This is key to providing surgeons with real-time…
Updates include tensor parallel support for Mamba2, sparse mixer normalization for MoE models, and more.
Updates include tensor parallel support for Mamba2, sparse mixer normalization for MoE models, and more.
NeMo Curator now supports images, enabling you to process data for training accurate generative AI models.
NeMo Curator now supports images, enabling you to process data for training accurate generative AI models.