DataBloom - Part 87

Misc

Deploy Agents, Assistants, and Avatars on NVIDIA RTX AI PCs with New Small Language Models

Post author By
Post date December 17, 2024
No Comments on Deploy Agents, Assistants, and Avatars on NVIDIA RTX AI PCs with New Small Language Models

Image of a photorealistic digital human looking at the camera. NVIDIA just announced a series of small language models (SLMs) that increase the amount and type of information digital humans can use to augment their…

NVIDIA just announced a series of small language models (SLMs) that increase the amount and type of information digital humans can use to augment their responses. This includes new large-context models that provide more relevant answers and new multi-modal models that allow images as inputs. These models are available now as part of NVIDIA ACE, a suite of digital human technologies that brings…

Source

Misc

Boost Llama 3.3 70B Inference Throughput 3x with NVIDIA TensorRT-LLM Speculative Decoding

Post author By
Post date December 17, 2024
No Comments on Boost Llama 3.3 70B Inference Throughput 3x with NVIDIA TensorRT-LLM Speculative Decoding

Meta’s Llama collection of open large language models (LLMs) continues to grow with the recent addition of Llama 3.3 70B, a text-only instruction-tuned model….

Meta’s Llama collection of open large language models (LLMs) continues to grow with the recent addition of Llama 3.3 70B, a text-only instruction-tuned model. Llama 3.3 provides enhanced performance respective to the older Llama 3.1 70B model and can even match the capabilities of the larger, more computationally expensive Llama 3.1 405B model on several tasks including math, reasoning, coding…

Source

Misc

AI in Your Own Words: NVIDIA Debuts NeMo Retriever Microservices for Multilingual Generative AI Fueled by Data

Post author By
Post date December 17, 2024
No Comments on AI in Your Own Words: NVIDIA Debuts NeMo Retriever Microservices for Multilingual Generative AI Fueled by Data

In enterprise AI, understanding and working across multiple languages is no longer optional — it’s essential for meeting the needs of employees, customers and users worldwide. Multilingual information retrieval — the ability to search, process and retrieve knowledge across languages — plays a key role in enabling AI to deliver more accurate and globally relevant
Read Article

Misc

Develop Multilingual and Cross-Lingual Information Retrieval Systems with Efficient Data Storage

Post author By
Post date December 17, 2024
No Comments on Develop Multilingual and Cross-Lingual Information Retrieval Systems with Efficient Data Storage

Efficient text retrieval is critical for a broad range of information retrieval applications such as search, question answering, semantic textual similarity,…

Efficient text retrieval is critical for a broad range of information retrieval applications such as search, question answering, semantic textual similarity, summarization, and item recommendation. It also plays a pivotal role in retrieval-augmented generation (RAG), a technique that enables large language models (LLMs) to access external context without modifying underlying parameters.

Source

Misc

NVIDIA Unveils Its Most Affordable Generative AI Supercomputer

Post author By
Post date December 17, 2024
No Comments on NVIDIA Unveils Its Most Affordable Generative AI Supercomputer

NVIDIA is taking the wraps off a new compact generative AI supercomputer, offering increased performance at a lower price with a software upgrade. The new NVIDIA Jetson Orin Nano Super Developer Kit, which fits in the palm of a hand, provides everyone from commercial AI developers to hobbyists and students, gains in generative AI capabilities
Read Article

Misc

NVIDIA Jetson Orin Nano Developer Kit Gets a “Super” Boost

Post author By
Post date December 17, 2024
No Comments on NVIDIA Jetson Orin Nano Developer Kit Gets a “Super” Boost

The generative AI landscape is rapidly evolving, with new large language models (LLMs), visual language models (VLMs), and vision language action (VLA) models…

The generative AI landscape is rapidly evolving, with new large language models (LLMs), visual language models (VLMs), and vision language action (VLA) models emerging daily. To stay at the forefront of this transformative era, developers need a platform powerful enough to seamlessly deploy the latest models from the cloud to the edge with optimized inferencing and open ML frameworks using CUDA.

Source

Misc

Welcome the Falcon 3 Family of Open Models!

Post author By
Post date December 17, 2024
No Comments on Welcome the Falcon 3 Family of Open Models!

Misc

Benchmarking Language Model Performance on 5th Gen Xeon at GCP

Post author By
Post date December 17, 2024
No Comments on Benchmarking Language Model Performance on 5th Gen Xeon at GCP

Misc

Sandboxing Agentic AI Workflows with WebAssembly

Post author By
Post date December 16, 2024
No Comments on Sandboxing Agentic AI Workflows with WebAssembly

Agentic AI workflows often involve the execution of large language model (LLM)-generated code to perform tasks like creating data visualizations. However, this…

Agentic AI workflows often involve the execution of large language model (LLM)-generated code to perform tasks like creating data visualizations. However, this code should be sanitized and executed in a safe environment to mitigate risks from prompt injection and errors in the returned code. Sanitizing Python with regular expressions and restricted runtimes is insufficient…

Source

Misc

Tech Leader, AI Visionary, Endlessly Curious Jensen Huang to Keynote CES 2025

Post author By
Post date December 16, 2024
No Comments on Tech Leader, AI Visionary, Endlessly Curious Jensen Huang to Keynote CES 2025

On Jan. 6 at 6:30 p.m. PT, NVIDIA founder and CEO Jensen Huang — with his trademark leather jacket and an unwavering vision — will step onto the CES 2025 stage. From humble beginnings as a busboy at a Denny’s to founding NVIDIA, Huang’s story embodies innovation and perseverance. Huang has been named the world’s
Read Article