DataBloom - Part 138

Misc

Accelerate AI Infrastructure Using an NVIDIA Bluefield-3 DPU Integration with DDN Storage

Post author By
Post date July 23, 2024
No Comments on Accelerate AI Infrastructure Using an NVIDIA Bluefield-3 DPU Integration with DDN Storage

Decorative image of a car driving at night with light streaks. As AI becomes integral to organizational innovation and competitive advantage, the need for efficient and scalable infrastructure is more critical than ever. A…

As AI becomes integral to organizational innovation and competitive advantage, the need for efficient and scalable infrastructure is more critical than ever. A partnership between NVIDIA and DDN Storage is setting new standards in this area. By integrating NVIDIA BlueField DPUs into DDN EXAScaler and DDN Infinia and using them innovatively, DDN Storage is transforming data-centric workloads.

Source

Misc

Supercharging Llama 3.1 across NVIDIA Platforms

Post author By
Post date July 23, 2024
No Comments on Supercharging Llama 3.1 across NVIDIA Platforms

Decorative image of a llama in cool sunglasses against a sunny landscape. Meta’s Llama collection of large language models are the most popular foundation models in the open-source community today, supporting a variety of use cases….

Meta’s Llama collection of large language models are the most popular foundation models in the open-source community today, supporting a variety of use cases. Millions of developers worldwide are building derivative models, and are integrating these into their applications. With Llama 3.1, Meta is launching a suite of large language models (LLMs) as well as a suite of trust and safety models…

Source

Misc

Develop Production-Grade Text Retrieval Pipelines for RAG with NVIDIA NeMo Retriever

Post author By
Post date July 23, 2024
No Comments on Develop Production-Grade Text Retrieval Pipelines for RAG with NVIDIA NeMo Retriever

An illustration representing text retrieval pipelines for RAG. Enterprises are sitting on a goldmine of data waiting to be used to improve efficiency, save money, and ultimately enable higher productivity. With generative…

Enterprises are sitting on a goldmine of data waiting to be used to improve efficiency, save money, and ultimately enable higher productivity. With generative AI, developers can build and deploy an agentic flow or a retrieval-augmented generation (RAG) chatbot, while ensuring the insights provided are based on the most accurate and up-to-date information. Building these solutions requires not…

Source

Misc

Build an Agentic RAG Pipeline with Llama 3.1 and NVIDIA NeMo Retriever NIMs

Post author By
Post date July 23, 2024
No Comments on Build an Agentic RAG Pipeline with Llama 3.1 and NVIDIA NeMo Retriever NIMs

An illustrations representing agnetic RAG. Employing retrieval-augmented generation (RAG) is an effective strategy for ensuring large language model (LLM) responses are up-to-date and not…

Employing retrieval-augmented generation (RAG) is an effective strategy for ensuring large language model (LLM) responses are up-to-date and not hallucinated. While various retrieval strategies can improve the recall of documents for generation, there is no one-size-fits-all approach. The retrieval pipeline depends on your data, from hyperparameters like the chunk size…

Source

Misc

Customize Generative AI Models for Enterprise Applications with Llama 3.1

Post author By
Post date July 23, 2024
No Comments on Customize Generative AI Models for Enterprise Applications with Llama 3.1

Llama 3 Performance with NVIDIA TensorRT-LLM and NVIDIA Triton Inference Server The newly unveiled Llama 3.1 collection of 8B, 70B, and 405B large language models (LLMs) is narrowing the gap between proprietary and open-source models. Their…

The newly unveiled Llama 3.1 collection of 8B, 70B, and 405B large language models (LLMs) is narrowing the gap between proprietary and open-source models. Their open nature is attracting more developers and enterprises to integrate these models into their AI applications. These models excel at various tasks including content generation, coding, and deep reasoning, and can be used to power…

Source

Misc

Creating Synthetic Data Using Llama 3.1 405B

Post author By
Post date July 23, 2024
No Comments on Creating Synthetic Data Using Llama 3.1 405B

An illustration representing syntheti Synthetic data isn’t about creating new information. It’s about transforming existing information to create different variants. For over a decade, synthetic…

Synthetic data isn’t about creating new information. It’s about transforming existing information to create different variants. For over a decade, synthetic data has been used to improve model accuracy across the board—whether it is transforming images to improve object detection models, strengthening fraudulent credit card detection, or improving BERT models for QA. What’s new?

Source

Misc

NVIDIA AI Foundry Builds Custom Llama 3.1 Generative AI Models for the World’s Enterprises

Post author By
Post date July 23, 2024
No Comments on NVIDIA AI Foundry Builds Custom Llama 3.1 Generative AI Models for the World’s Enterprises

NVIDIA today announced a new NVIDIA AI Foundry service and NVIDIA NIM™ inference microservices to supercharge generative AI for the world’s enterprises with the Llama 3.1 collection of openly available models, also introduced today.

Misc

AI, Go Fetch! New NVIDIA NeMo Retriever Microservices Boost LLM Accuracy and Throughput

Post author By
Post date July 23, 2024
No Comments on AI, Go Fetch! New NVIDIA NeMo Retriever Microservices Boost LLM Accuracy and Throughput

Generative AI applications have little, or sometimes negative, value without accuracy — and accuracy is rooted in data. To help developers efficiently fetch the best proprietary data to generate knowledgeable responses for their AI applications, NVIDIA today announced four new NVIDIA NeMo Retriever NIM inference microservices. Combined with NVIDIA NIM inference microservices for the Llama
Read Article

Misc

How NVIDIA AI Foundry Lets Enterprises Forge Custom Generative AI Models

Post author By
Post date July 23, 2024
No Comments on How NVIDIA AI Foundry Lets Enterprises Forge Custom Generative AI Models

Businesses seeking to harness the power of AI need customized models tailored to their specific industry needs. NVIDIA AI Foundry is a service that enables enterprises to use data, accelerated computing and software tools to create and deploy custom models that can supercharge their generative AI initiatives. Just as TSMC manufactures chips designed by other
Read Article

Misc

Llama 3.1 – 405B, 70B & 8B with multilinguality and long context

Post author By
Post date July 23, 2024
No Comments on Llama 3.1 – 405B, 70B & 8B with multilinguality and long context