DataBloom - Part 138

Misc

Build an Agentic RAG Pipeline with Llama 3.1 and NVIDIA NeMo Retriever NIMs

Post author By
Post date July 23, 2024
No Comments on Build an Agentic RAG Pipeline with Llama 3.1 and NVIDIA NeMo Retriever NIMs

An illustrations representing agnetic RAG. Employing retrieval-augmented generation (RAG) is an effective strategy for ensuring large language model (LLM) responses are up-to-date and not…

Employing retrieval-augmented generation (RAG) is an effective strategy for ensuring large language model (LLM) responses are up-to-date and not hallucinated. While various retrieval strategies can improve the recall of documents for generation, there is no one-size-fits-all approach. The retrieval pipeline depends on your data, from hyperparameters like the chunk size…

Source

Misc

Customize Generative AI Models for Enterprise Applications with Llama 3.1

Post author By
Post date July 23, 2024
No Comments on Customize Generative AI Models for Enterprise Applications with Llama 3.1

Llama 3 Performance with NVIDIA TensorRT-LLM and NVIDIA Triton Inference Server The newly unveiled Llama 3.1 collection of 8B, 70B, and 405B large language models (LLMs) is narrowing the gap between proprietary and open-source models. Their…

The newly unveiled Llama 3.1 collection of 8B, 70B, and 405B large language models (LLMs) is narrowing the gap between proprietary and open-source models. Their open nature is attracting more developers and enterprises to integrate these models into their AI applications. These models excel at various tasks including content generation, coding, and deep reasoning, and can be used to power…

Source

Misc

Creating Synthetic Data Using Llama 3.1 405B

Post author By
Post date July 23, 2024
No Comments on Creating Synthetic Data Using Llama 3.1 405B

An illustration representing syntheti Synthetic data isn’t about creating new information. It’s about transforming existing information to create different variants. For over a decade, synthetic…

Synthetic data isn’t about creating new information. It’s about transforming existing information to create different variants. For over a decade, synthetic data has been used to improve model accuracy across the board—whether it is transforming images to improve object detection models, strengthening fraudulent credit card detection, or improving BERT models for QA. What’s new?

Source

Misc

NVIDIA AI Foundry Builds Custom Llama 3.1 Generative AI Models for the World’s Enterprises

Post author By
Post date July 23, 2024
No Comments on NVIDIA AI Foundry Builds Custom Llama 3.1 Generative AI Models for the World’s Enterprises

NVIDIA today announced a new NVIDIA AI Foundry service and NVIDIA NIM™ inference microservices to supercharge generative AI for the world’s enterprises with the Llama 3.1 collection of openly available models, also introduced today.

Misc

AI, Go Fetch! New NVIDIA NeMo Retriever Microservices Boost LLM Accuracy and Throughput

Post author By
Post date July 23, 2024
No Comments on AI, Go Fetch! New NVIDIA NeMo Retriever Microservices Boost LLM Accuracy and Throughput

Generative AI applications have little, or sometimes negative, value without accuracy — and accuracy is rooted in data. To help developers efficiently fetch the best proprietary data to generate knowledgeable responses for their AI applications, NVIDIA today announced four new NVIDIA NeMo Retriever NIM inference microservices. Combined with NVIDIA NIM inference microservices for the Llama
Read Article

Misc

How NVIDIA AI Foundry Lets Enterprises Forge Custom Generative AI Models

Post author By
Post date July 23, 2024
No Comments on How NVIDIA AI Foundry Lets Enterprises Forge Custom Generative AI Models

Businesses seeking to harness the power of AI need customized models tailored to their specific industry needs. NVIDIA AI Foundry is a service that enables enterprises to use data, accelerated computing and software tools to create and deploy custom models that can supercharge their generative AI initiatives. Just as TSMC manufactures chips designed by other
Read Article

Misc

Llama 3.1 – 405B, 70B & 8B with multilinguality and long context

Post author By
Post date July 23, 2024
No Comments on Llama 3.1 – 405B, 70B & 8B with multilinguality and long context

Misc

Transforming Telco Network Operations Centers with NVIDIA NeMo Retriever and NVIDIA NIM

Post author By
Post date July 23, 2024
No Comments on Transforming Telco Network Operations Centers with NVIDIA NeMo Retriever and NVIDIA NIM

Two individuals looking at a computer in a telco network operations center. Telecom companies are challenged with consistently meeting service level agreements (SLAs) for end customers that ensure network quality of service. This…

Telecom companies are challenged with consistently meeting service level agreements (SLAs) for end customers that ensure network quality of service. This includes quickly troubleshooting network devices with complex issues, identifying root causes, and resolving issues efficiently at their network operations centers (NOCs). Current network troubleshooting and repair processes are often time…

Source

Misc

Automating Telco Network Design using NVIDIA NIM and NVIDIA NeMo

Post author By
Post date July 23, 2024
No Comments on Automating Telco Network Design using NVIDIA NIM and NVIDIA NeMo

Telco wireless network design. Telecom wireless network design demands streamlined processes and standardized approaches. Network architects, engineers, and IT professionals are challenged…

Telecom wireless network design demands streamlined processes and standardized approaches. Network architects, engineers, and IT professionals are challenged with manually retrieving and customizing Topology and Orchestration Specification for Cloud Applications (TOSCA) templates to meet firm industry specifications. This leads to reduced productivity and increases the risk of human errors and…

Source

Misc

NVIDIA’s AI Masters Sweep KDD Cup 2024 Data Science Competition

Post author By
Post date July 22, 2024
No Comments on NVIDIA’s AI Masters Sweep KDD Cup 2024 Data Science Competition

Team NVIDIA has triumphed at the Amazon KDD Cup 2024, securing first place Friday across all five competition tracks. The team — consisting of NVIDIANs Ahmet Erdem, Benedikt Schifferer, Chris Deotte, Gilberto Titericz, Ivan Sorokin and Simon Jegou — demonstrated its prowess in generative AI, winning in categories that included text generation, multiple-choice questions, name
Read Article