Phi-3-Medium accelerates research with logic-rich features in both short (4K) and long (128K) context.
Phi-3-Medium accelerates research with logic-rich features in both short (4K) and long (128K) context.
Phi-3-Medium accelerates research with logic-rich features in both short (4K) and long (128K) context.
Phi-3-Medium accelerates research with logic-rich features in both short (4K) and long (128K) context.
Edgeless Systems introduced Continuum AI, the first generative AI (GenAI) framework that keeps prompts encrypted at all times with confidential computing by…
Edgeless Systems introduced Continuum AI, the first generative AI (GenAI) framework that keeps prompts encrypted at all times with confidential computing by combining confidential VMs with NVIDIA H100 GPUs and secure sandboxing. The launch of this platform underscores a new era in AI deployment, where the benefits of powerful LLMs can be realized without compromising data privacy and…
On a weekday afternoon, Ashwini Ashtankar sat on the bank of the Doodhpathri River, in a valley nestled in the Himalayas. Taking a deep breath, she noticed that there was no city noise, no pollution — and no work emails. Ashtankar, a senior tools development engineer in NVIDIA’s Pune, India, office, took advantage of the
Read Article
Trained on 600+ programming languages, StarCoder2-15B is now packaged as a NIM inference microservice available for free from the NVIDIA API catalog.
Trained on 600+ programming languages, StarCoder2-15B is now packaged as a NIM inference microservice available for free from the NVIDIA API catalog.
Featured in Nature, this post delves into how GPUs and other advanced technologies are meeting the computational challenges posed by AI.
Featured in Nature, this post delves into how GPUs and other advanced technologies are meeting the computational challenges posed by AI.
Brev.dev is making it easier to develop AI solutions by leveraging software libraries, frameworks, and Jupyter Notebooks on the NVIDIA NGC catalog. You can use…
Brev.dev is making it easier to develop AI solutions by leveraging software libraries, frameworks, and Jupyter Notebooks on the NVIDIA NGC catalog. You can use Brev.dev to easily deploy software on an NVIDIA GPU by pairing a cloud orchestration tool with a simple UI. Get an on-demand GPU reliably from any cloud, access the notebook in-browser, or SSH directly into the machine with the Brev…
Gemma 2, the next generation of Google Gemma models, is now optimized with TensorRT-LLM and packaged as NVIDIA NIM inference microservice.
Gemma 2, the next generation of Google Gemma models, is now optimized with TensorRT-LLM and packaged as NVIDIA NIM inference microservice.