<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>Nim on Confidential Containers</title>
    <link>/tags/nim/</link>
    <description>Recent content in Nim on Confidential Containers</description>
    <generator>Hugo</generator>
    <language>en-us</language>
    <atom:link href="/tags/nim/index.xml" rel="self" type="application/rss+xml" />
    <item>
      <title>NVIDIA confidential NIM deployment</title>
      <link>/docs/examples/nvidia-nim-confidential-gpu-attestation/</link>
      <pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate>
      <guid>/docs/examples/nvidia-nim-confidential-gpu-attestation/</guid>
      <description>&lt;p&gt;This example adapts an &lt;a href=&#34;https://docs.nvidia.com/nim/index.html&#34;&gt;NVIDIA NIM&lt;/a&gt; inference deployment on&#xA;Kubernetes to run with Confidential Containers. This particular scenario targets one AMD SEV-SNP&#xA;Kubernetes worker node with NVIDIA GPU confidential computing support. The same NIM deployment&#xA;pattern can be adapted to Intel TDX nodes, but the reference values and attestation policy must be&#xA;generated for TDX rather than SNP. Those TDX-specific steps are out of scope for this exercise.&lt;/p&gt;&#xA;&lt;p&gt;NVIDIA NIM is a set of inference microservices that package foundation models as containers with&#xA;optimized runtimes and HTTP APIs for GPU infrastructure. This example starts with a plain NIM Pod&#xA;manifest for the &lt;code&gt;nvcr.io/nim/meta/llama-3.1-8b-instruct:1.13.1&lt;/code&gt; image, which serves the Meta&#xA;Llama 3.1 8B Instruct model through a chat completions API. The optional baseline step runs that&#xA;manifest with the non-confidential &lt;code&gt;kata-qemu-nvidia-gpu&lt;/code&gt; runtime class and queries its health,&#xA;model list, and chat completion endpoints on port 8000. The confidential scenario uses the&#xA;&lt;code&gt;kata-qemu-nvidia-gpu-snp&lt;/code&gt; runtime class which moves the Pod into a confidential VM, but the change&#xA;alone is not sufficient: A secure deployment also needs Trustee&amp;rsquo;s Key Broker Service (KBS), guest&#xA;pull, Attestation Agent (AA) and Confidential Data Hub (CDH) configuration, sealed secrets, image&#xA;signature policy, a generated Kata agent policy, trusted storage, and a KBS policy that approves&#xA;the expected CPU, GPU, and initdata evidence. The checkpoints below add those pieces one at a time.&lt;/p&gt;</description>
    </item>
  </channel>
</rss>
