From f28bb288d75e86f5b45a1e6f9ed374598e93aea2 Mon Sep 17 00:00:00 2001 From: Jeff Boudier <74218629+jeffboudier@users.noreply.github.com> Date: Fri, 23 Aug 2024 11:48:40 -0700 Subject: [PATCH] Update container_types.mdx Moving NIM container to end of list --- docs/source/others/container_types.mdx | 10 +++++----- 1 file changed, 5 insertions(+), 5 deletions(-) diff --git a/docs/source/others/container_types.mdx b/docs/source/others/container_types.mdx index 6c7ed9c..34c7c38 100644 --- a/docs/source/others/container_types.mdx +++ b/docs/source/others/container_types.mdx @@ -10,11 +10,6 @@ The default container type is the easiest way to deploy endpoints and is flexibl Select a custom container type if you'd like to customize the image and include a [custom container](https://huggingface.co/docs/inference-endpoints/guides/custom_container). -## NVIDIA NIM (no longer available in UI) - -*The NIM container type will no longer be officially supported for already existing Endpoints in Inference Endpoints beginning October 1st, 2024.* -Select the NIM container type for models supported by NVIDIA. You'll see this option in the UI if supported for that model. - ## Text Embeddings Inference Select the Text Embeddings Inference container type to gain all the benefits of [TEI](https://huggingface.co/docs/text-embeddings-inference/en/index) for your Endpoint. You'll see this option in the UI if supported for that model. @@ -30,3 +25,8 @@ Select the Text Generation Inference Inferentia2 Neuron container type for model ## Text Generation Inference (TPU) Select the Text Generation Inference TPU container type for models you'd like to deploy with TGI on a Google Cloud TPU instance. You'll see this option in the UI if supported for that model. + +## NVIDIA NIM (no longer available in UI) + +*The NIM container type will no longer be officially supported for already existing Endpoints in Inference Endpoints beginning October 1st, 2024.* +Select the NIM container type for models supported by NVIDIA. You'll see this option in the UI if supported for that model.