Amazon SageMaker Llama 2 Inference via Response Streaming
sagemaker sagemaker-endpoint response-streaming large-language-models text-generation-inference llama2 large-model-inference
-
Updated
Jun 28, 2024 - Jupyter Notebook