Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

test: Add L0_additional_dependency_dirs #7707

Merged
merged 13 commits into from
Oct 23, 2024
17 changes: 15 additions & 2 deletions docs/customization_guide/deploy.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
<!--
# Copyright (c) 2020-2023, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
# Copyright (c) 2020-2024, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
#
# Redistribution and use in source and binary forms, with or without
# modification, are permitted provided that the following conditions
Expand Down Expand Up @@ -244,7 +244,9 @@ to `False` if not required to avoid capturing or exposing any sensitive informat

#### `--trace-config level=<string> default "off"`

Tracing mode. Trace mode supports `triton` and `opentelemetry`. Unless required `--trace-config level=off` should be set to avoid capturing or exposing any sensitive information.
Tracing mode. Trace mode supports `triton` and `opentelemetry`. Unless required
`--trace-config level=off` should be set to avoid capturing or exposing any
sensitive information.


##### `backend-directory <string> default /opt/tritonserver/backends`
Expand Down Expand Up @@ -273,6 +275,17 @@ Directory where cache shared libraries are found.
> must be access controlled. Adding untrusted files
> can lead to arbitrarty code execution.

##### `backend-config=<backend>,additional-dependency-dirs=<string>`

This is an optional Windows feature that enables Triton to search custom
dependency directories when loading a specific backend. The user can input
these directories as a string of semicolon-separated paths (including a
trailing semicolon). These directories are programmatically prepended to
the process's PATH and are removed when the backend is loaded successfully.
Windows will search PATH last in its search sequence, so be cautious that
no untrusted files of same name exist in a location of higher search priority
(e.g., System32). It is still recommended to add backend-specific dependencies
to their corresponding backend folder when possible.



Expand Down
269 changes: 269 additions & 0 deletions qa/L0_additional_dependency_dirs/test.sh
Original file line number Diff line number Diff line change
@@ -0,0 +1,269 @@
#!/bin/bash
# Copyright 2024, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
#
# Redistribution and use in source and binary forms, with or without
# modification, are permitted provided that the following conditions
# are met:
# * Redistributions of source code must retain the above copyright
# notice, this list of conditions and the following disclaimer.
# * Redistributions in binary form must reproduce the above copyright
# notice, this list of conditions and the following disclaimer in the
# documentation and/or other materials provided with the distribution.
# * Neither the name of NVIDIA CORPORATION nor the names of its
# contributors may be used to endorse or promote products derived
# from this software without specific prior written permission.
#
# THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS ``AS IS'' AND ANY
# EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE
# IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR
# PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT OWNER OR
# CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL,
# EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO,
# PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR
# PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY
# OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT
# (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE
# OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.

REPO_VERSION=${NVIDIA_TRITON_SERVER_VERSION}
if [ "$#" -ge 1 ]; then
REPO_VERSION=$1
fi
if [ -z "$REPO_VERSION" ]; then
echo -e "Repository version must be specified"
echo -e "\n***\n*** Test Failed\n***"
exit 1
fi

if [[ -v WSL_DISTRO_NAME ]] || [[ -v MSYSTEM ]]; then
TRITON_DIR=${TRITON_DIR:=c:/tritonserver}
SERVER=${SERVER:=c:/tritonserver/bin/tritonserver.exe}
BACKEND_DIR=${BACKEND_DIR:=c:/tritonserver/backends}
MODELDIR=${MODELDIR:=c:/}
TRITONSERVER_IPADDR=${TRITONSERVER_IPADDR:="localhost"}
else
echo -e "Test is not currently supported for Linux"
exit 1
fi

# FIXME: TPRD-244 - Do not use hard-coded dependency paths when TRT models
# are being regularly generated on Windows.
DEPENDENCY_PATH="C:/ci_test_deps/24.07"
STALE_DEPENDENCY_PATH="C:/ci_test_deps/24.05"
GuanLuo marked this conversation as resolved.
Show resolved Hide resolved
CUSTOM_DEPENDENCY_DIR=${MODELDIR}/custom_dependency_location
STALE_DEPENDENCY_DIR=${MODELDIR}/stale_dependency_location
TRT_MODEL_DIR="C:/tmp/24.07_trt_models"
Copy link
Contributor

@rmccorm4 rmccorm4 Oct 23, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think this feature is agnostic to the backend, so having TRT models in this test will probably make it more painful to maintain or run locally than necessary. In your follow-up ticket TPRD-244, I would suggest revisiting this.

For example running CI with the BACKENDS="python" or something would have no effect on this test and fail looking for TRT models.

Instead I think TRT backend specific tests could add a test case for this feature if there's a specific TRT related flow it might be used for, and this test could be generic to something easier or more portable like a python model.

This would make it easier to compartmentalize running the tests on scenarios where it would be painful to get the TRT models for, or migrate to other runners, other versions, etc without compatibility issues.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If the main use case driving the feature is intended for use with TRT backend for now, it's fine - just food for thought.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I chose TRT for 2 reasons:

  1. The TRT backend was the motivating backend for this feature (as you say)
  2. The backend requires implicit dependencies that are expected to exist outside of the backend directory.

The python backend does not depend on any implicit dependencies outside of what exists in its corresponding backend directory and those dependencies must remain in that folder since their path relative to libtriton_python.so matters. I agree that having any other than TRT would be more maintainable, but I'd have to think on how I'd introduce an additional dependency to the python backend specifically for this test.

# Unlike the other commands, the mv command requires the path to be fully in
# the UNIX style in order for regex to work properly. We cannot apply this
# uniformly because the command line requires Windows path style.
LOCAL_CI_TEST_DEPS_DIR=${MODELDIR_POSIX}/ci_test_deps

source ../common/util.sh
rm -rf ${CUSTOM_DEPENDENCY_DIR} ${LOCAL_CI_TEST_DEPS_DIR} ${STALE_DEPENDENCY_DIR} ${MODELDIR_POSIX}/models
rm -f ./*.log ./*.out
RET=0;

mkdir ${LOCAL_CI_TEST_DEPS_DIR} ${CUSTOM_DEPENDENCY_DIR} ${STALE_DEPENDENCY_DIR}

# Make a copy of the ci_test_deps directory and move out the TRT dependencies
# A pre-test step will add ${LOCAL_CI_TEST_DEPS_DIR} to the PATH
cp -r ${DEPENDENCY_PATH}/* ${LOCAL_CI_TEST_DEPS_DIR} && mv ${LOCAL_CI_TEST_DEPS_DIR}/nvinfer* ${CUSTOM_DEPENDENCY_DIR}/

cp -r ${STALE_DEPENDENCY_PATH}/nvinfer* ${STALE_DEPENDENCY_DIR}/

mkdir ${MODELDIR_POSIX}/models && \
cp -r ${TRT_MODEL_DIR}/qa_model_repository/plan_int32_int32_int32 ${MODELDIR_POSIX}/models/plan_int32_int32_int32

function simple_inference_check()
{
INFER_SUCCESS=1
set +e
code=`curl -s -w %{http_code} -o ./curl.out -d'{"inputs":[{"name":"INPUT0","datatype":"INT32","shape":[1,16],"data":[1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16]},{"name":"INPUT1","datatype":"INT32","shape":[1,16],"data":[1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16]}]}' ${TRITONSERVER_IPADDR}:8000/v2/models/plan_int32_int32_int32/infer`
set -e
if [ "$code" != "200" ]; then
cat ./curl.out
INFER_SUCCESS=0
fi
}

# Test Case 1: Run server when TRT implicit dependencies are not on path (expect FAIL)
SERVER_LOG="./not_on_path_server.log"
SERVER_ARGS="--model-repository=${MODELDIR}/models --backend-directory=${BACKEND_DIR} --log-verbose=2"
run_server
if [ "$SERVER_PID" != "0" ]; then
echo -e "\n***\n*** FAILED on line ${LINENO}: unexpected success starting $SERVER\n***"
kill_server
RET=1
fi

# Test Case 2: Launch server with additional options to point to custom dependency location (expect SUCCESS)
SERVER_LOG="./custom_dependency_dir_server.log"
SERVER_ARGS="--model-repository=${MODELDIR}/models --backend-directory=${BACKEND_DIR} --log-verbose=2 --backend-config=tensorrt,additional-dependency-dirs=${CUSTOM_DEPENDENCY_DIR};"
run_server
if [ "$SERVER_PID" == "0" ]; then
echo -e "\n***\n*** Failed to start $SERVER\n***"
cat $SERVER_LOG
RET=1
fi

simple_inference_check
if [ "${INFER_SUCCESS}" == "0" ]; then
echo -e "\n***\n*** FAILED on line ${LINENO}: simple inference check failed\n***"
RET=1
fi

kill_server

# Test Case 3: Launch server with additional options to point to custom dependency location and load model dynamically (expect SUCCESS)
SERVER_LOG="./dynamic_loading_server.log"
SERVER_ARGS="--model-repository=${MODELDIR}/models --backend-directory=${BACKEND_DIR} --log-verbose=2 --backend-config=tensorrt,additional-dependency-dirs=${CUSTOM_DEPENDENCY_DIR}; --model-control-mode=explicit"
run_server
if [ "$SERVER_PID" == "0" ]; then
echo -e "\n***\n*** Failed to start $SERVER\n***"
cat $SERVER_LOG
RET=1
fi

code=`curl -s -w %{http_code} -o ./curl.out -X POST ${TRITONSERVER_IPADDR}:8000/v2/repository/models/plan_int32_int32_int32/load`
if [ "$code" != "200" ]; then
echo -e "\n***\n*** FAILED on line ${LINENO}: Unable to load model: plan_int32_int32_int32\n***"
cat ./curl.out
RET=1
fi

simple_inference_check
if [ "${INFER_SUCCESS}" == "0" ]; then
echo -e "\n***\n*** FAILED on line ${LINENO}: simple inference check failed\n***"
RET=1
fi

kill_server

# Test Case 4: Run server when pointing to stale TRT dependencies (expect FAIL)
SERVER_LOG="./stale_dependencies_server.log"
SERVER_ARGS="--model-repository=${MODELDIR}/models --backend-directory=${BACKEND_DIR} --log-verbose=2 --backend-config=tensorrt,additional-dependency-dirs=${STALE_DEPENDENCY_DIR};"
run_server
if [ "$SERVER_PID" != "0" ]; then
echo -e "\n***\n*** FAILED on line ${LINENO}: unexpected success starting $SERVER\n***"
kill_server
RET=1
fi

# Test Case 5: [Test ordering] Run server when pointing to stale and correct TRT dependencies (stale first - expect FAIL).
SERVER_LOG="./stale_first_server.log"
SERVER_ARGS="--model-repository=${MODELDIR}/models --backend-directory=${BACKEND_DIR} --log-verbose=2 --backend-config=tensorrt,additional-dependency-dirs=${STALE_DEPENDENCY_DIR};${CUSTOM_DEPENDENCY_DIR};"
run_server
if [ "$SERVER_PID" != "0" ]; then
echo -e "\n***\n*** FAILED on line ${LINENO}: unexpected success starting $SERVER\n***"
kill_server
RET=1
fi

# Test Case 6: [Test ordering] Run server when pointing to stale and correct TRT dependencies (correct first - expect SUCCESS).
SERVER_LOG="./correct_first_server.log"
SERVER_ARGS="--model-repository=${MODELDIR}/models --backend-directory=${BACKEND_DIR} --log-verbose=2 --backend-config=tensorrt,additional-dependency-dirs=${CUSTOM_DEPENDENCY_DIR};${STALE_DEPENDENCY_DIR};"
run_server
if [ "$SERVER_PID" == "0" ]; then
echo -e "\n***\n*** Failed to start $SERVER\n***"
cat $SERVER_LOG
RET=1
fi

simple_inference_check
if [ "${INFER_SUCCESS}" == "0" ]; then
echo -e "\n***\n*** FAILED on line ${LINENO}: simple inference check failed\n***"
RET=1
fi

kill_server

ORIGINAL_PATH=$PATH
# Test Case 7: [Test ordering] Run server when correct TRT dependencies exist in environment PATH (expect SUCCESS)
SERVER_LOG="./correct_in_environment.log"
SERVER_ARGS="--model-repository=${MODELDIR}/models --backend-directory=${BACKEND_DIR} --log-verbose=2"
PATH="${ORIGINAL_PATH};${CUSTOM_DEPENDENCY_DIR};"
run_server
if [ "$SERVER_PID" == "0" ]; then
echo -e "\n***\n*** Failed to start $SERVER\n***"
cat $SERVER_LOG
RET=1
fi

simple_inference_check
if [ "${INFER_SUCCESS}" == "0" ]; then
echo -e "\n***\n*** FAILED on line ${LINENO}: simple inference check failed\n***"
RET=1
fi

PATH=$ORIGINAL_PATH
kill_server

# Test Case 8: [Test ordering] Run server when correct TRT dependencies exist in environment PATH, but user specifies stale additional dependency directory (user input takes priority - expect FAIL)
SERVER_LOG="./correct_in_environment_but_user_adds_stale.log"
SERVER_ARGS="--model-repository=${MODELDIR}/models --backend-directory=${BACKEND_DIR} --log-verbose=2 --backend-config=tensorrt,additional-dependency-dirs=${STALE_DEPENDENCY_DIR};"
PATH="${ORIGINAL_PATH};${CUSTOM_DEPENDENCY_DIR};"
run_server
if [ "$SERVER_PID" != "0" ]; then
echo -e "\n***\n*** FAILED on line ${LINENO}: unexpected success starting $SERVER\n***"
kill_server
RET=1
fi

PATH=$ORIGINAL_PATH

# Test Case 9: [Test ordering] Run server when stale TRT dependencies exist in environment PATH (expect FAIL)
SERVER_LOG="./stale_in_environment.log"
SERVER_ARGS="--model-repository=${MODELDIR}/models --backend-directory=${BACKEND_DIR} --log-verbose=2"
PATH="${ORIGINAL_PATH};${STALE_DEPENDENCY_DIR};"
run_server
if [ "$SERVER_PID" != "0" ]; then
echo -e "\n***\n*** FAILED on line ${LINENO}: unexpected success starting $SERVER\n***"
kill_server
RET=1
fi

PATH=$ORIGINAL_PATH

# Test Case 10: [Test ordering] Run server when stale TRT dependencies exist in environment PATH, but user specifies correct additional dependency directory (user input takes priority - expect SUCCESS)
SERVER_LOG="./stale_in_environment_but_user_adds_correct.log"
SERVER_ARGS="--model-repository=${MODELDIR}/models --backend-directory=${BACKEND_DIR} --log-verbose=2 --backend-config=tensorrt,additional-dependency-dirs=${CUSTOM_DEPENDENCY_DIR};"
PATH="${ORIGINAL_PATH};${STALE_DEPENDENCY_DIR};"
run_server
if [ "$SERVER_PID" == "0" ]; then
echo -e "\n***\n*** Failed to start $SERVER\n***"
cat $SERVER_LOG
RET=1
fi

simple_inference_check
if [ "${INFER_SUCCESS}" == "0" ]; then
echo -e "\n***\n*** FAILED on line ${LINENO}: simple inference check failed\n***"
RET=1
fi

PATH=$ORIGINAL_PATH
kill_server

# Test Case 11: Incorrect extension usage. User provided path(s) that are not semi-colon separated (expect FAIL).
SERVER_LOG="./incorrect_usage_server.log"
SERVER_ARGS="--model-repository=${MODELDIR}/models --backend-directory=${BACKEND_DIR} --log-verbose=2 --backend-config=tensorrt,additional-dependency-dirs=C:/not_semicolon_terminated"
run_server
if [ "$SERVER_PID" != "0" ]; then
echo -e "\n***\n*** FAILED on line ${LINENO}: unexpected success starting $SERVER\n***"
kill_server
RET=1
fi

if [ $(cat ${SERVER_LOG} | grep "malformed" | wc -l) -eq 0 ]; then
echo -e "\n***\n*** FAILED on line ${LINENO}: expected error statement not found $SERVER\n***"
cat $SERVER_LOG
RET=1
fi

if [ $RET -eq 0 ]; then
echo -e "\n***\n*** Test Passed\n***"
else
echo -e "\n***\n*** Test FAILED\n***"
fi

exit $RET
Loading