feat: Load new model version should not reload loaded existing model version(s) #388

kthui · 2024-08-14T17:05:49Z

What does the PR do?

It is requested that any model version(s) already loaded should not be reloaded upon adding and loading new version(s), if the already loaded version(s) are not modified. This is a small optimization to the model load/unload logic to avoid reloading unchanged model version(s) unload a load request.

Checklist

Commit Type:

Check the conventional commit type
box here and add the label to the github PR.

Related PRs:

triton-inference-server/server#7527

Where should the reviewer start?

Start with the changes to model_lifecycle to see why the small twist to the logic enables the previously reloaded model version to be updated, and then move towards model_config_utils and model_repository_manager to see how those changes support the decision to update vs reload.

Test plan:

New tests on not reloading already loaded model version upon loading other model versions is added. See the tests on server PR.

CI Pipeline ID: 17511909 17650341

Caveats:

N/A

Background

Previously, the model (all versions) will always be reloaded if there is a change in the model directory beyond the model config. This will cause unnecessary reload of unmodified model version(s), which this change makes model directory change detection more granular and decides if a model version should be reloaded or updated based on the model file(s) of the version.

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

N/A

… model version files

src/model_config_utils.cc

GuanLuo

Overall LGTM, have a few comments.

GuanLuo · 2024-08-16T23:05:13Z

src/model_repository_manager/model_repository_manager.cc

-      // model files
-      mtime.second = std::max(mtime.second, GetModifiedTime(full_path));
+    model_timestamps_.clear();
+    return;


raise exception and try catch?

Did a quick test on L0_lifecycle, the test currently expects any error related to retrieving timestamp to remain hidden and simply set the failed timestamp to 0. To translate it into this ModelTimestamp object, setting timestamp to 0 is the same as initializing the object using the default constructor, which is leaving the model_timestamps_ object empty. Thus, if an exception is raised here, the caller catching the exception will simply re-initialize ModelTimestamp object using the default constructor (to remain aligned with the current logic of setting the timestamp to 0). In this case, raising an exception is redundant and can be handled by the constructor in one or two lines, or with a helper function specifically for keeping everything empty.

Another approach is we can change the test cases to accept the behavior change that failure to determine timestamp will fail the model load.

I think to get this feature out quickly, we can retain the existing error handling on reading timestamps, and file another ticket for enhancing the error handling at a later time. @GuanLuo @rmccorm4 what do you think?

Retaining existing error behavior for the cherry-pick and enhancing in a follow-up sounds fine to me. If we really intend to do it - please put in a // DLIS-XXXX: blah blah so it doesn't get lost.

Sure, added the comment: Comment on should raise an exception when failed to create timestamp

Can be addressed as follow-up, but I do think an exception should be raised, and it is fine to have the caller catch the exception and then initialize a timestamp with default constructor (if not failing). This is from the stand point where if the user initializes a timestamp with a path, the user is expecting the returned timestamp to carry correct timestamp w.r.t. the given path, not 0. And thus an error should be raised programmatically and it is up to the user to decide how to handle the error.

src/model_repository_manager/model_repository_manager.cc

This reverts commit 41e57e5.

…version(s) (#388) * Do not reload unmodified loaded model version * Track model directory timestamps more granularly to detect updates to model version files * Rename model config util config change require reload function * Re-organize ModelTimestamp() and throw exception * Revert "Re-organize ModelTimestamp() and throw exception" This reverts commit 41e57e5. * Break constructor into multiple functions * Comment on should raise an exception when failed to create timestamp

…version(s) (#388) (#390) * Do not reload unmodified loaded model version * Track model directory timestamps more granularly to detect updates to model version files * Rename model config util config change require reload function * Re-organize ModelTimestamp() and throw exception * Revert "Re-organize ModelTimestamp() and throw exception" This reverts commit 41e57e5. * Break constructor into multiple functions * Comment on should raise an exception when failed to create timestamp

Do not reload unmodified loaded model version

2c255e4

kthui added the PR: feat A new feature label Aug 14, 2024

kthui mentioned this pull request Aug 14, 2024

test: Load new model version should not reload loaded existing model version(s) triton-inference-server/server#7527

Merged

20 tasks

Track model directory timestamps more granularly to detect updates to…

3d2f5c3

… model version files

kthui force-pushed the jacky-load-new-model-version branch from bf815d6 to 3d2f5c3 Compare August 14, 2024 17:47

kthui mentioned this pull request Aug 16, 2024

docs: Load new model version should not reload loaded existing model version(s) triton-inference-server/server#7537

Closed

20 tasks

kthui requested review from nnshah1, rmccorm4 and GuanLuo August 16, 2024 17:52

kthui marked this pull request as ready for review August 16, 2024 17:56

nnshah1 reviewed Aug 16, 2024

View reviewed changes

src/model_config_utils.cc Outdated Show resolved Hide resolved

Rename model config util config change require reload function

e6da04c

kthui requested a review from nnshah1 August 16, 2024 22:13

GuanLuo reviewed Aug 16, 2024

View reviewed changes

kthui added 3 commits August 16, 2024 17:26

Re-organize ModelTimestamp() and throw exception

41e57e5

Revert "Re-organize ModelTimestamp() and throw exception"

413e510

This reverts commit 41e57e5.

Break constructor into multiple functions

c35fc4d

kthui requested a review from GuanLuo August 19, 2024 21:19

Comment on should raise an exception when failed to create timestamp

f73ab61

GuanLuo approved these changes Aug 20, 2024

View reviewed changes

rmccorm4 approved these changes Aug 20, 2024

View reviewed changes

kthui merged commit ae99d04 into main Aug 20, 2024
1 check passed

kthui deleted the jacky-load-new-model-version branch August 20, 2024 19:26

kthui mentioned this pull request Aug 20, 2024

feat: Load new model version should not reload loaded existing model … #390

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Load new model version should not reload loaded existing model version(s) #388

feat: Load new model version should not reload loaded existing model version(s) #388

kthui commented Aug 14, 2024 •

edited

Loading

GuanLuo left a comment

GuanLuo Aug 16, 2024

kthui Aug 17, 2024

kthui Aug 19, 2024

rmccorm4 Aug 19, 2024 •

edited

Loading

kthui Aug 20, 2024

GuanLuo Aug 20, 2024

feat: Load new model version should not reload loaded existing model version(s) #388

feat: Load new model version should not reload loaded existing model version(s) #388

Conversation

kthui commented Aug 14, 2024 • edited Loading

What does the PR do?

Checklist

Commit Type:

Related PRs:

Where should the reviewer start?

Test plan:

Caveats:

Background

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

GuanLuo left a comment

Choose a reason for hiding this comment

GuanLuo Aug 16, 2024

Choose a reason for hiding this comment

kthui Aug 17, 2024

Choose a reason for hiding this comment

kthui Aug 19, 2024

Choose a reason for hiding this comment

rmccorm4 Aug 19, 2024 • edited Loading

Choose a reason for hiding this comment

kthui Aug 20, 2024

Choose a reason for hiding this comment

GuanLuo Aug 20, 2024

Choose a reason for hiding this comment

kthui commented Aug 14, 2024 •

edited

Loading

rmccorm4 Aug 19, 2024 •

edited

Loading