This repository has been archived by the owner on Sep 18, 2024. It is now read-only.
correctly deal with job retries on openpai #3769
QuanluZhang
started this conversation in
New Feature Design Discussion
Replies: 1 comment
-
I think it is an important bug need to be fixed. Related to #863 and #865. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Short summary about the issue/question: a job on openpai may be retried. In the current version, nni is not aware of such event. This may induce potential issues, for example, an assessor may find such a trial's learning curve is strange, leading to incorrect behavior.
Brief what process you are following: normal
How to reproduce it: when a trial is retried on openpai
nni Environment:
Anything else we need to know:
Related to #863 and #865.
Beta Was this translation helpful? Give feedback.
All reactions