-
Notifications
You must be signed in to change notification settings - Fork 301
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DAOS-16127 tools: Add daos health check command (#14730) #14885
Conversation
Perform basic system health checks from the client perspective. Checks the following: * Client/Server versions * Key library versions and paths * Connected sytem information * Pool status for all pools to which the user has access * Container status for all containers in the checked pools Change-Id: I9154ee7f3632996e0e67ad6f320874e1df2e0d23 Signed-off-by: Michael MacDonald <mjmac@google.com>
The commit landed for DAOS-16127 resulted in some differences in JSON output for pool query. Several tests were written in such a way that the code expected the (en|dis)abled_ranks keys to always be set, even if those arrays were NULL. This isn't very idiomatic and is awkward to work with. The test code has been updated to instead use the get() operator which will return None if the response dict does not have the requested key. Also fixes a problem reported in DAOS-16283, where the JSON output of `dmg pool query` differed from the JSON output of `daos pool query` because it didn't include the usage array. Features: pool control Required-githooks: true Change-Id: I4b69ed55ce6df8b3122573b4c7df8f2118a57d1b Signed-off-by: Michael MacDonald <mjmac@google.com>
Ticket title is 'Add daos health check command' |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ftest LGTM
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please add clean/unclean cherry pick label. Thanks
Apparently these are needed on Ubuntu. Features: control pool Required-githooks: true Change-Id: Ieb0446760f0b53e2f09feeae0226ea26dd455d58 Signed-off-by: Michael MacDonald <mjmac@google.com>
478a60d
to
326d15f
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ftest LGTM. Thanks!
Test stage Functional Hardware Medium Verbs Provider completed with status UNSTABLE. https://build.hpdd.intel.com/job/daos-stack/job/daos//view/change-requests/job/PR-14885/7/testReport/ |
Test stage Functional Hardware Medium UCX Provider completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-14885/7/execution/node/1524/log |
Test stage Functional Hardware Medium MD on SSD completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-14885/7/execution/node/1616/log |
Test failure appears to be an instance of DAOS-16035. The other failures are the usual UCX/MD-on-SSD failures that happen when I forget to deselect those stages. :/ |
@tanabarr: Mind giving this a +1 so we can get it landed? TIA |
Perform basic system health checks from the client
perspective. Checks the following:
has access
checked pools
Signed-off-by: Michael MacDonald mjmac@google.com