Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DAOS-16477 mgmt: return suspect engines for pool healthy query #15196

Merged
merged 16 commits into from
Nov 1, 2024

Commits on Oct 8, 2024

  1. DAOS-16477 mgmt: return suspect engines for pool healthy query

    After significant failures, the system may leave behind some suspect
    engines that were marked as DEAD by the SWIM protocol, but were not
    excluded from the system to prevent data loss. An administrator
    can bring these ranks back online by restarting them.
    
    This PR aims to provide an administrative interface for querying
    suspect engines following a massive failure. These suspect engines
    can be retrieved using the daos/dmg --health-only command.
    
    An example of output of dmg pool query --health-only:
    
    Pool 6f450a68-8c7d-4da9-8900-02691650f6a2, ntarget=8, disabled=2, leader=3, version=4, state=Degraded
    Pool health info:
    - Disabled ranks: 1
    - Suspect ranks: 2
    - Rebuild busy, 0 objs, 0 recs
    
    Required-githooks: true
    Signed-off-by: Wang Shilong <shilong.wang@intel.com>
    wangshilong committed Oct 8, 2024
    Configuration menu
    Copy the full SHA
    79cfb27 View commit details
    Browse the repository at this point in the history

Commits on Oct 10, 2024

  1. control plane improvements

    Required-githooks: true
    
    Signed-off-by: Wang Shilong <shilong.wang@intel.com>
    wangshilong committed Oct 10, 2024
    Configuration menu
    Copy the full SHA
    acac64f View commit details
    Browse the repository at this point in the history

Commits on Oct 11, 2024

  1. Merge branch 'master' of github.com:daos-stack/daos into shilongw/DAO…

    …S-16477
    
    Required-githooks: true
    wangshilong committed Oct 11, 2024
    Configuration menu
    Copy the full SHA
    8c4e30a View commit details
    Browse the repository at this point in the history
  2. extend ftest to cover new interface and doc improvments

    Test-tag: DmgPoolQueryRanks
    Required-githooks: true
    Signed-off-by: Wang Shilong <shilong.wang@intel.com>
    wangshilong committed Oct 11, 2024
    Configuration menu
    Copy the full SHA
    b76d811 View commit details
    Browse the repository at this point in the history
  3. Merge branch 'master' of github.com:daos-stack/daos into shilongw/DAO…

    …S-16477
    
    Required-githooks: true
    wangshilong committed Oct 11, 2024
    Configuration menu
    Copy the full SHA
    7545298 View commit details
    Browse the repository at this point in the history
  4. bump timeout

    Test-tag: DmgPoolQueryRanks
    Required-githooks: true
    
    Signed-off-by: Wang Shilong <shilong.wang@intel.com>
    wangshilong committed Oct 11, 2024
    Configuration menu
    Copy the full SHA
    7a2b72f View commit details
    Browse the repository at this point in the history

Commits on Oct 14, 2024

  1. collect more debug logs

    Test-tag: DmgPoolQueryRanks
    Required-githooks: true
    Signed-off-by: Wang Shilong <shilong.wang@intel.com>
    wangshilong committed Oct 14, 2024
    Configuration menu
    Copy the full SHA
    2048ef4 View commit details
    Browse the repository at this point in the history

Commits on Oct 15, 2024

  1. improve test according to feedbacks.

    Required-githooks: true
    Test-tag: DmgPoolQueryRanks
    Signed-off-by: Wang Shilong <shilong.wang@intel.com>
    wangshilong committed Oct 15, 2024
    Configuration menu
    Copy the full SHA
    1e9f4df View commit details
    Browse the repository at this point in the history
  2. Merge branch 'master' of github.com:daos-stack/daos into shilongw/DAO…

    …S-16477
    
    Test-tag: DmgPoolQueryRanks
    Required-githooks: true
    
    Signed-off-by: Wang Shilong <shilong.wang@intel.com>
    wangshilong committed Oct 15, 2024
    Configuration menu
    Copy the full SHA
    0508a79 View commit details
    Browse the repository at this point in the history
  3. Fix warning

    Test-tag: DmgPoolQueryRanks
    Required-githooks: true
    
    Signed-off-by: Wang Shilong <shilong.wang@intel.com>
    wangshilong committed Oct 15, 2024
    Configuration menu
    Copy the full SHA
    9ff31a9 View commit details
    Browse the repository at this point in the history
  4. fix format

    Test-tag: DmgPoolQueryRanks
    Required-githooks: true
    Signed-off-by: Wang Shilong <shilong.wang@intel.com>
    wangshilong committed Oct 15, 2024
    Configuration menu
    Copy the full SHA
    dce3a51 View commit details
    Browse the repository at this point in the history

Commits on Oct 16, 2024

  1. Merge branch 'master' of github.com:daos-stack/daos into shilongw/DAO…

    …S-16477
    
    Features: DmgPoolQueryRanks
    
    Required-githooks: true
    
    Signed-off-by: Wang Shilong <shilong.wang@intel.com>
    wangshilong committed Oct 16, 2024
    Configuration menu
    Copy the full SHA
    ef6221a View commit details
    Browse the repository at this point in the history

Commits on Oct 18, 2024

  1. Merge branch 'master' of github.com:daos-stack/daos into shilongw/DAO…

    …S-16477
    
    Features: DmgPoolQueryRanks
    
    Signed-off-by: Wang Shilong <shilong.wang@intel.com>
    wangshilong committed Oct 18, 2024
    Configuration menu
    Copy the full SHA
    aa050fe View commit details
    Browse the repository at this point in the history

Commits on Oct 21, 2024

  1. test

    Features: DmgPoolQueryRanks
    Skip-nlt: true
    Required-githooks: true
    
    Signed-off-by: Wang Shilong <shilong.wang@intel.com>
    wangshilong committed Oct 21, 2024
    Configuration menu
    Copy the full SHA
    c944603 View commit details
    Browse the repository at this point in the history

Commits on Oct 24, 2024

  1. Configuration menu
    Copy the full SHA
    8bfee5d View commit details
    Browse the repository at this point in the history
  2. codes improvements

    Features: DmgPoolQueryRanks
    Skip-nlt: true
    Signed-off-by: Wang Shilong <shilong.wang@intel.com>
    Required-githooks: true
    wangshilong committed Oct 24, 2024
    Configuration menu
    Copy the full SHA
    3166a87 View commit details
    Browse the repository at this point in the history