DAOS-18239 test: DO NOT LAND backport discard retry instrumentation on old base#18459
Draft
kccain wants to merge 2 commits into
Draft
DAOS-18239 test: DO NOT LAND backport discard retry instrumentation on old base#18459kccain wants to merge 2 commits into
kccain wants to merge 2 commits into
Conversation
Choose ps leader engine rank in dmg pool exclude command while testing test_osa_online_reintegration_with_multiple_ranks, to see if that case may have an impact on a pool_discard() hang on that engine. Test based on older master commit 0ff9ca7 where pool_discard() hang was originally observed. Test-tag: OSAOnlineReintegration,test_osa_online_reintegration_with_multiple_ranks Test-Repeat: 5 Skip-unit-tests: true Skip-fault-injection-test: true Skip-test-rpms: true Test-provider-hw-medium: ofi+tcp Signed-off-by: Kenneth Cain <kenneth.cain@hpe.com>
|
Ticket title is 'osa/online_reintegration.py:OSAOnlineReintegration.test_osa_online_reintegration_with_multiple_ranks - dmg: rank 5 failed on pool TestPool_1' |
…n old base Choose ps leader engine rank in dmg pool exclude command while testing test_osa_online_reintegration_with_multiple_ranks, to see if that case may have an impact on a pool_discard() hang on that engine. And instrument cont_discard_cb() for any retries e.g., that might continuously get -DER_INPROGRESS (resulting in overall hang), as seen in original observation. Test based on older master commit 0ff9ca7 where pool_discard() hang was originally observed. Test-tag: OSAOnlineReintegration,test_osa_online_reintegration_with_multiple_ranks Test-Repeat: 10 Skip-unit-tests: true Skip-fault-injection-test: true Skip-test-rpms: true Test-provider-hw-medium: ofi+tcp Signed-off-by: Kenneth Cain <kenneth.cain@hpe.com>
5c39b3d to
c8204c5
Compare
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Choose ps leader engine rank in dmg pool exclude command while testing
test_osa_online_reintegration_with_multiple_ranks, to see if that
case may have an impact on a pool_discard() hang on that engine.
And instrument cont_discard_cb() for any retries e.g., that might
continuously get -DER_INPROGRESS (resulting in overall hang),
as seen in original observation.
Test based on older master commit 0ff9ca7 where pool_discard() hang
was originally observed.
Test-tag: OSAOnlineReintegration,test_osa_online_reintegration_with_multiple_ranks
Test-Repeat: 10
Skip-unit-tests: true
Skip-fault-injection-test: true
Skip-test-rpms: true
Test-provider-hw-medium: ofi+tcp
Steps for the author:
After all prior steps are complete: