OCPBUGS-76961: Extends the time for the extractor liveness probe#1231
Conversation
|
@jmesnil: This pull request references Jira Issue OCPBUGS-66996, which is invalid:
Comment The bug has been updated to refer to the pull request using the external bug tracker. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
|
Important Review skippedAuto reviews are limited based on label configuration. 🚫 Review skipped — only excluded labels are configured. (1)
Please check the settings in the CodeRabbit UI or the ⚙️ Run configurationConfiguration used: Organization UI Review profile: CHILL Plan: Pro Run ID: You can disable this status message by setting the Use the checkbox below for a quick retry:
✨ Finishing Touches🧪 Generate unit tests (beta)
📝 Coding Plan
Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment |
|
Upstream is #1198 |
|
/retest |
Before: * `crictl info` was timing out after 2 seconds * the command was executed every 10 seconds * 2 failures were making the container unhealthy * => 10 seconds of unavailability was making the pod crash This was too constraining as there are occasions where `crictl` can be unavailable for a longer period of time (eg when the TLS CA bundle requires to restart some pods) Now: * `crictl info` is timing out after 10 seconds * the command is executed every 30 seconds * 3 failures (default) are making the container unhealthy * => 1m30s of unaivailability is making the pod crash Note: The liveness probe is used instead of the readiness probe as the container MUST crash if the crictl connection has been changed (eg following a TLS CA bundle update) and at this point, the pod must be recreated to be able to connect to cri-o socket with an updated TLS certificate. This fixes https://issues.redhat.com/browse/OCPBUGS-76961 Upstream issue is https://issues.redhat.com/browse/OCPBUGS-66996 Signed-off-by: Jeff Mesnil <jmesnil@redhat.com>
5186e84 to
63dbf6f
Compare
|
/jira refresh |
|
@jmesnil: This pull request references Jira Issue OCPBUGS-66996, which is invalid:
Comment DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
|
/retest |
|
/override ci/prow/lint |
|
/label cherry-pick-approved |
|
@BaiyangZhou: Overrode contexts on behalf of BaiyangZhou: ci/prow/lint DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. |
|
/verified bypass |
|
@BaiyangZhou: The DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
|
/jira refresh |
|
@opokornyy: This pull request references Jira Issue OCPBUGS-66996, which is invalid:
Comment DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
|
/title OCPBUGS-76961: Extends the time for the extractor liveness probe |
|
/retitle OCPBUGS-76961: Extends the time for the extractor liveness probe |
|
/jira refresh |
|
/label backport-risk-assessed |
|
@jmesnil: An error was encountered querying GitHub for users with public email (jmesnil@redhat.com) for bug OCPBUGS-76961 on the Jira server at https://redhat.atlassian.net. No known errors were detected, please see the full error message for details. Full error message.
Post "http://ghproxy/graphql": dial tcp: lookup ghproxy on 172.30.0.10:53: no such host
Please contact an administrator to resolve this issue, then request a bug refresh with DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
|
The bug has been updated to refer to the pull request using the external bug tracker. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
|
@ncaak: This pull request references Jira Issue OCPBUGS-76961, which is valid. 7 validation(s) were run on this bug
No GitHub users were found matching the public email listed for the QA contact in Jira (jmesnil@redhat.com), skipping review request. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
|
/approve |
|
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: jmesnil, ncaak The full list of commands accepted by this bot can be found here. The pull request process is described here DetailsNeeds approval from an approver in each of these files:
Approvers can indicate their approval by writing |
|
/retest |
|
@jmesnil: all tests passed! Full PR test history. Your PR dashboard. DetailsInstructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here. |
385e38c
into
openshift:release-4.20
|
@jmesnil: Jira Issue Verification Checks: Jira Issue OCPBUGS-76961 Jira Issue OCPBUGS-76961 has been moved to the MODIFIED state and will move to the VERIFIED state when the change is available in an accepted nightly payload. 🕓 DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
Categories
Before:
crictl infowas timing out after 2 secondsThis was too constraining as there are occasions where
crictlcan be unavailable for a longer period of time (eg when the TLS CA bundle requires to restart some pods)Now:
crictl infois timing out after 10 secondsNote: The liveness probe is used instead of the readiness probe as the container MUST crash if the crictl connection has been changed (eg following a TLS CA bundle update) and at this point, the pod must be recreated to be able to connect to cri-o socket with an updated TLS certificate.
This fixes https://issues.redhat.com/browse/OCPBUGS-76961
Upstream issue is https://issues.redhat.com/browse/OCPBUGS-66996