OCPBUGS-86554: Wait for operators after removing master machine#6091
OCPBUGS-86554: Wait for operators after removing master machine#6091sergiordlr wants to merge 1 commit into
Conversation
|
Pipeline controller notification For optional jobs, comment This repository is configured in: LGTM mode |
|
@sergiordlr: This pull request references Jira Issue OCPBUGS-86554, which is invalid:
Comment The bug has been updated to refer to the pull request using the external bug tracker. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
|
No actionable comments were generated in the recent review. 🎉 ℹ️ Recent review info⚙️ Run configurationConfiguration used: Repository: openshift/coderabbit/.coderabbit.yaml Review profile: CHILL Plan: Enterprise Run ID: 📒 Files selected for processing (1)
🚧 Files skipped from review as they are similar to previous changes (1)
WalkthroughThe Ginkgo spec is reorganized into nested disruptive and longduration contexts; the longduration "Bootimage upgrade stub ignition to spec 3" test now waits for cluster operators to stabilize via WaitForStableCluster(oc.AsAdmin(), "3m", "50m") after control plane machine recreation and logs success. ChangesControlPlaneMachineSet tests
Estimated code review effort🎯 3 (Moderate) | ⏱️ ~20 minutes Suggested labels
🚥 Pre-merge checks | ✅ 15✅ Passed checks (15 passed)
✏️ Tip: You can configure your own custom pre-merge checks in the settings. ✨ Finishing Touches🧪 Generate unit tests (beta)
Warning There were issues while running some tools. Please review the errors and either fix the tool's configuration or disable the tool if it's a critical failure. 🔧 golangci-lint (2.12.2)Command failed Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment |
|
/payload-job periodic-ci-openshift-machine-config-operator-release-5.0-periodics-e2e-aws-mco-disruptive-techpreview-3of3 |
|
@sergiordlr: trigger 1 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command
See details on https://pr-payload-tests.ci.openshift.org/runs/ci/2e7ec460-59d2-11f1-823e-6414ed7b54ca-0 |
|
@sergiordlr: An error was encountered searching for bug OCPBUGS-86554 on the Jira server at https://redhat.atlassian.net. No known errors were detected, please see the full error message for details. Full error message.
No response returned: Get "https://redhat.atlassian.net/rest/api/2/issue/OCPBUGS-86554": GET https://redhat.atlassian.net/rest/api/2/issue/OCPBUGS-86554 giving up after 5 attempt(s)
Please contact an administrator to resolve this issue, then request a bug refresh with DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
|
@sergiordlr: The referenced Jira(s) [OCPBUGS-86554] could not be located, all automatically applied jira labels will be removed. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
|
/retitle OCPBUGS-86554: Wait for operators after removing master machine |
3814c82 to
398e0d3
Compare
|
/payload-job periodic-ci-openshift-machine-config-operator-release-5.0-periodics-e2e-aws-mco-disruptive-techpreview-3of3 |
|
@sergiordlr: trigger 1 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command
See details on https://pr-payload-tests.ci.openshift.org/runs/ci/a44edf80-59d3-11f1-9aab-20190993fc53-0 |
|
@sergiordlr: This pull request references Jira Issue OCPBUGS-86554, which is invalid:
Comment The bug has been updated to refer to the pull request using the external bug tracker. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
|
/jira refresh |
|
@djoshy: This pull request references Jira Issue OCPBUGS-86554, which is valid. The bug has been moved to the POST state. 3 validation(s) were run on this bug
DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
|
/retest |
|
/payload-job periodic-ci-openshift-machine-config-operator-release-5.0-periodics-e2e-aws-mco-disruptive-techpreview-3of3 |
|
@sergiordlr: trigger 1 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command
See details on https://pr-payload-tests.ci.openshift.org/runs/ci/e0ce64f0-5a67-11f1-9ec2-5a31d598467f-0 |
398e0d3 to
ef7e0be
Compare
|
/payload-job periodic-ci-openshift-machine-config-operator-release-5.0-periodics-e2e-aws-mco-disruptive-techpreview-3of3 |
|
@sergiordlr: trigger 1 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command
See details on https://pr-payload-tests.ci.openshift.org/runs/ci/1faef760-5aa5-11f1-98eb-c0fb1841307d-0 |
|
@sergiordlr: This pull request references Jira Issue OCPBUGS-86554, which is valid. 3 validation(s) were run on this bug
DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
|
Actionable comments posted: 0 |
|
/lgtm Thank you for the fix - seems sane to me, let's make sure the suites are green before merging. |
|
/payload-job periodic-ci-openshift-machine-config-operator-release-5.0-periodics-e2e-aws-mco-disruptive-techpreview-3of3 |
|
@sergiordlr: trigger 1 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command
See details on https://pr-payload-tests.ci.openshift.org/runs/ci/1fbd97c0-5b31-11f1-8b31-c7894e2a6a3e-0 |
|
/retest |
ef7e0be to
10cbf0c
Compare
|
New changes are detected. LGTM label has been removed. |
|
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: djoshy, sergiordlr The full list of commands accepted by this bot can be found here. The pull request process is described here DetailsNeeds approval from an approver in each of these files:
Approvers can indicate their approval by writing |
|
The test is now in periodic-ci-openshift-machine-config-operator-release-4.23-periodics-e2e-aws-mco-fips-proxy-longduration-1of2 Nevertheless, it seems that the etcd operator is still updating when the test ends. 30s of stability is not enough, etcd started updating again after 40-50 seconds. We have modified the stability period to 3 minutes to make sure that it is really stable. |
…ster machine
- What I did
Wait for all cluster operators to be stable after re-creating a master machine in test case
[PolarionID:85467][OTP] ControlPlaneMachineSets. Bootimage upgrade stub ignition to spec 3Since now we wait for the operators to be idle, the duration of the test has greatly increased making the prow job timeout. Since we don't need signals for GA, it has been decided to move the test to the long duration suite.
- How to verify it
When the test ends, all operators should be stable.
Check the intervals in the execution
In this execution the test ends while the operators are still updating. It should not happen. The test should not end until all operators as stable:
https://sippy.dptools.openshift.org/sippy-ng/job_runs/2058601931412082688/periodic-ci-openshift-machine-config-operator-release-5.0-periodics-e2e-aws-mco-disruptive-techpreview-3of3/intervals?end=2026-05-24T22%3A10%3A34Z&filterText=&intervalFile=e2e-timelines_spyglass_20260524-183519.json&overrideDisplayFlag=0&selectedSources=OperatorAvailable&selectedSources=OperatorProgressing&selectedSources=Alert&selectedSources=Disruption&selectedSources=E2EFailed&selectedSources=E2EPassed&start=2026-05-24T21%3A01%3A30Z
Summary by CodeRabbit