NO-JIRA: Exclude KubeJobFailed for periodic-gathering jobs in openshift-insights#30810
Conversation
|
Pipeline controller notification For optional jobs, comment This repository is configured in: automatic mode |
WalkthroughA Prometheus test in Changes
Estimated code review effort🎯 2 (Simple) | ⏱️ ~10 minutes 🚥 Pre-merge checks | ✅ 4 | ❌ 1❌ Failed checks (1 warning)
✅ Passed checks (4 passed)
✏️ Tip: You can configure your own custom pre-merge checks in the settings. ✨ Finishing Touches🧪 Generate unit tests (beta)
Comment |
|
/hold Should not be needed anymore, just wanted to have a record |
|
Scheduling required tests: |
|
/lgtm |
The periodic-gathering jobs in the openshift-insights namespace can transiently fail, causing KubeJobFailed alerts that make the "shouldn't report any alerts in firing state" e2e test fail. Use a PromQL `unless` clause to exclude specifically KubeJobFailed alerts from openshift-insights namespace where job_name matches periodic-gathering-*, while still catching any other unexpected alerts. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
da20cdd to
18f455a
Compare
|
@petr-muller: This pull request explicitly references no jira issue. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
|
/lgtm |
|
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: neisw, petr-muller The full list of commands accepted by this bot can be found here. The pull request process is described here DetailsNeeds approval from an approver in each of these files:
Approvers can indicate their approval by writing |
|
/hold cancel |
|
Scheduling required tests: |
|
No signs of trouble in the jobs /verified by CI |
|
@petr-muller: This PR has been marked as verified by DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
|
Job Failure Risk Analysis for sha: 18f455a
|
|
/override ci/prow/e2e-aws-ovn-fips The job actually failed due to TRT-2560 |
|
@petr-muller: Overrode contexts on behalf of petr-muller: ci/prow/e2e-aws-ovn-fips DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. |
|
/override ci/prow/e2e-gcp-ovn-upgrade ci/prow/e2e-metal-ipi-ovn-ipv6 |
|
@petr-muller: Overrode contexts on behalf of petr-muller: ci/prow/e2e-gcp-ovn-upgrade, ci/prow/e2e-metal-ipi-ovn-ipv6 DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. |
|
@petr-muller: all tests passed! Full PR test history. Your PR dashboard. DetailsInstructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here. |
Mitigates impact of OCPBUGS-77314 / CCXDEV-16087.
Summary
periodic-gatheringjobs in theopenshift-insightsnamespace can fail because of the cluster-external API failures, causingKubeJobFailedalerts that make the "shouldn't report any alerts in firing state" e2e test fail.unlessclause to exclude specificallyKubeJobFailedalerts fromopenshift-insightsnamespace wherejob_namematchesperiodic-gathering-.*, while still catching any other unexpected alerts.🤖 Generated with Claude Code
Summary by CodeRabbit