Skip to content

OCPBUGS-77261: Increase timeout for waiting control plane#10370

Merged
openshift-merge-bot[bot] merged 1 commit intoopenshift:mainfrom
mhanss:fix-timeout
Mar 11, 2026
Merged

OCPBUGS-77261: Increase timeout for waiting control plane#10370
openshift-merge-bot[bot] merged 1 commit intoopenshift:mainfrom
mhanss:fix-timeout

Conversation

@mhanss
Copy link
Contributor

@mhanss mhanss commented Mar 9, 2026

This PR fixes an issue with cluster-managed networking on real bare-metal hardware, where the system may take longer to configure networking. With the original 60-minute timeout, control plane nodes may fail to join the bootstrap node, leading to installation failures. This issue does not occur with user-managed networking because the networking is already configured by the user, making the 60-minute timeout sufficient.

Summary by CodeRabbit

  • Chores
    • Updated timeout configuration for the control plane initialization stage to enhance system reliability.

@openshift-ci-robot openshift-ci-robot added jira/valid-reference Indicates that this PR references a valid Jira ticket of any type. jira/invalid-bug Indicates that a referenced Jira bug is invalid for the branch this PR is targeting. labels Mar 9, 2026
@openshift-ci-robot
Copy link
Contributor

@mhanss: This pull request references Jira Issue OCPBUGS-77261, which is invalid:

  • expected the bug to target the "4.22.0" version, but no target version was set

Comment /jira refresh to re-evaluate validity if changes to the Jira bug are made, or edit the title of this pull request to link to a different bug.

The bug has been updated to refer to the pull request using the external bug tracker.

Details

In response to this:

This PR fixes an issue with cluster-managed networking on real bare-metal hardware, where the system may take longer to configure networking. With the original 60-minute timeout, control plane nodes may fail to join the bootstrap node, leading to installation failures. This issue does not occur with user-managed networking because the networking is already configured by the user, making the 60-minute timeout sufficient.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

@openshift-ci openshift-ci bot requested review from andfasano and bfournie March 9, 2026 13:58
@bfournie
Copy link
Contributor

bfournie commented Mar 9, 2026

/lgtm

@openshift-ci openshift-ci bot added the lgtm Indicates that a PR is ready to be merged. label Mar 9, 2026
@mhanss
Copy link
Contributor Author

mhanss commented Mar 9, 2026

/jira refresh

@openshift-ci-robot openshift-ci-robot added jira/valid-bug Indicates that a referenced Jira bug is valid for the branch this PR is targeting. and removed jira/invalid-bug Indicates that a referenced Jira bug is invalid for the branch this PR is targeting. labels Mar 9, 2026
@openshift-ci-robot
Copy link
Contributor

@mhanss: This pull request references Jira Issue OCPBUGS-77261, which is valid. The bug has been moved to the POST state.

3 validation(s) were run on this bug
  • bug is open, matching expected state (open)
  • bug target version (4.22.0) matches configured target version for branch (4.22.0)
  • bug is in the state ASSIGNED, which is one of the valid states (NEW, ASSIGNED, POST)

Requesting review from QA contact:
/cc @mhanss

Details

In response to this:

/jira refresh

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

@openshift-ci
Copy link
Contributor

openshift-ci bot commented Mar 9, 2026

@openshift-ci-robot: GitHub didn't allow me to request PR reviews from the following users: mhanss.

Note that only openshift members and repo collaborators can review this PR, and authors cannot review their own PRs.

Details

In response to this:

@mhanss: This pull request references Jira Issue OCPBUGS-77261, which is valid. The bug has been moved to the POST state.

3 validation(s) were run on this bug
  • bug is open, matching expected state (open)
  • bug target version (4.22.0) matches configured target version for branch (4.22.0)
  • bug is in the state ASSIGNED, which is one of the valid states (NEW, ASSIGNED, POST)

Requesting review from QA contact:
/cc @mhanss

In response to this:

/jira refresh

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

@coderabbitai
Copy link

coderabbitai bot commented Mar 9, 2026

No actionable comments were generated in the recent review. 🎉

ℹ️ Recent review info
⚙️ Run configuration

Configuration used: Repository: openshift/coderabbit/.coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: 3a135e39-daf9-479a-bbf4-1983dbbfe7ec

📥 Commits

Reviewing files that changed from the base of the PR and between 4006847 and e1515d3.

📒 Files selected for processing (1)
  • data/data/agent/files/usr/local/share/assisted-service/assisted-service.env.template

Walkthrough

A new environment variable HOST_STAGE_WAITING_FOR_CONTROL_PLANE_TIMEOUT with a value of 90 minutes was added to the assisted-service configuration template.

Changes

Cohort / File(s) Summary
Configuration Addition
data/data/agent/files/usr/local/share/assisted-service/assisted-service.env.template
Added environment variable HOST_STAGE_WAITING_FOR_CONTROL_PLANE_TIMEOUT=90m for control plane timeout configuration.

Estimated code review effort

🎯 1 (Trivial) | ⏱️ ~2 minutes

🚥 Pre-merge checks | ✅ 5
✅ Passed checks (5 passed)
Check name Status Explanation
Description Check ✅ Passed Check skipped - CodeRabbit’s high-level summary is enabled.
Title check ✅ Passed The title clearly describes the main change: increasing a timeout for waiting on the control plane, which matches the single environment variable addition shown in the changeset.
Docstring Coverage ✅ Passed No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.
Stable And Deterministic Test Names ✅ Passed This custom check validates Ginkgo test names for stability, but the PR only modifies an environment variable template file, not test files.
Test Structure And Quality ✅ Passed PR only modifies a configuration file, not Ginkgo test code. Custom check for test quality is not applicable.

✏️ Tip: You can configure your own custom pre-merge checks in the settings.

✨ Finishing Touches
🧪 Generate unit tests (beta)
  • Create PR with unit tests
  • Post copyable unit tests in a comment

Tip

Try Coding Plans. Let us write the prompt for your AI agent so you can ship faster (with fewer bugs).
Share your feedback on Discord.


Comment @coderabbitai help to get the list of available commands and usage tips.

@openshift-ci-robot
Copy link
Contributor

@mhanss: This pull request references Jira Issue OCPBUGS-77261, which is valid.

3 validation(s) were run on this bug
  • bug is open, matching expected state (open)
  • bug target version (4.22.0) matches configured target version for branch (4.22.0)
  • bug is in the state POST, which is one of the valid states (NEW, ASSIGNED, POST)

Requesting review from QA contact:
/cc @mhanss

Details

In response to this:

This PR fixes an issue with cluster-managed networking on real bare-metal hardware, where the system may take longer to configure networking. With the original 60-minute timeout, control plane nodes may fail to join the bootstrap node, leading to installation failures. This issue does not occur with user-managed networking because the networking is already configured by the user, making the 60-minute timeout sufficient.

Summary by CodeRabbit

  • Chores
  • Updated timeout configuration for the control plane initialization stage to enhance system reliability.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

@openshift-ci
Copy link
Contributor

openshift-ci bot commented Mar 9, 2026

@openshift-ci-robot: GitHub didn't allow me to request PR reviews from the following users: mhanss.

Note that only openshift members and repo collaborators can review this PR, and authors cannot review their own PRs.

Details

In response to this:

@mhanss: This pull request references Jira Issue OCPBUGS-77261, which is valid.

3 validation(s) were run on this bug
  • bug is open, matching expected state (open)
  • bug target version (4.22.0) matches configured target version for branch (4.22.0)
  • bug is in the state POST, which is one of the valid states (NEW, ASSIGNED, POST)

Requesting review from QA contact:
/cc @mhanss

In response to this:

This PR fixes an issue with cluster-managed networking on real bare-metal hardware, where the system may take longer to configure networking. With the original 60-minute timeout, control plane nodes may fail to join the bootstrap node, leading to installation failures. This issue does not occur with user-managed networking because the networking is already configured by the user, making the 60-minute timeout sufficient.

Summary by CodeRabbit

  • Chores
  • Updated timeout configuration for the control plane initialization stage to enhance system reliability.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

@mhanss
Copy link
Contributor Author

mhanss commented Mar 10, 2026

/test okd-scos-images

@mhanss
Copy link
Contributor Author

mhanss commented Mar 10, 2026

/verified bypass

@openshift-ci-robot openshift-ci-robot added the verified Signifies that the PR passed pre-merge verification criteria label Mar 10, 2026
@openshift-ci-robot
Copy link
Contributor

@mhanss: The verified label has been added.

Details

In response to this:

/verified bypass

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

@andfasano
Copy link
Contributor

/approve

@openshift-ci
Copy link
Contributor

openshift-ci bot commented Mar 10, 2026

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: andfasano

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@openshift-ci openshift-ci bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Mar 10, 2026
@mhanss
Copy link
Contributor Author

mhanss commented Mar 10, 2026

/cherry-pick release-4.21

@openshift-cherrypick-robot

@mhanss: once the present PR merges, I will cherry-pick it on top of release-4.21 in a new PR and assign it to you.

Details

In response to this:

/cherry-pick release-4.21

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

@mhanss
Copy link
Contributor Author

mhanss commented Mar 10, 2026

/test okd-scos-images

@openshift-ci
Copy link
Contributor

openshift-ci bot commented Mar 10, 2026

@mhanss: The following tests failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name Commit Details Required Rerun command
ci/prow/e2e-agent-compact-ipv4-iso-no-registry e1515d3 link false /test e2e-agent-compact-ipv4-iso-no-registry
ci/prow/e2e-agent-two-node-fencing-ipv4 e1515d3 link false /test e2e-agent-two-node-fencing-ipv4

Full PR test history. Your PR dashboard.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

@mhanss
Copy link
Contributor Author

mhanss commented Mar 11, 2026

/test okd-scos-images

@openshift-merge-bot openshift-merge-bot bot merged commit 112d506 into openshift:main Mar 11, 2026
26 of 28 checks passed
@openshift-ci-robot
Copy link
Contributor

@mhanss: Jira Issue Verification Checks: Jira Issue OCPBUGS-77261
✔️ This pull request was pre-merge verified.
✔️ All associated pull requests have merged.
✔️ All associated, merged pull requests were pre-merge verified.

Jira Issue OCPBUGS-77261 has been moved to the MODIFIED state and will move to the VERIFIED state when the change is available in an accepted nightly payload. 🕓

Details

In response to this:

This PR fixes an issue with cluster-managed networking on real bare-metal hardware, where the system may take longer to configure networking. With the original 60-minute timeout, control plane nodes may fail to join the bootstrap node, leading to installation failures. This issue does not occur with user-managed networking because the networking is already configured by the user, making the 60-minute timeout sufficient.

Summary by CodeRabbit

  • Chores
  • Updated timeout configuration for the control plane initialization stage to enhance system reliability.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

@openshift-cherrypick-robot

@mhanss: new pull request created: #10383

Details

In response to this:

/cherry-pick release-4.21

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved Indicates a PR has been approved by an approver from all required OWNERS files. jira/valid-bug Indicates that a referenced Jira bug is valid for the branch this PR is targeting. jira/valid-reference Indicates that this PR references a valid Jira ticket of any type. lgtm Indicates that a PR is ready to be merged. verified Signifies that the PR passed pre-merge verification criteria

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants