Skip to content

[release-4.19] Backport Mellanox firmware reset defaults#1233

Open
SchSeba wants to merge 3 commits into
openshift:release-4.19from
SchSeba:cp-release-4.19-20260601
Open

[release-4.19] Backport Mellanox firmware reset defaults#1233
SchSeba wants to merge 3 commits into
openshift:release-4.19from
SchSeba:cp-release-4.19-20260601

Conversation

@SchSeba

@SchSeba SchSeba commented Jun 1, 2026

Copy link
Copy Markdown
Contributor

Summary

  • install kmod in the downstream config daemon image so mstfwreset support works with newer mstflint
  • enable MellanoxFirmwareResetFeatureGate by default on release-4.19
  • adapt the older release-4.19 feature-gate implementation so the new default behavior works with the APIs available on that branch

Test plan

  • go test ./pkg/featuregate ./pkg/plugins/mellanox
  • Verify cherry-picks apply cleanly on release-4.19
  • Validate the downstream image still builds successfully

Made with Cursor

SchSeba added 2 commits June 1, 2026 11:54
this is needed when the version of mstfwreset will be >= v4.32

Signed-off-by: Sebastian Sch <sebassch@gmail.com>
This feature gate is used to enable the Mellanox firmware reset feature.

The reason we enable this feature gate is because on newer mstflint version
mstconfig command sends MFRL with bit 6 AND bit 3 ((1<<6)|(1<<3)).
some cards like ConnectX-6 Dx firmware does not support the bit 3 (PCIe link toggle) reset trigger for this operation,
so the MFRL SET command fails (rc != 0).
The code returns "Please power cycle machine to load new configurations."

The operator does not have access to out of band management system to make a power cycle.
so instead we run mstfwreset command to reset the firmware.

Signed-off-by: Sebastian Sch <sebassch@gmail.com>
(cherry picked from commit 518df93)
@openshift-ci openshift-ci Bot requested review from dougbtv and kunalmemane June 1, 2026 11:58
@openshift-ci

openshift-ci Bot commented Jun 1, 2026

Copy link
Copy Markdown
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: SchSeba

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@openshift-ci openshift-ci Bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Jun 1, 2026
Limit the default feature-state backport to feature gates that exist on release-4.19 so the branch builds and the Mellanox reset default remains enabled.

Signed-off-by: Sebastian Sch <sebassch@gmail.com>
Co-authored-by: Cursor <cursoragent@cursor.com>
@SchSeba SchSeba force-pushed the cp-release-4.19-20260601 branch from f0a1e5d to a1270ed Compare June 9, 2026 11:47
@openshift-ci

openshift-ci Bot commented Jun 9, 2026

Copy link
Copy Markdown
Contributor

@SchSeba: all tests passed!

Full PR test history. Your PR dashboard.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved Indicates a PR has been approved by an approver from all required OWNERS files.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant