Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update disable_reason when evacuation fails and other fixes #339

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

lmiccini
Copy link
Contributor

This commit brings the following changes:

  1. When evacuations fail we want to update the disable_log_reason field so users can immediately see the reason, plus we can filter out these computes and not try to evacuate them again.

  2. We now filter out evacuations that are 'failed' when deciding if a compute can be re-enabled after it has been fenced, as it is preferable to avoid users the additional burden of manually re-enabling the hypervisors if not necessary.
    If necessary the LEAVE_DISABLED option can be used to keep a host down.

  3. We wrap the initial call to nova api in try/except to give users evidence in the logs if nova is not available.

  4. We added a FORCE_ENABLE parameter. If true, ignore any failed evacuation and enable computes after evacuation.

Jira: https://issues.redhat.com/browse/OSPRH-11279

This commit brings the following changes:

1. When evacuations fail we want to update the disable_log_reason
field so users can immediately see the reason, plus we can filter
out these computes and not try to evacuate them again.

2. We now filter out evacuations that are 'failed' when deciding
if a compute can be re-enabled after it has been fenced, as it
is preferable to avoid users the additional burden of manually
re-enabling the hypervisors if not necessary.
If necessary the LEAVE_DISABLED option can be used to keep a host
down.

3. We wrap the initial call to nova api in try/except to give
users evidence in the logs if nova is not available.

4. We added a FORCE_ENABLE parameter. If true, ignore any failed
evacuation and enable computes after evacuation.

Jira: https://issues.redhat.com/browse/OSPRH-11279
@openshift-ci openshift-ci bot requested review from lewisdenny and viroel January 24, 2025 07:57
Copy link
Contributor

openshift-ci bot commented Jan 24, 2025

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: lmiccini
Once this PR has been reviewed and has the lgtm label, please assign stuggi for approval. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@lmiccini
Copy link
Contributor Author

/test functional

1 similar comment
@stuggi
Copy link
Contributor

stuggi commented Jan 24, 2025

/test functional

@stuggi stuggi requested a review from dciabrin February 10, 2025 10:37
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants