-
Notifications
You must be signed in to change notification settings - Fork 112
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Log Ceph health and do not fail on HEALTH_WARN #2611
Log Ceph health and do not fail on HEALTH_WARN #2611
Conversation
Thanks for the PR! ❤️ |
6ec95c6
to
06127dd
Compare
The For the second part of this patch I took away failure if health is only in WARN. What do you think of that? If we know the cluster can survive and upgrade when in WARN then I think it makes sense to keep that change, but if we know it can't then I'll take it out. I thought we should at least get a run of the job with both changes to start debugging the problem though. |
What is weird is that I didn't expect that job to perform a ceph minor update. It was added in |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/lgtm
The Ceph Upgrade tasks in the cifmw_cephadm role will fail before the upgrade starts if the health status is warn or error. This patch changes it so that the upgrade only fails if the cluster is in health error. We have had the job fail in CI but we do not know why. The task should log the Ceph health before starting the upgrade so that CI results will give the job owner more insight into why the job failed. Signed-off-by: John Fulton <fulton@redhat.com>
06127dd
to
246fa9c
Compare
per our discussion we now know that |
Yeah, it's already |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/lgtm
/approve |
/lgtm |
/approve |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: fmount, pablintino The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
17a08ad
into
openstack-k8s-operators:main
The Ceph Upgrade tasks in the cifmw_cephadm role
will fail before the upgrade starts if the health
status is warn or error.
This patch changes it so that the upgrade only fails
if the cluster is in health error.
We have had the job fail in CI but we do not know why.
The task should log the Ceph health before starting the
upgrade so that CI results will give the job owner more
insight into why the job failed.