Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Check that the ceph osd df tree weight and size are equal and reflect the current osd size #10641

Conversation

yitzhak12
Copy link
Contributor

@yitzhak12 yitzhak12 commented Oct 9, 2024

Add a new check for the resize osd tests:
Check that the ceph osd df tree weight and size are equal and reflect the current osd size.
See more details in the thread: https://ibm-systems-storage.slack.com/archives/C06DWD30QTZ/p1724321789366579?thread_ts=1724316882.105889&cid=C06DWD30QTZ.
One more note regarding the implementation in the PR: When getting the ceph osd df tree output, I used the regular format and not the JSON format because the JSON format lacks the values of WEIGHT and SIZE.

@yitzhak12 yitzhak12 requested a review from a team as a code owner October 9, 2024 14:01
@pull-request-size pull-request-size bot added the size/M PR that changes 30-99 lines label Oct 9, 2024
Copy link

@ocs-ci ocs-ci left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

PR validation on existing cluster

Cluster Name: ikave-vm416
Cluster Configuration:
PR Test Suite: tier1
PR Test Path: tests/functional/z_cluster/cluster_expansion/test_resize_osd.py
Additional Test Params:
OCP VERSION: 4.17
OCS VERSION: 4.17
tested against branch: master

Job UNSTABLE (some or all tests failed).

Copy link

@ocs-ci ocs-ci left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

PR validation on existing cluster

Cluster Name: ikave-vm416
Cluster Configuration:
PR Test Suite: tier1
PR Test Path: tests/functional/z_cluster/cluster_expansion/test_resize_osd.py
Additional Test Params:
OCP VERSION: 4.17
OCS VERSION: 4.17
tested against branch: master

Job PASSED.

@yitzhak12 yitzhak12 added the Verified Mark when PR was verified and log provided label Oct 10, 2024
@@ -2929,3 +2929,94 @@ def get_csi_images_for_client_ocp_version(ocp_version=None):
csi_ocp_version_images = csi_images.split(first_str)[1].split(last_str)[0]
csi_ocp_version_images_urls = extract_image_urls(csi_ocp_version_images)
return csi_ocp_version_images_urls


def parse_ceph_osd_df_tree_weight_and_size():
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The values are not the same, but changing the header value may work in our case. I will check it.

…sd size values are eqaul to the expected osd size

- Raise 'StorageSizeNotReflectedException' if the OSDs failed to restart

Signed-off-by: Itzhak Kave <[email protected]>
@yitzhak12 yitzhak12 force-pushed the check-size-and-wieght-post-resize-osd branch from ff84684 to 0c31ffc Compare October 14, 2024 15:46
@pull-request-size pull-request-size bot added size/L PR that changes 100-499 lines and removed size/M PR that changes 30-99 lines labels Oct 14, 2024
Copy link

@ocs-ci ocs-ci left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

PR validation on existing cluster

Cluster Name: ikave-vm416
Cluster Configuration:
PR Test Suite: tier1
PR Test Path: tests/functional/z_cluster/cluster_expansion/test_resize_osd.py
Additional Test Params:
OCP VERSION: 4.17
OCS VERSION: 4.17
tested against branch: master

Job PASSED.

ebenahar
ebenahar previously approved these changes Oct 15, 2024
@openshift-ci openshift-ci bot added the lgtm label Oct 15, 2024
Copy link

@ocs-ci ocs-ci left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

PR validation on existing cluster

Cluster Name: ikave-gcp417
Cluster Configuration: conf/deployment/gcp/ipi_1az_rhcos_3m_3w_shielded_machines.yaml
PR Test Suite: tier1
PR Test Path: tests/functional/z_cluster/cluster_expansion/test_resize_osd.py
Additional Test Params:
OCP VERSION: 4.17
OCS VERSION: 4.17
tested against branch: master

Job UNSTABLE (some or all tests failed).

Copy link

@ocs-ci ocs-ci left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

PR validation on existing cluster

Cluster Name: ikave-gcp416
Cluster Configuration: conf/deployment/gcp/ipi_1az_rhcos_3m_3w_shielded_machines.yaml
PR Test Suite: tier1
PR Test Path: tests/functional/z_cluster/cluster_expansion/test_resize_osd.py
Additional Test Params:
OCP VERSION: 4.16
OCS VERSION: 4.16
tested against branch: master

Job UNSTABLE (some or all tests failed).

Signed-off-by: Itzhak Kave <[email protected]>
Copy link

@ocs-ci ocs-ci left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

PR validation on existing cluster

Cluster Name: ikave-gcp416
Cluster Configuration: conf/deployment/gcp/ipi_1az_rhcos_3m_3w_shielded_machines.yaml
PR Test Suite: tier1
PR Test Path: tests/functional/z_cluster/cluster_expansion/test_resize_osd.py
Additional Test Params:
OCP VERSION: 4.16
OCS VERSION: 4.16
tested against branch: master

Job PASSED.

@@ -60,6 +60,7 @@ def check_resources_state_post_resize_osd(old_osd_pods, old_osd_pvcs, old_osd_pv
old_osd_pvs (list): The old osd PV objects before resizing the osd

Raises:
StorageSizeNotReflectedException: If the OSD pods failed to restart
ResourceWrongStatusException: If the following occurs:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

if this raises is not used can we remove this ?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do you mean to remove the line:

  1. The OSD pods failed to reach the status Terminated or to be deleted

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

fixed

Signed-off-by: Itzhak Kave <[email protected]>
@openshift-ci openshift-ci bot added the lgtm label Nov 7, 2024
Copy link

openshift-ci bot commented Nov 11, 2024

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: ebenahar, prsurve, yitzhak12

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

Copy link

@ocs-ci ocs-ci left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

PR validation

Cluster Name:
Cluster Configuration:
PR Test Suite: tier1
PR Test Path: tests/functional/z_cluster/cluster_expansion/test_resize_osd.py
Additional Test Params:
OCP VERSION: 4.17
OCS VERSION: 4.17
tested against branch: master

Job PASSED.

@ebenahar ebenahar merged commit 5fe47d9 into red-hat-storage:master Nov 11, 2024
7 of 8 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
lgtm size/L PR that changes 100-499 lines Squad/Brown Verified Mark when PR was verified and log provided
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants