-
Notifications
You must be signed in to change notification settings - Fork 1.5k
Target GA for recover from Volume Expansion failure feature #5353
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Conversation
c94f4b5
to
bd6bae1
Compare
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: gnufied The full list of commands accepted by this bot can be found here.
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
@@ -22,11 +22,12 @@ replaces: | |||
superseded-by: | |||
|
|||
latest-milestone: "v1.32" | |||
stage: "alpha" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
(starting a new "thread") I would also expect https://github.com/kubernetes/enhancements/tree/master/keps/sig-storage/1790-recover-resize-failure#graduation-criteria having GA graduation criteria.
@@ -22,11 +22,12 @@ replaces: | |||
superseded-by: | |||
|
|||
latest-milestone: "v1.32" | |||
stage: "alpha" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
(starting another unrelated "thread")
Should we mention the new annotation added in kubernetes/kubernetes#131907 ? The enhancement itself does not aim at RWX volumes, still, an API (annotation is an API) deserves some note somewhere. I guess that can be solved in the annotation "API review".
Shadowing @deads2k on this. I had one item that jumped at me reading the PRR. Was this metric added?
|
It would be worth expanding on https://github.com/kubernetes/enhancements/blob/d52c7d43a67a3f337e9c8a1869b820e8204af5dd/keps/sig-storage/1790-recover-resize-failure/README.md#scalability as part of the GA graduation. The answers were a bit vague so we should revisit that section and comment on the scalability questions with more information. |
No, we didn't add the metric. I was just discussing with storage folks and while the idea sounds nice on paper, there is very little practical benefit from this particular metric. From operational point of view, I can't see a reason why will an admin be interested in this metric. There are already other resizing related metrics, which work well enough. The metric item was tentative, so not sure if blocker for GA. |
I am adding some more information for GA graduation criteria but scalability requirements as such has not changed between beta and GA. Which bits you found vague? I answered couple of questions with - "Potentially Yes", because answer depends on whether a recovery was attempted. |
It would be worth cleaning up the KEP then. I think you mention using that metric in a few places in the PRR. |
The potentially yes jumped out at me because it sounded like we have not yet investigated if this would occur. Maybe you can expand on what you mean by recovery. |
What is the alternative method we recommend to a cluster-admin to determine when resize operations are failing on a particular node and/or failing across all nodes. |
So there is Some of these metrics have evolved since KEP (This KEP is now 5 years old) was originally written, I will update the KEP with newer metric names. |
@@ -140,6 +142,17 @@ This will allow external-resizer to recover safely from node expansion failures | |||
|
|||
 | |||
|
|||
### Handling of RWX volumes that don't require node expansion |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@jsafrane section about annotation stuff.
I have reworded those, ptal. |
#1790