kubelet: Handle UID reuse in pod worker #104847

smarterclayton · 2021-09-08T18:41:00Z

If a pod is killed (no longer wanted) and then a subsequent create/add/update event is seen in the pod worker, assume that a pod UID was reused (as it could be in static pods) and have the next SyncKnownPods after the pod terminates remove the worker history so that the config loop can restart the static pod, as well as return to the caller the fact that this termination was not final.

If the pod worker sees updates KILL -> CREATE it sets a flag on the worker status restartRequested = true
When SyncKnownPods runs it reports any terminated worker with that flag as TemporarilyTerminatedWork
The pod housekeeping loop (which reconciles the pod worker with the config state) checks to see which pods were terminated but may need restart (TemporarilyTerminatedWork) during the sync, and starts them if they are not terminal (aka desired and admitted)

A pod that restarts this way will wait at most housekeeping loop period (2s) between being terminated and starting again.

/kind bug
/sig node

Fixes #104648

TODO:

verifying this fixes the race

Fix a 1.22 regression when a static pod file is deleted and recreated while using a fixed UID, the pod was not properly restarted.

smarterclayton · 2021-09-08T19:45:41Z

In theory this fixes the problem but you have to wait for a reconcile loop of ~90s. A slightly more complex impl might queue the next pod. I'll take a look at that.

Ideally SyncKnownPods would be able to restart workers, but SyncKnownPods doesn't get passed "the pods the pod worker should know about" today. It needs to be "the admitted pods that should be running" which is a subset of what is in the pod manager.

smarterclayton · 2021-09-09T14:41:35Z

Ok, after thinking through this some more I'm fairly convinced this is safe. Builds on top of #104817 (which renames a bit of the admission logic):

If the pod worker sees updates KILL -> CREATE it sets a flag on the worker status restartRequested = true
When SyncKnownPods runs it reports any terminated worker with that flag as TemporarilyTerminatedWork
The pod housekeeping loop (which reconciles the pod worker with the config state) checks to see which pods were terminated but may need restart during the sync, and starts them if they are not terminal (aka desired and admitted)

A pod that restarts this way will wait at most housekeeping loop period (2s) between being terminated and starting again.

249043822 · 2021-09-10T06:33:29Z

/priority critical-urgent
/triage accepted

k8s-ci-robot · 2021-09-16T17:03:45Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: derekwaynecarr, smarterclayton

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~pkg/kubelet/OWNERS~~ [derekwaynecarr,smarterclayton]
~~test/e2e_node/OWNERS~~ [derekwaynecarr,smarterclayton]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

derekwaynecarr · 2021-09-16T17:05:19Z

/test pull-kubernetes-node-kubelet-serial-containerd
/test pull-kubernetes-node-kubelet-serial

rphillips · 2021-09-16T17:07:17Z

/hold cancel

k8s-ci-robot · 2021-09-16T18:08:52Z

@smarterclayton: The following tests failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name	Commit	Details	Rerun command
pull-kubernetes-node-kubelet-serial-containerd	`d571980`	link	`/test pull-kubernetes-node-kubelet-serial-containerd`
pull-kubernetes-node-kubelet-serial	`d571980`	link	`/test pull-kubernetes-node-kubelet-serial`
pull-kubernetes-integration	`d571980`	link	`/test pull-kubernetes-integration`

Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

rphillips · 2021-09-16T18:10:48Z

TestCustomResourceCascadingDeletion flake in pull-kubernetes-integration

/test pull-kubernetes-integration

…04847-upstream-release-1.22 Automated cherry pick of #104847: kubelet: Handle UID reuse in pod worker

A pod that has been rejected by admission will have status manager set the phase to Failed locally, which make take some time to propagate to the apiserver. The rejected pod will be included in admission until the apiserver propagates the change back, which was an unintended regression when checking pod worker state as authoritative. A pod that is terminal in the API may still be consuming resources on the system, so it should still be included in admission. [ehashman] Rebased on top of kubernetes#104847.

k8s-ci-robot requested review from resouer and SergeyKanzhelev September 8, 2021 18:41

k8s-ci-robot added area/kubelet approved Indicates a PR has been approved by an approver from all required OWNERS files. labels Sep 8, 2021

smarterclayton changed the title ~~kubelet: Handle UID reuse in pod worker~~ WIP: kubelet: Handle UID reuse in pod worker Sep 8, 2021

k8s-ci-robot added the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Sep 8, 2021

ehashman added this to Waiting on Author in SIG Node PR Triage Sep 9, 2021

smarterclayton mentioned this pull request Sep 9, 2021

1.22 regression: removing and recreating static pod manifest leaves pod in error state #104648

Closed

smarterclayton force-pushed the worker_uid_reuse branch from 7e3c2ba to a4cbc55 Compare September 9, 2021 14:35

k8s-ci-robot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. and removed size/S Denotes a PR that changes 10-29 lines, ignoring generated files. labels Sep 9, 2021

smarterclayton force-pushed the worker_uid_reuse branch from a4cbc55 to 05579f6 Compare September 9, 2021 14:40

smarterclayton mentioned this pull request Sep 9, 2021

kubelet: Test case for static pod with hardcoded UID #103374

Closed

openshift-ci-robot mentioned this pull request Sep 9, 2021

Bug 1999133: kubelet: Handle UID reuse in pod worker openshift/kubernetes#938

Merged

smarterclayton force-pushed the worker_uid_reuse branch from 05579f6 to 6dc65c1 Compare September 9, 2021 17:36

k8s-ci-robot added area/test sig/testing Categorizes an issue or PR as relevant to SIG Testing. labels Sep 9, 2021

k8s-ci-robot assigned derekwaynecarr Sep 16, 2021

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Sep 16, 2021

k8s-ci-robot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Sep 16, 2021

rphillips mentioned this pull request Sep 16, 2021

Automated cherry pick of #104847: kubelet: Handle UID reuse in pod worker #105075

Merged

k8s-ci-robot merged commit 51384aa into kubernetes:master Sep 16, 2021

SIG Node CI/Test Board automation moved this from Archive-it to Done Sep 16, 2021

SIG Node PR Triage automation moved this from Waiting on Author to Done Sep 16, 2021

k8s-ci-robot added this to the v1.23 milestone Sep 16, 2021

rphillips mentioned this pull request Sep 16, 2021

[release-4.9] BUG 2005108: UPSTREAM: 104847: Handle UID reuse in pod worker openshift/kubernetes#964

Merged

openshift-ci-robot mentioned this pull request Sep 17, 2021

[release-4.9] Bug 2005108: kubelet: Handle UID reuse in pod worker openshift/kubernetes#966

Closed

k8s-ci-robot added a commit that referenced this pull request Sep 17, 2021

Merge pull request #105075 from rphillips/automated-cherry-pick-of-#1…

917b68d

…04847-upstream-release-1.22 Automated cherry pick of #104847: kubelet: Handle UID reuse in pod worker

This was referenced Oct 5, 2021

Rebase v1.23.0 alpha.3 openshift/kubernetes#998

Closed

Rebase v1.22.2 openshift/kubernetes#1002

Closed

[release-4.9] Bug 2011815: UPSTREAM: 105527: kubelet: do not arbitrarily create a podSyncStatus for finished pods openshift/kubernetes#1009

Merged

openshift-ci-robot mentioned this pull request Oct 29, 2021

Bug 2018516: 4.9: bump(github.com/openshift/*): make go.{mod,sum} point to 1.22.1 openshift/kubernetes#1026

Closed

ehashman mentioned this pull request Nov 24, 2021

Manual cherry pick of #104817: kubelet: Rejected pods should be filtered from admission #104918

Merged

gjkim42 mentioned this pull request Jan 25, 2022

Some static pods fail to start on K8S 1.22 and 1.23 #107733

Closed

ehashman mentioned this pull request Jan 31, 2022

Revert "Ensure there is one running static pod with the same full name" #107734

Closed

liushi001 mentioned this pull request Feb 9, 2022

Slow CNI requests, such as TearDownPod would cause PLEG to be unhealthy. #107749

Closed

liggitt added the kind/regression Categorizes issue or PR as related to a regression from a prior release. label Apr 27, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

kubelet: Handle UID reuse in pod worker #104847

kubelet: Handle UID reuse in pod worker #104847

smarterclayton commented Sep 8, 2021 •

edited by liggitt

smarterclayton commented Sep 8, 2021 •

edited

smarterclayton commented Sep 9, 2021

249043822 commented Sep 10, 2021

k8s-ci-robot commented Sep 16, 2021

derekwaynecarr commented Sep 16, 2021

rphillips commented Sep 16, 2021

k8s-ci-robot commented Sep 16, 2021

rphillips commented Sep 16, 2021

kubelet: Handle UID reuse in pod worker #104847

kubelet: Handle UID reuse in pod worker #104847

Conversation

smarterclayton commented Sep 8, 2021 • edited by liggitt

smarterclayton commented Sep 8, 2021 • edited

smarterclayton commented Sep 9, 2021

249043822 commented Sep 10, 2021

k8s-ci-robot commented Sep 16, 2021

derekwaynecarr commented Sep 16, 2021

rphillips commented Sep 16, 2021

k8s-ci-robot commented Sep 16, 2021

rphillips commented Sep 16, 2021

smarterclayton commented Sep 8, 2021 •

edited by liggitt

smarterclayton commented Sep 8, 2021 •

edited