Bug 1920221: Allow test invokers to skip test waits before and after #98781

smarterclayton · 2021-02-05T01:20:53Z

A number of e2e tests are useful to run after the system has been disrupted or is in the progress of being disrupted, but the current
suite and test logic blocks progress waiting for all nodes to be healthy.

By passing -1 to --minStartupPods or --allowed-not-ready-nodes flags the caller can bypass wait logic before and after test suites that would prevent running e2e during disruption. This allows use of parts of the e2e suite during cluster duress to verify that controllers or components still function.

A specific example of this includes testing clusters that have a number of nodes marked unschedulable or not ready and verifying that the system still functions.

In general, some of the hardcoded waits won't make sense on all Kube distributions anyway (those without pods in kube-system), so bypassing may be useful for others who wrap e2e with their own logic. A caller should not have to have pods in kube-system to be conformant, for example.

This should not impact any existing callers of these APIs since previously -1 would fail or wait forever.

/kind cleanup

The e2e suite can be instructed not to wait for pods in kube-system to be ready or for all nodes to be ready by passing `--allowed-not-ready-nodes=-1` when invoking the e2e.test program. This allows callers to run subsets of the e2e suite in scenarios other than perfectly healthy clusters.

A number of e2e tests are useful to run after the system has been disrupted or is in the progress of being disrupted, but the current suite and test logic blocks progress waiting for all nodes to be healthy. By passing -1 to --minStartupPods or --allowed-not-ready-nodes flags the caller can bypass wait logic before and after test suites that would prevent running e2e during disruption. This allows use of parts of the e2e suite during cluster duress to verify that controllers or components still function.

k8s-ci-robot · 2021-02-05T01:20:59Z

@smarterclayton: This issue is currently awaiting triage.

If a SIG or subproject determines this is a relevant issue, they will accept it by applying the triage/accepted label and provide further guidance.

The triage/accepted label can be added by org members by writing /triage accepted in a comment.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

k8s-ci-robot · 2021-02-05T01:22:04Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: smarterclayton

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~test/OWNERS~~ [smarterclayton]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

smarterclayton · 2021-02-05T02:39:41Z

/retest

smarterclayton · 2021-02-05T03:06:14Z

/retest

smarterclayton · 2021-02-05T03:35:33Z

/retest

deads2k · 2021-02-05T13:18:39Z

controlled skips of preflights makes sense and doesn't impact callers making use of them.

/lgtm

k8s-ci-robot added do-not-merge/needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. needs-priority Indicates a PR lacks a `priority/foo` label and requires one. labels Feb 5, 2021

k8s-ci-robot requested review from pohly and SataQiu February 5, 2021 01:21

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Feb 5, 2021

openshift-ci-robot mentioned this pull request Feb 5, 2021

Bug 1920221: Prevent GCP e2e tests from triggering a rate limit on the listZone API openshift/kubernetes#552

Merged

smarterclayton changed the title ~~test/e2e: Allow test invokers to skip test waits before and after~~ Bug 1920221: Allow test invokers to skip test waits before and after Feb 5, 2021

k8s-ci-robot assigned deads2k Feb 5, 2021

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Feb 5, 2021

k8s-ci-robot merged commit 26744ac into kubernetes:master Feb 5, 2021

k8s-ci-robot added this to the v1.21 milestone Feb 5, 2021

openshift-ci-robot mentioned this pull request Feb 8, 2021

[release-4.6] Bug 1926262: Prevent GCP e2e tests from triggering a rate limit on the listZone API openshift/kubernetes#556

Merged

openshift-ci-robot mentioned this pull request Jul 8, 2021

[WIP] [release-4.6] Rebase onto v1.19.12 openshift/kubernetes#850

Closed

openshift-ci-robot mentioned this pull request Sep 7, 2021

Bug 2003027: Rebase 1.20.10 openshift/kubernetes#935

Merged

openshift-ci-robot mentioned this pull request Sep 16, 2021

[release-4.6] Bug 2008266: Rebase 1.19.14 openshift/kubernetes#962

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug 1920221: Allow test invokers to skip test waits before and after #98781

Bug 1920221: Allow test invokers to skip test waits before and after #98781

smarterclayton commented Feb 5, 2021 •

edited

k8s-ci-robot commented Feb 5, 2021

k8s-ci-robot commented Feb 5, 2021

smarterclayton commented Feb 5, 2021

smarterclayton commented Feb 5, 2021

smarterclayton commented Feb 5, 2021

deads2k commented Feb 5, 2021

Bug 1920221: Allow test invokers to skip test waits before and after #98781

Bug 1920221: Allow test invokers to skip test waits before and after #98781

Conversation

smarterclayton commented Feb 5, 2021 • edited

k8s-ci-robot commented Feb 5, 2021

k8s-ci-robot commented Feb 5, 2021

smarterclayton commented Feb 5, 2021

smarterclayton commented Feb 5, 2021

smarterclayton commented Feb 5, 2021

deads2k commented Feb 5, 2021

smarterclayton commented Feb 5, 2021 •

edited