test/utils/image: Support a single repository #93510

smarterclayton · 2020-07-28T18:39:22Z

In downstream contexts, it's extremely useful to be able to combine all the "testable" images in Kubernetes into a single repo so that a user could mirror these offline in one chunk, and audit the set of images for changes. For instance, within OpenShift we would like to have a single place we can place all the images used by all the tests with a single authentication scheme. While some images are not "real" and can't be mirrored (for instance, the images that point to an auth protected registry), that is not the majority.

This code makes it possible to specify an environment variable KUBE_TEST_REPO that maps the static strings of the registry to a single repository by placing the uniqueness in a tag. For instance:

KUBE_TEST_REPO=quay.io/openshift/community-e2e-images

would translate k8s.gcr.io/prometheus-to-sd:v0.5.0 to quay.io/openshift/community-e2e-images:e2e-30-k8s-gcr-io-prometheus-to-sd-v0-5-0-6JI59Yih4oaj3oQOjRfhyQ. When running the e2e tests with this variable set all code that uses the images will pull from the later location.

The tag is a safe form of the name, plus the index (the constant within manifest.go), plus a hash of the full input. The length of the tag is constrained to the minimum of hash + index + the safe name. Since we will continue to add new images, there should be no assumption that the tag creation cannot evolve over time - someone downstream will need to use a process to map the source images to destination images no matter what, and that is a mechanical transformation that should be independent of the tag value. Old tags should definitely be preserved for older versions of code, but there is no hard requirement that this have stability across code versions.

The public method is changed to return two maps - index to original name and index to test repo name. These maps would be the same if the env var is not set. A consumer that wished to build that mapping (not implemented here, but fairly easy) is to read the map and output the transformation in a form that could be scripted by users using image mirroring command line tools.

Follow up

Add a doc on this
Add a simple tool for getting a FROM->TO map for CLI automation

Specifying the KUBE_TEST_REPO environment variable when e2e tests are executed will instruct the test infrastructure to load that image from a location within the specified repo, using a predefined pattern.

k8s-ci-robot · 2020-07-28T18:40:20Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: smarterclayton

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~test/utils/image/OWNERS~~ [smarterclayton]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

fejta-bot · 2020-10-26T21:14:24Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

smarterclayton · 2020-12-02T19:55:15Z

/remove-lifecycle stale

wilsonehusin

I'm in favor for adding this feature! small question on hashing part 😄

wilsonehusin · 2020-12-03T20:15:17Z

test/utils/image/manifest.go

+// getRepositoryMappedConfig maps an existing image to the provided repo, generating a
+// tag that is unique with the input config. The tag will contain the index, a hash of
+// the image spec (to be unique) and shorten and make the pull spec "safe" so it will
+// fit in the tag to allow a human to recognize the value. If index is -1, then no
+// index will be added to the tag.
+func getRepositoryMappedConfig(index int, config Config, repo string) Config {


I'd like to understand this use case -- is it because image tags are mutable and we want to make sure it's consistent through hash?

alternatively, I'm not sure if I follow the what scenario is implied with "safe" in the description

Since tags have character limits that are shorter than the limit of a full image reference, the hash is to dedup similar images with very long names from one registry. I.e. something like:

docker.io/my-very-long-repo-name/very-very-very-very-very-very-very-very-very-very-very-very-very-very-very-very-long:lots_of_characters_in_tags docker.io/my-very-long-repo-name/very-very-very-very-very-very-very-very-very-very-very-very-very-very-very-very-long:lots_of_characters_in_tags_2

needs to be able to fit into the character limit of an image tag, but we want to be able to guarantee that if you had two of these you could collapse them into a tag in the same repo (shorten the pull_spec, add the hash that uniquifies the rest)

Oh I see, I missed this part of PR description:

k8s.gcr.io/prometheus-to-sd:v0.5.0 to quay.io/openshift/community-e2e-images:e2e-30-k8s-gcr-io-prometheus-to-sd-v0-5-0-6JI59Yih4oaj3oQOjRfhyQ

I just realized that all images are defined with the same name (i.e. community-e2e-images) but with different tags -- that part doesn't seem very intuitive to me. Am I missing something from this proposed approach vs having each image be its own entry in the repository? i.e. quay.io/openshift-k8s-e2e/prometheus-to-sd:v0.5.0

It's mostly simplification - you may not own 50 different repositories (like on docker.io you couldn't mirror the same way). So then if you have a choice between one repo vs multiple - having one repo is more flexibility if you have quota or cost limits (i.e. on docker.io can't have 30 private repos without paying). So really trying to simplify it all down as much as possible to be most flexible for both people who have cloud provider registries, or docker.io accounts, or quay accounts, or maybe even on prem where you are only allowed to touch one repo in your artifactory server.

Also, think about if you're testing this. If you want to sync for 1.20, and for 1.21, you might actually want to test it first. To test it, you want a way to catch "oops". If you're mirroring into individual repos, then your oops factor could be much higher. If each mirror is one repo (and the code is designed to ensure the same image ends up in the same spot) then you can share AND separate. Once you've tested you can delete that repo - that's easier to do than 50 different repos. A repo only having one type of image is just a convention, it's not a hard rule.

Thanks for elaborating. The scenario of mirroring to docker.io isn't something that I considered nor has a use case for. My perspective on supporting this repository override is to make it easy to run conformance on airgapped clusters, which typically is locked to self-hosted container registries, making the cost to host individual image repository somewhat negligible (vs SaaS registry).

I do see value in minimizing "oops" factor though -- I agree it'd be a good idea to have such system!

dims · 2021-01-11T15:13:39Z

@smarterclayton @wilsonehusin do we want to pick this back up and get to a consensus?

wilsonehusin · 2021-01-11T17:22:57Z

@dims yes I'd love to! I'm still trying to understand the use case for the hashing part, awaiting @smarterclayton

Generally I'm okay with it, just that I'm wondering if it's a solution that is still relevant today or not

smarterclayton · 2021-01-12T20:16:49Z

To really fill this out, I think a second part would be to add a simple stub command that wrote out FROM TO statements that could be used with image tooling (like docker, skopeo, etc etc etc). I would also want to write up docs for "how you run e2e tests from a private mirror without access to public internet".

In downstream contexts, it's extremely useful to be able to combine all the "testable" images in Kubernetes into a single repo so that a user could mirror these offline in one chunk, and audit the set of images for changes. For instance, within OpenShift we would like to have a single place we can place all the images used by all the tests with a single authentication scheme. While some images are not "real" and can't be mirrored (for instance, the images that point to an auth protected registry), that is not the majority. This code makes it possible to specify an environment variable KUBE_TEST_REPO that maps the static strings of the registry to a single repository by placing the uniqueness in a tag. For instance: KUBE_TEST_REPO=quay.io/openshift/community-e2e-images would translate `k8s.gcr.io/prometheus-to-sd:v0.5.0` to `quay.io/openshift/community-e2e-images:e2e-30-k8s-gcr-io-prometheus-to-sd-v0-5-0-6JI59Yih4oaj3oQOjRfhyQ`. The tag is a safe form of the name, plus the index (the constant within manifest.go), plus a hash of the full input. The length of the tag is constrained to the minimum of hash + index + the safe name. The public method is changed to return two maps - index to original name and index to test repo name. These maps would be the same if the env var is not set.

smarterclayton · 2021-01-13T18:54:00Z

I can do the stub command in this PR or separate - up to you two (if the argument I make about single repo being compelling, obviously)

wilsonehusin · 2021-01-13T20:35:05Z

The FROM TO statements seem nice to have, but this seem to be useful by itself. I'm leaning towards having this merged in and iterate on the FROM TO statements afterwards, but I don't feel strongly on either approach.

/lgtm

smarterclayton · 2021-01-14T14:36:42Z

/kind bug

k8s-ci-robot · 2021-01-14T16:06:40Z

@smarterclayton: The following tests failed, say /retest to rerun all failed tests:

Test name	Commit	Details	Rerun command
pull-kubernetes-files-remake	1fb7931ade9e2c70da6ec0b8d3c55b6eea1e951f	link	`/test pull-kubernetes-files-remake`
pull-kubernetes-e2e-gce	1fb7931ade9e2c70da6ec0b8d3c55b6eea1e951f	link	`/test pull-kubernetes-e2e-gce`
pull-kubernetes-kubemark-e2e-gce-big	1fb7931ade9e2c70da6ec0b8d3c55b6eea1e951f	link	`/test pull-kubernetes-kubemark-e2e-gce-big`

Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

fejta-bot · 2021-01-14T18:16:23Z

/retest
This bot automatically retries jobs that failed/flaked on approved PRs (send feedback to fejta).

Review the full test history for this PR.

Silence the bot with an /lgtm cancel or /hold comment for consistent failures.

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Jul 28, 2020

k8s-ci-robot requested review from listx and mkumatag July 28, 2020 18:40

k8s-ci-robot added release-note Denotes a PR that will be considered when it comes time to generate release notes. and removed do-not-merge/release-note-label-needed Indicates that a PR should not merge because it's missing one of the release note labels. labels Jul 28, 2020

smarterclayton force-pushed the one_repo_test_images branch from 18194c4 to 1fb7931 Compare July 28, 2020 18:49

sttts mentioned this pull request Oct 12, 2020

Bug 1816812: Allow test images to be in a single mirror openshift/kubernetes#291

Merged

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Oct 26, 2020

spiffxp added do-not-merge/needs-kind Indicates a PR lacks a `kind/foo` label and requires one. and removed needs-kind Indicates a PR lacks a `kind/foo` label and requires one. labels Nov 5, 2020

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Dec 2, 2020

smarterclayton mentioned this pull request Dec 2, 2020

[Umbrella] Considerations around a mirror registry kubernetes/sig-release#1369

Closed

wilsonehusin mentioned this pull request Dec 2, 2020

Ability to get list of overridable registries in e2e.test #96475

Closed

wilsonehusin reviewed Dec 3, 2020

View reviewed changes

k8s-ci-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Jan 11, 2021

smarterclayton force-pushed the one_repo_test_images branch from 1fb7931 to 386f94f Compare January 12, 2021 20:23

smarterclayton changed the title ~~DO NOT MERGE: test/utils/image: Support a single repository~~ test/utils/image: Support a single repository Jan 12, 2021

k8s-ci-robot removed the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Jan 12, 2021

k8s-ci-robot assigned wilsonehusin Jan 13, 2021

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Jan 13, 2021

k8s-ci-robot added kind/bug Categorizes issue or PR as related to a bug. and removed do-not-merge/needs-kind Indicates a PR lacks a `kind/foo` label and requires one. labels Jan 14, 2021

k8s-ci-robot merged commit 0b4a30b into kubernetes:master Jan 14, 2021

k8s-ci-robot added this to the v1.21 milestone Jan 14, 2021

github-actions bot mentioned this pull request Jan 19, 2021

Week Ending January 17, 2021 dev-obs/actus#320

Open

openshift-ci-robot mentioned this pull request Jul 8, 2021

[WIP] [release-4.6] Rebase onto v1.19.12 openshift/kubernetes#850

Closed

openshift-ci-robot mentioned this pull request Sep 7, 2021

Bug 2003027: Rebase 1.20.10 openshift/kubernetes#935

Merged

openshift-ci-robot mentioned this pull request Sep 16, 2021

[release-4.6] Bug 2008266: Rebase 1.19.14 openshift/kubernetes#962

Merged

johnSchnake mentioned this pull request Dec 14, 2021

can not pull image from china: docker pull k8s.gcr.io/conformance:v1.22.1 vmware-tanzu/sonobuoy#1535

Closed

aojea mentioned this pull request Jan 15, 2022

Modify test/utils/image/manifest.go to handle the images based on a yaml file #107537

Closed

johnSchnake mentioned this pull request Apr 17, 2022

testing image translation needed for KUBE_TEST_REPO #109523

Closed

aojea mentioned this pull request Jan 13, 2023

[E2E] Support for pulling images from private registry #114615

Closed

BenTheElder mentioned this pull request Sep 14, 2023

Add Mirroring Guide for e2e.test / Conformance tests kubernetes/registry.k8s.io#259

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test/utils/image: Support a single repository #93510

test/utils/image: Support a single repository #93510

smarterclayton commented Jul 28, 2020 •

edited

k8s-ci-robot commented Jul 28, 2020

fejta-bot commented Oct 26, 2020

smarterclayton commented Dec 2, 2020

wilsonehusin left a comment

wilsonehusin Dec 3, 2020

smarterclayton Dec 3, 2020

wilsonehusin Dec 9, 2020

smarterclayton Jan 12, 2021 •

edited

wilsonehusin Jan 13, 2021

dims commented Jan 11, 2021

wilsonehusin commented Jan 11, 2021

smarterclayton commented Jan 12, 2021

smarterclayton commented Jan 13, 2021

wilsonehusin commented Jan 13, 2021

smarterclayton commented Jan 14, 2021

k8s-ci-robot commented Jan 14, 2021 •

edited

fejta-bot commented Jan 14, 2021

test/utils/image: Support a single repository #93510

test/utils/image: Support a single repository #93510

Conversation

smarterclayton commented Jul 28, 2020 • edited

k8s-ci-robot commented Jul 28, 2020

fejta-bot commented Oct 26, 2020

smarterclayton commented Dec 2, 2020

wilsonehusin left a comment

Choose a reason for hiding this comment

wilsonehusin Dec 3, 2020

Choose a reason for hiding this comment

smarterclayton Dec 3, 2020

Choose a reason for hiding this comment

wilsonehusin Dec 9, 2020

Choose a reason for hiding this comment

smarterclayton Jan 12, 2021 • edited

Choose a reason for hiding this comment

wilsonehusin Jan 13, 2021

Choose a reason for hiding this comment

dims commented Jan 11, 2021

wilsonehusin commented Jan 11, 2021

smarterclayton commented Jan 12, 2021

smarterclayton commented Jan 13, 2021

wilsonehusin commented Jan 13, 2021

smarterclayton commented Jan 14, 2021

k8s-ci-robot commented Jan 14, 2021 • edited

fejta-bot commented Jan 14, 2021

smarterclayton commented Jul 28, 2020 •

edited

smarterclayton Jan 12, 2021 •

edited

k8s-ci-robot commented Jan 14, 2021 •

edited