podvm: Understand and reduce podvm permutations #1890

stevenhorsman · 2024-07-01T09:53:01Z

At the moment we have a matrix of 4 possible options for podvm (mkosi/packer) x (cloud-init/process-user-data). We then multiple this by base OSs too (ubuntu/fedora/rhel) (we will ignore OS version at the moment on the assumption we can sync on that?) and cloud-providers that can support it and it explodes quite a lot and becomes complicated to understand and test
We want to reduce this, so we can minimise differences and duplicated code. One possible plan is:

Identify all the variants of podvm builds we have and track who is using what
Try and switch all podvm builds to use mkosi
Switch the CI to mkosi and then deprecate packer
Later investigate removing cloud-init and getting process-user-data to read the data from there if possible?

stevenhorsman · 2024-07-01T10:08:28Z

Part 1 - Identify podvm builds

Note: I think we decided a while ago that mkosi didn't work so well with ubuntu, so we wanted to switch to a fedora-like stack and deprecate the Ubuntu based podvm builds upstream?

Base OS	Architecture	Cloud provider(s)	mkosi/packer	cloud-init/process-user-data	Being used?	Being tested	Notes
Ubuntu	amd64	aws	packer	cloud-init	❓
Ubuntu	amd64	azure	packer	cloud-init	❓	✅
Ubuntu	amd64	docker	packer	cloud-init	❓	✅ ?
Ubuntu	amd64	ibmcloud	packer	cloud-init	✅	❓
Ubuntu	amd64	libvirt	packer	cloud-init	✅	✅
Ubuntu	amd64	powervs	packer	cloud-init	❓
Ubuntu	amd64	vsphere	packer	cloud-init	❓		Deprecate this?
Ubuntu	s390x	ibmcloud/libvirt	packer	cloud-init	✅
Fedora	amd64	aws	mkosi	process-user-data>	✅
Fedora	amd64	azure	mkosi	process-user-data>	✅
Fedora	amd64	docker	mkosi?	<cloud-init/process-user-data>	✅
Fedora	amd64	ibmcloud	mkosi?	cloud-init?	❓	❓
Fedora	amd64	libvirt	mkosi?	cloud-init?	❓	❓
Fedora	amd64	powervs	mkosi/packer	<cloud-init/process-user-data>	❓
Fedora	amd64	vsphere	packer?	cloud-init	❓		Deprecate this?
Fedora	s390x	ibmcloud/libvirt	mkosi?	cloud-init?	❓	❓
RHEL	amd64	aws	mkosi?	<cloud-init/process-user-data>	❓		Any upstream testing?
RHEL	amd64	azure	mkosi?	<cloud-init/process-user-data>	❓		Any upstream testing?
RHEL	amd64	docker	mkosi?	<cloud-init/process-user-data>			Any upstream testing?
RHEL	amd64	ibmcloud	mkosi?	cloud-init?	❓		Any upstream testing?
RHEL	amd64	libvirt	mkosi?	cloud-init?	❓		Any upstream testing?
RHEL	amd64	powervs	mkosi/packer	<cloud-init/process-user-data>	❓		Any upstream testing?
RHEL	amd64	vsphere	packer	cloud-init	❓		Deprecate this?
RHEL	s390x	ibmcloud/libvirt	<mkosi/packer>	cloud-init?	❓		Any upstream testing?

mkulke · 2024-07-01T10:25:29Z

did you mean "Any downstream testing?"

maybe we can have a "being tested" column

mkosi_x86_64 should work on both AWS + Azure.

Afaik all the packer images use cloud-init?

stevenhorsman · 2024-07-01T10:30:28Z

did you mean "Any downstream testing?"

For RHEL I meant testing of the upstream podvm build, but that testing itself could be manual testing, or testing in a downstream environment (as I'm pretty confident we don't have any upstream automated testing for RHEL). We have some documentation for it though. I hope that helps clarify?

maybe we can have a "being tested" column

Will do

mkosi_x86_64 should work on both AWS + Azure.

These are both using process-user-data I believe and primarly fedora based in the upstream testing?

mkulke · 2024-07-01T10:43:35Z

did you mean "Any downstream testing?"

From RHEL I meant testing of the upstream podvm build, but that testing itself could be manual testing, or testing in a downstream environment (as I'm pretty confident we don't have automated testing for RHEL). We have some documentation for it though. I hope that helps clarify?

not quite :) I guess we have either (automated) testing in the project or potentially "downstream" (e.g a vendor product that uses CAA). One could argue that untested images, if they are consumed and tested downstream should also be maintained downstream?

These are both using process-user-data I believe and primarly fedora based in the upstream testing?

yes. I think we can just check for "cloud-init" yes/no. cloud-init will not work on dm-verity protected root-fs's. so we could also just check for dm-verity yes/no?

mkulke · 2024-07-01T10:45:42Z

none of the mkosi image is being tested atm, afaict

mkulke · 2024-07-01T10:46:46Z

amd64 azure packer image is being tested

stevenhorsman · 2024-07-01T10:53:32Z

not quite :) I guess we have either (automated) testing in the project or potentially "downstream" (e.g a vendor product that uses CAA). One could argue that untested images, if they are consumed and tested downstream should also be maintained downstream?

So the grey area that I was hinting at was for when pure upstream versions were tested internall. e.g. for ibmcloud, we tested the pure upstream version, but due to lack of publicly available resources those tests were done internally. I agree that if the versions are downstream then the downstream teams are responsible for maintenance (though we want to do our best to not break them, so it's potentially interesting). Sorry, I think I'm mostly overcomplicating an already complicated chart!

stevenhorsman · 2024-07-08T10:00:19Z

Part 2 - Identify To-Be of podvm builds

As the attempt to list out our "As-is" set of podvm images hasn't seemed to work, probably as it is too complex, maybe we should focus on out To-Be set and what we are aiming for in Step 2 of this work. My starting list of suggestions, based on quite a lot of ignorance is:

Base OS	Architecture	Cloud provider(s)	mkosi/packer	cloud-init/process-user-data	Being used?	Being tested?	Notes
Fedora	amd64	aws	mkosi	process-user-data	✅
Fedora	amd64	azure	mkosi	process-user-data	✅
Fedora	amd64	docker	mkosi	process-user-data	✅
Fedora	amd64	ibmcloud	mkosi	cloud-init	❓	❓	Requires UEFI, which isn't available in all IBM Cloud regions
Fedora	amd64	libvirt	mkosi	cloud-init	✅	❓
Fedora	amd64	powervs	mkosi	<cloud-init/process-user-data>	❓
Fedora	s390x	ibmcloud	mkosi	cloud-init	✅	❓
Fedora	s390x	libvirt	mkosi	cloud-init	✅	❓

It would be good to understand where we are - particularly with libvirt and ibmcloud to understand what gaps we have in just trying out podvm builds.

wainersm · 2024-07-08T15:39:15Z

Hi @stevenhorsman !

What if we have an entry for Ubuntu/amd64/libvirt/packer/cloud-init for a while mkosi equivalent isn't running on CI?

stevenhorsman · 2024-07-08T15:46:40Z

Hi @stevenhorsman !

What if we have an entry for Ubuntu/amd64/libvirt/packer/cloud-init for a while mkosi equivalent isn't running on CI?

So I'm just trying to track and understand if anyone has already tried these combination above for, the short-term goals. It isn't designed to say that we will remove the Ubuntu builds until we have replacements. Ideally if/when someone confirms that Fedora/amd64/libvirt/mkosi/cloud-init works then, we'd start a parallel builds in our CI to create that version of podvms.

snir911 · 2024-07-09T07:19:18Z

AFAIK RHEL is supported only by packer builds and for AWS, Azure and libvirt, and there's no upstream testing for RHEL

stevenhorsman added the podvm Related to podvm images label Jul 1, 2024

wainersm mentioned this issue Jul 22, 2024

podvm-mkosi: update podvm-builder for fedora image #1939

Merged

stevenhorsman self-assigned this Sep 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

podvm: Understand and reduce podvm permutations #1890

podvm: Understand and reduce podvm permutations #1890

stevenhorsman commented Jul 1, 2024 •

edited

Loading

stevenhorsman commented Jul 1, 2024 •

edited

Loading

mkulke commented Jul 1, 2024

stevenhorsman commented Jul 1, 2024 •

edited

Loading

mkulke commented Jul 1, 2024

mkulke commented Jul 1, 2024

mkulke commented Jul 1, 2024

stevenhorsman commented Jul 1, 2024

stevenhorsman commented Jul 8, 2024 •

edited

Loading

wainersm commented Jul 8, 2024

stevenhorsman commented Jul 8, 2024

snir911 commented Jul 9, 2024

podvm: Understand and reduce podvm permutations #1890

podvm: Understand and reduce podvm permutations #1890

Comments

stevenhorsman commented Jul 1, 2024 • edited Loading

stevenhorsman commented Jul 1, 2024 • edited Loading

Part 1 - Identify podvm builds

mkulke commented Jul 1, 2024

stevenhorsman commented Jul 1, 2024 • edited Loading

mkulke commented Jul 1, 2024

mkulke commented Jul 1, 2024

mkulke commented Jul 1, 2024

stevenhorsman commented Jul 1, 2024

stevenhorsman commented Jul 8, 2024 • edited Loading

Part 2 - Identify To-Be of podvm builds

wainersm commented Jul 8, 2024

stevenhorsman commented Jul 8, 2024

snir911 commented Jul 9, 2024

stevenhorsman commented Jul 1, 2024 •

edited

Loading

stevenhorsman commented Jul 1, 2024 •

edited

Loading

stevenhorsman commented Jul 1, 2024 •

edited

Loading

stevenhorsman commented Jul 8, 2024 •

edited

Loading