RFC: install: charts for peerpods #2656

beraldoleal · 2025-10-27T11:32:19Z

This is a first draft of our deployment via charts. CoCo project is moving from the operator to a umbrela chart that will have as dependency other charts, including kata-containers.

This way we can list peerpods as dependency of that chart.

Ideally, our kustomize folder could be replaced by this chart, to avoid duplication. But I would like to start simple with the chart first and replace later.

wainersm · 2025-10-27T18:52:08Z

Hi @beraldoleal, first of all, thanks for working on that!

I only glanced at the changes and the first thing that catch to my eyes is you are grouping configuration for all cloud providers on same files. However, most of our code is oriented by cloud provider directories (e.g. src/cloud-api-adaptor/, src/cloud-providers/, etc...). It's common pull requests being reviewed by maintainers of provider X because the code changes files on provider's directory X; likewise running CI only to test X... is it possible to make helm charts modular per cloud provider too?

Also, does this PR contain everything to proper replace the operator? Asking because I was thinking maybe we could run CI on that PR to see if it works fine

beraldoleal · 2025-10-28T14:18:50Z

I only glanced at the changes and the first thing that catch to my eyes is you are grouping configuration for all cloud providers on same files. However, most of our code is oriented by cloud provider directories (e.g. src/cloud-api-adaptor/, src/cloud-providers/, etc...). It's common pull requests being reviewed by maintainers of provider X because the code changes files on provider's directory X; likewise running CI only to test X... is it possible to make helm charts modular per cloud provider too?

I thought about splitting as well, but I'm not entirely convinced it's the right approach here. A few things have been bugging me:

Users would need to set the provider in two places: once inside the values file itself, and again when running the install command with -f values/<provider>.yaml. Feels a bit redundant and could be confusing;
Provider configurations may diverge significantly over time. Keeping them in separate files might make it easier for that divergence to happen unnoticed, whereas having them in one file makes differences more visible and encourages consistency.
The templates are already generic enough. Our ConfigMap template dynamically captures any value under the provider section: there's no provider-specific template logic needed. So we'd only be splitting the values, not the templates themselves.

For testing multiple providers in CI, we'd need multiple values files anyway, regardless of which approach we take.

Said that... I guess my main question is: do we actually have templates that diverge from provider to provider, or is it just the variables that go into the ConfigMap? If it's only the values that differ and the templates work generically across all providers, I'm leaning toward keeping things together for now.

But I'm open to being convinced otherwise. wdyt?

~~Also, thinking better about the location, I think we should add this as part of confidential-containers/helms repository. To give better visibility, and make it a dependency of the kata chart.~~ (Ignore)

Also, does this PR contain everything to proper replace the operator? Asking because I was thinking maybe we could run CI on that PR to see if it works fine.

I haven't deployed yet. Its on my todo list for this week. But its a good idea to run a CI job.

stevenhorsman

Thanks for starting this @beraldoleal - I am very ignorant on helm, so really appreciated it. As Wainer has said - lets get some CI on this as soon as we can. I'm not sure about all the providers being together and whether that is confusing for folks that only deal with one of them, but we can canvas opinion wider in the community for this.

src/cloud-api-adaptor/install/charts/peerpods/Chart.yaml

src/cloud-api-adaptor/install/charts/peerpods/values.yaml

beraldoleal · 2025-12-04T20:01:28Z

Hi, @wainersm @stevenhorsman... I addressed your first comments, and this is a first look on how it could be. For now I did only gcp, to get early blockers/no-go from the community. As you agree with the approach, I will work on the next providers and with the remaining part of the chart it self.. right now its only the providers operands.

stevenhorsman

It looks like many of these commits are for updating the config-extractor, so I think it would be good to handle them as a separate PR rather than merging them into the charts one

src/cloud-providers/cmd/config-extractor/main.go

src/cloud-providers/cmd/config-extractor/README.md

stevenhorsman · 2025-12-05T16:54:57Z

src/cloud-providers/Makefile

Can these changes go into a prior commit in the tool update PR as although they are the mechanism to generate charts, I think separating the code out will help keep this commit clean (which I think is needed as I'm hoping to get some people who have more charts experience involved in the review and they don't need the distraction of the mechanism we use to create the templates). Does that make sense?

Hum.. I'm spliting the PR, no problem... but not sure about the Makefile changes. Those targets depends on the charts to exist. They are chart specific.

wainersm · 2025-12-08T12:33:42Z

src/cloud-api-adaptor/install/charts/peerpods/templates/configmap.yaml

+    app.kubernetes.io/managed-by: {{ .Release.Service }}
+data:
+  CLOUD_PROVIDER: "{{ .Values.provider }}"
+  DISABLECVM: "{{ not .Values.confidential.enabled }}"


Hi @beraldoleal !

User can set DISABLECVM in the provider's values. In this case it should honor that value, right?

i.e. if .Values.DISABLECVM then DISABLECVM else not .Values.confidential.enabled

Then in the loop for "all" configs below, it should skip DISABLECVM otherwise DISABLECVM will be set twice.

Actually, what I said above about the loop for "all" is wrong. DISABLECVM is in the provider specific config. We could move it to the common "all" configs right?

Hi @wainersm, yes, currently DISABLECVM is not a global/common code. Each provider implements its own. Because of that I don't think I would like to move to the global/common code in this PR, otherwise this PR will be a refactoring of the current code. Some providers doesn't have it, like docker, vsphere and ibmcloud-powervs.

That said, I'm leaning towards users not needing to be aware of these internal details. Whether a flag is per-provider or common code shouldn't be exposed to users. Even if a flag is in common code with a single default, users might need different values depending on their provider scenario, for instance,
PROXY_TIMEOUT or VXLAN_PORT might need different values for AWS vs libvirt. This seems a good reason to keep the all abstraction out of the chart and let each provider config be self-contained and explicit. WDYT?

User can set DISABLECVM in the provider's values. In this case it should honor that value, right?

Yes you are right. The reason I had confidential.enabled was as syntactic sugar, as a single toggle that could set multiple related configs. For instance, confidential.enabled=true could trigger DISABLECVM=false along with other CVM-related settings.

I really would like to avoid having users to set multiple places just to change something. However, I realized this abstraction is more relevant for downstream. Removing it for now, we can revisit if a clear use case emerges.

Hi @wainersm, yes, currently DISABLECVM is not a global/common code. Each provider implements its own. Because of that I don't think I would like to move to the global/common code in this PR, otherwise this PR will be a refactoring of the current code. Some providers doesn't have it, like docker, vsphere and ibmcloud-powervs.

True. DISABLEMCVM is not available in all providers.

That said, I'm leaning towards users not needing to be aware of these internal details. Whether a flag is per-provider or common code shouldn't be exposed to users. Even if a flag is in common code with a single default, users might need different values depending on their provider scenario, for instance, PROXY_TIMEOUT or VXLAN_PORT might need different values for AWS vs libvirt. This seems a good reason to keep the all abstraction out of the chart and let each provider config be self-contained and explicit. WDYT?

This seems a good reason to keep the all abstraction out of the chart ... I didn't understand what means to be out of the chart. :(

User can set DISABLECVM in the provider's values. In this case it should honor that value, right?

Yes you are right. The reason I had confidential.enabled was as syntactic sugar, as a single toggle that could set multiple related configs. For instance, confidential.enabled=true could trigger DISABLECVM=false along with other CVM-related settings.

I really would like to avoid having users to set multiple places just to change something. However, I realized this abstraction is more relevant for downstream. Removing it for now, we can revisit if a clear use case emerges.

Cool!

wainersm · 2025-12-08T12:49:40Z

src/cloud-api-adaptor/install/charts/peerpods/templates/controllers-install-job.yaml

@@ -0,0 +1,54 @@
+# TODO: This pre-install hook should be avoided!


Lacks me helm knowledge to fully understand the problem here. If you could explain a bit more...
My concern with this batch stuff, IIUC, is that we won't be able to easily test a change on peerpod-ctrl on pull requests as it is referencing a released version (maybe reference a commit fine, but even though it should be merged already).

Actually, the plan is to eliminate this Job and pre-install hooks as well... instead, template the peerpod-ctrl resources directly in a specific peerpod-ctrl chart. That way helm uninstall properly cleans up everything, and we could test peerpod-ctrl changes in a PR just by running helm install from the branch.

Got it. So peerpod-ctrl will be addressed later.

wainersm · 2025-12-08T13:06:56Z

Hi @beraldoleal !

The charts looks promising. I am only concerned with the pre-install hook which is calling kubectl as I mentioned above. Having this way it seems that helm does not add much value and actually makes deploy more complicated; I might be wrong as I don't know much about helm.

Other than that, could we make a fully working version for docker or libvirt so that we could run it locally to experiment with?

This is a first draft of our deployment via charts. CoCo project is moving from the operator to a umbrela chart that will have as dependency other charts, including kata-containers. This way we can list peerpods as dependency of that chart. Ideally, our kustomize folder could be replaced by this chart, to avoid duplication. But I would like to start simple with the chart first and replace later. Signed-off-by: Beraldo Leal <[email protected]>

Lets run the make target on CI to check for inconsistencies between the templates and the source code values. Signed-off-by: Beraldo Leal <[email protected]>

beraldoleal requested a review from a team as a code owner October 27, 2025 11:32

beraldoleal requested review from bpradipt, stevenhorsman and wainersm October 27, 2025 11:33

stevenhorsman reviewed Oct 29, 2025

View reviewed changes

beraldoleal force-pushed the charts branch from 7ba4449 to 84f5e0a Compare November 3, 2025 14:06

beraldoleal mentioned this pull request Nov 3, 2025

parser: fix env variable precedence #2670

Merged

beraldoleal force-pushed the charts branch 2 times, most recently from 59f4f8c to 8e75f0b Compare December 4, 2025 19:59

beraldoleal requested a review from stevenhorsman December 4, 2025 20:19

stevenhorsman reviewed Dec 5, 2025

View reviewed changes

src/cloud-providers/cmd/config-extractor/main.go Outdated Show resolved Hide resolved

src/cloud-providers/cmd/config-extractor/README.md Outdated Show resolved Hide resolved

stevenhorsman reviewed Dec 5, 2025

View reviewed changes

beraldoleal force-pushed the charts branch from 8e75f0b to 4ae2a59 Compare December 5, 2025 18:37

wainersm reviewed Dec 8, 2025

View reviewed changes

beraldoleal added 2 commits December 8, 2025 09:03

ci: checking from charts templates drift

5ec527e

Lets run the make target on CI to check for inconsistencies between the templates and the source code values. Signed-off-by: Beraldo Leal <[email protected]>

beraldoleal force-pushed the charts branch from 4ae2a59 to 5ec527e Compare December 8, 2025 14:03

		@@ -0,0 +1,54 @@
		# TODO: This pre-install hook should be avoided!

RFC: install: charts for peerpods #2656

Are you sure you want to change the base?

RFC: install: charts for peerpods #2656

Uh oh!

Conversation

beraldoleal commented Oct 27, 2025

Uh oh!

wainersm commented Oct 27, 2025

Uh oh!

beraldoleal commented Oct 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stevenhorsman left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

beraldoleal commented Dec 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stevenhorsman left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wainersm commented Dec 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

beraldoleal commented Oct 28, 2025 •

edited

Loading

beraldoleal commented Dec 4, 2025 •

edited

Loading