[Granular resource limits] Add support for granular resource quotas #8662

norbertcyran · 2025-10-17T15:18:12Z

What type of PR is this?

/kind feature

What this PR does / why we need it:

This PR is a part of granular resource limits initiative (#8703). It implements the foundation for the new resource quotas system. The legacy system supports only cluster-wide resource limits coming from the cloud provider. This PR introduces possibility to provide multiple quotas that can apply to different subset of nodes.

For now, the new package is not integrated with the rest of the codebase. This is done on purpose to safely ship the new system in smaller chunks. Therefore, this PR does not introduce any user-facing changes.

Which issue(s) this PR fixes:

Part of #8703.

Special notes for your reviewer:

This PR ended up larger than I expected. Caching of node deltas, support for storage and ephemeral storage, and integration with scale up and scale down will be implemented in the next PRs. See the proposal #8702 for more details.

Does this PR introduce a user-facing change?

NONE

Additional documentation e.g., KEPs (Kubernetes Enhancement Proposals), usage docs, etc.:

k8s-ci-robot · 2025-10-17T15:18:19Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: norbertcyran
Once this PR has been reviewed and has the lgtm label, please assign feiskyer for approval. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

cluster-autoscaler/OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

k8s-ci-robot · 2025-10-17T15:18:22Z

Hi @norbertcyran. Thanks for your PR.

I'm waiting for a kubernetes member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

norbertcyran · 2025-10-17T15:41:08Z

FYI: not ready for review yet

norbertcyran · 2025-10-31T14:59:59Z

FYI: not ready for review yet

Ready now

elmiko

the code is looking good to me, i have a couple questions. i like the tests too.

elmiko · 2025-11-04T22:04:23Z

cluster-autoscaler/resourcequotas/tracker.go

+				continue
+			}
+
+			if limitsLeft < resourceDelta*int64(nodeDelta) {


i'm not following the math here, could you explain what resourceDelta*int64(nodeDelta) is calculating?

i might be confused about nodeDelta

nodeDelta is the number of nodes (of the same shape) to be added to the cluster, resourceDelta is the quantity of a specific resources in a node of that shape. For instance, if we want to add 3 nodes with 4 CPU each, resourceDelta*int64(nodeDelta) will evaluate to 12. This condition basically checks if adding 12 CPUs to the cluster would exceed the limit

Perhaps it would be cleaner to call these nodesToBeAdded and resourcesToBeAdded or something similar. However, I'm thinking about adding support for negative deltas later on to remove duplication in the scale down logic (https://github.com/kubernetes/autoscaler/blob/master/cluster-autoscaler/core/scaledown/planner/planner.go#L164, https://github.com/kubernetes/autoscaler/blob/master/cluster-autoscaler/core/scaledown/resource/limits.go).

I can add some comments to clarify what deltas mean, unless you have other suggestions?

this is great, thank you for the explanation. it makes sense to me now.

Perhaps it would be cleaner to call these nodesToBeAdded and resourcesToBeAdded or something similar.

i like this, perhaps names that are more descriptive with what is planned next, but this would definitely help with readability.

I can add some comments to clarify what deltas mean, unless you have other suggestions?

i think changing the variable names would help, and i also like having more comments here. i think even something as brief as what you described here would be helpful.

elmiko · 2025-11-04T22:06:19Z

cluster-autoscaler/resourcequotas/factory.go

+// NewQuotasTracker calculates resources used by the nodes for every
+// quota returned by the Provider. Then, based on usages and limits it calculates
+// how many resources can be still added to the cluster. Returns a Tracker object.
+func (f *TrackerFactory) NewQuotasTracker(ctx *context.AutoscalingContext, nodes []*corev1.Node) (*Tracker, error) {


just a question of curiosity, is the intention that a new Tracker will be created on each scan interval of the core?

yes, it will probably be created here, replacing the legacy logic: https://github.com/kubernetes/autoscaler/blob/master/cluster-autoscaler/core/scaleup/orchestrator/orchestrator.go#L124

Performance-wise it's not ideal, but it's not very different from the current logic, except that the loop over nodes will be repeated over all quotas. Still, the complexity will be negligible compared to scheduling simulations and bin-packing. Ideally we'd have a goroutine updating the tracker state in the background, but that seems like a lot of effort and edge cases related to consistency. At this point, I would say it would be a premature optimization, but we might want to improve it in the future

got it, thank you for the explanation =)

k8s-ci-robot requested review from feiskyer and vadasambar October 17, 2025 15:18

k8s-ci-robot added the needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. label Oct 17, 2025

k8s-ci-robot added the size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files. label Oct 17, 2025

norbertcyran changed the title ~~New resource limits~~ Granular resource limits Oct 17, 2025

norbertcyran changed the title ~~Granular resource limits~~ [WI{Granular resource limits Oct 17, 2025

norbertcyran changed the title ~~[WI{Granular resource limits~~ [WIP] Granular resource limits Oct 17, 2025

norbertcyran force-pushed the new-resource-limits branch from 9a781ed to 48db6ad Compare October 17, 2025 17:08

norbertcyran mentioned this pull request Oct 28, 2025

Granular resource limits #8703

Open

10 tasks

norbertcyran force-pushed the new-resource-limits branch from 48db6ad to 475cac9 Compare October 30, 2025 14:40

Add support for multiple resource limits

5ddb228

norbertcyran force-pushed the new-resource-limits branch from 475cac9 to e313251 Compare October 30, 2025 14:41

k8s-ci-robot added release-note-none Denotes a PR that doesn't merit a release note. and removed do-not-merge/release-note-label-needed Indicates that a PR should not merge because it's missing one of the release note labels. labels Oct 30, 2025

Add support for multiple resource limits

080fd15

norbertcyran force-pushed the new-resource-limits branch from e313251 to 080fd15 Compare October 30, 2025 15:11

norbertcyran changed the title ~~[WIP] Granular resource limits~~ [Granular resource limits] Add support for granular resource quotas Oct 30, 2025

norbertcyran marked this pull request as ready for review October 30, 2025 15:12

k8s-ci-robot removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Oct 30, 2025

k8s-ci-robot requested a review from elmiko October 30, 2025 15:12

elmiko reviewed Nov 4, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Granular resource limits] Add support for granular resource quotas #8662

[Granular resource limits] Add support for granular resource quotas #8662

norbertcyran commented Oct 17, 2025 •

edited

Loading

Uh oh!

k8s-ci-robot commented Oct 17, 2025

Uh oh!

k8s-ci-robot commented Oct 17, 2025

Uh oh!

norbertcyran commented Oct 17, 2025

Uh oh!

norbertcyran commented Oct 31, 2025

Uh oh!

elmiko left a comment

Uh oh!

elmiko Nov 4, 2025

Uh oh!

norbertcyran Nov 6, 2025

Uh oh!

elmiko Nov 7, 2025

Uh oh!

elmiko Nov 4, 2025

Uh oh!

norbertcyran Nov 6, 2025

Uh oh!

elmiko Nov 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[Granular resource limits] Add support for granular resource quotas #8662

Are you sure you want to change the base?

[Granular resource limits] Add support for granular resource quotas #8662

Conversation

norbertcyran commented Oct 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What type of PR is this?

What this PR does / why we need it:

Which issue(s) this PR fixes:

Special notes for your reviewer:

Does this PR introduce a user-facing change?

Additional documentation e.g., KEPs (Kubernetes Enhancement Proposals), usage docs, etc.:

Uh oh!

k8s-ci-robot commented Oct 17, 2025

Uh oh!

k8s-ci-robot commented Oct 17, 2025

Uh oh!

norbertcyran commented Oct 17, 2025

Uh oh!

norbertcyran commented Oct 31, 2025

Uh oh!

elmiko left a comment

Choose a reason for hiding this comment

Uh oh!

elmiko Nov 4, 2025

Choose a reason for hiding this comment

Uh oh!

norbertcyran Nov 6, 2025

Choose a reason for hiding this comment

Uh oh!

elmiko Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

elmiko Nov 4, 2025

Choose a reason for hiding this comment

Uh oh!

norbertcyran Nov 6, 2025

Choose a reason for hiding this comment

Uh oh!

elmiko Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

norbertcyran commented Oct 17, 2025 •

edited

Loading