add flag to paginate list calls by oliviassss · Pull Request #177 · aws/amazon-network-policy-controller-k8s

oliviassss · 2025-05-16T23:27:33Z

What type of PR is this?

Which issue does this PR fix:
When the pod scale is large, the current unpaginated list pod call will heavy load apiserver and lead to apiserver timeout.
Add a flag --list-page-size to paginate k8s list pod calls, default value is 1000.

What does this PR do / Why do we need it:
Avoid apiserver timeout issue that may lead to controller crash when pod scale is large.

If an issue # is not available please add steps to reproduce and the controller logs:

Testing done on this change:

Automation added to e2e:

Will this PR introduce any new dependencies?:

Will this break upgrades or downgrades. Has updating a running cluster been tested?:

Does this PR introduce any user-facing change?:

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

M00nF1sh

lgtm for this code alone，but i have below concerns:

have we tested that this pagination is really happening in apiServer from audit logs? are we use real client instead of cached one?
- edit: just read through the entire codebase, from the code seems we are using the cached client, i'm wondering how this would work at all..
i'm curious that there isn't pagination support in client-go or controller-runtime?

M00nF1sh

k8sClient shall be a real client to make this work, and seems we are using cached one, unless i made some mistake here.
https://github.com/aws/amazon-network-policy-controller-k8s/blob/main/pkg/config/runtime_config.go#L96

oliviassss · 2025-05-17T00:35:05Z

@M00nF1sh thanks, yes it's a cached client. I'm running the controller in a cluster with 130k pods to confirm from apiserver side.

oliviassss · 2025-05-17T07:46:51Z

i'm curious that there isn't pagination support in client-go or controller-runtime

client-go enforces a default page size as 500, but for init list call, with RV=0, the limit is ignored. so apiserver ignores page size when RV=0. saw this from source code comment
https://github.com/kubernetes/kubernetes/blob/master/staging/src/k8s.io/apiserver/pkg/storage/cacher/cacher.go#L693

// computeListLimit determines whether the cacher should
// apply a limit to an incoming LIST request and returns its value.
//
// note that this function doesn't check RVM nor the Continuation token.
// these parameters are validated by the shouldDelegateList function.
//
// as of today, the limit is ignored for requests that set RV == 0
func computeListLimit(opts storage.ListOptions) int64 {
	if opts.Predicate.Limit <= 0 || opts.ResourceVersion == "0" {
		return 0
	}
	return opts.Predicate.Limit
}

have we tested that this pagination is really happening in apiServer from audit logs? are we use real client instead of cached one?

I confirmed this behavior from 130k-pod cluster

During startup only 1 list pods call is made (although the limit=500, I don't see pagination due to the reason above)
During reconciliation, only watch calls are made, no list calls (should because it's list from a cached client). Also, in reconciliation the list pod calls have selector, so it's not listing all pods, which is less concerned. I will do more scale test to confirm.

oliviassss · 2026-03-04T22:15:35Z

Closed as paginated list from APIserver cache is not supported before 1.34

oliviassss requested a review from a team as a code owner May 16, 2025 23:27

add flag to paginzate list calls

e82cd96

oliviassss force-pushed the list-page branch from 60adc96 to e82cd96 Compare May 16, 2025 23:32

M00nF1sh approved these changes May 16, 2025

View reviewed changes

M00nF1sh self-requested a review May 16, 2025 23:42

M00nF1sh suggested changes May 16, 2025

View reviewed changes

yash97 changed the title ~~add flag to paginzate list calls~~ add flag to paginate list calls May 17, 2025

oliviassss closed this Mar 4, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add flag to paginate list calls#177

add flag to paginate list calls#177
oliviassss wants to merge 1 commit intoaws:mainfrom
oliviassss:list-page

oliviassss commented May 16, 2025

Uh oh!

M00nF1sh left a comment •

edited

Loading

Uh oh!

M00nF1sh left a comment •

edited

Loading

Uh oh!

oliviassss commented May 17, 2025

Uh oh!

oliviassss commented May 17, 2025 •

edited

Loading

Uh oh!

oliviassss commented Mar 4, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

oliviassss commented May 16, 2025

Uh oh!

M00nF1sh left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

M00nF1sh left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

oliviassss commented May 17, 2025

Uh oh!

oliviassss commented May 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

oliviassss commented Mar 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

M00nF1sh left a comment •

edited

Loading

M00nF1sh left a comment •

edited

Loading

oliviassss commented May 17, 2025 •

edited

Loading

oliviassss commented Mar 4, 2026 •

edited

Loading