tests: filter out outliers in performance tests by peaBerberian · Pull Request #1788 · canalplus/rx-player

peaBerberian · 2026-01-21T15:44:20Z

For multiple years now, we run performance tests on each PR - to detect performance regressions on some key scenarios (load, seek, track switching).

It should be able to catch true large regressions but it bothers me that sometimes it seems to detect with a high confidence a very minor regression in the "cold loading multithread" scenario.

This one could particularly be sensitive to ordering / optimizations made by the browsers' cache.

So I'm here trying to experiment with some strategies to limit the possibility of having some kind of bias in our performance tests:

I do more test iterations. We previously hit what seems to be a limitation in the CI when running the browser 128 times. I want to check if it's still the case as it's limiting.
I remove the 10% outliers of all samples, both for the previous state and the current state. It may be enough to remove the difference for our cold-loading test.
I added a function trying to detect ordering bias

For multiple years now, we run performance tests - to detect performance regressions on some key scenarios (load, seek, track switching). It should be able to catch true large regressions but it bothers me that sometimes it seems to detect with a high confidence a very minor regression in the "cold loading multithread" scenario. This one could particularly be sensitive to ordering / optimizations made by the browsers' cache. So I'm here trying to experiment with some strategies to limit the possibility of having some kind of bias in our performance tests: - I do more test iterations. We previously hit what seems to be a limitation in the CI when running the browser 128 times. I want to check if it's still the case as it's limiting. - I remove the 10% outliers of all samples, both for the previous state and the current state. It may be enough to remove the difference for our cold-loading test. - I added a function trying to detect ordering bias

github-actions · 2026-02-27T16:32:38Z

✅ Automated performance checks have passed on commit 99b9af7ff62e331e7c354904c6792c468e7e625c with the base branch dev.

Details

Performance tests 1st run output

No significative change in performance for tests:

Name	Mean	Median
loading	24.11ms -> 24.10ms (0.003ms, z: 0.50429)	36.00ms -> 35.85ms
seeking	308.40ms -> 311.83ms (-3.435ms, z: 0.30503)	17.25ms -> 17.10ms
audio-track-reload	32.16ms -> 32.22ms (-0.053ms, z: 1.63560)	48.15ms -> 48.30ms
cold loading multithread	51.26ms -> 50.45ms (0.814ms, z: 24.35218)	76.65ms -> 75.60ms
seeking multithread	13.72ms -> 13.68ms (0.044ms, z: 1.58353)	20.55ms -> 20.40ms
audio-track-reload multithread	30.29ms -> 30.13ms (0.160ms, z: 5.85138)	45.15ms -> 44.85ms
hot loading multithread	20.12ms -> 20.00ms (0.123ms, z: 6.53893)	30.00ms -> 29.85ms

peaBerberian force-pushed the perf-tests-improv branch from bc991db to 27a936c Compare January 21, 2026 15:45

peaBerberian force-pushed the dev branch 9 times, most recently from 0142e34 to 1fd9df3 Compare January 27, 2026 11:59

peaBerberian force-pushed the perf-tests-improv branch from 27a936c to 563ebe7 Compare January 27, 2026 12:00

peaBerberian force-pushed the perf-tests-improv branch from 563ebe7 to bfb9d1f Compare February 27, 2026 14:01

canalplus deleted a comment from github-actions Bot Feb 27, 2026

peaBerberian added the Priority: 3 (Low) This issue or PR has a low priority. label Apr 2, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tests: filter out outliers in performance tests#1788

tests: filter out outliers in performance tests#1788
peaBerberian wants to merge 1 commit intodevfrom
perf-tests-improv

peaBerberian commented Jan 21, 2026

Uh oh!

github-actions Bot commented Feb 27, 2026

Performance tests 1st run output

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

peaBerberian commented Jan 21, 2026

Uh oh!

github-actions Bot commented Feb 27, 2026

Performance tests 1st run output

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant