Understand the outlier benchmarks on 3.14 (main) vs. 3.13.0

As suggested in the last sync meeting, we should understand why some of the benchmarks regressed and progressed.  There are possible outcomes for each:

1. The benchmark is poorly designed
2. There is low-hanging fixes in CPython to reduce the regression
3. We are reasonably comfortable with the regression given improvements elsewhere

I think as a first pass, we should just try to classify along these lines, and then fix CPython (where possible) first, and fix benchmarks with a lower priority.

For the progressions, it may just be a source of WHATSNEW content.

Let's crowdsource this where possible, reporting back to the checklist below.

Using the [last weekly as a guide](https://github.com/faster-cpython/benchmarking/blob/main/results/bm-20250419-3.14.0a7%2B-71da68d/bm-20250419-linux-x86_64-python-71da68d5887b6c058907-3.14.0a7%2B-71da68d-vs-3.13.0.svg), the statistically significant regressions are below.  For longitudinal details, see the [plot of benchmark performance](https://github.com/faster-cpython/ideas/issues/726#issuecomment-2829028656) over time below.

- [ ] subparsers, many_optionals (argparse)
- [x] python_startup / python_startup_no_site
- [x] json_dumps / json_loads
- [ ] mako
- [ ] nbody
- [ ] coroutines
- [ ] typing_runtime_protocols
- [ ] fannkuch
- [ ] deltablue
- [ ] shortest_path (networkx)
- [ ] pickle_pure_python

The most statistically significant progressions are:

- [x] mdp (tuple hash caching provided a major speedup)
- [x] deepcopy / deepcopy_memo
- [ ] go
- [ ] regex / regex_effbot / regex_v8
- [ ] float
- [ ] pylint
- [ ] spectral_norm
- [ ] richards / richards_super
- [ ] xml_etree_parse
- [ ] dulwich_log
- [ ] tomli_loads
- [ ] genshi_text
- [ ] 2to3
- [x] async stuff



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Understand the outlier benchmarks on 3.14 (main) vs. 3.13.0 #726

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Understand the outlier benchmarks on 3.14 (main) vs. 3.13.0 #726

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions