perf: use chunks="auto" in a single thread#2137
Conversation
- `pixi run generate-config`
|
very good! we should also check this for higher resolution and I would vote for removing the spoiling logs, they don't make much sense with parallel and fast execution. #2135 is still a bit faster with 248 secs but I am fine with dropping this for the sake of less code in atlite. |
It's quite possible that my computer is slower. My master run also took a couple more minutes :) |
|
I guess its mostly because of the the raster cache that I implemented in the atlite branch which makes matrix calculations ~70% faster. but this is nothing urgent now. |
|
For more clusters: memory consumption is quite manageable even for 1024 clusters (~ 6 GB for solar-hsat) and the higher number of clusters does not affect runtime negatively. Runtime statistics for 512 clusters: On master: On this branch: |
|
I would say, let's merge this in |
|
Not sure what this failing pypsa-app:config.validator.yaml action is? @lkstrp |
Yeah just ignore, you don't have access yet. I think it's a stability problem on the cluster and nothing in here, but need to check. |
Changes proposed in this Pull Request
Alternative to #2135 and #2136 , that simply relies xarray's automatic chunk selection which falls back to the netcdf stored chunks and picks nprocesses=1 in the default xarray backend to clock in similar speed improvements as #2135 .
on master:
on this perf branch:
ie. 24min reduces to 5min (out of which 21s are snakemake's dag generation).
Checklist
Required:
doc/release_notes.rst.If applicable:
scripts/lib/validation.doc/*.rstfiles.