Feature/urban heat islands#47
Conversation
Restructuring to hydra
Develop into main
New dataset structure, tests, second dataset.
Update minor README
Fix pooch and setup
|
CI tests were failing due to a |
|
Very good changes and nice documentation of changes! I made small comments in the code, and I made a change to remove the Additionally, I wonder whether it would make sense to keep only one type of predictive model and decouple the metric calculation into a separate class that can be assigned via configuration? |
|
From @gabrieletijunaityte
Yes good idea! Let's also discuss this in next meeting. |
We store these in teams for now. Butterfly UC is an exempion
What does this PR do?
Adds the Guatemala Urban Heat Islands use case (WP330) and several
framework improvements required to support it.
Framework changes (general — affect all use cases):
New: Regression model and head
src/models/predictive_model_regression.py— Lightning modulefor regression tasks (MSE loss, MAE/RMSE/R² metrics)
src/models/components/pred_heads/mlp_regression_head.py— MLPprediction head for continuous targets
New: Multimodal encoder
src/models/components/eo_encoders/multimodal_encoder.py—unified encoder replacing three separate encoders (coords-only,
tabular-only, fusion). Controlled via
use_coordsanduse_tabularflags.Fix: BaseDataModule
tabular_dimfor "feat_" propertyspatial_split_distance_mas configurable parameter(was hardcoded to 4000m)
Fix: BaseDataset
feat_*tabular columnsUse case specific changes (Guatemala LST):
scripts/make_model_ready_heat_guatemala.py— preprocessingscript with NaN handling, convert categorical features and rename columns to "feat_*"
src/data/heat_guatemala_dataset.py— dataset classBefore submitting
kept together as they are tightly coupled
ModuleNotFoundError: No module named 'pkg_resources'(setuptools missing in Python 3.12 env, exists on main branch too)