Skip to content

Conversation

@iamdonovan
Copy link
Collaborator

@iamdonovan iamdonovan commented Nov 3, 2025

This PR addresses most of the remaining issues raised by reviewers:

  • Adds a GitHub Action to check the dataset before anything can be merged into origin/main
  • Adds a script to update the dataset based on new/updated files in dataset/contributor_files, scripts/update_dataset.py
  • Updates contributing instructions based on level of familiarity/comfort with git/github
  • Adds a table, script showing a confusion matrix of mapping conflicts (Table 4)
  • Adds a figure, script demonstrating two glaciers with mapping conflicts (Fig. 9)
  • Fixes a few additional "level" uses and old category values

Additional changes/updates:

  • fixed DOI link for glaciers in regions 13, 15

- change legend label to 'category'
- decrease font size
add zenodo badge
update badge to point to latest doi
- print confusion matrix as .tex
- print information about different agreements/disagreements
"mappingconflicts.csv" only has 83 rows - counting the regions, region 11 should have 9 rather than 10
- add script to plot two conflict examples
- move mapping tools to map_tools.py (scale bar, image stretch)
- add conflict examples to category_examples.csv
- add lake outlines for conflict examples to lake_outlines.gpkg
- pan-sharpen with bands 543
- fix crlf lines in pansharpen_all.py
- plot added lakes for Fig. 2b
- add Fig. 9
move from essd/ to main directory, add gitpython as a dependency
- test whether new/updated csv files have correct columns
- test whether all geopackage files (a) exist, and (b) have the correct columns
- re-name test_column_names -> test_columns
- add list of rgi regions rather than parsing filenames
- add test to compare lake_cat values in csv files and geopackage files
add tests + github action
- add github/not github sub-sections
- explain that PR will trigger checks for formatting
- start including information about (eventual) update_dataset.py script
@iamdonovan
Copy link
Collaborator Author

Note: the geopackage files still need to be updated, and I still have to add the update_dataset.py script with instructions.

- check for new/updated contributor files
- check for conflicts
- if no conflicts, merge + update gpkg files
- update summary_table.csv
@iamdonovan iamdonovan marked this pull request as ready for review November 5, 2025 00:10
@iamdonovan
Copy link
Collaborator Author

gpkg files have been updated for regions 13, 15, and I have added the update script now. Final (possible) additions will be to update the mappingconflicts.csv file with glaciers that contributors agreed on, to update the confusion matrix (Table 4).

- fix figure caption italics
- add information about the different scripts contained in the scripts folder
- fix script name
scatter plot of # of glaciers for each region, identified using global lake inventories and variable buffer around RGI terminus.

- script, .png file
- add CSV file for global comparison
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants