Cache GeoTIFF metadata to skip remote reads on rebuild#19
Conversation
Replace subprocess-based check_geotiff_cog() with geotiff_extract_metadata() that extracts CRS, bounds, shape, and transform via rasterio in one read. Add item_create_from_cache() to build pystac Items from cached metadata without any network I/O. Full rebuild drops from ~5.5 hours to minutes once cache is populated. Closes #10 Relates to NewGraphEnvironment/sred#8 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
ff03f89 to
3ad5ac4
Compare
ReviewBug:
|
Summary
rio cogeo validatewith rasterio-basedgeotiff_extract_metadata()that extracts spatial metadata (CRS, bounds, shape, transform) + validates COG status in one remote readitem_create_from_cache()to build pystac Items from cached metadata with zero network I/Ostac_geotiff_checks.csvwith spatial columns (epsg, height, width, transform, bounds) — backward compatible with old-format rowsitem_create.pyanditem_reprocess.pyto use cache hit path, fall back to rio_stac on missPerformance
Test plan
Relates to NewGraphEnvironment/sred-2025-2026#8
🤖 Generated with Claude Code