Geospatial data scientist. Spatial machine learning, remote sensing, and reproducible analysis pipelines. M.S. Geography (GIS and remote sensing), Texas State University.
I build models on spatial data and care about whether they actually hold up: spatial cross-validation, honest reporting of where a model stops working, and code someone else can run. My work spans energy siting, water resources, land-cover change, and urban environment, applied across the US and Nepal.
Open to GIS Analyst, GIS Developer, and geospatial data science roles. US, open to relocation and remote.
Where Solar Gets Built (and Why It Isn't About Sun) A Random Forest model of utility-scale solar siting in Texas, with a North Carolina transfer test. Finding: distance to transmission infrastructure predicts siting roughly an order of magnitude better than solar irradiance. Spatial cross-validated ROC-AUC 0.92, with a full validation ladder and a cross-state transfer test that localizes exactly where the model stops generalizing. Python, scikit-learn, GeoPandas, Earth Engine, SHAP
Watershed Land-Cover Simulation (Markov-CA) A Markov chain and Cellular Automata model that learns 2010 to 2020 land-cover transitions and forecasts where San Antonio grows by 2030, validated at Figure of Merit 0.85 against a persistence baseline. Python, ArcGIS Pro, NLCD
Rooftop Solar Across Data Environments The same rooftop-solar method run on 1 m airborne LiDAR and a 30 m open DSM, quantifying how input-data resolution systematically biases solar estimates. Python, ArcGIS Pro, USGS 3DEP, Copernicus GLO-30
Austin Urban Heat Island Story Map Summer land surface temperature across 65 Austin neighborhoods, measured in Earth Engine and joined to census income through a PostGIS spatial join. Canopy and heat correlate at r = -0.87. Earth Engine, PostGIS, R, Leaflet
More projects at nirajan550123.github.io.
Modeling and data: Python (scikit-learn, GeoPandas, Rasterio, pandas), R (tidyverse, sf), spatial cross-validation, SHAP Remote sensing: Google Earth Engine, Landsat and Sentinel, NDVI/LST, land-cover classification Spatial and database: PostgreSQL/PostGIS, spatial SQL, ArcGIS Pro, ArcPy, QGIS Delivery: Leaflet, GitHub Pages, reproducible project structure