Ensemble Learning Models combines open source Python tools to support large scale learning on satellite imagery and other Earth science data. The current phase of NASA funding for elm aims to:
- Provide a wide variety of ensemble learning options, such as ensemble averaging, hierarchical modeling, and cross validation
- Provide a variety of spatial preprocessors
- Create a web-based user interface for common machine learning tasks with satellite imagery, such as labeling features
- Develop several spectral clustering algorithms for large data
- Promote
elmfor NASA projects and the satellite / climate science industry
The following milestones describe Phase II work over an 18 to 24 month period.
- Coordinate with NASA Goddard Space Flight Center on applications of ML
- Review the state of
elm, updates toxarray,dask,dask-learn elmpackage rename requirement (currently has a conflict with Extreme Learning Machine)
- Cross Validation and Model Quality Assurance
- Hierarchical Modeling
- Vote Count Ensemble Averaging
- Zonal Statistics and Spatial Filters
- Change Detection
- Develop 4 spectral clustering / embedding methods
- More intuitive interface for scikit-learn spectral clustering methods
- Bokeh Map Drawing Tools
- Feature Labeling Tools
- Visualizing Inputs and Predictions
- Support for xarray’s multi-file datasets
- Feature engineering options for 3-D and 4-D data sources
- Allow numpy arrays or pandas data frames in place of ElmStores
- Automated Simple Configurations of Elm
- Runtime User Interface
- Validation Through Realistic Examples
- Climatic Regions Publication
- Hardening ELM