You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This project aims to help firefighters in fighting forest fires by predicting whether it will rain tomorrow or not. By doing so, the firefighters can focus more on areas that will not be rainy.
A CatBoost classification model is applied to forecast the rain status tomorrow based on the Rain in Australia dataset from Kaggle.
The project notebook includes a total of 28 experiments including:
4 Models : Random Forest, Logistic Regression, XGBoost and CatBoost.
3 Sampling Techniques: SMOTE, ADASYN and Random Oversampling.
Feature Engineering (+5 features).
MinMax Scaling.
The final selected model is the CatBoost model with:
Metric
Score
Precision
0.82
Recall
0.63
F1
0.71
Accuracy
0.89
Using ADASYN, MinMax scaling and no exta features from feature engineering.
Confusion Matrix:
AUC - ROC Curve
Note: The final model was selected based on precision since we care about reducing the false positive which is predicting that it will rain tomorrow when it will actually not. This will affect the firefighters work as they may ignore an area since it is predicted to be rainy while it's actually not, risking that it could cause fires to spread further.
We've also developed an app on Android that gets the location input by the user, uses the Weatherstack API to get weather data for that location and displays them, then predicts whether it will rain tomorrow or not.