This guide provides step-by-step instructions on how to operate the Delivery ETA system. Follow the steps below to process data, perform exploratory data analysis (EDA), train models, make predictions, and evaluate the results.
- Purpose: This script processes raw data and prepares it for analysis and modeling.
- Input: Raw data files located in
data/raw/. - Output: Processed data saved in
data/processed/. - How to Run:
python src/preprocessing.py
- Purpose: This script generates a summary of the dataset, including key statistics and visualizations.
- Input: Processed data from
data/processed/. - Output: EDA results saved in
results/(e.g.,eda_summary.csv) and visualizations inresults/figures/. - How to Run:
python src/eda_summary.py
- Purpose: This script trains machine learning models using the processed data.
- Input: Processed data from
data/processed/. - Output: Trained model artifacts saved in
models/. - How to Run:
python src/train_model.py
- Purpose: This script uses the trained model to make predictions on new data.
- Input: New data or test data.
- Output: Predictions saved in
results/(e.g.,amazon_model_results.csv). - How to Run:
python src/predict.py
- Purpose: This script evaluates the performance of the trained model using metrics such as accuracy, precision, recall, etc.
- Input: Predictions and ground truth data.
- Output: Evaluation metrics saved in
results/tables/(e.g.,amazon_test_metrics.csv). - How to Run:
python src/evaluate_model.py
- Purpose: This script contains a deep learning model built from scratch for advanced predictions.
- How to Run:
python src/dl_model.py
- Ensure all dependencies are installed before running the scripts. Use the
requirements.txtfile to install them:pip install -r requirements.txt