Medical Insurance Cost Prediction

Predicted vs Actual Log Charges

Linear Regression model performance visualizing predicted vs. actual values on a log scale.

Project Overview

This project predicts individual medical insurance costs using demographic and health data. To improve model accuracy and handle the right-skewed distribution of insurance charges, a Log Transformation was applied to the target variable (charges).

Key Objectives:

Analyze the impact of features like age, bmi, and smoker on total charges.
Train a Linear Regression model using Scikit-Learn.
Evaluate the model using standard regression metrics.

Model Performance

Based on the final evaluation, the model achieved the following results:

R² Score: 0.894
Mean Absolute Error (MAE): 0.21
Mean Squared Error (MSE): 0.098

Project Workflow

Data Cleaning: Handled duplicates and verified no missing values existed.
Feature Engineering: * Encoded categorical variables (sex, smoker, region).
- Applied np.log() to the charges column to normalize the distribution.
Training: Split the data into training and testing sets.
Prediction: Generated predictions on the log-scale and visualized them against actual values.

Tech Stack

Language: Python
Libraries: Pandas, NumPy, Matplotlib, Seaborn, Scikit-Learn

File Structure

dataset/insurance.csv: Input dataset.
images/linear_trend.png: Visualization of results.
medical_insurance_cost_prediction.ipynb: Complete Python code and analysis.

Let's Connect!

Nosheen Khan on LinkedIn | Nosheen Khan on Kaggle

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
dataset		dataset
LICENSE		LICENSE
README.md		README.md
linear_trend.png		linear_trend.png
medical_insurance_cost_prediction.ipynb		medical_insurance_cost_prediction.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Medical Insurance Cost Prediction

Predicted vs Actual Log Charges

Project Overview

Key Objectives:

Model Performance

Project Workflow

Tech Stack

File Structure

Let's Connect!

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Medical Insurance Cost Prediction

Predicted vs Actual Log Charges

Project Overview

Key Objectives:

Model Performance

Project Workflow

Tech Stack

File Structure

Let's Connect!

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages