📰 Fake News Detection

A machine learning project that trains and evaluates a fake news classification model from tabular text data using Jupyter Notebooks. The workflow demonstrates industry best practices for reproducible ML development — including data preprocessing, feature engineering, cross-validation, and pipeline-based model training.

📂 Project Structure

FakeNews-Detection/ │ ├── data/ # Dataset (download separately) ├── notebooks/ │ ├── Untitled.ipynb # Baseline workflow │ ├── Untitled_optimized.ipynb # Reproducible + CV/Pipeline version ├── requirements.txt # Project dependencies ├── README.md # Documentation

⚙️ Features

Data loading & cleaning
Feature engineering & encoding
Stratified train/validation/test splits
Model training with scikit-learn Pipelines (avoiding leakage)
Cross-validated evaluation (Accuracy, F1, Confusion Matrix)
Optional hyperparameter search (GridSearchCV / RandomizedSearchCV)
Reproducibility with fixed seeds
Lightweight profiling/timers for faster iterations

📊 Dataset

This project uses the Fake News Dataset from Kaggle:
🔗 Fake News Dataset (Kaggle)

Download and place the CSV file(s) inside the `data/` folder before running the notebooks.

🚀 Getting Started

1. Clone the repo  
  git clone https://github.com/GauravP1101/FakeNews-Detection.git
  cd FakeNews-Detection

2. Create & activate a virtual environment
   
  python -m venv .venv
  # Windows
  .venv\Scripts\activate
  # macOS/Linux
  source .venv/bin/activate

3. Install dependencies
  pip install -r requirements.txt
  If requirements.txt is missing, you can start with:
  pip install jupyter numpy pandas scikit-learn matplotlib seaborn xgboost lightgbm catboost
  pip freeze > requirements.txt

4. Run notebooks
  jupyter notebook

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.ipynb_checkpoints		.ipynb_checkpoints
data		data
notebooks		notebooks
.gitattributes		.gitattributes
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📰 Fake News Detection

📂 Project Structure

⚙️ Features

📊 Dataset

Download and place the CSV file(s) inside the `data/` folder before running the notebooks.

🚀 Getting Started

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

📰 Fake News Detection

📂 Project Structure

⚙️ Features

📊 Dataset

Download and place the CSV file(s) inside the data/ folder before running the notebooks.

🚀 Getting Started

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Download and place the CSV file(s) inside the `data/` folder before running the notebooks.

Packages