Unlocking Yelp with AI 🚀

Welcome to Unlocking Yelp with AI, where we leverage the power of machine learning and artificial intelligence to analyze Yelp business data and predict ratings. Dive in to explore how data science can help unlock insights into customer reviews and business success. 🌟

🌟 Project Overview

This project aims to predict Yelp business ratings using a variety of machine learning models, such as Random Forest. We utilize Yelp's extensive dataset, perform Exploratory Data Analysis (EDA), clean the data, and build predictive models to understand what makes a business stand out.

Key Features:

Predict Yelp business ratings based on customer reviews and features.
Analyze key factors contributing to business ratings.
Visualize the distribution and trends in Yelp data.

📁 Repository Structure

data/: Contains raw and processed data files.
- raw/: Original Yelp dataset files (not tracked).
- processed/: Cleaned and preprocessed data files.
notebooks/: Jupyter notebooks for EDA, feature engineering, and feature selection.
models/: Stored machine learning model.
visualizations/: Images and HTML files of visualizations.
README.md: Project overview and documentation (you're here!).

🚀 Getting Started

Prerequisites

Make sure you have Python 3.8+ installed. You'll also need to install the required packages listed in requirements.txt:

pip install -r requirements.txt

Running the Project

Clone the Repository:

git clone https://github.com/your-username/Unlocking-Yelp-with-AI.git
cd Unlocking-Yelp-with-AI

Install Dependencies:
```
pip install -r requirements.txt
```
Explore the Data:
- Run the notebooks in the notebooks/ folder to explore the data and perform EDA.

🌐 Dataset

The dataset used for this project is the Yelp Academic Dataset, which contains information about businesses, reviews, and users. Due to its size, the raw data files are not included in the repository, but you can download them directly from Yelp's website or access the processed files provided.

📊 Visualizations

We have created several visualizations to help understand the data better, including:

Geographical maps showing business distribution.
Rating trends over time.
Feature correlations to understand what factors influence ratings the most.

Visualizations can be found in the visualizations/ directory.

🧠 Models

We use machine learning models like Random Forest and K-Nearest Neighbors (KNN) to predict business ratings. Model training and evaluation can be found in the notebooks/03_Modeling.ipynb notebook.

🤖 Technologies Used

Python: Core programming language.
Pandas and NumPy: For data manipulation and processing.
Scikit-learn: For machine learning models.
Matplotlib and Seaborn: For data visualization.
GeoPandas: For geographical data representation.
SciPy: For scientific and technical computing.

🤝 Contributing

We welcome contributions! Feel free to fork this repository, create a branch, and submit a pull request. For major changes, please open an issue first to discuss what you would like to change.

📧 Contact

If you have any questions or feedback, please feel free to reach out via email victoriiavu@g.ucla.edu

⭐ Acknowledgments

Yelp for providing the dataset.
Scikit-learn and Pandas for their powerful data processing and machine learning tools.

If you found this project interesting, don't forget to star ⭐ the repository!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Unlocking Yelp with AI 🚀

🌟 Project Overview

Key Features:

📁 Repository Structure

🚀 Getting Started

Prerequisites

Running the Project

🌐 Dataset

📊 Visualizations

🧠 Models

🤖 Technologies Used

🤝 Contributing

📧 Contact

⭐ Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
data/processed		data/processed
models		models
notebooks		notebooks
visualizations		visualizations
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Unlocking Yelp with AI 🚀

🌟 Project Overview

Key Features:

📁 Repository Structure

🚀 Getting Started

Prerequisites

Running the Project

🌐 Dataset

📊 Visualizations

🧠 Models

🤖 Technologies Used

🤝 Contributing

📧 Contact

⭐ Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages