Skip to content

Latest commit

 

History

History
27 lines (21 loc) · 1.78 KB

File metadata and controls

27 lines (21 loc) · 1.78 KB

Student-Performance-Kaggle-Data-Set

Table of Contents

  1. Installation
  2. Project Description
  3. File Description
  4. Results
  5. Licensing, Authors, and Acknowledgements

Installation

The only libraries needed to run this code are the standard ones in Data Science: Python 3.X, Jupyter, NumPy, Pandas, Matplotlib, SKLearn, and Seaborn

Project Description

I used the Kaggle dataset about Student Performance in Math over the course of 3 years to try to better understand the following:

  1. What are the primary environmental factors that affect math performance in this dataset?
  2. What are the primary controllable factors that affect math performance?
  3. What factors actually had the largest effect on final grades?

File Description

There is the original CSV data from Kaggle, a text file describing it, and a jupyter notebook available to explore answers to each of the questions posed above.

Results

The conclusions reached can be found at the post here

Licensing, Authors, and Acknowledgements

Full credit goes to Kaggle for the dataset which is available here. This was done as part of Udemy's Data Scientist Nanodegree