Skip to content

SPARTANX21/EDA-Sugarcane-Production-in-World

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Project Title: Exploratory Data Analysis (EDA) - Sugarcane Production in World

Overview:

This GitHub repository contains an EDA project focusing on the production of sugarcane across various countries worldwide. The project utilizes Python libraries such as Pandas, NumPy, Seaborn, and Matplotlib for data manipulation, visualization, and analysis.

Key Features:

  1. Data Cleaning:

    • Formats data columns to standardize numerical values.
    • Renames columns for clarity and consistency.
    • Handles missing values by dropping rows with null values.
  2. Analysis:

    • Visualizes the percentage share of sugarcane production by continent.
    • Explores the relationship between acreage and production of sugarcane.
    • Identifies outliers and distributions in numerical columns.
    • Determines the top sugarcane-producing countries based on various metrics.
    • Calculates correlation between different columns using a heatmap.
  3. Visualization:

    • Utilizes pie charts, bar plots, line plots, violin plots, and box plots to represent data insights.
    • Incorporates Seaborn styles for aesthetic visualizations.
    • Provides detailed labels and titles for clarity.
  4. Documentation:

    • Includes detailed code comments for each step of the analysis.
    • Utilizes Markdown format for structured documentation.

Project Structure:

  • Data Cleaning: Cleansing and formatting the dataset.
  • Analysis: Exploration and visualization of sugarcane production data.
  • Visualization: Visual representation of data insights.
  • Documentation: Detailed explanation of each code segment.

Sources:

How to Use:

  1. Clone this repository to your local machine.
  2. Ensure you have Python and the required libraries installed.
  3. Run the Python script to execute the analysis.
  4. Explore visualizations and insights obtained from the project.

Contributions:

  • Contributions to improve code efficiency, add new analyses, or enhance visualizations are welcome.
  • Fork the repository, make changes, and submit a pull request for review.

Author:

  • Pranay Shah

About

This repository contains a Jupyter Notebook that performs an analysis of sugarcane production data using Python and popular data analysis libraries such as Pandas, Seaborn, and Matplotlib

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors