🤖 Machine Learning App

The purpose of this application is to provide a simple experience of the process of creating an ML model and releasing a web application that uses that model.

Workflow

Model Creation: The Random Forest model is trained using a dataset of penguins, which is then saved using the joblib library for easy integration into the Streamlit app.
Model Integration in Streamlit: The saved model is loaded and integrated into the Streamlit application, allowing for interactive prediction based on user-selected features.
Deployment: The application is deployed on Streamlit Community Cloud, providing an easily accessible, hands-on experience of the complete model deployment pipeline.

Demo App

Demo: https://yh-machine-learning.streamlit.app/

What This App Does

This app allows users to explore the predictions made by a Random Forest model trained on the Palmer Penguins dataset. The model predicts penguin species based on various features such as species, island, bill measurements, flipper length, body mass, and sex. Users can interact with the app to see how different features influence the model's predictions.

About Dataset

Artwork by @allison_horst (https://github.com/allisonhorst)

The model is trained using the Palmer Penguins dataset, a widely recognized dataset for practicing machine learning techniques. This dataset provides information on three penguin species (Adelie, Chinstrap, and Gentoo) from the Palmer Archipelago in Antarctica. Key features include:

Species: The species of the penguin (Adelie, Chinstrap, Gentoo).
Island: The specific island where the penguin was observed (Biscoe, Dream, Torgersen).
Bill Length: The length of the penguin's bill (mm).
Bill Depth: The depth of the penguin's bill (mm).
Flipper Length: The length of the penguin's flipper (mm).
Body Mass: The mass of the penguin (g).
Sex: The sex of the penguin (male or female).

This dataset is sourced from Kaggle, and it can be accessed here. The diversity in features makes it an excellent choice for building a classification model and understanding the importance of each feature in species prediction.

Technology Stack

Python
Streamlit: For creating the web application interface.
scikit-learn: For loading and using the pre-trained Random Forest model.
NumPy & Pandas: For data manipulation and processing.
Matplotlib & Seaborn: For generating visualizations.

Name		Name	Last commit message	Last commit date
Latest commit History 136 Commits
.streamlit		.streamlit
dataset		dataset
images		images
pages		pages
.devcontainer.json		.devcontainer.json
.gitignore		.gitignore
README.md		README.md
model_training.py		model_training.py
penguin_classifier_model.pkl		penguin_classifier_model.pkl
requirements.txt		requirements.txt
streamlit_app.py		streamlit_app.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🤖 Machine Learning App

Workflow

Demo App

What This App Does

About Dataset

Technology Stack

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

yoshan0921/yh-machine-learning

Folders and files

Latest commit

History

Repository files navigation

🤖 Machine Learning App

Workflow

Demo App

What This App Does

About Dataset

Technology Stack

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages