CEAM 3D Object Classifier

CEAM PROJECT: A CNN ML Model built using PyTorch (TorchVision). It is a 3D Image Classifier trained to recognize three 3D shapes (cone, cube, sphere). Trained on a Dataset crafted from scratch with 300+ Images, this model is quite accurate in its predictions with an ~90% Training Accuracy and 75% Validation Accuracy.

Project Overview

The idea was to make a CNN that would differentiate between three easily recongnizable 3D shapes : a cone, a cube and a sphere.

In my earlier project on Fashion MNIST Dataset, I used TensorFlow and the Keras Library to make the Fashion Classifier. However, this project involved 3D figures of various colours, shapes, orientations and lighting conditions, in comparision to the Fashion MNIST Dataset which contained only 2D Images and greyscale. Hence I upgraded to PyTorch.

Find Dataset Here

STEP 1: DATASET CREATION

Generated solid colour 3D models renderings in Blender with a script generated using AI (I'm not familiar with Blender or the bpy module) and ran the script within Blender to produce 60 randomly generated 3D rendering of cones, cubes and spheres, all in various colours, sizes, orientations and lighting conditions.

My initial dataset consisted of 60 images (48 Training, 12 Validation). Later while running my model, it was still doing poorly even at the 3rd Trial, despite tweaking the epochs, adding an extra convolutional layer and even adding greyscale to prevent the model from cheating.

Therefore, I updated my Dataset folder to include 300+ images (60 Validation Images) which includes:

Solid Colour 3D Rendered Figures (different sizes, colours, orientations, lighting conditions)
Gradient Coloured 3D Rendered Figures (different sizes, colours, orientations, lighting conditions)
3D Figures against different coloured backgrounds (From the Internet)
Real life object images (eg. dice, rubik cubes, traffic cones, party caps, marbles, balls, etc. from the Internet)

The scripts I generated for Blender are in the repository. There are two separate scripts:

shape_generator_1.py is the solid colour script.

gradient_shape_generator.py is the gradient colour script.

STEP 2: BUILDING THE MODEL

For all the modules used in the project, please refer to Prerequisites.

First I started by creating a class called CEAMShapeNet that is basically a child class from the nn.Module class in torchvision. Initially it had on 2 Conv2D layers but I later updated it to have 3 Conv2D layers due to the lack of precision my model was showing (rather, the absence of precision. It was consistently yelling CUBE! at every shape I threw it) Then I set up the Dense Layers and defined the forward function.
Activated Dropout during trainign to prevent the model from learning shortcuts.
Set up transforms in __main__ for both Training and Validation Datasets. To prevent my model from cheating and learning only based on colour I stripped it off its ability to see colour using transforms.Grayscale. Moreover I used transforms.RandomHorizontalFLip and transforms.RandomRotation to effectively increase the dataset images without physically doing any of these operations on my actual datatset.
Used os module to map directories and load assets.
The accuracies and losses of both Training and Validation was then collected and stored in a list defined at the beginning of the __main__ to be able to produce line graphs of the same. The console also prints the raw data in real time as each Epoch completes.
At last the weights would be saved in a .pth file for the predict.py script.

It took me 6 Trials to get to the near accurate version of this model.

STEP 3: THE PREDICTOR - THE MAIN PROGRAM

predict.py script is the main script to be run. It utilizes the weights from the latest run of model.py. Here I made a new function predict_custom_image which contains all the instructions to predict any image that is uploaded from any directory (the ones I used for testing is available in Test folder.)

Make sure you write the file paths properly while testing or else you'll get an error.

STEP 4: TEST THE MODEL ON EVERY CASE

Tested the model to see where this model fails with nearly 30+ images based off different parameters from the training data. This was genuinely the fun part because I wanted to prove my initial hypothesis right, but turns out the model was smarter than I accounted it for!

Metrics and Parameters

Optimizer : Adam
Learning Rate : 0.001
Batch Size : 32
Regularization : Dropout (p=0.5) enabled during training.
Convolutional Layers : 3
Colour (RGB) : Currently disabled
Batch Shuffle : True
Epochs : 25

6th TRIAL Data:

(Epoch 25/25)

Metric	Training Phase	Validation Phase
Loss	$0.3312$	$0.6325$
Accuracy	$88.6$%	$75.0$%

Graphs:

Prerequisites

This project was built with Python 3.

Modules used in this project are:

os
torch
seaborn and matplotlib (for graphs)
PIL (for Image)

To install the modules required type this in your terminal (within PyCharm for .venv users)

pip install torch seaborn matplotlib pillow

Mac Users, please replace pip with pip3.

How to Run the Project

You need not run model.py unless ceam_shapenet_weights.pth is not found. If that is the case, run model.py first to generate the weights. Along with it you'll generate and save two Analysis Graphs : Accuracy Analysis and Loss Analysis Graphs for both Training and Validation.

To run model.py you need to have downloaded the Dataset beforehand. It will not run otherwise.

Run predict.py after typing in the image path. You have at your disposal a collection of test images I used myself in the Test directory. If you wish, you may also test some from the Dataset folder

If you have Blender, you can run shape_generator_1.py and gradient_shape_generator.py within Blender NOT in Python. The script will not run there. The script will generate beautiful 3D renders for you and save it to a folder.

To clone this repository:

git clone https://github.com/shadow-edge9/CEAM-Task-3D-Image-Classifier
cd CEAM-Task-3D-Image-Classifier

To run model.py:

python3 model.py

To run predict.py:

python3 predict.py

Experimentation Phase

Tested the model with the 30+ Images from Test folder and documented my observations in a presentation. Go to Project Assets

Observations and Conclusions

After Experimentation Phase, the following conclusions were drawn:

The model struggles with backgrounds that have several elements (blurred or otherwise) or more than two colours.
The model might slightly struggle with gradients
The model seems to recognize cones well, even with a slightly messy background and gradients.
When the model encounters a shape unknown to it, it doesn’t give an equal probability; rather it forces the shape into the nearest identical category.

Learnings, Challenges and Bug Fixes

Trial 1 & 2

Initially I ran the model for 20 epochs both for Trial 1 and Trial 2 with the old 60 image dataset. It had only 2 Convolutional layers, and was doing terribly. It called every object a CUBE, and in Trial 2 it called everything a CUBE with 100% Confidence. Assuming it to be because of the colours I stripped off its ability to see colour with Grayscale. Clearly, it's doing so poorly in its Validation phase with 42% accuracy. Since I didn't plot the graphs back then, I only have the raw data for it. Trial 2 ended with these metrics:

Epoch (20/20)

Metric	Training Phase	Validation Phase
Loss	$0.6876$	$1.0582$
Accuracy	$64.6$%	$41.7$%

Trial 3

In Trial 3 I updated my Dataset to have 300+ images, hoping to get better results. Little did I know that the model was still going do worse. Its Training and Validation Accuracies were capped at a mere 58.3% each, meaning it was still guessing.

Epoch (25/25)

Metric	Training Phase	Validation Phase
Loss	$0.7218$	$1.0857$
Accuracy	$58.3$%	$58.3$%

The only good outcome from this was that it stopped yelling 100% CUBE for every shape and starting to have slight doubts at Sphere and Cone as well. I thought of increasing the number of epochs but as the graphs showed me, the model was learning well but doing worse in the exam. So I added a third Convolutional Layer and tested it out in the next Trial.

Trial 4

Increased the number of out-channels to 128.
Clearly this one did much better in terms of accuracy from the previous three trials.
Training accuracy reached 77.1% and the validation accuracy reached 66.7%, which is actually a good improvement.
However the training loss and validation losses were still rather dismal.

Metric	Training Phase	Validation Phase
Loss	$0.5632$	$1.1658$
Accuracy	$77.1$%	$66.7$%

After running predict.py things got even worse, and amusing Somehow, the model figured everything had to be a cube.

When I actually gave it a cube, it said 59.6% cube and 37% cone and 3.5% probability for a sphere.
The moment I gave it a cone or a sphere, it would say 100% cube.

It's like when you confidently yell out the wrong answer in class. That's exactly what my model was doing.

So I decided to either cut down the number of epochs or change the rendering because despite the large dataset and 3 Convolutional Layers, it was still failing to learn.

Trial 5

Ran it at 16 Epochs and finally the Validation Accuracy was better than Training Accuracy.
The first time the graphs did not plateau and both actually decreased together.
The first time Validation Losses dropped to a value below 1.0.

Epoch(16/16)

Metric	Training Phase	Validation Phase
Loss	$0.7117$	$0.8270$
Accuracy	$64.6$%	$75.0$%

But it STILL called my sphere a CUBE.

And that's when I realised...

I never did update my Dataset.

I did, but I updated it on my Desktop and not in my Python .venv, and of course, it did not reflect the changes in PyCharm. This whole time I was running my tests with the old 60 Images and hoping for improvement.

NOTE: If you're on a Python .venv, you might want to reupload any Desktop folders if you ever update it. Then I went ahead a reuploaded my 300+ Image Dataset.

fixed the dataset and loaded updated one

Trial 6

The metrics of the Trial 6 are documented in Metrics and Parameters
Trial 6 marked the end of my experimentation with the models parameters. All that was left was to test different Images which I documented in Experimentation Phase.

Next Steps

I intend to run the model without the greyscale filter and see if it's accuracy improves or worsens.

Project Assets

Link to Dataset and Experiment Conclusion pdf available here in this Google Drive Link:

View Assets

DISCLAIMER: I do not own the rights to any of these images in this dataset. Some of the images in this dataset were collected from Google.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CEAM 3D Object Classifier

Project Overview

STEP 1: DATASET CREATION

STEP 2: BUILDING THE MODEL

STEP 3: THE PREDICTOR - THE MAIN PROGRAM

STEP 4: TEST THE MODEL ON EVERY CASE

Metrics and Parameters

(Epoch 25/25)

Prerequisites

How to Run the Project

Experimentation Phase

Observations and Conclusions

Learnings, Challenges and Bug Fixes

Trial 1 & 2

Epoch (20/20)

Trial 3

Epoch (25/25)

Trial 4

Trial 5

Epoch(16/16)

Trial 6

Next Steps

Project Assets

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
Test		Test
LICENSE		LICENSE
README.md		README.md
ceam_shapenet_weights.pth		ceam_shapenet_weights.pth
gradient_shape_generator.py		gradient_shape_generator.py
model.py		model.py
predict.py		predict.py
shape_generator_1.py		shape_generator_1.py

Folders and files

Latest commit

History

Repository files navigation

CEAM 3D Object Classifier

Project Overview

STEP 1: DATASET CREATION

STEP 2: BUILDING THE MODEL

STEP 3: THE PREDICTOR - THE MAIN PROGRAM

STEP 4: TEST THE MODEL ON EVERY CASE

Metrics and Parameters

(Epoch 25/25)

Prerequisites

How to Run the Project

Experimentation Phase

Observations and Conclusions

Learnings, Challenges and Bug Fixes

Trial 1 & 2

Epoch (20/20)

Trial 3

Epoch (25/25)

Trial 4

Trial 5

Epoch(16/16)

Trial 6

Next Steps

Project Assets

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages