Skip to content

Latest commit

 

History

History
18 lines (10 loc) · 542 Bytes

File metadata and controls

18 lines (10 loc) · 542 Bytes

Digital Forensics using Machine Learning

  • Analyzing Email Content with Kmeans & MNB

  • Analyzing Images with LinearSVC (Grayscale + Hog)

  • Analyzing URLs with RandomForest (TF-IDF + Entropy-XFeatures)


Unsupervised Dataset: https://www.kaggle.com/datasets/wcukierski/enron-email-dataset

Supervised Dataset: enron_spam_data_label.csv

Graphic Image Classification Dataset: Public DaSCIS.es Image Dataset

URL Dataset: url_data_mega_deep_learning_checked.csv