Hijabistahub TikTok Sentiment Analysis — Engagement Insights & Model Benchmarking

This project analyzes TikTok comments related to Hijabistahub to understand public sentiment and engagement patterns, then benchmarks multiple text-classification models (Naive Bayes, SVM, Gradient Boosting). The objective is to convert unstructured social feedback into actionable insights that can support content strategy and brand perception monitoring.

What This Project Delivers

Sentiment classification of TikTok comments (positive vs negative)
Engagement insights using view-count trends and influencer comparisons
End-to-end ML workflow (preprocessing → training/testing → evaluation → visualization)

Key Visuals (Project Overview)

Sentiment Distribution	Top Comment Terms
View Count Trend Over Time	Influencer vs View Count

RapidMiner Pipelines (Reproducible Workflow Evidence)

Training & Testing Workflow

Text Preprocessing Pipeline

Methodology (Summary)

1) Data Preparation

Cleaned noisy social text (symbols, duplicates, inconsistent casing)
Structured comments into a usable dataset for modeling

2) NLP Preprocessing

Tokenization
Case transformation
Token-length filtering
Stopword removal (English + additional filtering)

3) Model Benchmarking

The following models were trained and evaluated with consistent preprocessing:

Naive Bayes
SVM
Gradient Boosting

Tools & Tech Stack

RapidMiner Studio (workflow-based modeling and evaluation)
Python (Jupyter Notebooks) for preprocessing / labeling support
CSV datasets for training/testing inputs

How to Run (Fast Start)

RapidMiner

Open RapidMiner Studio
Load the .rmp processes
Ensure CSV paths are mapped correctly
Run the training/testing workflows and visualization process

Python

Open the notebooks (.ipynb)
Run cells in order to reproduce preprocessing and dataset preparation

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
data		data
models		models
notebooks		notebooks
screenshots		screenshots
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hijabistahub TikTok Sentiment Analysis — Engagement Insights & Model Benchmarking

What This Project Delivers

Key Visuals (Project Overview)

RapidMiner Pipelines (Reproducible Workflow Evidence)

Methodology (Summary)

1) Data Preparation

2) NLP Preprocessing

3) Model Benchmarking

Tools & Tech Stack

How to Run (Fast Start)

RapidMiner

Python

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Hijabistahub TikTok Sentiment Analysis — Engagement Insights & Model Benchmarking

What This Project Delivers

Key Visuals (Project Overview)

RapidMiner Pipelines (Reproducible Workflow Evidence)

Methodology (Summary)

1) Data Preparation

2) NLP Preprocessing

3) Model Benchmarking

Tools & Tech Stack

How to Run (Fast Start)

RapidMiner

Python

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages