TikTok Content Moderation System

Claims vs. Opinions Classifier

Google Advanced Data Analytics Capstone Project Author: Daniel Steven Rodriguez Sandoval

Overview

TikTok receives millions of user reports daily. Human moderators cannot review every flagged video immediately, creating backlogs that allow misinformation to remain live longer than necessary. This project builds a machine learning pipeline that classifies TikTok video content as claims (verifiable assertions) or opinions (personal viewpoints), enabling the moderation team to automatically prioritize high-risk videos for expedited review.

Problem Statement

The core challenge is a binary classification task:

Given a set of video metadata and transcription features, can we reliably predict whether a video contains a claim or an opinion?

This matters because claim-based videos are more likely to spread misinformation and require human review before wide distribution.

Priority metric: Recall — a missed claim (false negative) is more costly than a false positive, since it means potential misinformation stays on the platform unreviewed.

Dataset

Note on Data Privacy & Reproducibility: To ensure this pipeline can be run publicly while respecting data privacy, the script utilizes a dynamically generated synthetic dataset. This data was engineered using numpy to precisely mirror the statistical distributions, engagement patterns, and feature relationships found in the original Google Advanced Data Analytics Capstone dataset.

Records: 19,382 TikTok videos (Synthetic)
Target variable: claim_status (claim / opinion) — near-balanced (~50/50)
Features: 12 columns including video engagement metrics, author account status, verification status, and video transcription text

Feature	Description
`video_view_count`	Total views
`video_like_count`	Total likes
`video_share_count`	Total shares
`video_download_count`	Total downloads
`video_comment_count`	Total comments
`video_duration_sec`	Duration in seconds
`verified_status`	Whether the author is verified
`author_ban_status`	Author account status (active / banned / under review)
`video_transcription_text`	Auto-generated caption text

Methodology — PACE Framework

Plan

Defined the business problem and ethical considerations
Identified recall as the key optimization metric
Selected tree-based models (Random Forest, XGBoost) — robust to outliers, no feature scaling required

Analyze

Inspected data shape, null values, and class distribution
Explored engagement patterns: claim videos receive ~10× more views, likes, and shares than opinion videos
Analyzed author ban status: claim videos are disproportionately associated with banned/under-review accounts
Computed derived engagement rate features (likes_per_view, shares_per_view, comments_per_view)

Construct

Extracted text_length from video transcriptions
Applied pd.get_dummies encoding for categorical variables
Split data into Train / Validation / Test sets (60% / 20% / 20%)
Applied CountVectorizer (2–3 grams, top 15 features) to transcription text — fit on training set only to prevent data leakage
Final feature matrix: 27 features

Execute

Trained Random Forest and XGBoost with best hyperparameters identified via 5-fold cross-validated GridSearchCV (scoring = recall)
Evaluated on held-out validation and test sets
Selected champion model based on recall
Deployed probability threshold (≥ 0.85) for operational HIGH_PRIORITY flagging

Hypothesis Testing

Chi-squared tests of independence (α = 0.05) were conducted to identify categorical variables significantly associated with claim status:

Test	χ²	p-value	Result
`claim_status` vs `author_ban_status`	5736.68	< 0.001	Reject H₀ — significant association
`claim_status` vs `verified_status`	336.38	< 0.001	Reject H₀ — significant association

Both variables carry strong predictive signal and are included in the model feature set.

Results

Model	Recall	Precision	F1	Accuracy
Random Forest	99.95%	100.00%	99.97%	99.97%
XGBoost	99.84%	99.95%	99.90%	99.90%

Champion model: Random Forest

Operational Flagging

Using a probability threshold of 0.85:

~50% of videos flagged as HIGH_PRIORITY_REVIEW
99.69% of all actual claims captured in the priority queue
Allows moderation teams to clear the highest-risk content first, significantly reducing average time misinformation stays live

Top Predictive Features

video_view_count
video_like_count
video_share_count
video_download_count
author_ban_status_banned

Engagement metrics dominate — claim videos go viral at a fundamentally different rate than opinions.

Project Structure

TikTok-Content-Moderation/
├── TikTok_Capstone.py       # Main pipeline script
├── requirements.txt         # Python dependencies
├── README.md                # This file
└── outputs/                 # Generated plots (created on run)
    ├── plot_01_distribution.png
    ├── plot_02_engagement.png
    ├── plot_03_ban_status.png
    ├── plot_04_confusion_matrices.png
    ├── plot_05_feature_importance.png
    └── plot_06_claim_probability.png

How to Run

Requirements: Python 3.9+, conda recommended (XGBoost requires OpenMP on macOS)

# Install dependencies
pip install -r requirements.txt

# Run the full pipeline
python TikTok_Capstone.py

All plots are saved to the outputs/ folder. Expected runtime: < 30 seconds.

Certificate

Google Advanced Data Analytics Professional Certificate — Coursera

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitignore		.gitignore
README.md		README.md
TikTok_Capstone.py		TikTok_Capstone.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TikTok Content Moderation System

Claims vs. Opinions Classifier

Overview

Problem Statement

Dataset

Dataset

Methodology — PACE Framework

Plan

Analyze

Construct

Execute

Hypothesis Testing

Results

Operational Flagging

Top Predictive Features

Project Structure

How to Run

Certificate

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

TikTok Content Moderation System

Claims vs. Opinions Classifier

Overview

Problem Statement

Dataset

Dataset

Methodology — PACE Framework

Plan

Analyze

Construct

Execute

Hypothesis Testing

Results

Operational Flagging

Top Predictive Features

Project Structure

How to Run

Certificate

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages