LSTM–RoBERTa Sentiment Analyzer

A deep learning sentiment analysis solution for interpreting real user feedback and extracting actionable insights from social media data.

Please click here for video demo.

Highlights

End-to-end sentiment analysis combining custom LSTM and transformer-based (RoBERTa) models
Interactive Streamlit web application deployed on Hugging Face Spaces (zero cost free-tier deployment)
Supports sentiment classification for real social media content (YouTube comments)
Demonstrates practical business applications for marketing and product development teams
Custom-trained deep learning model (LSTM) outperforms rule-based baselines (TextBlob and VADER)

Skills Demonstrated

✔ Deep Learning & Natural Language Processing (NLP): tokenization, embeddings, transformer inference

✔ API & Data Handling: YouTube sentiment datasets, API quota workarounds, preloading strategy for demo app

✔ Model Development: custom LSTM training, performance evaluation, model export

✔ Comparative Benchmarking: baseline comparison vs TextBlob and VADER

✔ Application Deployment: Streamlit UI, Hugging Face Spaces hosting with awareness of resource constraints

✔ Model Section: User can switch between a custom-trained LSTM and a robust RoBERTa transformer

Problem Statement

In the age of social media, understanding public sentiment has become essential for businesses. Twitter, as one of the largest platforms for public expression, offers a vast and valuable source of data for sentiment analysis.

The goal of this project is to develop machine learning models (LSTM and roBERTa transformer) capable of accurately classifying the sentiment of social media comments as either positive, neutral or negative. This analysis can provide actionable insights to help organizations tailor their products and marketing strategies, improve customer service, and ultimately enhance user satisfaction.

Overview

Note:
This Streamlit application is hosted on the free tier of Hugging Face Spaces. If the app has been idle for more than 24 hours, it may take some time to reactivate. In such cases, please click “Restart this Space” to relaunch the application. Thank you for your patience.

This project presents an end-to-end sentiment analysis system for YouTube comments, combining a custom deep learning model (LSTM) and a transformer-based model (RoBERTa), deploying them through an interactive Streamlit web application (demo).

Due to daily quota limitations of the YouTube Data API, this demonstration uses preloaded comments to ensure a stable and consistent user experience while effectively showcasing the system’s sentiment analysis capabilities. In a full implementation (please click here for video demo), users would be able to input any YouTube video URL and extract comments in real time using the YouTube Data API.

Sentiment & Business Insights

In the demo application, different YouTube videos are used for sentiment analysis which include new product teaser and game trailer.

New Product Teaser - Everything New with GoPro HERO13 Black

Business Impacts

Measure customer excitement before product launch
Identify potential concerns or negative reactions
Improve marketing strategies and product positioning

Model Comparison & Results

	Custom-trained LSTM	Transformer (RoBERTa)
Avg. Score out of 1.00	0.58	0.58
% of Positive Comments	44.8%	32.8%
% of Neutral Comments	28.9%	51.2%
% of Negative Comments	26.4%	15.9%

Analysis

The overall sentiment analysis indicates a slightly positive but polarized user perception. Both models show an average sentiment score of 0.58 / 1.00 across 402 comments. The wide score dispersion highlights a clear divide between enthusiastic supporters and dissatisfied users.

From a business perspective, user sentiment is strongly feature-driven rather than brand-driven. Word cloud analysis shows that discussions are centered on product capabilities such as lens quality, sensor performance, battery life, software updates, and upgrades. Positive sentiment aligns closely with purchase intent and upgrade interest, indicating strong demand among early adopters and existing users.

Additionally, frequent mentions of competitors including DJI and Osmo highlight a highly competitive market with low switching costs, increasing the importance of clear differentiation and transparent communication.

Actionable Insights

Since there are concerns over specific product capabilities, the company can anchor marketing messages around quantifiable improvements: battery life increase, sensor performance or software upgrades.

In view of the fierce competition with low switching cost, the company can differentiate itself by explicitly positioning HERO13 against competitor by identifying the strengths competitors can’t match, for instance durability and accessories.

Game Trailer - Fallout 4: Anniversary Edition

Business Impacts

Understand player expectations and engagement
Anticipate audience reception
Support promotional decision-making

Model Comparison & Results

	Custom-trained LSTM	Transformer (RoBERTa)
Avg. Score out of 1.00	0.47	0.43
% of Positive Comments	30.8%	15.7%
% of Neutral Comments	24.2%	54.1%
% of Negative Comments	45.0%	30.1%

Analysis

The overall sentiment for this dataset skews negative and polarized, with an average sentiment score of 0.47 (LSTM) and 0.43 (RoBERTa) across 458 comments. The wide dispersion of sentiment scores confirms a strong divide between dissatisfied users and a smaller but vocal positive group.

From a business perspective, conversation is heavily franchise- and release-driven rather than centered on gameplay mechanics alone. Word cloud analysis shows dominant themes such as “new” “version” “remaster” “release” and “DLC” (i.e. Downloadable Content) indicating that user sentiment is shaped largely by expectations around new releases, remakes, and updates.

Negative sentiment, however, is more prevalent and more specific. Common terms such as “old,” “nothing,” “mod,” (i.e. modification), “wait,” “money,” “need,” and “already” point to frustration with perceived lack of innovation, repetitive re-releases, and reliance on modding communities. Complaints appear less about technical failure and more about strategic direction and content freshness.

The recurring appearance of “remaster” and “new version” in negative contexts suggests release fatigue. Users expect substantive changes, not incremental updates to existing titles. When expectations are unmet, disappointment translates quickly into negative sentiment.

Actionable Insights

Sentiment is strongly influenced by release strategy, with disappointment in release fatigue. The company can lead messaging with what is genuinely new including systems, content depth and mechanics.

Stages of Development

The whole project consists of two main stages:

Model Development
- Data preprocessing and exploratory analysis
- Training a custom LSTM sentiment analysis model with 1.6 million pre-labeled tweets
- Compare the performance of the custom LSTM model with traditional approaches - TextBlob and VADER
- Saving and exporting the trained model for deployment (.keras)
Model Deployment
- Hosting the trained LSTM model on Hugging Face
- Integrating a transformer-based model (RoBERTa)
- Deploying a Streamlit application that allows users to select and compare models

Model Comparison

Custom LSTM Model

Trained on labeled sentiment data (Sentiment140)
Captures sequential and contextual patterns in text
Custom LSTM achieved test accuracy of 0.78, outperforming TextBlob (0.61) and VADER (0.63)
Saved and uploaded to Hugging Face for inference
Lightweight and fast to run

Transformer Model (RoBERTa)

Pre-trained transformer-based sentiment analysis model released by META
Provides robust contextual understanding and strong performance
High accuracy on negations and subtle emotional cues
Longer processing time due to larger model and heavier memory usage

Streamlit Application

The final deliverable of this project is a Streamlit web application deployed on Hugging Face Spaces. We do not consider Streamlit Community Cloud due to the large file size of the model and complexity of the project.

Features of the Demo Application

Model selection:
- Custom LSTM
- Transformer (RoBERTa)
Sentiment analysis on preloaded YouTube comments
Interactive results display

Full Application

In a full implementation of this application, users would be able to input any YouTube video URL, allowing the system to extract comments in real time using the YouTube Data API. Please check out here for the video demonstration of full implementation.

Author

Carmen Wong

Name		Name	Last commit message	Last commit date
Latest commit History 133 Commits
dataset		dataset
demo_video		demo_video
images		images
scripts		scripts
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LSTM–RoBERTa Sentiment Analyzer

Highlights

Skills Demonstrated

Problem Statement

Overview

Sentiment & Business Insights

New Product Teaser - Everything New with GoPro HERO13 Black

Business Impacts

Model Comparison & Results

Analysis

Actionable Insights

Game Trailer - Fallout 4: Anniversary Edition

Business Impacts

Model Comparison & Results

Analysis

Actionable Insights

Stages of Development

Model Comparison

Custom LSTM Model

Transformer Model (RoBERTa)

Streamlit Application

Features of the Demo Application

Full Application

Author

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LSTM–RoBERTa Sentiment Analyzer

Highlights

Skills Demonstrated

Problem Statement

Overview

Sentiment & Business Insights

New Product Teaser - Everything New with GoPro HERO13 Black

Business Impacts

Model Comparison & Results

Analysis

Actionable Insights

Game Trailer - Fallout 4: Anniversary Edition

Business Impacts

Model Comparison & Results

Analysis

Actionable Insights

Stages of Development

Model Comparison

Custom LSTM Model

Transformer Model (RoBERTa)

Streamlit Application

Features of the Demo Application

Full Application

Author

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages