Sarcastically

What is it ?

Sarcastically is machine learning algorithm based on a text and audio multi-model approach developed using Tensorflow and Keras on top of the MUStARD dataset. Built for the purpose of the Bachelors of Computer Science at the University of Westminster.

Technology Used

The technology for the project is defined in the environment.yml file listed in the root of the project.

Purpose of the application

Sarcastically was built in order to improve sarcasm detection by utilizing better audio models focused on natural human voice which is able to provide better features which can be used to improve the accuracy of the sarcasm detection. This accompanied by the text model helps in improving the goal of this project as a whole.

Before you use the model

You will need to download the following files:

The WikiWord Vectors Dataset from FastText
The MUStARD Dataset Files
1. The video files
2. The JSON file containing the labelled data for the dataset

After you download the above, do the following:

Make sure that the audio files are stored in a folder named mmsd_raw_data in the root of the project.
Run the cells in the audio-model-preprocessing.ipynb file in order to convert the mp4 videos into .wav files.
Run the cells in mustard_normalizer.ipynb to normalize the data into a CSV file named normalized_mustard_dataset.csv

Finally,

Create a conda environment based on the environment.yml file as below

conda create -n <your_env_name> -f environment.yml

Activate the conda environment

source activate <your_env_name>

To train the model

Open the sarcastically.ipynb file and run all the cells. (make sure you've downloaded the files in the previous step)

If you run into any issues

Contact me at ryanjk.kuruppu@gmail.com

If you want access to my research paper

Checkout my final report at docs/sarcastically.pdf

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
audio_model		audio_model
common		common
docs		docs
models		models
.gitignore		.gitignore
README.md		README.md
environment.yml		environment.yml
mustard_dataset.json		mustard_dataset.json
mustard_normalizer.ipynb		mustard_normalizer.ipynb
normalized_mustard_dataset.csv		normalized_mustard_dataset.csv
sarcastically.ipynb		sarcastically.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sarcastically

What is it ?

Technology Used

Purpose of the application

Before you use the model

To train the model

If you run into any issues

If you want access to my research paper

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Sarcastically

What is it ?

Technology Used

Purpose of the application

Before you use the model

To train the model

If you run into any issues

If you want access to my research paper

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages