What this?

this project using wav2vec2 to supervise generate audio with target emotion vectors.

Getting start

put some train files in ./train_dataset/*.wav
preprocess wav files to mel-spectrogram by runing preprocess.py
download wav2vec2 model
modifies training option from train.py
run train.py

About wav2vec2

wav2vec2 pretrain model link

download model and extract demo code.

import os
import audeer

url = 'https://zenodo.org/record/6221127/files/w2v2-L-robust-12.6bc4a7fd-1.1.0.zip'
model_path = './model.zip'

audeer.download_url(
    url, 
    model_path, 
    verbose=True,
)

audeer.extract_archive(
    model_path, 
    ".", 
    verbose=True,
)

all code and pretrain emotion wav2vec2 model from:

About emotion vectors

valence (the pleasantness of a stimulus)
arousal (the intensity of emotion provoked by a stimulus)
dominance (the degree of control exerted by a stimulus)

Norms of English lemmas PAD_emotional_state_model

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
config.py		config.py
dataset.py		dataset.py
model.py		model.py
preprocess.py		preprocess.py
readme.md		readme.md
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

What this?

Getting start

About wav2vec2

About emotion vectors

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

What this?

Getting start

About wav2vec2

About emotion vectors

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages