Correction for text with punctuation and dash by hyunjoolee · Pull Request #33 · NVIDIA/mellotron

hyunjoolee · 2020-01-17T06:34:52Z

I have found that if there are punctuation and dash characters in the text, they are not converted to clean text in text/init.py get_arpabet().

For examples, words like "recommendations.", "fbi," and "policy-making" are not searchable in the cmu_dict.
I think these will reduce model performance.

So I suggest some code as attached.

ava added 2 commits January 17, 2020 23:08

Correction for text with punctuation and dash

79a929c

Correction for text with punctuation and dash

ddf8492

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Correction for text with punctuation and dash#33

Correction for text with punctuation and dash#33
hyunjoolee wants to merge 2 commits intoNVIDIA:masterfrom
hyunjoolee:master

hyunjoolee commented Jan 17, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

hyunjoolee commented Jan 17, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant