Skip to content

Dara Loader bug: t + ʃ -> tʃ and d + ʒ -> dʒ #15

@bi1101

Description

@bi1101

With the current data loader pipeline, affricates are merged together, affecting model performance

Example: "that she" ['ð', 'æ', 't', 'ʃ', 'i'] becomes ['ð', 'æ', 'tʃ', 'i']

Metadata

Metadata

Assignees

Labels

bugSomething isn't working

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions