With the current data loader pipeline, affricates are merged together, affecting model performance Example: "that she" ['ð', 'æ', 't', 'ʃ', 'i'] becomes ['ð', 'æ', 'tʃ', 'i']
With the current data loader pipeline, affricates are merged together, affecting model performance
Example: "that she" ['ð', 'æ', 't', 'ʃ', 'i'] becomes ['ð', 'æ', 'tʃ', 'i']