Skip to content

Training a new model: Ascii codec can't decode byte 0xc3 #31

@Steven1791

Description

@Steven1791

There seems to be some kind of encoding problem when training a new model (en-fr). I'm pretty sure it's because of the french alphabet (e.g. é, è ...).
To make sure the problem wasn't caused by me, I followed the instructions provided. However I was running the code on a GPU cluster in a docker container. See the files attached for a complete list of apt, pip and pip3 packages available in my container (I provided python 2 and 3, since I wasn't sure if 2 is still needed).

I downloaded the LibriSpeech Data set and used the LibriSpeech AST config file to train a new model, the error occured within 30s after starting training.

image

installed-software.txt
packages_pip.txt
packages_pip3.txt

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions