Training a new model: Ascii codec can't decode byte 0xc3

There seems to be some kind of encoding problem **when training a new model** (en-fr). I'm pretty sure it's because of the french alphabet (e.g. é, è ...). 
To make sure the problem wasn't caused by me, I followed the instructions provided. However I was running the code on a GPU cluster in a docker container. See the files attached for a complete list of apt, pip and pip3 packages available in my container (I provided python 2 and 3, since I wasn't sure if 2 is still needed).

I downloaded the LibriSpeech Data set and used the LibriSpeech AST config file to train a new model, the error occured within 30s after starting training.

![image](https://user-images.githubusercontent.com/34455478/65311078-e88ef500-db8f-11e9-81b7-06e0047a9353.png)

[installed-software.txt](https://github.com/eske/seq2seq/files/3634858/installed-software.txt)
[packages_pip.txt](https://github.com/eske/seq2seq/files/3634859/packages_pip.txt)
[packages_pip3.txt](https://github.com/eske/seq2seq/files/3634860/packages_pip3.txt)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training a new model: Ascii codec can't decode byte 0xc3 #31

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Training a new model: Ascii codec can't decode byte 0xc3 #31

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions