Hi! I'm running the following command to train the model:
$ python train.py --data_dir=./test/data --output_dir=./outputs
The GPU I'm using has 16276MiB. However, I get an out of memory error immediately:
/wavenet/networks.py", line 88, in forward
gated = gated_tanh * gated_sigmoid
RuntimeError: cuda runtime error (2) : out of memory at /opt/conda/conda-bld/pytorch_1524584710464/work/aten/src/THC/generic/THCStorage.cu:58
Any thoughts as to why this might be happening? Based on my calculations, the input size is 1x100,000x256 which should easily fit in the 16276 MiB of memory that the GPU has.