Transformer-BiGRU The codebase and result for using BiGRU to predict GPU memory usage in deep learning training.