The calculation of input_file_size is based on the OUTPUT data, not the INPUT data at Aggregation testing and Scan testing. I have formatted a patch for this bug, but i'm not sure how to commit it to the community. The patch as attachment.
0001-Fix-an-error-about-caculating-of-input_file_size.patch