Please consider adding this result to your list.
A VGG-like network with 6 convolutional layers and 1 fully connected layer.
conv128-conv256-maxpool-conv256-conv512-maxpool-conv512-maxpool-conv512-maxpool-fc1024-fc10
Standard preprocessing (mean/std subtraction/division)
Cutout data augmentation
7.3M parameters
Layer-wise training, no global back-propagation
Code and more results: https://github.com/anokland/local-loss
Please consider adding this result to your list.
A VGG-like network with 6 convolutional layers and 1 fully connected layer.
conv128-conv256-maxpool-conv256-conv512-maxpool-conv512-maxpool-conv512-maxpool-fc1024-fc10
Standard preprocessing (mean/std subtraction/division)
Cutout data augmentation
7.3M parameters
Layer-wise training, no global back-propagation
Code and more results: https://github.com/anokland/local-loss