Why Training Set accuracy decrease dramatically after stopping the trainNetwork?
8 views (last 30 days)
Don Mathis on 8 Feb 2019
Maybe your minibatch size is too small. The accuracy drop may be due to batchnormalization layers getting finalized, during which time the mean and variance of the incoming activations of each batchnorm layer are computed using the whole training set. If those full-batch statistics don't match the minibatch statistics very well, the finalized batchnorm layers will not be performing a very good normalization.