May 2019
Intermediate to advanced
272 pages
7h 19m
English
It is important to track failure early on to speed up the training process.
When the loss of the discriminator goes to 0, as mentioned, the updates of the generator are insignificant.
When the norm of the gradients is large, it means relatively large weight updates. Although this is expected on the first iterations, the norm should decrease as training proceeds.
It is expected that the variance of the discriminator's loss decreases over time and does not have sudden spikes.
Read now
Unlock full access