The Microsoft ResNet-50 model

ResNet was proposed by He et al. ( https://arxiv.org/pdf/1512.03385.pdf) and won the ImageNet competition in 2015. This method showed that deeper networks can be trained. The deeper the network, the more saturated the accuracy becomes. It's not even due to overfitting or due to the presence of a high number of parameters, but due to a reduction in the training error. This is due to the inability to backpropagate the gradients. This can be overcome by sending the gradients directly to the deeper layers with a residual block as follows:

Every two layers are connected forming a residual block. You can see that the ...

Get Deep Learning for Computer Vision now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.