January 2019
Intermediate to advanced
390 pages
9h 16m
English
Regularization adds a term in the loss function to ensure that the cost increases as the model increases the number of features. Hence, we force the model to stay simpler. If L(X, Y) was the loss function earlier, we replace it with the following:

In the preceding, N can be L1 norm, L2 norm, or a combination of the two, and λ is the regularization coefficient. Regularization helps in reducing the model variance, without losing any important properties of the data distribution:
Read now
Unlock full access