November 2019
Intermediate to advanced
304 pages
8h 40m
English
Consider the following experiments to illustrate step 1.
The following training was performed on 10,000 records with a batch size of 8 and a learning rate of 0.008:

The following is the evaluation performed on the same dataset for a batch size of 50 and a learning rate of 0.008:

To perform step 2, we increased the learning rate to 0.6, to observe the results. Note that a learning rate beyond a certain limit will not help efficiency in any way. Our job is to find that limit:
You can observe that Accuracy is reduced to 82.40% ...