book

Statistics for Machine Learning

by Pratap Dangeti

July 2017

Beginner to intermediate

442 pages

10h 8m

English

Packt Publishing

Read now

Unlock full access

Content preview from Statistics for Machine Learning

Adaptive moment estimation - Adam

Adam is another method that computes adaptive learning rates for each parameter. In addition to storing an exponentially decaying average of past squared gradients like Adadelta and RMSprop, Adam also keeps an exponentially decaying average of past gradients, similar to momentum.

When you are in doubt, just use Adam!

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Start your free trial

Probability and Statistics for Machine Learning

Jon Krohn

Feature Engineering for Machine Learning

Alice Zheng, Amanda Casari

Practical Statistics for Data Scientists

Peter Bruce, Andrew Bruce

Grokking Machine Learning

Luis Serrano

Publisher Resources

ISBN: 9781788295758Supplemental Content