book

Java Deep Learning Projects

by Md. Rezaul Karim

June 2018

Intermediate to advanced

436 pages

10h 33m

English

Packt Publishing

Read now

Unlock full access

Content preview from Java Deep Learning Projects

LSTM network construction

As discussed in the Titanic survival prediction section, again everything starts with MultiLayerConfiguration, which organizes those layers and their hyperparameters. Our LSTM network consists of five layers. The input layer is followed by three LSTM layers. Then the last layer is an RNN layer, which is also the output layer.

More technically, the first layer is the input layer, and then three layers are placed as LSTM layers. For the LSTM layers, we initialized the weights using Xavier. We use SGD as the optimization algorithm with Adam updater and the activation function is tanh.

Finally, the RNN output layer has a softmax activation function, which gives us a probability distribution over classes (that is, outputs ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Start your free trial

Publisher Resources

ISBN: 9781788997454Supplemental Content

Java Deep Learning Projects

by Md. Rezaul Karim

LSTM network construction

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

You might also like

Java Deep Learning Essentials

Java Deep Learning Cookbook

Machine Learning in Java - Second Edition

Mastering Java Machine Learning

Publisher Resources