book

Python Deep Learning

by Valentino Zocca, Gianmario Spacagna, Daniel Slater, Peter Roelants

April 2017

Intermediate to advanced

406 pages

10h 15m

English

Packt Publishing

Read now

Unlock full access

eBooks, discount offers, and moreWhy subscribe?
What this book covers

Downloading the example codeDownloading the color images of this book
PiracyQuestions
What is machine learning?
Supervised learningUnsupervised learningReinforcement learningSteps Involved in machine learning systemsBrief description of popular techniques/algorithmsLinear regressionDecision treesK-meansNaïve BayesSupport vector machinesThe cross-entropy methodNeural networksDeep learningApplications in real lifeA popular open source package
Why neural networks?
Neurons and layersDifferent types of activation functionThe back-propagation algorithmLinear regressionLogistic regressionBack-propagationApplications in industrySignal processingMedicalAutonomous car drivingBusinessPattern recognitionSpeech productionCode example of a neural network for the function xor
What is deep learning?Fundamental conceptsFeature learningDeep learning algorithms
Speech recognitionObject recognition and classification
TheanoTensorFlowKerasSample deep neural net code using Keras
AutoencodersNetwork designRegularization techniques for autoencodersDenoising autoencodersContractive autoencodersSparse autoencodersSummary of autoencoders
Hopfield networks and Boltzmann machinesBoltzmann machineRestricted Boltzmann machineImplementation in TensorFlowDeep belief networks
Similarities between artificial and biological models
Stride and padding in convolutional layers
Recurrent neural networksRNN — how to implement and trainBackpropagation through timeVanishing and exploding gradientsLong short term memory
Word-based modelsN-gramsNeural language modelsCharacter-based modelPreprocessing and reading dataLSTM networkTrainingSamplingExample training
Speech recognition pipelineSpeech as input dataPreprocessingAcoustic modelDeep belief networksRecurrent neural networksCTCAttention-based modelsDecodingEnd-to-end models
Early game playing AI
A supervised learning approach to games
Q-function
Experience replayEpsilon greedy
Atari Breakout random benchmarkPreprocessing the screenCreating a deep convolutional networkConvergence issues in Q-learningPolicy gradients versus Q-learning
Baseline for variance reductionGeneralized advantage estimator
What is anomaly and outlier detection?
Data modelingDetection modeling
Getting started with H2O
MNIST digit anomaly recognitionElectrocardiogram pulse detection
What is a data product?
Weights initializationParallel SGD using HOGWILD!Adaptive learningRate annealingMomentumNesterov's accelerationNewton's methodAdagradAdadeltaDistributed learning via Map/ReduceSparkling Water
Labeled DataUnlabeled DataSummary of validation
A/B TestingA summary of testing
POJO model exportAnomaly score APIsA summary of deployment

Content preview from Python Deep Learning

Policy gradients in AlphaGo

For AlphaGo using policy gradients, the network was set up to play games against itself. It did so with a reward of 0 for every time step until the final one where the game is either won or lost, giving a reward of 1 or -1. This final reward is then applied to every time step in the network, and the network is trained using policy gradients in the same way as our Tic-tac-toe example. To prevent overfitting, games were played against a randomly selected previous version of the network. If the network constantly plays against itself, the risk is it could end up with some very niche strategies, which would not work against varied opponents, a local minima of sorts.

Building the initial supervised learning network that predicted ...