3.1 BACKPROPAGATION AND ADJOINT ALGORITHMIC DIFFERENTIATION3.2 DATA PREPARATION AND SCALING3.3 WEIGHT INITIALISATION3.4 THE CHOICE OF LOSS FUNCTION3.5 OPTIMISATION ALGORITHMS3.6 COMMON TRAINING PROBLEMS3.7 BATCH NORMALISATION3.8 EVALUATION AND VALIDATION3.9 SOBOLEV TRAINING: USING FUNCTION DERIVATIVES3.10 CONCLUSIONSNOTES