book

Deep Learning for Coders with fastai and PyTorch

by Jeremy Howard, Sylvain Gugger

July 2020

Intermediate to advanced

621 pages

16h 47m

English

O'Reilly Media, Inc.

Book available

Read now

Unlock full access

Who This Book Is ForWhat You Need to KnowWhat You Will LearnO’Reilly Online LearningHow to Contact Us
Deep Learning Is for EveryoneNeural Networks: A Brief HistoryWho We AreHow to Learn Deep LearningYour Projects and Your MindsetThe Software: PyTorch, fastai, and Jupyter (And Why It Doesn’t Matter)Your First ModelGetting a GPU Deep Learning ServerRunning Your First NotebookWhat Is Machine Learning?What Is a Neural Network?A Bit of Deep Learning JargonLimitations Inherent to Machine LearningHow Our Image Recognizer WorksWhat Our Image Recognizer LearnedImage Recognizers Can Tackle Non-Image TasksJargon RecapDeep Learning Is Not Just for Image ClassificationValidation Sets and Test SetsUse Judgment in Defining Test SetsA Choose Your Own Adventure MomentQuestionnaireFurther Research
The Practice of Deep LearningStarting Your ProjectThe State of Deep LearningThe Drivetrain ApproachGathering DataFrom Data to DataLoadersData AugmentationTraining Your Model, and Using It to Clean Your DataTurning Your Model into an Online ApplicationUsing the Model for InferenceCreating a Notebook App from the ModelTurning Your Notebook into a Real AppDeploying Your AppHow to Avoid DisasterUnforeseen Consequences and Feedback LoopsGet Writing!QuestionnaireFurther Research
Key Examples for Data EthicsBugs and Recourse: Buggy Algorithm Used for Healthcare BenefitsFeedback Loops: YouTube’s Recommendation SystemBias: Professor Latanya Sweeney “Arrested”Why Does This Matter?Integrating Machine Learning with Product DesignTopics in Data EthicsRecourse and AccountabilityFeedback LoopsBiasDisinformationIdentifying and Addressing Ethical IssuesAnalyze a Project You Are Working OnProcesses to ImplementThe Power of DiversityFairness, Accountability, and TransparencyRole of PolicyThe Effectiveness of RegulationRights and PolicyCars: A Historical PrecedentConclusionQuestionnaireFurther ResearchDeep Learning in Practice: That’s a Wrap!
Pixels: The Foundations of Computer VisionFirst Try: Pixel SimilarityNumPy Arrays and PyTorch TensorsComputing Metrics Using BroadcastingStochastic Gradient DescentCalculating GradientsStepping with a Learning RateAn End-to-End SGD ExampleSummarizing Gradient DescentThe MNIST Loss FunctionSigmoidSGD and Mini-BatchesPutting It All TogetherCreating an OptimizerAdding a NonlinearityGoing DeeperJargon RecapQuestionnaireFurther Research
From Dogs and Cats to Pet BreedsPresizingChecking and Debugging a DataBlockCross-Entropy LossViewing Activations and LabelsSoftmaxLog LikelihoodTaking the logModel InterpretationImproving Our ModelThe Learning Rate FinderUnfreezing and Transfer LearningDiscriminative Learning RatesSelecting the Number of EpochsDeeper ArchitecturesConclusionQuestionnaireFurther Research
Multi-Label ClassificationThe DataConstructing a DataBlockBinary Cross EntropyRegressionAssembling the DataTraining a ModelConclusionQuestionnaireFurther Research

ImagenetteNormalizationProgressive ResizingTest Time AugmentationMixupLabel SmoothingConclusionQuestionnaireFurther Research
A First Look at the DataLearning the Latent FactorsCreating the DataLoadersCollaborative Filtering from ScratchWeight DecayCreating Our Own Embedding ModuleInterpreting Embeddings and BiasesUsing fastai.collabEmbedding DistanceBootstrapping a Collaborative Filtering ModelDeep Learning for Collaborative FilteringConclusionQuestionnaireFurther Research
Categorical EmbeddingsBeyond Deep LearningThe DatasetKaggle CompetitionsLook at the DataDecision TreesHandling DatesUsing TabularPandas and TabularProcCreating the Decision TreeCategorical VariablesRandom ForestsCreating a Random ForestOut-of-Bag ErrorModel InterpretationTree Variance for Prediction ConfidenceFeature ImportanceRemoving Low-Importance VariablesRemoving Redundant FeaturesPartial DependenceData LeakageTree InterpreterExtrapolation and Neural NetworksThe Extrapolation ProblemFinding Out-of-Domain DataUsing a Neural NetworkEnsemblingBoostingCombining Embeddings with Other MethodsConclusionQuestionnaireFurther Research
Text PreprocessingTokenizationWord Tokenization with fastaiSubword TokenizationNumericalization with fastaiPutting Our Texts into Batches for a Language ModelTraining a Text ClassifierLanguage Model Using DataBlockFine-Tuning the Language ModelSaving and Loading ModelsText GenerationCreating the Classifier DataLoadersFine-Tuning the ClassifierDisinformation and Language ModelsConclusionQuestionnaireFurther Research
Going Deeper into fastai’s Layered APITransformsWriting Your Own TransformPipelineTfmdLists and Datasets: Transformed CollectionsTfmdListsDatasetsApplying the Mid-Level Data API: SiamesePairConclusionQuestionnaireFurther ResearchUnderstanding fastai’s Applications: Wrap Up
The DataOur First Language Model from ScratchOur Language Model in PyTorchOur First Recurrent Neural NetworkImproving the RNNMaintaining the State of an RNNCreating More SignalMultilayer RNNsThe ModelExploding or Disappearing ActivationsLSTMBuilding an LSTM from ScratchTraining a Language Model Using LSTMsRegularizing an LSTMDropoutActivation Regularization and Temporal Activation RegularizationTraining a Weight-Tied Regularized LSTMConclusionQuestionnaireFurther Research
The Magic of ConvolutionsMapping a Convolutional KernelConvolutions in PyTorchStrides and PaddingUnderstanding the Convolution EquationsOur First Convolutional Neural NetworkCreating the CNNUnderstanding Convolution ArithmeticReceptive FieldsA Note About TwitterColor ImagesImproving Training StabilityA Simple BaselineIncrease Batch Size1cycle TrainingBatch NormalizationConclusionQuestionnaireFurther Research
Going Back to ImagenetteBuilding a Modern CNN: ResNetSkip ConnectionsA State-of-the-Art ResNetBottleneck LayersConclusionQuestionnaireFurther Research
Computer Visioncnn_learnerunet_learnerA Siamese NetworkNatural Language ProcessingTabularConclusionQuestionnaireFurther Research
Establishing a BaselineA Generic OptimizerMomentumRMSPropAdamDecoupled Weight DecayCallbacksCreating a CallbackCallback Ordering and ExceptionsConclusionQuestionnaireFurther ResearchFoundations of Deep Learning: Wrap Up
Building a Neural Net Layer from ScratchModeling a NeuronMatrix Multiplication from ScratchElementwise ArithmeticBroadcastingEinstein SummationThe Forward and Backward PassesDefining and Initializing a LayerGradients and the Backward PassRefactoring the ModelGoing to PyTorchConclusionQuestionnaireFurther Research
CAM and HooksGradient CAMConclusionQuestionnaireFurther Research
DataDatasetModule and ParameterSimple CNNLossLearnerCallbacksScheduling the Learning RateConclusionQuestionnaireFurther Research
Blogging with GitHub PagesCreating the RepositorySetting Up Your Home PageCreating PostsSynchronizing GitHub and Your ComputerJupyter for Blogging
Data ScientistsStrategyDataAnalyticsImplementationMaintenanceConstraints

Content preview from Deep Learning for Coders with fastai and PyTorch

Chapter 7. Training a State-of-the-Art Model

This chapter introduces more advanced techniques for training an image classification model and getting state-of-the-art results. You can skip it if you want to learn more about other applications of deep learning and come back to it later—knowledge of this material will not be assumed in later chapters.

We will look at what normalization is, a powerful data augmentation technique called Mixup, the progressive resizing approach, and test time augmentation. To show all of this, we are going to train a model from scratch (not using transfer learning) by using a subset of ImageNet called Imagenette. It contains a subset of 10 very different categories from the original ImageNet dataset, making for quicker training when we want to experiment.

This is going to be much harder to do well than with our previous datasets because we’re using full-size, full-color images, which are photos of objects of different sizes, in different orientations, in different lighting, and so forth. So, in this chapter we’re going to introduce important techniques for getting the most out of your dataset, especially when you’re training from scratch, or using transfer learning to train a model on a very different kind of dataset than the pretrained model used.

Imagenette

When fast.ai first started, people used three main datasets for building and testing computer vision models:

ImageNet: 1.3 million images of various sizes, around 500 pixels across, in 1,000 ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Start your free trial

Build a Large Language Model (From Scratch)

Publisher Resources

ISBN: 9781492045519Errata Page Supplemental Content

Deep Learning for Coders with fastai and PyTorch

by Jeremy Howard, Sylvain Gugger

Chapter 7. Training a State-of-the-Art Model

Imagenette

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

You might also like

Build a Large Language Model (From Scratch)

Build a Large Language Model (From Scratch)

Fluent Python, 2nd Edition

Hands-On Large Language Models

Publisher Resources

Chapter 7. Training a State-of-the-Art Model

Imagenette

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,and much more.

You might also like

Build a Large Language Model (From Scratch)

Build a Large Language Model (From Scratch)

Fluent Python, 2nd Edition

Hands-On Large Language Models

Publisher Resources

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.