book

AI and ML for Coders in PyTorch

by Laurence Moroney

July 2025

Beginner to intermediate

444 pages

11h 44m

English

O'Reilly Media, Inc.

Book available

Read now

Unlock full access

Who Should Read This BookWhy I Wrote This BookNavigating This BookTechnology You Need to UnderstandOnline ResourcesConventions Used in This BookUsing Code ExamplesO’Reilly Online LearningHow to Contact UsAcknowledgments
What Is Machine Learning?Limitations of Traditional ProgrammingFrom Programming to LearningWhat Is PyTorch?Using PyTorchInstalling Porch in PythonUsing PyTorch in PyCharmUsing PyTorch in Google ColabGetting Started with Machine LearningSeeing What the Network LearnedSummary
How Computer Vision WorksThe Fashion MNIST DatabaseNeurons for VisionDesigning the Neural NetworkThe Complete CodeTraining the Neural NetworkExploring the Model OutputOverfittingEarly StoppingSummary
ConvolutionsPoolingImplementing Convolutional Neural NetworksExploring the Convolutional NetworkBuilding a CNN to Distinguish Between Horses and HumansThe “Horses or Humans” DatasetHandling the DataCNN Architecture for “Horses or Humans”Adding Validation to the “Horses or Humans” DatasetTesting “Horses or Humans” ImagesImage AugmentationTransfer LearningMulticlass ClassificationDropout RegularizationSummary
Getting Started with DatasetsExploring the FashionMNIST ClassGeneric Dataset ClassesImageFolderDatasetFolderFakeDataUsing Custom SplitsThe ETL Process for Managing Data in Machine LearningOptimizing the Load PhaseUsing the DataLoader ClassBatchingShufflingParallel Data LoadingCustom Data SamplingParallelizing ETL to Improve Training PerformanceSummary
Encoding Language into NumbersGetting Started with TokenizationTurning Sentences into SequencesRemoving Stopwords and Cleaning TextStripping Out HTML TagsStripping Out StopwordsStripping Out PunctuationWorking with Real Data SourcesGetting Text DatasetsGetting Text from CSV FilesGetting Text from JSON FilesSummary
Establishing Meaning from WordsA Simple Example: Positives and NegativesGoing a Little Deeper: VectorsEmbeddings in PyTorchBuilding a Sarcasm Detector by Using EmbeddingsReducing Overfitting in Language ModelsPutting It All TogetherUsing the Model to Classify a SentenceVisualizing the EmbeddingsUsing Pretrained EmbeddingsSummary
The Basis of RecurrenceExtending Recurrence for LanguageCreating a Text Classifier with RNNsStacking LSTMsUsing Pretrained Embeddings with RNNsSummary
Turning Sequences into Input SequencesCreating the ModelGenerating TextPredicting the Next WordCompounding Predictions to Generate TextExtending the DatasetImproving the Model ArchitectureEmbedding DimensionsInitializing the LSTMsVariable Learning RateImproving the DataCharacter-Based EncodingSummary

Common Attributes of Time SeriesTrendSeasonalityAutocorrelationNoiseTechniques for Predicting Time SeriesNaive Prediction to Create a BaselineMeasuring Prediction AccuracyLess Naive Predictions: Using a Moving Average for PredictionImproving the Moving-Average AnalysisSummary
Creating a Windowed DatasetCreating a Windowed Version of the Time Series DatasetCreating and Training a DNN to Fit the Sequence DataEvaluating the Results of the DNNTuning the Learning RateSummary
Convolutions for Sequence DataCoding ConvolutionsExperimenting with the Conv1D HyperparametersUsing NASA Weather DataReading GISS Data in PythonUsing RNNs for Sequence ModelingExploring a Larger DatasetUsing Other Recurrent MethodsUsing DropoutUsing Bidirectional RNNsSummary
TensorsImage DataText DataTensors Out of a ModelSummary
Introducing TorchServeSetting Up TorchServePreparing Your EnvironmentSetting Up Your config.properties FileDefining Your ModelCreating the Handler FileCreating the Model ArchiveStarting the ServerTesting InferenceGoing FurtherServing with FlaskCreating an Environment for FlaskCreating a Flask Server in PythonSummary
The Hugging Face HubUsing Hugging Face HubUsing a Model From Hugging Face HubPyTorch HubUsing PyTorch Vision ModelsNatural Language ProcessingOther ModelsSummary
Understanding TransformersEncoder ArchitecturesThe Decoder ArchitectureThe Encoder-Decoder ArchitectureThe transformers APIGetting Started with transformersCore ConceptsPipelinesTokenizersSummary
Fine-Tuning an LLMSetup and DependenciesLoading and Examining the DataInitializing the Model and TokenizerPreprocessing the DataCollating the DataDefining MetricsConfiguring TrainingInitializing the TrainerTraining and EvaluationSaving and Testing the ModelPrompt-Tuning an LLMPreparing the DataCreating the Data LoadersDefining the ModelTraining the ModelEvaluation During TrainingReporting Training MetricsSaving the Prompt EmbeddingsPerforming Inference with the ModelSummary
Getting Started with OllamaRunning Ollama as a ServerBuilding an App that Uses an Ollama LLMThe ScenarioBuilding a Python Proof-of-ConceptCreating a Web App for OllamaThe app.js FileThe Index.html FileSummary
What Is RAG?Getting Started with RAGUnderstanding SimilarityCreating the DatabasePerforming a Similarity SearchPutting It All TogetherUsing RAG Content with an LLMExtending to Hosted ModelsSummary
What Are Diffusion Models?Using Hugging Face DiffusersImage-to-Image with DiffusersInpainting with DiffusersSummary
Training a LoRA with DiffusersGetting DiffusersGetting Data for Fine-Tuning a LoRAFine-Tuning a Model with DiffusersPublishing Your ModelGenerating an Image with the Custom LoRASummary

Content preview from AI and ML for Coders in PyTorch

Chapter 7. Recurrent Neural Networks for Natural Language Processing

In Chapter 5, you saw how to tokenize and sequence text, turning sentences into tensors of numbers that could then be fed into a neural network. You then extended that in Chapter 6 by looking at embeddings, which constitute a way to have words with similar meanings cluster together to enable the calculation of sentiment. This worked really well, as you saw by building a sarcasm classifier. But there’s a limitation to that: namely, sentences aren’t just collections of words—and often, the order in which the words appear will dictate their overall meaning. Also, adjectives can add to or change the meaning of the nouns they appear beside. For example, the word blue might be meaningless from a sentiment perspective, as might sky, but when you put them together to get blue sky, it indicates a clear sentiment that’s usually positive. Finally, some nouns may qualify others, such as in rain cloud, writing desk, and coffee mug.

To take sequences like this into account, you need to take an additional approach: you need to factor recurrence into the model architecture. In this chapter, you’ll look at different ways of doing this. We’ll explore how sequence information can be learned and how you can use this information to create a type of model that is better able to understand text: the recurrent neural network (RNN).

The Basis of Recurrence

To understand how recurrence might work, let’s first consider the limitations ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Start your free trial

Publisher Resources

ISBN: 9781098199166Errata Page Supplemental Content

AI and ML for Coders in PyTorch

by Laurence Moroney

Chapter 7. Recurrent Neural Networks for Natural Language Processing

The Basis of Recurrence

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

You might also like

AI and Machine Learning for Coders

Generative AI with LangChain

Generative AI with LangChain - Second Edition

Learn Generative AI with PyTorch

Publisher Resources

Chapter 7. Recurrent Neural Networks for Natural Language Processing

The Basis of Recurrence

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,and much more.

You might also like

AI and Machine Learning for Coders

Generative AI with LangChain

Generative AI with LangChain - Second Edition

Learn Generative AI with PyTorch

Publisher Resources

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.