book

Deep Learning for the Life Sciences

by Bharath Ramsundar, Peter Eastman, Pat Walters, Vijay Pande

April 2019

Intermediate to advanced

233 pages

6h 42m

English

O'Reilly Media, Inc.

Read now

Unlock full access

Conventions Used in This BookUsing Code ExamplesO’Reilly Online LearningHow to Contact UsAcknowledgments
Why Deep Learning?Contemporary Life Science Is About DataWhat Will You Learn?
Linear ModelsMultilayer PerceptronsTraining ModelsValidationRegularizationHyperparameter OptimizationOther Types of ModelsConvolutional Neural NetworksRecurrent Neural NetworksFurther Reading
DeepChem DatasetsTraining a Model to Predict Toxicity of MoleculesCase Study: Training an MNIST ModelThe MNIST Digit Recognition DatasetA Convolutional Architecture for MNISTConclusion
What Is a Molecule?What Are Molecular Bonds?Molecular GraphsMolecular ConformationsChirality of MoleculesFeaturizing a MoleculeSMILES Strings and RDKitExtended-Connectivity FingerprintsMolecular DescriptorsGraph ConvolutionsTraining a Model to Predict SolubilityMoleculeNetSMARTS StringsConclusion
Protein StructuresProtein SequencesA Short Primer on Protein BindingBiophysical FeaturizationsGrid FeaturizationAtomic FeaturizationThe PDBBind Case StudyPDBBind DatasetFeaturizing the PDBBind DatasetConclusion
DNA, RNA, and ProteinsAnd Now for the Real WorldTranscription Factor BindingA Convolutional Model for TF BindingChromatin AccessibilityRNA InterferenceConclusion
A Brief Introduction to MicroscopyModern Optical MicroscopyThe Diffraction LimitElectron and Atomic Force MicroscopySuper-Resolution MicroscopyDeep Learning and the Diffraction Limit?Preparing Biological Samples for MicroscopyStainingSample FixationSectioning SamplesFluorescence MicroscopySample Preparation ArtifactsDeep Learning ApplicationsCell CountingCell SegmentationComputational AssaysConclusion
Computer-Aided DiagnosticsProbabilistic Diagnoses with Bayesian NetworksElectronic Health Record DataThe Dangers of Large Patient EHR Databases?Deep RadiologyX-Ray Scans and CT ScansHistologyMRI ScansLearning Models as TherapeuticsDiabetic RetinopathyConclusionEthical ConsiderationsJob LossesSummary
Variational AutoencodersGenerative Adversarial NetworksApplications of Generative Models in the Life SciencesGenerating New Ideas for Lead CompoundsProtein DesignA Tool for Scientific DiscoveryThe Future of Generative ModelingWorking with Generative ModelsAnalyzing the Generative Model’s OutputConclusion

Explaining PredictionsOptimizing InputsPredicting UncertaintyInterpretability, Explainability, and Real-World ConsequencesConclusion
Preparing a Dataset for Predictive ModelingTraining a Predictive ModelPreparing a Dataset for Model PredictionApplying a Predictive ModelConclusion
Medical DiagnosisPersonalized MedicinePharmaceutical DevelopmentBiology ResearchConclusion

Content preview from Deep Learning for the Life Sciences

Chapter 4. Machine Learning for Molecules

This chapter covers the basics of performing machine learning on molecular data. Before we dive into the chapter, it might help for us to briefly discuss why molecular machine learning can be a fruitful subject of study. Much of modern materials science and chemistry is driven by the need to design new molecules that have desired properties. While significant scientific work has gone into new design strategies, much random search is sometimes still needed to construct interesting molecules. The dream of molecular machine learning is to replace such random experimentation with guided search, where machine-learned predictors can propose which new molecules might have desired properties. Such accurate predictors could enable the creation of radically new materials and chemicals with useful properties.

This dream is compelling, but how can we get started on this path? The first step is to construct technical methods for transforming molecules into vectors of numbers that can then be passed to learning algorithms. Such methods are called molecular featurizations. We will cover a number of them in this chapter, and more in the next chapter. Molecules are complex entities, and researchers have developed a host of different techniques for featurizing them. These representations include chemical descriptor vectors, 2D graph representations, 3D electrostatic grid representations, orbital basis function representations, and more.

Once featurized, ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Start your free trial

Practical Simulations for Machine Learning

AI for Healthcare with Keras and Tensorflow 2.0: Design, Develop, and Deploy Machine Learning Models Using Healthcare Data

Anshik

Publisher Resources

ISBN: 9781492039822Errata Page Supplemental Content

Deep Learning for the Life Sciences

by Bharath Ramsundar, Peter Eastman, Pat Walters, Vijay Pande

Chapter 4. Machine Learning for Molecules

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

You might also like

Practical Simulations for Machine Learning

Machine Learning Pocket Reference

Deep Learning Quick Reference

AI for Healthcare with Keras and Tensorflow 2.0: Design, Develop, and Deploy Machine Learning Models Using Healthcare Data

Publisher Resources

Chapter 4. Machine Learning for Molecules

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,and much more.

You might also like

Practical Simulations for Machine Learning

Machine Learning Pocket Reference

Deep Learning Quick Reference

AI for Healthcare with Keras and Tensorflow 2.0: Design, Develop, and Deploy Machine Learning Models Using Healthcare Data

Publisher Resources

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.