Skip to Content
Deep Learning for Biology
book

Deep Learning for Biology

by Charles Ravarani, Natasha Latysheva
July 2025
Intermediate to advanced
436 pages
11h 17m
English
O'Reilly Media, Inc.
Content preview from Deep Learning for Biology

Chapter 3. Learning the Logic of DNA

In this chapter, we’ll build a deep learning model to predict whether a DNA sequence is bound by a class of proteins called transcription factors (TFs). Transcription factors play a central role in gene regulation: they bind to specific DNA sequences and influence whether nearby genes are turned on or off. By recognizing these sequence patterns, we can begin to decode the regulatory logic embedded in the genome.

Unlike the previous chapter—where we used an off-the-shelf protein model from Hugging Face—here we’ll start defining and training our own models from scratch. This gives us more control and helps us better understand how deep learning works on biological data. We’ll explore both convolutional and transformer-based architectures and introduce interpretation techniques to help us understand how our models make predictions.

We will tackle this problem in stages, gradually increasing the complexity:

1. Start simple

First, we’ll train a basic convolutional network to predict whether a DNA sequence binds a single transcription factor called CTCF. Its binding behavior is relatively easy to predict, making it a great first target. We’ll build the full pipeline: loading data, training the model, and checking whether it captures meaningful biological signals.

2. Increase complexity

Next, we’ll scale up to predicting whether a sequence binds any of 10 different TFs. We’ll introduce regularization and normalization, improve our evaluation ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

Math for Deep Learning

Math for Deep Learning

Ronald T. Kneusel

Publisher Resources

ISBN: 9781098168025Errata Page