Skip to Content
BLAST
book

BLAST

by Ian Korf, Mark Yandell, Joseph Bedell
July 2003
Intermediate to advanced
368 pages
13h 44m
English
O'Reilly Media, Inc.
Content preview from BLAST
This is the Title of the Book, eMatter Edition
Copyright © 2012 O’Reilly & Associates, Inc. All rights reserved.
20 Tips to Improve Your BLAST Searches
|
119
8.5 Use the Karlin-Altschul Equation
to Design Experiments
The Karlin-Altschul equation is very useful for predicting the outcome of a BLAST
experiment, especially in large search spaces. Suppose you want to find exons in the
human genome by looking for similarities in the pufferfish genome. These genomes
last shared a common ancestor about 450 million years ago. You might assume that
any similarities at this distance must be due to evolutionary conservation.
Recall from Chapter 4 that the number of alignments expected by chance (E) is a
function of the search space (M, N), the normalized score (λS), and a minor con-
stant (K).
The typical cross-species parameters +1/-1 match/mismatch have a target frequency
of 75 percent identity and 0.55 nats per aligned letter on average (H). A 50-bp align-
ment therefore contains about 27.5 nats. Substituting this normalized score into the
Karlin-Altschul equation with K=0.334, M=1.5 GB (assuming half of the human
genome contains repeats), and N=450 MB (the size of the repeat-poor pufferfish
genome), you expect about 230,000 alignments by chance. That’s roughly the same
as the number of exons in the human genome. If you want to look for 50-bp exons,
you’ll have to sift through a lot of false positives.
To
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

INSPIRED

INSPIRED

Marty Cagan
Storytelling with You

Storytelling with You

Cole Nussbaumer Knaflic
Observability Engineering

Observability Engineering

Charity Majors, Liz Fong-Jones, George Miranda

Publisher Resources

ISBN: 0596002998Catalog PageErrata