Skip to Content
Python for Bioinformatics
book

Python for Bioinformatics

by Jason Kinser
June 2008
Beginner to intermediate
417 pages
10h 41m
English
Jones & Bartlett Learning
Content preview from Python for Bioinformatics

9 Tandem Repeats

Some regions of the genome contain repeating regions of DNA. In some cases the repeats are quite simple—for example, GATGATGAT. In other cases repeats are much more complicated, with regions repeating minor variations or sets of nested repeating regions. This chapter will explore one method of finding these repeating regions.

9.1 Tandem Repeats

Repeats are consecutive repeating segments in a string. Two simple examples are

 

TCTCTCTCATTCATTCATTC

A compressed format for representing the repeat is to place a subscript for the number of repeats of a substring enclosed in parentheses—for example, TGTGTGTG, which can be written as (TG)4.

Tandem repeats may become much more complex when a region repeats with a minor variation. In an ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Bioinformatics with Python Cookbook - Third Edition

Bioinformatics with Python Cookbook - Third Edition

Tiago Antao

Publisher Resources

ISBN: 9780763751869