CLAST
Clustering Biological Sequences
Vicente Molieri; Lina J. Karam; Zoé Lacroix School of Electrical, Computer, and Energy Engineering, Arizona State University, Tempe, AZ, USA
Abstract
Clustering sequences is important in a variety of applications, including development of nonredundant databases, function prediction, and identifying patterns of gene expression. Currently, clustering methods rely on a prealignment as supplementary information to guide the construction of clusters. This chapter introduces a novel algorithm to cluster nucleotide and peptide sequences. The algorithm is a no-reference approach that utilizes only the sequences as input. We also introduce a novel metric that is used to describe the relationship ...
Get Emerging Trends in Computational Biology, Bioinformatics, and Systems Biology now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.