June 2008
Beginner to intermediate
417 pages
10h 41m
English
The data generated from experiments in bioinformatics may contain several dimensions and be quite complicated. However, the dimensionality of the data may far exceed its complexity. A reduction in dimensionality often allows simpler algorithms to analyze the data effectively. The most common method of data reduction in bioinformatics is principal component analysis (PCA).
Principal component analysis is an often-used tool that reduces the dimensionality of a problem. Consider a set of vectors that lie in RN space. It is possible that the data is not scattered about but has an organization. When looked at in one view, the data looks scattered, but if viewed from a different orientation the ...