Machine learning for genetic variants

Research has revealed that population groups from Asia, Europe, Africa, and America can be separated based on their genomic data. However, it is more challenging to accurately predict the haplogroup and the continent of origin, that is, geography, ethnicity, and language. Other research shows that the Y chromosome lineage can be geographically localized, forming the evidence for (geographically) clustering the human alleles of the human genotypes.

Thus, the clustering of individuals is correlated with geographic origin and ancestry. Since race depends on ancestry as well, the clusters are also correlated with the more traditional concepts of race, but the correlation is not perfect since genetic variation ...

Get Scala Machine Learning Projects now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.