Next-Generation Sequencing Data Analysis176
methods that use this strategy. Among them, scmap [138] projects cells in
a query dataset onto a reference dataset (or combined references), to iden-
tify matching individual cells (the scmap- cell mode) or specic cell types
(the scmap- cluster mode). In this method, the similarity between individual
cells in the query sample and cells or cell types in the reference is measured
using three distance metrics (Pearson, Spearman, and cosine). Some of the
other methods are machine learning based, i.e., they rst construct classi-
er models from a reference dataset and then use the classiers to annotate
individual cells or clusters from a query dataset. SingleCellNet (SCN), for
example, uses a random forest ...