One of the most exciting and practical goals of combining healthcare with technology is to mine large quantities of data to discover what, if anything, has eluded researchers—either through a lack of sufficiently large datasets or a lack of human ability to notice unlikely relationships. Unsupervised learning is a promising avenue for pursuing this goal, because unsupervised machine learning techniques do not require existing human knowledge to generate new insights about structure within datasets. This video, designed for learners with a basic understanding of statistics and computer programming, provides a detailed introduction to three specific types of unsupervised learning: cluster analysis, association analysis, and principal components analysis, as applied to health data sets both at the individual and population levels. Examples will be introduced in both Python and R.
Aileen Nielsen is a software engineer at One Drop, an AI/ML heathtech company working on diabetes-management products. A member of the New York City Bar Association’s Science and Law committee, Aileen holds degrees in anthropology, law, and physics from Princeton, Yale, and Columbia, respectively.