Adding more robustness to heteroscedastic noise with factor analysis

One of the main problems with standard PCA is the intrinsic weakness of such a model in terms of heteroscedastic noise. If you are not familiar with this terminology, it will be helpful to introduce two definitions. A multivariate decorrelated noise term is characterized by a diagonal covariance matrix, C, which can have two different configurations, as follows:

  • C = diag(σ2, σ2, ..., σ2): In this case, the noise is defined as homoscedastic (all of the components have the same variance).
  • C = diag(σ12, σ22, ..., σn2), with σ12 ≠ σ22 ≠ ... ≠ σn2: In this case, the noise is defined as heteroscedastic (every component has its own variance).

It's possible to prove that, when ...

Get Hands-On Unsupervised Learning with Python now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.