Skip to Content
Unstructured Data Analytics
book

Unstructured Data Analytics

by Jean Paul Isson
March 2018
Intermediate to advanced
432 pages
10h 7m
English
Wiley
Content preview from Unstructured Data Analytics

APPENDIX A Tech Corner Details

This Tech Corner section is for readers who are interested in getting more details regarding some linear algebra techniques commonly used in text analysis to reduce document-by-terms matrices and for information retrieval and computer vision. If you are not interested to learn more about these linear algebra techniques feel free to give this entire appendix section a skim. This chapter provides some useful details about linear algebra techniques we recalled throughout the book.

The three powerful techniques we will discuss here for matrices factorization and reduction include:

  • Singular value decomposition (SVD)
  • Principal component analysis (PCA)
  • QR factorization

For each technique we will provide a description of the technique along with a step-by-step example on how to apply the technique throughout an example.

SVD and QR are both matrix decomposition techniques. SVD and QR help to produce a reduced rank approximation of the document matrix, as we need to identify the dependence between rows (documents) and columns (terms).

To easily navigate through some technical details of this section, readers require basic knowledge of linear algebra and matrix calculations are prerequisites, including:

  • Space point
  • Scalars
  • Vectors
  • Matrices
  • Matrix reduction, matrix decomposition
  • Matrix diagonalization
  • Inverse matrices
  • Characteristic equations
  • Eigen-decomposition, eigenvalues, and eigenvectors
  • Orthogonal and orthonormal vectors
  • Gram–Schmidt orthonormalization ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Cleaning Data for Effective Data Science

Cleaning Data for Effective Data Science

David Mertz
Advanced Analytics and Deep Learning Models

Advanced Analytics and Deep Learning Models

Archana Mire, Shaveta Malik, Amit Kumar Tyagi

Publisher Resources

ISBN: 9781119129752Purchase book