July 2017
Intermediate to advanced
796 pages
18h 55m
English
LDA is a topic model, which infers topics from a collection of text documents. LDA can be thought of as an unsupervised clustering algorithm as follows:
In order to invoke LDA, you need to import the package:
import org.apache.spark.ml.clustering.LDA
Step 1. First, you need to initialize an LDA model setting 10 topics and 10 iterations of clustering:
scala> val lda = new LDA().setK(10).setMaxIter(10) ...
Read now
Unlock full access