In this chapter, you learned about
StreamingKMeans clustering, which is used for streaming data. We discussed both steps involved in this algorithm—streaming and BallKMeans. We used Mahout Streaming K-means on the census1990 data. We also discussed the
clusterQualitySummarizer class. In the next chapter, we will discuss one more clustering algorithm implemented in Mahout—spectral clustering.