Data Mining (Under The Hood)
In this part of the book Data Science for Software Engineering: Sharing Data and Models, we offer some tutorial notes on commonly used software engineering applications of data mining, along with some tutorial material on data mining algorithms. Covered topics of SE problems include effort estimation and defect prediction. Covered aspects of data mining include discretization, column pruning (also known as feature selection), row pruning, clustering, contrast set learning, decision learning, and learning for continuous classes.
The last three chapters listed application areas of data mining in software engineering. This chapter discusses the internals of a data miner. In particular, it answers ...