By visualizing the data, it is possible to understand the meaning of the data by positioning it in a visual context. A numerical analysis of data can hide at first sight patterns, that is, trends and correlations that instead represent the basis of data mining. These characteristics are highlighted through graphs and diagrams that describe the nature of the data under analysis.
The first thing we can do is plot a data plot. In this way, we will be able to highlight the characteristics we have already analyzed with the statistical data returned by the describe() function. A boxplot is a graphical representation that's used to describe the distribution of a sample by simple dispersion and position indexes. As we said in ...