A precise, well-formed definition is one of the most critical requirements for shared understanding of data [1]
—ISO/IEC 11179-1
To illustrate some of the practices we learned about in the preceding chapters, let’s have a brief look at the characteristics of a few example datasets. We will examine some well-known datasets that pop up regularly in data analytics courses and textbooks for teaching about visualization, regression modeling, machine learning, and other data exploration methods. There are literally thousands of online datasets hosted by independent researchers, government ...