Exploring data using R

In the previous section, we found several patterns in the data, such as the behavior of the CitySize column. Now we are going to look at a better picture of the columns in the SourceData.Actions table using R. For this data exploration, we will use R Studio with additional libraries RODBC for access to SQL Server and ggplot2 for visualization of data.

When R Studio is started, let's write the following lines of code in R:


The install.packages() function installs both libraries. For the preceding code to succeed, we need access to the internet from the computer running R Studio. The install.packages() function downloads the package with all its prerequisites and ...

Get Hands-On Data Science with SQL Server 2017 now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.