Chapter 15

Using Open Source R for Data Science


check Grasping the basics concepts and vocabulary of R

check Exploring objects in R

check Previewing popular R packages

check Playing with more advanced R packages

R is an open-source, free statistical software system that, like Python, has been widely adopted across the data science sector over the past decade. In fact, a somewhat never-ending squabble takes place among data science types about which programming language is best suited for data science. Practitioners who favor R generally do so because of its advanced statistical programming and data visualization capabilities — capabilities that simply can’t be replicated in Python. When it comes to data science practitioners, specifically, R’s user base is broader than Python’s. (For more on Python, see Chapter 14.)

You can download the R programming language and the packages that support it from

R is not as easy to learn as Python, but R can be more powerful for certain types of advanced statistical analyses. Although R’s learning curve is somewhat steeper than ...

Get Data Science For Dummies, 2nd Edition now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.