Skip to Content
Python vs. R for Data Science
book

Python vs. R for Data Science

by Michael Grogan
June 2018
Beginner to intermediate
3 pages
18m
English
O'Reilly Media, Inc.
Content preview from Python vs. R for Data Science

Chapter 1. Python vs. R for Data Science

Introduction

Python and R are two of the mainstream languages in data science. Fundamentally, Python is a language for programmers, whereas R is a language for statisticians. In a data science context, there is a significant degree of overlap when it comes to the capabilities of each language in the fields of regression analysis and machine learning. Your choice of language will depend highly on the environment in which you are operating. In a production environment, Python integrates with other languages much more seamlessly and is therefore the modus operandi in this context. However, R is much more common in research environments due to its more extensive selection of libraries for statistical analysis.

Basics

Python

R

Current version

3.6

3.4.3

Self-defined as

Python is a programming language that lets you work quickly and integrate your systems effectively. According to the official website, the Python quote emphasizes productivity as well as its use as a glue language.

R is an open source language that is specifically designed for conducting statistical analysis. As such, it is highly popular within fields such as data science, engineering, and other cognitive disciplines. The R Project for Statistical Computing describes the R language as an environment specifically designed for “statistical computing and graphics.”

Strengths

Python has significantly more flexibility in interacting with ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

Python and R for the Modern Data Scientist

Python and R for the Modern Data Scientist

Rick J. Scavetta, Boyan Angelov
Hands-On Data Science with R

Hands-On Data Science with R

Doug Ortiz, Vitor Bianchi Lanzetta, Nataraj Dasgupta, Ricardo Anjoleto Farias

Publisher Resources

ISBN: 9781492033943Errata Page