Skip to Content
Machine Learning with R, the tidyverse, and mlr
book

Machine Learning with R, the tidyverse, and mlr

by Hefin Rhys
April 2020
Intermediate to advanced
536 pages
16h 55m
English
Manning Publications
Content preview from Machine Learning with R, the tidyverse, and mlr

A.1. Data vocabulary

Let’s start with some basic vocabulary we’ll be using to describe data. There are some variations in the way data scientists and statisticians use the terminology, so I’ll try to make it clear which terms are equivalent and which I opt to use throughout the book. In this section, we’ll discuss

  • The difference between a sample and a population
  • What we mean by rows, columns, cases, and variables
  • What the different types of variables are and how they differ

A.1.1. Sample vs. population

In data science and statistics, we’re usually trying to learn something about, or predict something in, the real world. Let’s say we’re interested in the tusk length of hippos. It would be impossible to measure the tusk length of every hippo ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Advanced Machine Learning with R

Advanced Machine Learning with R

Cory Lesmeister, Dr. Sunil Kumar Chinnamgari
R Machine Learning Projects

R Machine Learning Projects

Dr. Sunil Kumar Chinnamgari

Publisher Resources

ISBN: 9781617296574Publisher SupportOtherPublisher WebsiteSupplemental ContentErrata PagePurchase Link