Chapter 5. Logistic Regression

The contributor for this chapter is Brian Dalessandro. Brian works at Media6Degrees as a VP of data science, and he’s active in the research community. He’s also served as cochair of the KDD competition. M6D (also known as Media 6 Degrees) is a startup in New York City in the online advertising space. Figure 5-1 shows Brian’s data science profile—his y-axis is scaled from Clown to Rockstar.

Brian came to talk to the class about logistic regression and evaluation, but he started out with two thought experiments.

Thought Experiments

How would data science differ if we had a “grand unified theory of everything”? Take this to mean a symbolic explanation of how the world works. This one question raises a bunch of other questions:
- Would we even need data science if we had such a theory?
- Is it even theoretically possible to have such a theory? Do such theories lie only in the realm of, say, physics, where we can anticipate the exact return of a comet we see once a century?
- What’s the critical difference between physics and data science that makes such a theory implausible?
- Is it just accuracy? Or more generally, how much we imagine can be explained? Is it because we predict human behavior, which can be affected by our predictions, creating a feedback loop?
  
  It might be useful to think of the sciences as ...

Get Doing Data Science now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Start your free trial

Doing Data Science by Cathy O'Neil, Rachel Schutt

Chapter 5. Logistic Regression

Figure 5-1. Brian’s data science profile

Thought Experiments

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly