Skip to Content
Causal inference 101: Answering the crucial "why" in your analysis
conference

Causal inference 101: Answering the crucial "why" in your analysis

by Subhasish Misra
February 2020
Beginner to intermediate
36m
English
O'Reilly Media, Inc.
Closed Captioning available in German, English, Spanish, French, Japanese, Korean, Portuguese (Portugal, Brazil), Chinese (Simplified), Chinese (Traditional)

Overview

Causal questions are ubiquitous in data science. For example, you may have questions that are deeply rooted in causality, such as whether or not changing a feature on a website led to more traffic or if digital ad exposure led to incremental purchase, did changing a feature in a website lead to more traffic or if digital ad exposure led to incremental purchase.

Randomized tests are considered to be the gold standard when addressing causal effects; however, in many cases experiments are unfeasible or unethical. In such cases, you have to rely on observational (nonexperimental)data to derive causal insights. The crucial difference between randomized experiments and observational data is that in the former, test subjects (e.g., the customers) are randomly assigned a treatment (e.g., digital advertisement exposure). This helps curb the possibility that user response (e.g., clicking on a link in the ad and purchasing the product) across the two groups of treated and nontreated subjects is different because of preexisting differences in user characteristic (e.g., demographics, geolocation, etc.) In essence, you can then attribute divergences observed posttreatment in key outcomes (e.g., purchase rate) as the causal impact of the treatment. But this treatment assignment mechanism is absent when using observational data.

Subhasish Misra (Walmart Labs) explores the statistical methods available to ensure you’re able to circumvent this shortcoming and get to causality. You’ll get a practical overview of the aspects of causal inference, including the fundamental tenants of causality and measuring causal effects; the challenges involved in measuring causal effects in real-world situations; distinguishing between randomized and observational measurement approaches; an introduction to measuring casual effects with observational data using matching and its extension of propensity score-based matching with a focus on the institution and statistics behind it, tips from the trenches based on Subhasish’s experience with these techniques; and practical limitations of these approaches. Subhashish walks you through an example of how matching was applied to get causal insights regarding the effectiveness of a digital product at Walmart.

Prerequisite knowledge

  • A basic understanding of machine statistics and data science

What you'll learn

  • Discover the fundamental nuances of causal inference and analytical frameworks and implementation tools to tease out causal effects in the wild—when randomization isn't an option
  • Understand the differences between randomized and observational studies and the challenges in getting to causal conclusions for each

This session is from the 2019 O'Reilly Strata Conference in New York, NY.

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Watch now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

What Is Causal Inference?

What Is Causal Inference?

Hugo Bowne-Anderson, Mike Loukides
Crucial Conversations

Crucial Conversations

Joseph Grenny, Kerry Patterson, Ron McMillan, Al Switzler, Emily Gregory

Publisher Resources

ISBN: 0636920372004