STA 235H - Potential Outcomes
Fall 2023
McCombs School of Business, UT Austin
1 / 46

How? Potential Outcomes Framework

What? Causal Estimands

Why? Causal Questions and Study Design

2 / 46

The "How": Potential outcomes framework

3 / 46

4 / 46

What do you think are the biggest issues here?

5 / 46

6 / 46

Before we start...

Be clear about your language
Be clear about your data
Be clear about your assumptions

7 / 46

What is Causal Inference?

Inferring the effect of one thing on another thing

8 / 46

What is Causal Inference?

Inferring the effect of one thing on another thing

"My headache went away because I took an aspirin".

8 / 46

What is Causal Inference?

Inferring the effect of one thing on another thing

"My headache went away because I took an aspirin".
"The new marketing campaign increased our sales by 20%"

8 / 46

What is Causal Inference?

Inferring the effect of one thing on another thing

"My headache went away because I took an aspirin".
"The new marketing campaign increased our sales by 20%"
"Providing students support when filling out FAFSA forms improves college access and completion."

8 / 46

A world of potential (outcomes)

Under a binary treatment or intervention, there are two potential worlds:

World 1: You take the pill
World 2: You don't take the pill

9 / 46

A world of potential (outcomes)

A potential outcome is the outcome under each of these scenarios or "worlds".
- There will be one for each path!

10 / 46

A world of potential (outcomes)

A potential outcome is the outcome under each of these scenarios or "worlds".
- There will be one for each path!
A priori, each of these scenarios has a potential outcome
A posteriori, I can only observe at most one of the potential outcomes

10 / 46

A world of potential (outcomes)

A potential outcome is the outcome under each of these scenarios or "worlds".
- There will be one for each path!
A priori, each of these scenarios has a potential outcome
A posteriori, I can only observe at most one of the potential outcomes

Fundamental Problem of Causal Inference

10 / 46

What are the potential outcomes for our previous example?

11 / 46

Potential Outcomes Examples"My headache went away because I took an aspirin".
12 / 46

Potential Outcomes Examples

"My headache went away because I took an aspirin".

Headache status if I take an aspirin/ Headache status if I don't take an aspirin

12 / 46

Potential Outcomes Examples

"My headache went away because I took an aspirin".

Headache status if I take an aspirin/ Headache status if I don't take an aspirin

"The new marketing campaign increased our sales by 20%"

12 / 46

Potential Outcomes Examples

"My headache went away because I took an aspirin".

Headache status if I take an aspirin/ Headache status if I don't take an aspirin

"The new marketing campaign increased our sales by 20%"

"Providing students support when filling out FAFSA forms improves college access and completion."

12 / 46

Let's see a specific exampleYou work at a retail company and you are debating on whether to send out an email campaign to boost your sales:
13 / 46

Let's see a specific example

You work at a retail company and you are debating on whether to send out an email campaign to boost your sales:
You are interested in two specific outcomes:

Sales: Whether a customer makes a purchase or not.

Churn: Whether a customer unsubscribes for your mailing list or not.

13 / 46

Potential Outcomes Framework

Let's introduce some notation:

Let $Y_{i}$ be the observed outcome for unit $i$ (e.g. whether a person makes a purchase or not).
Let $Z_{i}$ be the treatment or intervention (e.g. receiving a promotional email (1) or not (0)).
Let $Y_{i} (z)$ be the potential outcome under treatment $Z = z$ . (e.g. whether the person would make a purchase or not if they received treatment z).

14 / 46

Potential Outcomes Framework

Let's introduce some notation:

Let $Y_{i}$ be the observed outcome for unit $i$ (e.g. whether a person makes a purchase or not).
Let $Z_{i}$ be the treatment or intervention (e.g. receiving a promotional email (1) or not (0)).
Let $Y_{i} (z)$ be the potential outcome under treatment $Z = z$ . (e.g. whether the person would make a purchase or not if they received treatment z).

Then, if a person is treated, $Z_{i} = 1$ , then their observed outcome $Y_{i}$ will be the same as their potential outcome under treatment, $Y_{i} (1)$

$Y_{i} | (Z_{i} = 1) \overset{Δ}{=} Y_{i} (1)$

14 / 46

Potential Outcomes Framework

Let's introduce some notation:

Let $Y_{i}$ be the observed outcome for unit $i$ (e.g. whether a person makes a purchase or not).
Let $Z_{i}$ be the treatment or intervention (e.g. receiving a promotional email (1) or not (0)).
Let $Y_{i} (z)$ be the potential outcome under treatment $Z = z$ . (e.g. whether the person would make a purchase or not if they received treatment z).

Then, if a person is treated, $Z_{i} = 1$ , then their observed outcome $Y_{i}$ will be the same as their potential outcome under treatment, $Y_{i} (1)$

$Y_{i} | (Z_{i} = 1) \overset{Δ}{=} Y_{i} (1)$ In the same fashion, if a person is not treated, $Z_{i} = 0$ , then their observed outcome $Y_{i}$ will be the same as their potential outcome under control, $Y_{i} (0)$

$Y_{i} | (Z_{i} = 0) \overset{Δ}{=} Y_{i} (0)$

14 / 46

Potential Outcomes Framework

This means that we can write the observed outcome as a function of the potential outcomes:

$\to Y_{i} = Z_{i} \cdot Y_{i} (1) + (1 - Z_{i}) \cdot Y_{i} (0)$

This definition will be useful because we can see this as a missing data problem.

15 / 46

Causal Effects

Individual Causal Effect

$I C E_{i} = Y_{i} (1) - Y_{i} (0)$

16 / 46

Causal Effects

Individual Causal Effect

$I C E_{i} = Y_{i} (1) - Y_{i} (0)$
Can we ever observe individual causal effects?

17 / 46

Causal Effects

Individual Causal Effect

$I C E_{i} = Y_{i} (1) - Y_{i} (0)$

Can we ever observe individual causal effects?

No!*

18 / 46

Only one realization

Z=1

Z=0

19 / 46

The "What": Causal estimands, estimates, and estimators

20 / 46

Estimands vs Estimates vs Estimators

Estimand

A quantity we want to estimate

Estimate

The result of an estimation

Estimator

A rule for calculating
an estimate based on data

21 / 46

Estimands vs Estimates vs Estimators

Estimand

A quantity we want to estimate

E.g.: Population mean

$μ$

Estimate

The result of an estimation

E.g.: Result of the sample mean
for a given sample S

$\hat{μ}$

Estimator

A rule for calculating
an estimate based on data

E.g.: Sample mean

$\frac{1}{n} \sum_{i} Y_{i}$

22 / 46

Estimands vs Estimates vs Estimators

Source: Deng, 2022

23 / 46

Estimands vs Estimates vs Estimators

Some important estimands that we need to keep in mind:

Average Treatment Effect (ATE)

Average Treatment Effect on the Treated (ATT)

Conditional Average Treatment Effect (CATE)

24 / 46

Estimands vs Estimates vs Estimators

Some important estimands that we need to keep in mind:

ATE: E.g. Average Treatment Effect for all customers

ATT: E.g. Average Treatment Effect for customers that received the email

CATE: E.g. Average Treatmenf Effect for customer under 25 years old

25 / 46

Estimands vs Estimates vs Estimators

Some important estimands that we need to keep in mind:

$A T E = E [Y (1) - Y (0)]$

$A T T = E [Y (1) - Y (0) | Z = 1]$

$C A T E = E [Y (1) - Y (0) | X]$

26 / 46

Getting around the fundamental problem of causal inferenceLet's go back to our original example: Does an email campaign increase sales?
i
Z
Y
Y(1)
Y(0)
Y(1)-Y(0)
1
0
0
?
0
?
2
1
0
0
?
?
3
1
1
1
?
?
4
0
1
?
1
?
5
0
0
?
0
?
6
1
1
1
?
?
27 / 46

Getting around the fundamental problem of causal inferenceWe have a missing data problem
i
Z
Y
Y(1)
Y(0)
Y(1)-Y(0)
1
0
0
?
0
?
2
1
0
0
?
?
3
1
1
1
?
?
4
0
1
?
1
?
5
0
0
?
0
?
6
1
1
1
?
?
28 / 46

Getting around the fundamental problem of causal inferenceCompare those who received the email to the ones did not received the email.
i
Z
Y
Y(1)
Y(0)
Y(1)-Y(0)
1
0
0
?
0
?
2
1
0
0
?
?
3
1
1
1
?
?
4
0
1
?
1
?
5
0
0
?
0
?
6
1
1
1
?
?
29 / 46

Getting around the fundamental problem of causal inferenceCompare those who received the email to the ones did not received the email.
i
Z
Y
Y(1)
Y(0)
Y(1)-Y(0)
1
0
0
?
0
?
2
1
0
0
?
?
3
1
1
1
?
?
4
0
1
?
1
?
5
0
0
?
0
?
6
1
1
1
?
?
30 / 46

Getting around the fundamental problem of causal inference

Compare those who received the email to the ones did not received the email.

$\hat{τ} = \frac{1}{3} \sum_{i \in Z = 1} Y_{i} - \frac{1}{3} \sum_{i \in Z = 0} Y_{i} = 0.333$

31 / 46

Getting around the fundamental problem of causal inference

I we had more data, we could do the same with a simple regression:

$P u r c h a s e = β_{0} + β_{1} E m a i l + ε$

32 / 46

Getting around the fundamental problem of causal inference

I we had more data, we could do the same with a simple regression:

$P u r c h a s e = β_{0} + β_{1} E m a i l + ε$

Imagine you get the following results:

$P u r c h a s e = 0.4 + 0.33 E m a i l + ε$

Interpret the coefficient for Email:

32 / 46

What could be the problem with comparing the sample means?

33 / 46

Let's do a little exercise

34 / 46

Look at your green piece of paper and go to the following website

https://sta235h.click/week4

Would you go to a physician/urgent care?

35 / 46

The "Why": Causal questions and study designs

36 / 46

Under what assumptions is our estimate causal?

We are using: $\hat{τ} = \frac{1}{3} \sum_{i \in Z = 1} Y_{i} - \frac{1}{3} \sum_{i \in Z = 0} Y_{i})$ to estimate:

$τ = E [Y_{i} (1) - Y_{i} (0)]$

37 / 46

Under what assumptions is our estimate causal?

We are using: $\hat{τ} = \frac{1}{3} \sum_{i \in Z = 1} Y_{i} - \frac{1}{3} \sum_{i \in Z = 0} Y_{i})$

to estimate:

$τ = E [Y_{i} (1) - Y_{i} (0)]$

Let's do some math

38 / 46

Under what assumptions is our estimate causal?

$τ = E [Y_{i} (1) - Y_{i} (0)]$ $= E [Y_{i} (1)] - E [Y_{i} (0)]$

39 / 46

Under what assumptions is our estimate causal?

$τ = E [Y_{i} (1) - Y_{i} (0)]$ $= E [Y_{i} (1)] - E [Y_{i} (0)]$ Key assumption:

Ignorability

Ignorability means that the potential outcomes $Y (0)$ and $Y (1)$ are independent of the treatment, e.g. $(Y (0), Y (1)) ⊥ ⊥ Z$ .

$E [Y_{i} (1) | Z = 0] = E [Y_{i} (1) | Z = 1] = E [Y_{i} (1)]$ and

$E [Y_{i} (0) | Z = 0] = E [Y_{i} (0) | Z = 1] = E [Y_{i} (0)]$

39 / 46

Under what assumptions is our estimate causal?

$τ = E [Y_{i} (1) - Y_{i} (0)]$ $= E [Y_{i} (1)] - E [Y_{i} (0)]$

Under ignorability (see previous slide), $E [Y_{i} (1)] = E [Y_{i} (1) | Z = 1] = E [Y_{i} | Z = 1]$ and $E [Y_{i} (0)] = E [Y_{i} (0) | Z = 0] = E [Y_{i} | Z = 0]$ , then:

$τ = E [Y_{i} (1)] - E [Y_{i} (0)] = \underset{Obs. Outcome for T}{\underset{⏟}{E [Y_{i} (1) | Z = 1]}} - \overset{Obs. Outcome for C}{\overset{⏞}{E [Y_{i} (0) | Z = 0]}}$

40 / 46

Ignorability Assumption

We can just "ignore" the missing data problem:

41 / 46

Ignorability Assumption

We can just "ignore" the missing data problem:

42 / 46

Ignorability Assumption

We can just "ignore" the missing data problem:

43 / 46

Main takeaway points

Causal Inference is hard

44 / 46

Main takeaway points

Causal Inference is hard

Think about the causal problem

44 / 46

Main takeaway points

Causal Inference is hard

Think about the causal problem
Check validity of assumptions (Is ignorability plausible? Am I controlling for the right covariates?)

44 / 46

Main takeaway points

Causal Inference is hard

Think about the causal problem
Check validity of assumptions (Is ignorability plausible? Am I controlling for the right covariates?)
Most of this chapter will be spent on looking for exogeneous variation to make the ignorability assumption happen.

44 / 46

Next week

Randomized Controlled Trials:
- Pros and Cons
- Concept of validity
- A/B Testing

45 / 46

References

Angrist, J. & S. Pischke. (2015). "Mastering Metrics". Chapter 1.
Cunningham, S. (2021). "Causal Inference: The Mixtape". Chapter 4: Potential Outcomes Causal Model.
Neil, B. (2020). "Introduction to Causal Inference". Fall 2020 Course

↑, ←, Pg Up, k	Go to previous slide
↓, →, Pg Dn, Space, j	Go to next slide
Home	Go to first slide
End	Go to last slide
Number + Return	Go to specific slide
b / m / f	Toggle blackout / mirrored / fullscreen mode
c	Clone slideshow
p	Toggle presenter mode
t	Restart the presentation timer
?, h	Toggle this help
s	Toggle scribble toolbox

STA 235H - Potential Outcomes

Fall 2023

McCombs School of Business, UT Austin

Before we start...

What is Causal Inference?

What is Causal Inference?

What is Causal Inference?

What is Causal Inference?

A world of potential (outcomes)

A world of potential (outcomes)

A world of potential (outcomes)

A world of potential (outcomes)

Potential Outcomes Examples

Potential Outcomes Examples

Potential Outcomes Examples

Potential Outcomes Examples

Let's see a specific example

Let's see a specific example

Potential Outcomes Framework

Potential Outcomes Framework

Potential Outcomes Framework

Potential Outcomes Framework

Causal Effects

Causal Effects

Causal Effects

Only one realization

Estimands vs Estimates vs Estimators

Estimands vs Estimates vs Estimators

Estimands vs Estimates vs Estimators

Estimands vs Estimates vs Estimators

Estimands vs Estimates vs Estimators

Estimands vs Estimates vs Estimators

Getting around the fundamental problem of causal inference

Getting around the fundamental problem of causal inference

Getting around the fundamental problem of causal inference

Getting around the fundamental problem of causal inference

Getting around the fundamental problem of causal inference

Getting around the fundamental problem of causal inference

Getting around the fundamental problem of causal inference

Under what assumptions is our estimate causal?

Under what assumptions is our estimate causal?

Under what assumptions is our estimate causal?

Under what assumptions is our estimate causal?

Under what assumptions is our estimate causal?

Ignorability Assumption

Ignorability Assumption

Ignorability Assumption

Main takeaway points

Main takeaway points

Main takeaway points

Main takeaway points

Next week

References

Help