POLSCI 4SS3
Winter 2023
No class on March 29
Optional lab that week
Will use flex session on April 12 to catch up
No office hours between March 23-29
March 22 (today): Beyond Experimentation
March 29: NO CLASS
April 5: Quasi-experiments I
April 12: Quasi-experiments II
April 21: Final project due
Field experiments as the gold standard to evaluate policy
Many choices in research design and implementation
Today: How do we learn from experiments?
We want to make statements about causation
To back up those statements, we need to rule out confounding factors
(coming soon!)
ID | Female | Y(0) | Y(1) |
---|---|---|---|
1 | 0 | 0 | 0 |
2 | 0 | 0 | 1 |
3 | 1 | 0 | 1 |
4 | 1 | 1 | 1 |
\(Y(*)\) are the potential outcomes under control (0)
and treatment (1)
, respectively
\(Y(*) = 1\) means person’s life improves, \(Y(*) = 0\) means life stays the same
ID | Female | Y(0) | Y(1) |
---|---|---|---|
1 | 0 | 0 | 0 |
2 | 0 | 0 | 1 |
3 | 1 | 0 | 1 |
4 | 1 | 1 | 1 |
We have:
ID | Female | Y(0) | Y(1) | Z |
---|---|---|---|---|
1 | 0 | 0 | 0 | 0 |
2 | 0 | 0 | 1 | 0 |
3 | 1 | 0 | 1 | 1 |
4 | 1 | 1 | 1 | 1 |
We happened to randomly assign the policy to the two women
We only observe the potential outcomes that corresponds to the treatment status
ID | Female | Y(0) | Y(1) | Z | Y obs |
---|---|---|---|---|---|
1 | 0 | 0 | 0 | 0 | 0 |
2 | 0 | 0 | 1 | 0 | 0 |
3 | 1 | 0 | 1 | 1 | 1 |
4 | 1 | 1 | 1 | 1 | 1 |
\[ATE = E[Y(1)] - E[Y(0)] = 3/4 - 1/4 = 1/2\]
ID | Female | Y(0) | Y(1) | Z | Y obs |
---|---|---|---|---|---|
1 | 0 | 0 | 0 | 0 | 0 |
2 | 0 | 0 | 1 | 0 | 0 |
3 | 1 | 0 | 1 | 1 | 1 |
4 | 1 | 1 | 1 | 1 | 1 |
We can approximate the ATE with \(\widehat{ATE} = 2/2 - 0/2 = 1\)
We are off the mark! What happens if we redo the experiment?
ID | Female | Y(0) | Y(1) | Z | Y obs |
---|---|---|---|---|---|
1 | 0 | 0 | 0 | 1 | 0 |
2 | 0 | 0 | 1 | 0 | 0 |
3 | 1 | 0 | 1 | 1 | 1 |
4 | 1 | 1 | 1 | 0 | 1 |
We still have \(ATE = 1/2\)
But now \(\widehat{ATE} = 1/2 - 1/2 = 0\)
Off the mark in the opposite direction
Experiment 1 |
Experiment 2 |
||||||
---|---|---|---|---|---|---|---|
ID | Female | Y(0) | Y(1) | Z | Y obs | Z | Y obs |
1 | 0 | 0 | 0 | 0 | 0 | 1 | 0 |
2 | 0 | 0 | 1 | 0 | 0 | 0 | 0 |
3 | 1 | 0 | 1 | 1 | 1 | 1 | 1 |
4 | 1 | 1 | 1 | 1 | 1 | 0 | 1 |
Perhaps men and women react to the policy differently
We want to rule out results depending on whether we assign treatments to men or women
Experiment 1: 2/2 women in treatment and 0/2 in control (imbalanced)
Experiment 2: 1/2 woman in treatment and 1/2 in control (balanced)
Experiments only rule out the role of potential confounders IN EXPECTATION
We can sustain this claim in two ways
With a sufficiently large sample (But how large is large enough?)
By repeating the same experiment multiple times (Nobody does this)
We only know bias, RMSE, and power in our simulations
Need a lot of domain expertise to attribute ATE to policy
This involves explaining why it works
First step toward knowing whether it would work somewhere else
Internal validity: We can (probabilistically)
attribute effect to policy intervention
External validity: Whether effect extrapolates or generalizes
Extrapolation: Whether it works elsewhere
Generalization: Whether it works everywhere
Example: A house burns down because the television was left on
Not all houses with TVs left on burn down, but sometimes they do, perhaps because the wiring was poor
A support factor is one part of the causal pie
Causal pie: A set of causes that are jointly but not separately sufficient for a contribution to an effect (INUS causation)
Analogy: TUP only works if we have good schools
Scaling up: Whether we can apply intervention to broader area
Small scale interventions can become unfeasible or cost-prohibitive in a larger scale
Some policies only work at a small scale!
Drilling down: Can we apply the results of an intervention to individual units?
Just because it works on average, it does not mean that everyone will benefit from it
May waste money on people for whom the policy does not work
This can be unethical
(more or less)
the same policyGoals:
Establish whether a policy is generally advisable (pooling results)
Understand why things work in some places but not others (support factors)
CPRs: Non-excludable, rivalrous
Examples?
Problem: Prone to congestion, overextraction
Country | Resource | Community | Threat |
---|---|---|---|
Brazil | Groundwater | Rural villages | Drought, overuse |
China | Surface water | Urban neighborhoods | Pollution |
Costa Rica | Groundwater | Rural villages | Drought, overuse |
Liberia | Forest | Villages | Overcutting |
Peru | Forest | Indigenous communities | Extraction |
Uganda | Forest | Villages | Overcutting |
Dissemination |
|||||
---|---|---|---|---|---|
Country | Wokshops | Training | Monitoring | Citizens | Management |
Brazil | X | X | X | X | |
China | X | X | X | ||
Costa Rica | X | X | X | X | X |
Liberia | X | X | X | X | X |
Peru | X | X | X | X | X |
Uganda | X | X | X | X | X |
Back on April 5!