POLSCI 4SS3
Winter 2023
Two more weeks left!
Final projects due April 21
Groups need to meet with instructor one more time before April 19
Extra office hour times April 13-19
Every group member needs to be in at least one group meeting to receive group meeting grade
Learning from experiments
Good to understand what works, but not why or where
Need to think about support factors
Scaling up, drilling down
Today: Observational causal data strategies
Data strategy |
||
---|---|---|
Inquiry | Observational | Experimental |
Descriptive | Sample survey | List experiment |
Causal | Quasi-experiment | Survey/field experiment |
Data strategy |
||
---|---|---|
Inquiry | Observational | Experimental |
Descriptive | Sample survey | List experiment |
Causal | Quasi-experiment | Survey/field experiment |
Example
Students who are likely to participate enroll in Political Science courses more often
Example
We believe that more education increases income
But having smart parents increases both education and income
Example
Random assignment avoids this in expectation
Hard to overcome with observational causal data strategies
Need to pretend that we can analyze data as if it was an experiment
Answer strategies that produce data as-if they were drawn from an experiment
Natural experiment: Random assignment outside of the researcher control
Example: Choosing municipalities at random for auditing
Quasi-experiment: Conditions are assigned in a manner that is sufficiently orthogonal to potential outcomes
Benefits eligible only to people under a certain income
Institutional variation based on arbitrary population thresholds
Winning an election by a narrow margin
Being barely inside/outside an administrative border
Unexpected events
Score (running variable)
Cutoff (threshold)
Treatment (at least two conditions)
Question: Effect of higher education on earnings
Challenge: Selection bias
Solution: Focus on attendance at US state flagship university among 28-33 year olds
Outcome: Earnings
Score: SAT test scores
Cutoff: Admission cutoff
Treatment: Attending flagship university
Local randomization
Continuity-based
Potential outcomes are not random because they depend on the score (and other things)
However, around the cutoff, treatment assignment is as good as random
So we can pretend we have an experiment within a bandwidth around the cutoff
1. Continuity
Score is continuous at the cutoff
2. Comparability
Units are comparable at or around the cutoff
Treatment assignment is deterministic at the cutoff
But usually too few or no units at the cutoff
Task: Approximate the gap at the cutoff as best as possible
This becomes a line drawing problem
Intuitive design
Widely applicable
Easy to visualize
Results are highly local
Scale up? Drill down?
Too many moving parts
Software automates most of the decisions
Ideally, results are consistent under different approaches
Focus on: Difference-in-differences design