(Rational) Addictions
I have an addiction. Perhaps more than one. But one for sure. And, so, when this image
In a very early post I discuss the difficulties created with measurement error in an otherwise exogenous binary covariate. Simulations show how an apparently valid instrument does not yield consistent estimates of the true coefficient in this case. The intuition behind this result is that measurement error in a binary variable cannot be classical measurement error; the measurement error must be negatively correlated with the truth (Black et al. 2000). Because the truth is correlated with the measurement error, any instrument that is correlated with the truth will also be correlated with the measurement error, rendering the instrument invalid.
Well, it's one thing to point out a problem. It's another to provide a solution. As I mentioned in the prior post (but did not go into detail), Nguimkeu et al. (2019) provide a solution that applied researchers should get to know. Their estimator allows for both endogeneity in the true binary covariate and (one-sided) measurement error. Even better, it is trivial to estimate in Stata.
Specifically, it assumes the measurement error is one-sided: if D* = 0, then D = 0, but if D* = 1, then D may be either 0 or 1. In other words, the model allows for the presence of false negatives, but not false positives. This may be reasonable in situations where there is stigma associated with D* and, therefore, individuals are only likely to misreport the absence of D* and not its presence. Aside from this, identification requires two instruments: a variable in z that is not included in x and a variable in w that is not in z. Finally, ε needs to be independent of x and z (but can be correlated with w); ν and υ need to be independent of x, z, and w.
Given this setup, estimation is accomplished using a simple two-step approach. In the first step, Poirier's (1980) partial observability model is estimated under the assumption that the errors are jointly normal. This model is similar to a bivariate probit, but with only a single outcome. In this case, we have
D = δD* = I(zθ + ν > 0, wγ + υ > 0)
which is estimable by maximum likelihood. In Stata, this is estimated using the -biprobit, partial- command.
In the second step,
y = xβ + αD*-hat + ε
is estimated by Ordinary Least Squares (OLS) where
D*-hat = Φ(zθ-hat)
and Φ() is the standard normal CDF. Appropriate standard errors can be obtained by bootstrapping the two-step estimator. The estimate of α is consistent under the required assumptions.
Indeed. And, while the authors' do note the strength of the requirements for identification, they state: "To our knowledge, this paper is the first attempt to provide point estimates of treatment effects in the context of endogenous misreporting of a binary treatment variable."
That brings us to another possible solution when dealing with a mismeasured and endogenous binary covariate: partial identification. The general idea of partial identification is also covered in a prior post. Partial identification is my second addiction after measurement error. So, put the two together and you get
To get a rough idea of the partial identification approach, consider the population Average Treatment Effect (ATE) of D* given by
E[Y(1) - Y(0)],
where Y(1), Y(0) are potential outcomes associated with treatment (denoted by D* = 1) and non-treatment (D* = 0). Focus on one aspect of this parameter, E[Y(1)]. Treatment of E[Y(0)] is analogous. E[Y(1)] is equivalent to
E[Y(1) | D* = 1]*Pr(D* = 1) + E[Y(1) | D* = 0]*Pr(D* = 0).
In principle, E[Y(1) | D* = 1], Pr(D* = 1), and Pr(D* = 0) can be observed. However, even in the absence of measurement error, E[Y(1) | D* = 0] can never be observed. This is the problem of the missing counterfactual. One can impose strong assumptions to point identify this quantity (e.g., independence or conditional independence assumptions).
- Probability of a false positive with Y = 1
- Probability of a false negative with Y = 1
- Probability of a false positive with Y = 0
- Probability of a false negative with Y = 0
Partial identification is perfectly suited for assessing the impact of a binary covariate when the variable is endogenous and potentially misreported. Of course, the down side is that partial identification is not point identification. But, point identification under faulty assumptions is not identification either. Researchers must be wary of their own addiction.
To incredible certitude.
References
Manski, C.F. (1990), "Nonparametric Bounds on Treatment Effects," American Economic Review, 80, 319-323
McCarthy, I., D.L. Millimet, and M. Roy (2015), "Bounding Treatment Effects: Stata Command for the Partial Identification of the Average Treatment Effect with Endogenous and Misreported Treatment Assignment," Stata Journal, 15, 411-436