Downloads

Image representing the file: CWP411818.pdf

CWP411818.pdf

PDF | 621.91 KB

We develop a theory for estimation of a high-dimensional sparse parameter 𝛳 defi ned as a minimizer of a population loss function LD(𝛳,g0) which, in addition to 𝛳, depends on a, potentially infi nite dimensional, nuisance parameter g0. Our approach is based on estimating 𝛳 via an l1-regularized minimization of a sample analog of Ls(𝛳,ĝ), plugging in a fi rst-stage estimate ĝ, computed on a hold-out sample. We defi ne a population loss to be (Neyman) orthogonal if the gradient of the loss with respect to 𝛳, has pathwise derivative with respect to g equal to zero, when evaluated at the true parameter and nuisance component. We show that orthogonality implies a second-order impact of the fi rst stage nuisance error on the second stage target parameter estimate. Our approach applies to both convex and non-convex losses, albeit the latter case requires a small adaptation of our method with a preliminary estimation step of the target parameter. Our result enables oracle convergence rates for 𝛳 under assumptions on the first stage rates, typically of the order of n1/4.

We show how such an orthogonal loss can be constructed via a novel orthogonalization process for a general model de fined by conditional moment restrictions. We apply our theory to high-dimensional versions of standard estimation problems in statistics and econometrics, such as: estimation of conditional moment models with missing data, estimation of structural utilities in games of incomplete information and estimation of treatment effects in regression models with non-linear link functions.