Publications by author | Cowles Foundation for Research in Economics

Abstract

In complicated/nonlinear parametric models, it is generally hard to determine whether the model parameters are (globally) point identiﬁed. We provide computationally attractive procedures to construct conﬁdence sets (CSs) for identiﬁed sets of parameters in econometric models deﬁned through a likelihood or a vector of moments. The CSs for the identiﬁed set or for a function of the identiﬁed set (such as a subvector) are based on inverting an optimal sample criterion (such as likelihood or continuously updated GMM), where the cutoﬀ values are computed via Monte Carlo simulations directly from a quasi posterior distribution of the criterion. We establish new Bernstein-von Mises type theorems for the posterior distributions of the quasi-likelihood ratio (QLR) and proﬁle QLR statistics in partially identiﬁed models, allowing for singularities. These results imply that the Monte Carlo criterion-based CSs have correct frequentist coverage for the identiﬁed set as the sample size increases, and that they coincide with Bayesian credible sets based on inverting a LR statistic for point-identiﬁed likelihood models. We also show that our Monte Carlo optimal criterion-based CSs are uniformly valid over a class of data generating processes that include both partially- and point-identiﬁed models. We demonstrate good ﬁnite sample coverage properties of our proposed methods in four non-trivial simulation experiments: missing data, entry game with correlated payoﬀ shocks, Euler equation and ﬁnite mixture models. Finally, our proposed procedures are applied in two empirical examples.

Abstract

In complicated/nonlinear parametric models, it is generally hard to know whether the model parameters are point identiﬁed. We provide computationally attractive procedures to construct conﬁdence sets (CSs) for identiﬁed sets of full parameters and of subvectors in models deﬁned through a likelihood or a vector of moment equalities or inequalities. These CSs are based on level sets of optimal sample criterion functions (such as likelihood or optimally-weighted or continuously-updated GMM criterions). The level sets are constructed using cutoﬀs that are computed via Monte Carlo (MC) simulations directly from the quasi-posterior distributions of the criterions. We establish new Bernstein-von Mises (or Bayesian Wilks) type theorems for the quasi-posterior distributions of the quasi-likelihood ratio (QLR) and proﬁle QLR in partially-identiﬁed regular models and some non-regular models. These results imply that our MC CSs have exact asymptotic frequentist coverage for identiﬁed sets of full parameters and of subvectors in partially-identiﬁed regular models, and have valid but potentially conservative coverage in models with reduced-form parameters on the boundary. Our MC CSs for identiﬁed sets of subvectors are shown to have exact asymptotic coverage in models with singularities. We also provide results on uniform validity of our CSs over classes of DGPs that include point and partially identiﬁed models. We demonstrate good ﬁnite-sample coverage properties of our procedures in two simulation experiments. Finally, our procedures are applied to two non-trivial empirical examples: an airline entry game and a model of trade flows.

Abstract

In complicated/nonlinear parametric models, it is hard to determine whether a parameter of interest is formally point identiﬁed. We provide computationally attractive procedures to construct conﬁdence sets (CSs) for identiﬁed sets of parameters in econometric models deﬁned through a likelihood or a vector of moments. The CSs for the identiﬁed set or for a function of the identiﬁed set (such as a subvector) are based on inverting an optimal sample criterion (such as likelihood or continuously updated GMM), where the cutoﬀ values are computed directly from Markov Chain Monte Carlo (MCMC) simulations of a quasi posterior distribution of the criterion. We establish new Bernstein-von Mises type theorems for the posterior distributions of the quasi-likelihood ratio (QLR) and proﬁle QLR statistics in partially identiﬁed models, allowing for singularities. These results imply that the MCMC criterion-based CSs have correct frequentist coverage for the identiﬁed set as the sample size increases, and that they coincide with Bayesian credible sets based on inverting a LR statistic for point-identiﬁed likelihood models. We also show that our MCMC optimal criterion-based CSs are uniformly valid over a class of data generating processes that include both partially- and point- identiﬁed models. We demonstrate good ﬁnite sample coverage properties of our proposed methods in four non-trivial simulation experiments: missing data, entry game with correlated payoﬀ shocks, Euler equation and ﬁnite mixture models.

Abstract

We propose new methods for estimating the bid-ask spread from observed transaction prices alone. Our methods are based on the empirical characteristic function instead of the sample autocovariance function like the method of Roll (1984). As in Roll (1984), we have a closed form expression for the spread, but this is only based on a limited amount of the model-implied identiﬁcation restrictions. We also provide methods that take account of more identiﬁcation information. We compare our methods theoretically and numerically with the Roll method as well as with its best known competitor, the Hasbrouck (2004) method, which uses a Bayesian Gibbs methodology under a Gaussian assumption. Our estimators are competitive with Roll’s and Hasbrouck’s when the latent true fundamental return distribution is Gaussian, and perform much better when this distribution is far from Gaussian. Our methods are applied to the Emini futures contract on the S&P 500 during the Flash Crash of May 6, 2010. Extensions to models allowing for unbalanced order flow or Hidden Markov trade direction indicators or trade direction indicators having general asymmetric support or adverse selection are also presented, without requiring additional data.

Abstract

This paper reviews recent advances in estimation and inference for nonparametric and semiparametric models with endogeneity. It ﬁrst describes methods of sieves and penalization for estimating unknown functions identiﬁed via conditional moment restrictions. Examples include nonparametric instrumental variables regression (NPIV), nonparametric quantile IV regression and many more semi-nonparametric structural models. Asymptotic properties of the sieve estimators and the sieve Wald, quasi-likelihood ratio (QLR) hypothesis tests of functionals with nonparametric endogeneity are presented. For sieve NPIV estimation, the rate-adaptive data-driven choices of sieve regularization parameters and the sieve score bootstrap uniform conﬁdence bands are described. Finally, simple sieve variance estimation and over-identiﬁcation test for semiparametric two-step GMM are reviewed. Monte Carlo examples are included.

Abstract

This paper considers estimation of semi-nonparametric GARCH ﬁltered copula models in which the individual time series are modelled by semi-nonparametric GARCH and the joint distributions of the multivariate standardized innovations are characterized by parametric copulas with nonparametric marginal distributions. The models extend those of Chen and Fan (2006) to allow for semi-nonparametric conditional means and volatilities, which are estimated via the method of sieves such as splines. The ﬁtted residuals are then used to estimate the copula parameters and the marginal densities of the standardized innovations jointly via the sieve maximum likelihood (SML). We show that, even using nonparametrically ﬁltered data, both our SML and the two-step copula estimator of Chen and Fan (2006) are still root-n consistent and asymptotically normal, and the asymptotic variances of both estimators do not depend on the nonparametric ﬁltering errors. Even more surprisingly, our SML copula estimator using the ﬁltered data achieves the full semiparametric eﬀiciency bound as if the standardized innovations were directly observed. These nice properties lead to simple and more accurate estimation of Value-at-Risk (VaR) for multivariate ﬁnancial data with flexible dynamics, contemporaneous tail dependence and asymmetric distributions of innovations. Monte Carlo studies demonstrate that our SML estimators of the copula parameters and the marginal distributions of the standardized innovations have smaller variances and smaller mean squared errors compared to those of the two-step estimators in ﬁnite samples. A real data application is presented.

Abstract

This paper considers semiparametric two-step GMM estimation and inference with weakly dependent data, where unknown nuisance functions are estimated via sieve extremum estimation in the ﬁrst step. We show that although the asymptotic variance of the second-step GMM estimator may not have a closed form expression, it can be well approximated by sieve variances that have simple closed form expressions. We present consistent or robust variance estimation, Wald tests and Hansen’s (1982) over-identiﬁcation tests for the second step GMM that properly reflect the ﬁrst-step estimated functions and the weak dependence of the data. Our sieve semiparametric two-step GMM inference procedures are shown to be numerically equivalent to the ones computed as if the ﬁrst step were parametric. A new consistent random-perturbation estimator of the derivative of the expectation of the non-smooth moment function is also provided.

Abstract

In models deﬁned by unconditional moment restrictions, speciﬁcation tests are possible and estimators can be ranked in terms of eﬀiciency whenever the number of moment restrictions exceeds the number of parameters. We show that a similar relationship between potential refutability of a model and semiparametric eﬀiciency is present in a much broader class of settings. Formally, we show a condition we name local overidentiﬁcation is required for both speciﬁcation tests to have power against local alternatives and for the existence of both eﬀicient and ineﬀicient estimators of regular parameters. Our results immediately imply semiparametric conditional moment restriction models are typically locally overidentiﬁed, and hence their proper speciﬁcation is locally testable. We further study nonparametric conditional moment restriction models and obtain a simple characterization of local overidentiﬁcation in that context. As a result, we are able to determine when nonparametric conditional moment restriction models are locally testable, and when plug-in and two stage estimators of regular parameters are semiparametrically eﬀicient.

Abstract

In the unconditional moment restriction model of Hansen (1982), speciﬁcation tests and more eﬀicient estimators are both available whenever the number of moment restrictions exceeds the number of parameters of interest. We show a similar relationship between potential refutability of a model and existence of more eﬀicient estimators is present in much broader settings. Speciﬁcally, a condition we name local overidentiﬁcation is shown to be equivalent to both the existence of speciﬁcation tests with nontrivial local power and the existence of more eﬀicient estimators of some “smooth” parameters in general semi/nonparametric models. Under our notion of local overidentiﬁcation, various locally nontrivial speciﬁcation tests such as Hausman tests, incremental Sargan tests (or optimally weighted quasi-likelihood ratio tests) naturally extend to general semi/nonparametric settings. We further obtain simple characterizations of local overidentiﬁcation for general models of nonparametric conditional moment restrictions with possibly diﬀerent conditioning sets. The results are applied to determining when semi/nonparametric models with endogeneity are locally testable, and when nonparametric plug-in and semiparametric two-step GMM estimators are semiparametrically eﬀicient. Examples of empirically relevant semi/nonparametric structural models are presented.

Abstract

We show that spline and wavelet series regression estimators for weakly dependent regressors attain the optimal uniform (i.e., sup-norm) convergence rate (n/log n)-p/(2p+d) of Stone (1982), where d is the number of regressors and p is the smoothness of the regression function. The optimal rate is achieved even for heavy-tailed martingale diﬀerence errors with ﬁnite (2 + (d/p))th absolute moment for d/p < 2. We also establish the asymptotic normality of t statistics for possibly nonlinear, irregular functionals of the conditional mean function under weak conditions. The results are proved by deriving a new exponential inequality for sums of weakly dependent random matrices, which is of independent interest.