Missing data for return predictors is a common problem in cross sectional asset pricing. Most papers do not explicitly discuss how they deal with missing data but conventional treatments focus on the subset of firms with no missing data for any predictor or impute the unconditional mean. Both methods have undesirable properties – they are either inefficient or lead to biased estimators and incorrect inference. We propose a simple and computationally attractive alternative using conditional mean imputations and weighted least squares, cast in a generalized method of moments (GMM) framework. This method allows us to use all observations with observed returns, it results in valid inference, and it can be applied in non-linear and high-dimensional settings. In Monte Carlo simulations, we find that it performs almost as well as the efficient but computationally costly GMM estimator in many cases. We apply our procedure to a large panel of return predictors and find that it leads to improved out-of-sample predictability.

More on this topic

BFI Working Paper·Feb 20, 2025

Non est Disputandum de Generalizability? A Glimpse into The External Validity Trial

John List
Topics: Uncategorized
BFI Working Paper·Feb 18, 2025

How Costly Are Business Cycle Volatility and Inflation? A Vox Populi Approach

Dimitris Georgarakos, Kwang Hwan Kim, Olivier Coibion, Myungkyu Shim, Myunghwan Andrew Lee, Yuriy Gorodnichenko, Geoff Kenny, Seowoo Han, and Michael Weber
Topics: Uncategorized
BFI Working Paper·Feb 14, 2025

Decisions Under Risk are Decisions Under Complexity: Comment

Daniel Banki, Uri Simonsohn, Robert Walatka, and George Wu
Topics: Uncategorized