Volume no :14, Issue no: 1, September (2015)

CONSISTENT VARIABLE SELECTION IN LARGE REGRESSION MODELS

Author's: Erhard Reschenhofer
Pages: [49] - [67]
Received Date: September 1, 2015
Submitted by:
DOI: http://dx.doi.org/10.18642/jsata_7100121536

Abstract

In this paper, conditions for the consistent selection of a subset from a large set of potential regressors are derived. It is assumed that the number of potential regressors increases as the sample size increases and, in addition, that the regressors are orthogonal. Subset-selection criteria are proposed which satisfy these conditions. These criteria do not depend on any tuning parameters. It is also shown that some other criteria, which include AIC and BIC, violate these conditions. Simulation studies with different sample sizes and large sets of orthogonal regressors are conducted to compare the performance of the new criteria with that of conventional model-selection criteria. The results of these simulation studies corroborate the theoretical findings. In large samples, the consistent criteria always make the correct decisions. They include all genuine regressors and exclude the others. In contrast, AIC tends to select always the maximum number of regressors and BIC is also not competitive when the number of potential regressors grows too fast.

Keywords

model selection criteria, asymptotic properties, orthogonal regressors, order statistics.