Source author record

Clifford M. Hurvich

Clifford M. Hurvich appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.ST Statistics Theory Methodology Applications math.PR

Catalog footprint

What is connected

6works

5topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2021arXiv

On the Use of Information Criteria for Subset Selection in Least Squares Regression

Least squares (LS)-based subset selection methods are popular in linear regression modeling. Best subset selection (BS) is known to be NP-hard and has a computational cost that grows exponentially with the number of predictors. Recently, Bertsimas (2016) formulated BS as a mixed integer optimization (MIO) problem and largely reduced the computation overhead by using a well-developed optimization solver, but the current methodology is not scalable to very large datasets. In this paper, we propose a novel LS-based method, the best orthogonalized subset selection (BOSS) method, which performs BS upon an orthogonalized basis of ordered predictors and scales easily to large problem sizes. Another challenge in applying LS-based methods in practice is the selection rule to choose the optimal subset size k. Cross-validation (CV) requires fitting a procedure multiple times, and results in a selected k that is random across repeated application to the same dataset. Compared to CV, information criteria only require fitting a procedure once, but they require knowledge of the effective degrees of freedom for the fitting procedure, which is generally not available analytically for complex methods. Since BOSS uses orthogonalized predictors, we first explore a connection for orthogonal non-random predictors between BS and its Lagrangian formulation (i.e., minimization of the residual sum of squares plus the product of a regularization parameter and k), and based on this connection propose a heuristic degrees of freedom (hdf) for BOSS that can be estimated via an analytically-based expression. We show in both simulations and real data analysis that BOSS using a proposed Kullback-Leibler based information criterion AICc-hdf has the strongest performance of all of the LS-based methods considered and is competitive with regularization methods, with the computational effort of a single ordinary LS fit.

preprint2020arXiv

Selection of Regression Models under Linear Restrictions for Fixed and Random Designs

Many important modeling tasks in linear regression, including variable selection (in which slopes of some predictors are set equal to zero) and simplified models based on sums or differences of predictors (in which slopes of those predictors are set equal to each other, or the negative of each other, respectively), can be viewed as being based on imposing linear restrictions on regression parameters. In this paper, we discuss how such models can be compared using information criteria designed to estimate predictive measures like squared error and Kullback-Leibler (KL) discrepancy, in the presence of either deterministic predictors (fixed-X) or random predictors (random-X). We extend the justifications for existing fixed-X criteria Cp, FPE and AICc, and random-X criteria Sp and RCp, to general linear restrictions. We further propose and justify a KL-based criterion, RAICc, under random-X for variable selection and general linear restrictions. We show in simulations that the use of the KL-based criteria AICc and RAICc results in better predictive performance and sparser solutions than the use of squared error-based criteria, including cross-validation.

preprint2013arXiv

Limit Laws in Transaction-Level Asset Price Models

We consider pure-jump transaction-level models for asset prices in continuous time, driven by point processes. In a bivariate model that admits cointegration, we allow for time deformations to account for such effects as intraday seasonal patterns in volatility, and non-trading periods that may be different for the two assets. We also allow for asymmetries (leverage effects). We obtain the asymptotic distribution of the log-price process. We also obtain the asymptotic distribution of the ordinary least-squares estimator of the cointegrating parameter based on data sampled from an equally-spaced discretization of calendar time, in the case of weak fractional cointegration. For this same case, we obtain the asymptotic distribution for a tapered estimator under more

preprint2007arXiv

Asymptotics for Duration-Driven Long Range Dependent Processes

We consider processes with second order long range dependence resulting from heavy tailed durations. We refer to this phenomenon as duration-driven long range dependence (DDLRD), as opposed to the more widely studied linear long range dependence based on fractional differencing of an $iid$ process. We consider in detail two specific processes having DDLRD, originally presented in Taqqu and Levy (1986), and Parke (1999). For these processes, we obtain the limiting distribution of suitably standardized discrete Fourier transforms (DFTs) and sample autocovariances. At low frequencies, the standardized DFTs converge to a stable law, as do the standardized sample autocovariances at fixed lags. Finite collections of standardized sample autocovariances at a fixed set of lags converge to a degenerate distribution. The standardized DFTs at high frequencies converge to a Gaussian law. Our asymptotic results are strikingly similar for the two DDLRD processes studied. We calibrate our asymptotic results with a simulation study which also investigates the properties of the semiparametric log periodogram regression estimator of the memory parameter.

preprint2007arXiv

Semiparametric estimation of fractional cointegrating subspaces

We consider a common-components model for multivariate fractional cointegration, in which the $s\geq1$ components have different memory parameters. The cointegrating rank may exceed 1. We decompose the true cointegrating vectors into orthogonal fractional cointegrating subspaces such that vectors from distinct subspaces yield cointegrating errors with distinct memory parameters. We estimate each cointegrating subspace separately, using appropriate sets of eigenvectors of an averaged periodogram matrix of tapered, differenced observations, based on the first $m$ Fourier frequencies, with $m$ fixed. The angle between the true and estimated cointegrating subspaces is $o_p(1)$. We use the cointegrating residuals corresponding to an estimated cointegrating vector to obtain a consistent and asymptotically normal estimate of the memory parameter for the given cointegrating subspace, using a univariate Gaussian semiparametric estimator with a bandwidth that tends to $\infty$ more slowly than $n$. We use these estimates to test for fractional cointegration and to consistently identify the cointegrating subspaces.

preprint2006arXiv

Propagation of Memory Parameter from Durations to Counts

We establish sufficient conditions on durations that are stationary with finite variance and memory parameter $d \in [0,1/2)$ to ensure that the corresponding counting process $N(t)$ satisfies $\textmd{Var} N(t) \sim C t^{2d+1}$ ($C>0$) as $t \to \infty$, with the same memory parameter $d \in [0,1/2)$ that was assumed for the durations. Thus, these conditions ensure that the memory in durations propagates to the same memory parameter in counts and therefore in realized volatility. We then show that any utoregressive Conditional Duration ACD(1,1) model with a sufficient number of finite moments yields short memory in counts, while any Long Memory Stochastic Duration model with $d>0$ and all finite moments yields long memory in counts, with the same $d$.