Source author record

Matthew Harding

Matthew Harding appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

econ.EM Machine Learning math.ST Methodology Statistics Theory

Catalog footprint

What is connected

4works

5topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Estimation of a Factor-Augmented Linear Model with Applications Using Student Achievement Data

In many longitudinal settings, economic theory does not guide practitioners on the type of restrictions that must be imposed to solve the rotational indeterminacy of factor-augmented linear models. We study this problem and offer several novel results on identification using internally generated instruments. We propose a new class of estimators and establish large sample results using recent developments on clustered samples and high-dimensional models. We carry out simulation studies which show that the proposed approaches improve the performance of existing methods on the estimation of unknown factors. Lastly, we consider three empirical applications using administrative data of students clustered in different subjects in elementary school, high school and college.

preprint2022arXiv

Managers versus Machines: Do Algorithms Replicate Human Intuition in Credit Ratings?

We use machine learning techniques to investigate whether it is possible to replicate the behavior of bank managers who assess the risk of commercial loans made by a large commercial US bank. Even though a typical bank already relies on an algorithmic scorecard process to evaluate risk, bank managers are given significant latitude in adjusting the risk score in order to account for other holistic factors based on their intuition and experience. We show that it is possible to find machine learning algorithms that can replicate the behavior of the bank managers. The input to the algorithms consists of a combination of standard financials and soft information available to bank managers as part of the typical loan review process. We also document the presence of significant heterogeneity in the adjustment process that can be traced to differences across managers and industries. Our results highlight the effectiveness of machine learning based analytic approaches to banking and the potential challenges to high-skill jobs in the financial sector.

preprint2015arXiv

Scalable Bayesian Non-Negative Tensor Factorization for Massive Count Data

We present a Bayesian non-negative tensor factorization model for count-valued tensor data, and develop scalable inference algorithms (both batch and online) for dealing with massive tensors. Our generative model can handle overdispersed counts as well as infer the rank of the decomposition. Moreover, leveraging a reparameterization of the Poisson distribution as a multinomial facilitates conjugacy in the model and enables simple and efficient Gibbs sampling and variational Bayes (VB) inference updates, with a computational cost that only depends on the number of nonzeros in the tensor. The model also provides a nice interpretability for the factors; in our model, each factor corresponds to a "topic". We develop a set of online inference algorithms that allow further scaling up the model to massive tensors, for which batch inference methods may be infeasible. We apply our framework on diverse real-world applications, such as \emph{multiway} topic modeling on a scientific publications database, analyzing a political science data set, and analyzing a massive household transactions data set.

preprint2015arXiv

Strong limit of the extreme eigenvalues of a symmetrized auto-cross covariance matrix

The auto-cross covariance matrix is defined as \[\mathbf{M}_n=\frac{1} {2T}\sum_{j=1}^T\bigl(\mathbf{e}_j\mathbf{e}_{j+τ}^*+\mathbf{e}_{j+ τ}\mathbf{e}_j^*\bigr),\] where $\mathbf{e}_j$'s are $n$-dimensional vectors of independent standard complex components with a common mean 0, variance $σ^2$, and uniformly bounded $2+η$th moments and $τ$ is the lag. Jin et al. [Ann. Appl. Probab. 24 (2014) 1199-1225] has proved that the LSD of $\mathbf{M}_n$ exists uniquely and nonrandomly, and independent of $τ$ for all $τ\ge 1$. And in addition they gave an analytic expression of the LSD. As a continuation of Jin et al. [Ann. Appl. Probab. 24 (2014) 1199-1225], this paper proved that under the condition of uniformly bounded fourth moments, in any closed interval outside the support of the LSD, with probability 1 there will be no eigenvalues of $\mathbf{M}_n$ for all large $n$. As a consequence of the main theorem, the limits of the largest and smallest eigenvalue of $\mathbf{M}_n$ are also obtained.

Matthew Harding

What is connected

Connect this record

See the researcher in context

Building this map preview

4 published item(s)

Estimation of a Factor-Augmented Linear Model with Applications Using Student Achievement Data

Managers versus Machines: Do Algorithms Replicate Human Intuition in Credit Ratings?

Scalable Bayesian Non-Negative Tensor Factorization for Massive Count Data

Strong limit of the extreme eigenvalues of a symmetrized auto-cross covariance matrix