Researcher profile

Matt Challacombe

Matt Challacombe contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - Emerging
9works
0followers
6topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

9 published item(s)

preprint2015arXiv

A $N$-Body Solver for Square Root Iteration

We develop the Sparse Approximate Matrix Multiply ($\tt SpAMM$) $n$-body solver for first order Newton Schulz iteration of the matrix square root and inverse square root. The solver performs recursive two-sided metric queries on a modified Cauchy-Schwarz criterion, culling negligible sub-volumes of the product-tensor for problems with structured decay in the sub-space metric. These sub-structures are shown to bound the relative error in the matrix-matrix product, and in favorable cases, to enjoy a reduced computational complexity governed by dimensionality reduction of the product volume. A main contribution is demonstration of a new, algebraic locality that develops under contractive identity iteration, with collapse of the metric-subspace onto the identity's plane diagonal, resulting in a stronger $\tt SpAMM$ bound. Also, we carry out a first order {Fréchet} analyses for single and dual channel instances of the square root iteration, and look at bifurcations due to ill-conditioning and a too aggressive $\tt SpAMM$ approximation. Then, we show that extreme $\tt SpAMM$ approximation and contractive identity iteration can be achieved for ill-conditioned systems through regularization, and we demonstrate the potential for acceleration with a scoping, product representation of the inverse factor.

preprint2015arXiv

A N-Body Solver for Free Mesh Interpolation

Factorization of the Gaussian RBF kernel is developed for free-mesh interpolation in the flat, polynomial limit corresponding to Taylor expansion and the Vandermonde basis of geometric moments. With this spectral approximation, a top-down octree-scoping of an interpolant is found by recursively decomposing the residual, similar to the work of Driscoll and Heryudono (2007), except that in the current approach the grid is decoupled from the low rank approximation, allowing partial separation of sampling errors (the mesh) from representation errors (the polynomial order). Then, it is possible to demonstrate roughly 5 orders of magnitude improvement in free-mesh interpolation errors for the three-dimensional Franke function, relative to previous benchmarks. As in related work on $N$-body methods for factorization by square root iteration (Challacombe 2015), some emphasis is placed on resolution of the identity.

preprint2015arXiv

Solvers for $\mathcal{O} (N)$ Electronic Structure in the Strong Scaling Limit

We present a hybrid OpenMP/Charm++ framework for solving the $\mathcal{O} (N)$ Self-Consistent-Field eigenvalue problem with parallelism in the strong scaling regime, $P\gg{N}$, where $P$ is the number of cores, and $N$ a measure of system size, i.e. the number of matrix rows/columns, basis functions, atoms, molecules, etc. This result is achieved with a nested approach to Spectral Projection and the Sparse Approximate Matrix Multiply [Bock and Challacombe, SIAM J.~Sci.~Comput. 35 C72, 2013], and involves a recursive, task-parallel algorithm, often employed by generalized $N$-Body solvers, to occlusion and culling of negligible products in the case of matrices with decay. Employing classic technologies associated with generalized $N$-Body solvers, including over-decomposition, recursive task parallelism, orderings that preserve locality, and persistence-based load balancing, we obtain scaling beyond hundreds of cores per molecule for small water clusters ([H${}_2$O]${}_N$, $N \in \{ 30, 90, 150 \}$, $P/N \approx \{ 819, 273, 164 \}$) and find support for an increasingly strong scalability with increasing system size $N$.

preprint2014arXiv

An N-Body Solution to the Problem of Fock Exchange

We report an N-Body approach to computing the Fock exchange matrix with and without permutational symmetry. The method achieves an O(N lg N) computational complexity through an embedded metric-query, allowing hierarchical application of direct SCF criteria. The advantages of permutational symmetry are found to be 4-fold for small systems, but decreasing with increasing system size and/or more permissive neglect criteria. This work sets the stage for: (1) the introduction of range queries in multi-level multipole schemes for rank reduction, and (2) recursive task parallelism.

preprint2012arXiv

An Optimized Sparse Approximate Matrix Multiply for Matrices with Decay

We present an optimized single-precision implementation of the Sparse Approximate Matrix Multiply (\SpAMM{}) [M. Challacombe and N. Bock, arXiv {\bf 1011.3534} (2010)], a fast algorithm for matrix-matrix multiplication for matrices with decay that achieves an $\mathcal{O} (n \log n)$ computational complexity with respect to matrix dimension $n$. We find that the max norm of the error achieved with a \SpAMM{} tolerance below $2 \times 10^{-8}$ is lower than that of the single-precision {\tt SGEMM} for dense quantum chemical matrices, while outperforming {\tt SGEMM} with a cross-over already for small matrices ($n \sim 1000$). Relative to naive implementations of \SpAMM{} using Intel's Math Kernel Library ({\tt MKL}) or AMD's Core Math Library ({\tt ACML}), our optimized version is found to be significantly faster. Detailed performance comparisons are made for quantum chemical matrices with differently structured sub-blocks. Finally, we discuss the potential of improved hardware prefetch to yield 2--3x speedups.

preprint2010arXiv

Fast Multiplication of Matrices with Decay

A fast algorithm for the approximate multiplication of matrices with decay is introduced; the Sparse Approximate Matrix Multiply (SpAMM) reduces complexity in the product space, a different approach from current methods that economize within the matrix space through truncation or rank reduction. Matrix truncation (element dropping) is compared to SpAMM for quantum chemical matrices with approximate exponential and algebraic decay. For matched errors in the electronic total energy, SpAMM is found to require fewer to far fewer floating point operations relative to dropping. The challenges and opportunities afforded by this new approach are discussed, including the potential for high performance implementations.

preprint2010arXiv

Linear Scaling Solution of the Time-Dependent Self-Consistent-Field Equations

A new approach to solving the Time-Dependent Self-Consistent-Field equations is developed based on the double quotient formulation of Tsiper [J. Phys. B, 34 L401 (2001)]. Dual channel, quasi-independent non-linear optimization of these quotients is found to yield convergence rates approaching those of the best case (single channel) Tamm-Dancoff approximation. This formulation is variational with respect to matrix truncation, admitting linear scaling solution of the matrix-eigenvalue problem, which is demonstrated for bulk excitons in the polyphenylene vinylene oligimer and the (4,3) carbon nanotube segment.

preprint2008arXiv

Molecular-orbital-free algorithm for excited states in time-dependent perturbation theory

A non-linear conjugate gradient optimization scheme is used to obtain excitation energies within the Random Phase Approximation (RPA). The solutions to the RPA eigenvalue equation are located through a variational characterization using a modified Thouless functional, which is based upon an asymmetric Rayleigh quotient, in an orthogonalized atomic orbital representation. In this way, the computational bottleneck of calculating molecular orbitals is avoided. The variational space is reduced to the physically-relevant transitions by projections. The feasibility of an RPA implementation scaling linearly with system size, N, is investigated by monitoring convergence behavior with respect to the quality of initial guess and sensitivity to noise under thresholding, both for well- and ill-conditioned problems. The molecular- orbital-free algorithm is found to be robust and computationally efficient providing a first step toward a large-scale, reduced complexity calculation of time-dependent optical properties and linear response. The algorithm is extensible to other forms of time-dependent perturbation theory including, but not limited to, time-dependent Density Functional theory.

preprint2004arXiv

Linear scaling computation of the Fock matrix. VIII. Periodic boundaries for exact exchange at the $Γ$-point

A translationally invariant formulation of the Hartree-Fock (HF) $Γ$-point approximation is presented. This formulation is achieved through introduction of the Minimum Image Convention (MIC) at the level of primitive two-electron integrals, and implemented in a periodic version of the ONX algorithm [J. Chem. Phys, {\bf 106} 9708 (1997)] for linear scaling computation of the exchange matrix. Convergence of the HF-MIC $Γ$-point model to the HF ${\bf k}$-space limit is demonstrated for fully periodic magnesium oxide, ice and diamond. Computation of the diamond lattice constant using the HF-MIC model together with the hybrid PBE0 density functional [Theochem, {\bf 493} 145 (1999)] yields $a_0=3.569$Åwith the 6-21G* basis set and a $3\times3\times3$ supercell. Linear scaling computation of the HF-MIC exchange matrix is demonstrated for diamond and ice in the condensed phase