Researcher profile

Donald St. P. Richards

Donald St. P. Richards contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
6works
0followers
9topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

6 published item(s)

preprint2014arXiv

Interpreting the Distance Correlation Results for the COMBO-17 Survey

The accurate classification of galaxies in large-sample astrophysical databases of galaxy clusters depends sensitively on the ability to distinguish between morphological types, especially at higher redshifts. This capability can be enhanced through a new statistical measure of association and correlation, called the {\it distance correlation coefficient}, which has more statistical power to detect associations than does the classical Pearson measure of linear relationships between two variables. The distance correlation measure offers a more precise alternative to the classical measure since it is capable of detecting nonlinear relationships that may appear in astrophysical applications. We showed recently that the comparison between the distance and Pearson correlation coefficients can be used effectively to isolate potential outliers in various galaxy datasets, and this comparison has the ability to confirm the level of accuracy associated with the data. In this work, we elucidate the advantages of distance correlation when applied to large databases. We illustrate how the distance correlation measure can be used effectively as a tool to confirm nonlinear relationships between various variables in the COMBO-17 database, including the lengths of the major and minor axes, and the alternative redshift distribution. For these outlier pairs, the distance correlation coefficient is routinely higher than the Pearson coefficient since it is easier to detect nonlinear relationships with distance correlation. The V-shaped scatterplots of Pearson versus distance correlation coefficients also reveal the patterns with increasing redshift and the contributions of different galaxy types within each redshift range.

preprint2014arXiv

Kurtosis Tests for Multivariate Normality with Monotone Incomplete Data

We consider the problem of testing multivariate normality when the data consists of a random sample of two-step monotone incomplete observations. We define for such data a generalization of Mardia's statistic for measuring kurtosis, derive the asymptotic non-null distribution of the statistic under certain regularity conditions and against a broad class of alternatives, and give an application to a well-known data set on cholesterol measurements.

preprint2014arXiv

Schur Complement Based Analysis of MIMO Zero-Forcing for Rician Fading

For multiple-input/multiple-output (MIMO) spatial multiplexing with zero-forcing detection (ZF), signal-to-noise ratio (SNR) analysis for Rician fading involves the cumbersome noncentral-Wishart distribution (NCWD) of the transmit sample-correlation (Gramian) matrix. An \textsl{approximation} with a \textsl{virtual} CWD previously yielded for the ZF SNR an approximate (virtual) Gamma distribution. However, analytical conditions qualifying the accuracy of the SNR-distribution approximation were unknown. Therefore, we have been attempting to exactly characterize ZF SNR for Rician fading. Our previous attempts succeeded only for the sole Rician-fading stream under Rician--Rayleigh fading, by writing it as scalar Schur complement (SC) in the Gramian. Herein, we pursue a more general, matrix-SC-based analysis to characterize SNRs when several streams may undergo Rician fading. On one hand, for full-Rician fading, the SC distribution is found to be exactly a CWD if and only if a channel-mean--correlation \textsl{condition} holds. Interestingly, this CWD then coincides with the \textsl{virtual} CWD ensuing from the \textsl{approximation}. Thus, under the \textsl{condition}, the actual and virtual SNR-distributions coincide. On the other hand, for Rician--Rayleigh fading, the matrix-SC distribution is characterized in terms of determinant of matrix with elementary-function entries, which also yields a new characterization of the ZF SNR. Average error probability results validate our analysis vs.~simulation.

preprint2013arXiv

Distance Correlation Methods for Discovering Associations in Large Astrophysical Databases

High-dimensional, large-sample astrophysical databases of galaxy clusters, such as the Chandra Deep Field South COMBO-17 database, provide measurements on many variables for thousands of galaxies and a range of redshifts. Current understanding of galaxy formation and evolution rests sensitively on relationships between different astrophysical variables; hence an ability to detect and verify associations or correlations between variables is important in astrophysical research. In this paper, we apply a recently defined statistical measure called the distance correlation coefficient which can be used to identify new associations and correlations between astrophysical variables. The distance correlation coefficient applies to variables of any dimension; it can be used to determine smaller sets of variables that provide equivalent astrophysical information; it is zero only when variables are independent; and it is capable of detecting nonlinear associations that are undetectable by the classical Pearson correlation coefficient. Hence, the distance correlation coefficient provides more information than the Pearson coefficient. We analyze numerous pairs of variables in the COMBO-17 database with the distance correlation method and with the maximal information coefficient. We show that the Pearson coefficient can be estimated with higher accuracy from the corresponding distance correlation coefficient than from the maximal information coefficient. For given values of the Pearson coefficient, the distance correlation method has a greater ability than the maximal information coefficient to resolve astrophysical data into highly concentrated V-shapes, which enhances classification and pattern identification. These results are observed over a range of redshifts beyond the local universe and for galaxies from elliptical to spiral.

preprint2013arXiv

Long-term Variability in the Length of the Solar Cycle

The recent paucity of sunspots and the delay in the expected start of Solar Cycle 24 have drawn attention to the challenges involved in predicting solar activity. Traditional models of the solar cycle usually require information about the starting time and rise time as well as the shape and amplitude of the cycle. With this tutorial, we investigate the variations in the length of the sunspot number cycle and examine whether the variability can be explained in terms of a secular pattern. We identified long-term cycles in archival data from 1610 - 2000 using median trace analyses of the cycle length and power spectrum analyses of the (O-C) residuals of the dates of sunspot minima and maxima. Median trace analyses of data spanning 385 years indicate a cycle length with a period of 183 - 243 years, and a power spectrum analysis identifies a period of 188 $\pm$ 38 years. We also find a correspondence between the times of historic minima and the length of the sunspot cycle, such that the cycle length increases during the time when the number of spots is at a minimum. In particular, the cycle length was growing during the Maunder Minimum when almost no sunspots were visible on the Sun. Our study suggests that the length of the sunspot number cycle should increase gradually, on average, over the next $\sim$75 years, accompanied by a gradual decrease in the number of sunspots. This information should be considered in cycle prediction models to provide better estimates of the starting time of each cycle.

preprint2004arXiv

Algebraic methods toward higher-order probability inequalities, II

Let (L,\preccurlyeq) be a finite distributive lattice, and suppose that the functions f_1,f_2:L\to R are monotone increasing with respect to the partial order \preccurlyeq. Given μa probability measure on L, denote by E(f_i) the average of f_i over L with respect to μ, i=1,2. Then the FKG inequality provides a condition on the measure μunder which the covariance, Cov(f_1,f_2):=E(f_1f_2)-E(f_1)E(f_2), is nonnegative. In this paper we derive a ``third-order'' generalization of the FKG inequality. We also establish fourth- and fifth-order generalizations of the FKG inequality and formulate a conjecture for a general mth-order generalization. For functions and measures on R^n we establish these inequalities by extending the method of diffusion processes. We provide several applications of the third-order inequality, generalizing earlier applications of the FKG inequality. Finally, we remark on some connections between the theory of total positivity and the existence of inequalities of FKG-type within the context of Riemannian manifolds.