Researcher profile

Sofia C. Olhede

Sofia C. Olhede contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
12works
0followers
15topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

12 published item(s)

preprint2022arXiv

Edge coherence in multiplex networks

This paper introduces a nonparametric framework for the setting where multiple networks are observed on the same set of nodes, also known as multiplex networks. Our objective is to provide a simple parameterization which explicitly captures linear dependence between the different layers of networks. For non-Euclidean observations, such as shapes and graphs, the notion of "linear" must be defined appropriately. Taking inspiration from the representation of stochastic processes and the analogy of the multivariate spectral representation of a stochastic process with joint exchangeability of Bernoulli arrays, we introduce the notion of edge coherence as a measure of linear dependence in the graph limit space. Edge coherence is defined for pairs of edges from any two network layers and is the key novel parameter. We illustrate the utility of our approach by eliciting simple models such as a correlated stochastic blockmodel and a correlated inhomogeneous graph limit model.

preprint2022arXiv

Networks with Correlated Edge Processes

This article proposes methods to model nonstationary temporal graph processes. This corresponds to modelling the observation of edge variables (relationships between objects) indicating interactions between pairs of nodes (or objects) exhibiting dependence (correlation) and evolution in time over interactions. This article thus blends (integer) time series models with flexible static network models to produce models of temporal graph data, and statistical fitting procedures for time-varying interaction data. We illustrate the power of our proposed fitting method by analysing a hospital contact network, and this shows the high dimensional data challenge of modelling and inferring correlation between a large number of variables.

preprint2022arXiv

The Debiased Spatial Whittle Likelihood

We provide a computationally and statistically efficient method for estimating the parameters of a stochastic covariance model observed on a regular spatial grid in any number of dimensions. Our proposed method, which we call the Debiased Spatial Whittle likelihood, makes important corrections to the well-known Whittle likelihood to account for large sources of bias caused by boundary effects and aliasing. We generalise the approach to flexibly allow for significant volumes of missing data including those with lower-dimensional substructure, and for irregular sampling boundaries. We build a theoretical framework under relatively weak assumptions which ensures consistency and asymptotic normality in numerous practical settings including missing data and non-Gaussian processes. We also extend our consistency results to multivariate processes. We provide detailed implementation guidelines which ensure the estimation procedure can be conducted in O(n log n) operations, where n is the number of points of the encapsulating rectangular grid, thus keeping the computational scalability of Fourier and Whittle-based methods for large data sets. We validate our procedure over a range of simulated and real-world settings, and compare with state-of-the-art alternatives, demonstrating the enduring practical appeal of Fourier-based methods, provided they are corrected by the procedures developed in this paper.

preprint2020arXiv

Modeling Network Populations via Graph Distances

This article introduces a new class of models for multiple networks. The core idea is to parametrize a distribution on labelled graphs in terms of a Fréchet mean graph (which depends on a user-specified choice of metric or graph distance) and a parameter that controls the concentration of this distribution about its mean. Entropy is the natural parameter for such control, varying from a point mass concentrated on the Fréchet mean itself to a uniform distribution over all graphs on a given vertex set. We provide a hierarchical Bayesian approach for exploiting this construction, along with straightforward strategies for sampling from the resultant posterior distribution. We conclude by demonstrating the efficacy of our approach via simulation studies and two multiple-network data analysis examples: one drawn from systems biology and the other from neuroscience.

preprint2013arXiv

Degree-based network models

We derive the sampling properties of random networks based on weights whose pairwise products parameterize independent Bernoulli trials. This enables an understanding of many degree-based network models, in which the structure of realized networks is governed by properties of their degree sequences. We provide exact results and large-sample approximations for power-law networks and other more general forms. This enables us to quantify sampling variability both within and across network populations, and to characterize the limiting extremes of variation achievable through such models. Our results highlight that variation explained through expected degree structure need not be attributed to more complicated generative mechanisms.

preprint2013arXiv

Maximum-likelihood estimation of lithospheric flexural rigidity, initial-loading fraction, and load correlation, under isotropy

Topography and gravity are geophysical fields whose joint statistical structure derives from interface-loading processes modulated by the underlying mechanics of isostatic and flexural compensation in the shallow lithosphere. Under this dual statistical-mechanistic viewpoint an estimation problem can be formulated where the knowns are topography and gravity and the principal unknown the elastic flexural rigidity of the lithosphere. In the guise of an equivalent "effective elastic thickness", this important, geographically varying, structural parameter has been the subject of many interpretative studies, but precisely how well it is known or how best it can be found from the data, abundant nonetheless, has remained contentious and unresolved throughout the last few decades of dedicated study. The popular methods whereby admittance or coherence, both spectral measures of the relation between gravity and topography, are inverted for the flexural rigidity, have revealed themselves to have insufficient power to independently constrain both it and the additional unknown initial-loading fraction and load-correlation fac- tors, respectively. Solving this extremely ill-posed inversion problem leads to non-uniqueness and is further complicated by practical considerations such as the choice of regularizing data tapers to render the analysis sufficiently selective both in the spatial and spectral domains. Here, we rewrite the problem in a form amenable to maximum-likelihood estimation theory, which we show yields unbiased, minimum-variance estimates of flexural rigidity, initial-loading frac- tion and load correlation, each of those separably resolved with little a posteriori correlation between their estimates. We are also able to separately characterize the isotropic spectral shape of the initial loading processes.

preprint2013arXiv

Nonparametric graphon estimation

We propose a nonparametric framework for the analysis of networks, based on a natural limit object termed a graphon. We prove consistency of graphon estimation under general conditions, giving rates which include the important practical setting of sparse networks. Our results cover dense and sparse stochastic blockmodels with a growing number of classes, under model misspecification. We use profile likelihood methods, and connect our results to approximation theory, nonparametric function estimation, and the theory of graph limits.

preprint2012arXiv

Covariance of Replicated Modulated Cyclical Time Series

This paper introduces the novel class of modulated cyclostationary processes, a class of non-stationary processes exhibiting frequency coupling, and proposes a method of their estimation from repeated trials. Cyclostationary processes also exhibit frequency correlation but have Loeve spectra whose support lies only on parallel lines in the dual-frequency plane. Such extremely sparse structure does not adequately represent many biological processes. Thus, we propose a model that, in the time domain, modulates the covariance of cyclostationary processes and consequently broadens their frequency support in the dual-frequency plane. The spectra and the cross-coherence of the proposed modulated cyclostationary process are first estimated using multitaper methods. A shrinkage procedure is then applied to each trial-specific estimate to reduce the estimation risk. Multiple trials of each series are observed. When combining information across trials, we carefully take into account the bias that may be introduced by phase misalignment and the fact that the Loeve spectra and cross-coherence across replicates may only be "similar" - but not necessarily identical - across replicates. The application of the inference methods developed for the modulated cyclostationary model to EEG data also demonstrates that the proposed model captures statistically significant cross-frequency interactions, that ought to be further examined by neuroscientists.

preprint2012arXiv

Order statistics of observed network degrees

This article discusses the properties of extremes of degree sequences calculated from network data. We introduce the notion of a normalized degree, in order to permit a comparison of degree sequences between networks with differing numbers of nodes. We model each normalized degree as a bounded continuous random variable, and determine the properties of the ordered k-maxima and minima of the normalized network degrees when they comprise a random sample from a Beta distribution. In this setting, their means and variances take a simplified form given by their ordering, and we discuss the relation of these quantities to other prescribed decays such as power laws. We verify the derived properties from simulated sets of normalized degrees, and discuss possible extensions to more flexible classes of distributions.

preprint2011arXiv

Bivariate Instantaneous Frequency and Bandwidth

The generalizations of instantaneous frequency and instantaneous bandwidth to a bivariate signal are derived. These are uniquely defined whether the signal is represented as a pair of real-valued signals, or as one analytic and one anti-analytic signal. A nonstationary but oscillatory bivariate signal has a natural representation as an ellipse whose properties evolve in time, and this representation provides a simple geometric interpretation for the bivariate instantaneous moments. The bivariate bandwidth is shown to consist of three terms measuring the degree of instability of the time-varying ellipse: amplitude modulation with fixed eccentricity, eccentricity modulation, and orientation modulation or precession. An application to the analysis of data from a free-drifting oceanographic float is presented and discussed.

preprint2011arXiv

Nonparametric tests of structure for high angular resolution diffusion imaging in Q-space

High angular resolution diffusion imaging data is the observed characteristic function for the local diffusion of water molecules in tissue. This data is used to infer structural information in brain imaging. Nonparametric scalar measures are proposed to summarize such data, and to locally characterize spatial features of the diffusion probability density function (PDF), relying on the geometry of the characteristic function. Summary statistics are defined so that their distributions are, to first-order, both independent of nuisance parameters and also analytically tractable. The dominant direction of the diffusion at a spatial location (voxel) is determined, and a new set of axes are introduced in Fourier space. Variation quantified in these axes determines the local spatial properties of the diffusion density. Nonparametric hypothesis tests for determining whether the diffusion is unimodal, isotropic or multi-modal are proposed. More subtle characteristics of white-matter microstructure, such as the degree of anisotropy of the PDF and symmetry compared with a variety of asymmetric PDF alternatives, may be ascertained directly in the Fourier domain without parametric assumptions on the form of the diffusion PDF. We simulate a set of diffusion processes and characterize their local properties using the newly introduced summaries. We show how complex white-matter structures across multiple voxels exhibit clear ellipsoidal and asymmetric structure in simulation, and assess the performance of the statistics in clinically-acquired magnetic resonance imaging data.

preprint2011arXiv

On the Analytic Wavelet Transform

An exact and general expression for the analytic wavelet transform of a real-valued signal is constructed, resolving the time-dependent effects of non-negligible amplitude and frequency modulation. The analytic signal is first locally represented as a modulated oscillation, demodulated by its own instantaneous frequency, and then Taylor-expanded at each point in time. The terms in this expansion, called the instantaneous modulation functions, are time-varying functions which quantify, at increasingly higher orders, the local departures of the signal from a uniform sinusoidal oscillation. Closed-form expressions for these functions are found in terms of Bell polynomials and derivatives of the signal's instantaneous frequency and bandwidth. The analytic wavelet transform is shown to depend upon the interaction between the signal's instantaneous modulation functions and frequency-domain derivatives of the wavelet, inducing a hierarchy of departures of the transform away from a perfect representation of the signal. The form of these deviation terms suggests a set of conditions for matching the wavelet properties to suit the variability of the signal, in which case our expressions simplify considerably. One may then quantify the time-varying bias associated with signal estimation via wavelet ridge analysis, and choose wavelets to minimize this bias.