Source author record

Sofia C. Olhede

Sofia C. Olhede appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Methodology math.ST Statistics Theory Applications math.CO physics.ao-ph Social and Information Networks Computation Machine Learning math.FA math.PR physics.data-an physics.geo-ph physics.med-ph physics.soc-ph stat.OT

Catalog footprint

What is connected

17works

16topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Edge coherence in multiplex networks

This paper introduces a nonparametric framework for the setting where multiple networks are observed on the same set of nodes, also known as multiplex networks. Our objective is to provide a simple parameterization which explicitly captures linear dependence between the different layers of networks. For non-Euclidean observations, such as shapes and graphs, the notion of "linear" must be defined appropriately. Taking inspiration from the representation of stochastic processes and the analogy of the multivariate spectral representation of a stochastic process with joint exchangeability of Bernoulli arrays, we introduce the notion of edge coherence as a measure of linear dependence in the graph limit space. Edge coherence is defined for pairs of edges from any two network layers and is the key novel parameter. We illustrate the utility of our approach by eliciting simple models such as a correlated stochastic blockmodel and a correlated inhomogeneous graph limit model.

preprint2022arXiv

Networks with Correlated Edge Processes

This article proposes methods to model nonstationary temporal graph processes. This corresponds to modelling the observation of edge variables (relationships between objects) indicating interactions between pairs of nodes (or objects) exhibiting dependence (correlation) and evolution in time over interactions. This article thus blends (integer) time series models with flexible static network models to produce models of temporal graph data, and statistical fitting procedures for time-varying interaction data. We illustrate the power of our proposed fitting method by analysing a hospital contact network, and this shows the high dimensional data challenge of modelling and inferring correlation between a large number of variables.

preprint2022arXiv

The Debiased Spatial Whittle Likelihood

We provide a computationally and statistically efficient method for estimating the parameters of a stochastic covariance model observed on a regular spatial grid in any number of dimensions. Our proposed method, which we call the Debiased Spatial Whittle likelihood, makes important corrections to the well-known Whittle likelihood to account for large sources of bias caused by boundary effects and aliasing. We generalise the approach to flexibly allow for significant volumes of missing data including those with lower-dimensional substructure, and for irregular sampling boundaries. We build a theoretical framework under relatively weak assumptions which ensures consistency and asymptotic normality in numerous practical settings including missing data and non-Gaussian processes. We also extend our consistency results to multivariate processes. We provide detailed implementation guidelines which ensure the estimation procedure can be conducted in O(n log n) operations, where n is the number of points of the encapsulating rectangular grid, thus keeping the computational scalability of Fourier and Whittle-based methods for large data sets. We validate our procedure over a range of simulated and real-world settings, and compare with state-of-the-art alternatives, demonstrating the enduring practical appeal of Fourier-based methods, provided they are corrected by the procedures developed in this paper.

preprint2020arXiv

Modeling Network Populations via Graph Distances

This article introduces a new class of models for multiple networks. The core idea is to parametrize a distribution on labelled graphs in terms of a Fréchet mean graph (which depends on a user-specified choice of metric or graph distance) and a parameter that controls the concentration of this distribution about its mean. Entropy is the natural parameter for such control, varying from a point mass concentrated on the Fréchet mean itself to a uniform distribution over all graphs on a given vertex set. We provide a hierarchical Bayesian approach for exploiting this construction, along with straightforward strategies for sampling from the resultant posterior distribution. We conclude by demonstrating the efficacy of our approach via simulation studies and two multiple-network data analysis examples: one drawn from systems biology and the other from neuroscience.

preprint2015arXiv

A Power Variance Test for Nonstationarity in Complex-Valued Signals

We propose a novel algorithm for testing the hypothesis of nonstationarity in complex-valued signals. The implementation uses both the bootstrap and the Fast Fourier Transform such that the algorithm can be efficiently implemented in O(NlogN) time, where N is the length of the observed signal. The test procedure examines the second-order structure and contrasts the observed power variance - i.e. the variability of the instantaneous variance over time - with the expected characteristics of stationary signals generated via the bootstrap method. Our algorithmic procedure is capable of learning different types of nonstationarity, such as jumps or strong sinusoidal components. We illustrate the utility of our test and algorithm through application to turbulent flow data from fluid dynamics.

preprint2014arXiv

Network histograms and universality of blockmodel approximation

In this article we introduce the network histogram: a statistical summary of network interactions, to be used as a tool for exploratory data analysis. A network histogram is obtained by fitting a stochastic blockmodel to a single observation of a network dataset. Blocks of edges play the role of histogram bins, and community sizes that of histogram bandwidths or bin sizes. Just as standard histograms allow for varying bandwidths, different blockmodel estimates can all be considered valid representations of an underlying probability model, subject to bandwidth constraints. Here we provide methods for automatic bandwidth selection, by which the network histogram approximates the generating mechanism that gives rise to exchangeable random graphs. This makes the blockmodel a universal network representation for unlabeled graphs. With this insight, we discuss the interpretation of network communities in light of the fact that many different community assignments can all give an equally valid representation of such a network. To demonstrate the fidelity-versus-interpretability tradeoff inherent in considering different numbers and sizes of communities, we analyze two publicly available networks - political weblogs and student friendships - and discuss how to interpret the network histogram when additional information related to node and edge labeling is present.

preprint2013arXiv

Degree-based network models

We derive the sampling properties of random networks based on weights whose pairwise products parameterize independent Bernoulli trials. This enables an understanding of many degree-based network models, in which the structure of realized networks is governed by properties of their degree sequences. We provide exact results and large-sample approximations for power-law networks and other more general forms. This enables us to quantify sampling variability both within and across network populations, and to characterize the limiting extremes of variation achievable through such models. Our results highlight that variation explained through expected degree structure need not be attributed to more complicated generative mechanisms.

preprint2013arXiv

Maximum-likelihood estimation of lithospheric flexural rigidity, initial-loading fraction, and load correlation, under isotropy

Topography and gravity are geophysical fields whose joint statistical structure derives from interface-loading processes modulated by the underlying mechanics of isostatic and flexural compensation in the shallow lithosphere. Under this dual statistical-mechanistic viewpoint an estimation problem can be formulated where the knowns are topography and gravity and the principal unknown the elastic flexural rigidity of the lithosphere. In the guise of an equivalent "effective elastic thickness", this important, geographically varying, structural parameter has been the subject of many interpretative studies, but precisely how well it is known or how best it can be found from the data, abundant nonetheless, has remained contentious and unresolved throughout the last few decades of dedicated study. The popular methods whereby admittance or coherence, both spectral measures of the relation between gravity and topography, are inverted for the flexural rigidity, have revealed themselves to have insufficient power to independently constrain both it and the additional unknown initial-loading fraction and load-correlation fac- tors, respectively. Solving this extremely ill-posed inversion problem leads to non-uniqueness and is further complicated by practical considerations such as the choice of regularizing data tapers to render the analysis sufficiently selective both in the spatial and spectral domains. Here, we rewrite the problem in a form amenable to maximum-likelihood estimation theory, which we show yields unbiased, minimum-variance estimates of flexural rigidity, initial-loading frac- tion and load correlation, each of those separably resolved with little a posteriori correlation between their estimates. We are also able to separately characterize the isotropic spectral shape of the initial loading processes.

preprint2013arXiv

Nonparametric graphon estimation

We propose a nonparametric framework for the analysis of networks, based on a natural limit object termed a graphon. We prove consistency of graphon estimation under general conditions, giving rates which include the important practical setting of sparse networks. Our results cover dense and sparse stochastic blockmodels with a growing number of classes, under model misspecification. We use profile likelihood methods, and connect our results to approximation theory, nonparametric function estimation, and the theory of graph limits.

preprint2012arXiv

Covariance of Replicated Modulated Cyclical Time Series

This paper introduces the novel class of modulated cyclostationary processes, a class of non-stationary processes exhibiting frequency coupling, and proposes a method of their estimation from repeated trials. Cyclostationary processes also exhibit frequency correlation but have Loeve spectra whose support lies only on parallel lines in the dual-frequency plane. Such extremely sparse structure does not adequately represent many biological processes. Thus, we propose a model that, in the time domain, modulates the covariance of cyclostationary processes and consequently broadens their frequency support in the dual-frequency plane. The spectra and the cross-coherence of the proposed modulated cyclostationary process are first estimated using multitaper methods. A shrinkage procedure is then applied to each trial-specific estimate to reduce the estimation risk. Multiple trials of each series are observed. When combining information across trials, we carefully take into account the bias that may be introduced by phase misalignment and the fact that the Loeve spectra and cross-coherence across replicates may only be "similar" - but not necessarily identical - across replicates. The application of the inference methods developed for the modulated cyclostationary model to EEG data also demonstrates that the proposed model captures statistically significant cross-frequency interactions, that ought to be further examined by neuroscientists.

preprint2012arXiv

Generalized Morse Wavelets as a Superfamily of Analytic Wavelets

The generalized Morse wavelets are shown to constitute a superfamily that essentially encompasses all other commonly used analytic wavelets, subsuming eight apparently distinct types of analysis filters into a single common form. This superfamily of analytic wavelets provides a framework for systematically investigating wavelet suitability for various applications. In addition to a parameter controlling the time-domain duration or Fourier-domain bandwidth, the wavelet {\em shape} with fixed bandwidth may be modified by varying a second parameter, called $γ$. For integer values of $γ$, the most symmetric, most nearly Gaussian, and generally most time-frequency concentrated member of the superfamily is found to occur for $γ=3$. These wavelets, known as "Airy wavelets," capture the essential idea of popular Morlet wavelet, while avoiding its deficiencies. They may be recommended as an ideal starting point for general purpose use.

preprint2012arXiv

Order statistics of observed network degrees

This article discusses the properties of extremes of degree sequences calculated from network data. We introduce the notion of a normalized degree, in order to permit a comparison of degree sequences between networks with differing numbers of nodes. We model each normalized degree as a bounded continuous random variable, and determine the properties of the ordered k-maxima and minima of the normalized network degrees when they comprise a random sample from a Beta distribution. In this setting, their means and variances take a simplified form given by their ordering, and we discuss the relation of these quantities to other prescribed decays such as power laws. We verify the derived properties from simulated sets of normalized degrees, and discuss possible extensions to more flexible classes of distributions.

preprint2011arXiv

Analysis of Modulated Multivariate Oscillations

The concept of a common modulated oscillation spanning multiple time series is formalized, a method for the recovery of such a signal from potentially noisy observations is proposed, and the time-varying bias properties of the recovery method are derived. The method, an extension of wavelet ridge analysis to the multivariate case, identifies the common oscillation by seeking, at each point in time, a frequency for which a bandpassed version of the signal obtains a local maximum in power. The lowest-order bias is shown to involve a quantity, termed the instantaneous curvature, which measures the strength of local quadratic modulation of the signal after demodulation by the common oscillation frequency. The bias can be made to be small if the analysis filter, or wavelet, can be chosen such that the signal's instantaneous curvature changes little over the filter time scale. An application is presented to the detection of vortex motions in a set of freely-drifting oceanographic instruments tracking the ocean currents.

preprint2011arXiv

Bivariate Instantaneous Frequency and Bandwidth

The generalizations of instantaneous frequency and instantaneous bandwidth to a bivariate signal are derived. These are uniquely defined whether the signal is represented as a pair of real-valued signals, or as one analytic and one anti-analytic signal. A nonstationary but oscillatory bivariate signal has a natural representation as an ellipse whose properties evolve in time, and this representation provides a simple geometric interpretation for the bivariate instantaneous moments. The bivariate bandwidth is shown to consist of three terms measuring the degree of instability of the time-varying ellipse: amplitude modulation with fixed eccentricity, eccentricity modulation, and orientation modulation or precession. An application to the analysis of data from a free-drifting oceanographic float is presented and discussed.

preprint2011arXiv

Extracting waves and vortices from Lagrangian trajectories

A method for extracting time-varying oscillatory motions from time series records is applied to Lagrangian trajectories from a numerical model of eddies generated by an unstable equivalent barotropic jet on a beta plane. An oscillation in a Lagrangian trajectory is represented mathematically as the signal traced out as a particle orbits a time-varying ellipse, a model which captures wavelike motions as well as the displacement signal of a particle trapped in an evolving vortex. Such oscillatory features can be separated from the turbulent background flow through an analysis founded upon a complex-valued wavelet transform of the trajectory. Application of the method to a set of one hundred modeled trajectories shows that the oscillatory motions of Lagrangian particles orbiting vortex cores appear to be extracted very well by the method, which depends upon only a handful of free parameters and which requires no operator intervention. Furthermore, vortex motions are clearly distinguished from wavelike meandering of the jet---the former are high frequency, nearly circular signals, while the latter are linear in polarization and at much lower frequencies. This suggests that the proposed method can be useful for identifying and studying vortex and wave properties in large Lagrangian datasets. In particular, the eccentricity of the oscillatory displacement signals, a quantity which is not normally considered in Lagrangian studies, emerges as an informative diagnostic for characterizing qualitatively different types of motion.

preprint2011arXiv

Nonparametric tests of structure for high angular resolution diffusion imaging in Q-space

High angular resolution diffusion imaging data is the observed characteristic function for the local diffusion of water molecules in tissue. This data is used to infer structural information in brain imaging. Nonparametric scalar measures are proposed to summarize such data, and to locally characterize spatial features of the diffusion probability density function (PDF), relying on the geometry of the characteristic function. Summary statistics are defined so that their distributions are, to first-order, both independent of nuisance parameters and also analytically tractable. The dominant direction of the diffusion at a spatial location (voxel) is determined, and a new set of axes are introduced in Fourier space. Variation quantified in these axes determines the local spatial properties of the diffusion density. Nonparametric hypothesis tests for determining whether the diffusion is unimodal, isotropic or multi-modal are proposed. More subtle characteristics of white-matter microstructure, such as the degree of anisotropy of the PDF and symmetry compared with a variety of asymmetric PDF alternatives, may be ascertained directly in the Fourier domain without parametric assumptions on the form of the diffusion PDF. We simulate a set of diffusion processes and characterize their local properties using the newly introduced summaries. We show how complex white-matter structures across multiple voxels exhibit clear ellipsoidal and asymmetric structure in simulation, and assess the performance of the statistics in clinically-acquired magnetic resonance imaging data.

preprint2011arXiv

On the Analytic Wavelet Transform

An exact and general expression for the analytic wavelet transform of a real-valued signal is constructed, resolving the time-dependent effects of non-negligible amplitude and frequency modulation. The analytic signal is first locally represented as a modulated oscillation, demodulated by its own instantaneous frequency, and then Taylor-expanded at each point in time. The terms in this expansion, called the instantaneous modulation functions, are time-varying functions which quantify, at increasingly higher orders, the local departures of the signal from a uniform sinusoidal oscillation. Closed-form expressions for these functions are found in terms of Bell polynomials and derivatives of the signal's instantaneous frequency and bandwidth. The analytic wavelet transform is shown to depend upon the interaction between the signal's instantaneous modulation functions and frequency-domain derivatives of the wavelet, inducing a hierarchy of departures of the transform away from a perfect representation of the signal. The form of these deviation terms suggests a set of conditions for matching the wavelet properties to suit the variability of the signal, in which case our expressions simplify considerably. One may then quantify the time-varying bias associated with signal estimation via wavelet ridge analysis, and choose wavelets to minimize this bias.

Sofia C. Olhede

What is connected

Connect this record

See the researcher in context

Building this map preview

17 published item(s)

Edge coherence in multiplex networks

Networks with Correlated Edge Processes

The Debiased Spatial Whittle Likelihood

Modeling Network Populations via Graph Distances

A Power Variance Test for Nonstationarity in Complex-Valued Signals

Network histograms and universality of blockmodel approximation

Degree-based network models

Maximum-likelihood estimation of lithospheric flexural rigidity, initial-loading fraction, and load correlation, under isotropy

Nonparametric graphon estimation

Covariance of Replicated Modulated Cyclical Time Series

Generalized Morse Wavelets as a Superfamily of Analytic Wavelets

Order statistics of observed network degrees

Analysis of Modulated Multivariate Oscillations

Bivariate Instantaneous Frequency and Bandwidth

Extracting waves and vortices from Lagrangian trajectories

Nonparametric tests of structure for high angular resolution diffusion imaging in Q-space

On the Analytic Wavelet Transform