Source author record

Mike Hobson

Mike Hobson appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning Applications astro-ph.EP astro-ph.IM Computation gr-qc Methodology physics.data-an

Catalog footprint

What is connected

5works

8topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Split personalities in Bayesian Neural Networks: the case for full marginalisation

The true posterior distribution of a Bayesian neural network is massively multimodal. Whilst most of these modes are functionally equivalent, we demonstrate that there remains a level of real multimodality that manifests in even the simplest neural network setups. It is only by fully marginalising over all posterior modes, using appropriate Bayesian sampling tools, that we can capture the split personalities of the network. The ability of a network trained in this manner to reason between multiple candidate solutions dramatically improves the generalisability of the model, a feature we contend is not consistently captured by alternative approaches to the training of Bayesian neural networks. We provide a concise minimal example of this, which can provide lessons and a future path forward for correctly utilising the explainability and interpretability of Bayesian neural networks.

preprint2020arXiv

Compromise-free Bayesian neural networks

We conduct a thorough analysis of the relationship between the out-of-sample performance and the Bayesian evidence (marginal likelihood) of Bayesian neural networks (BNNs), as well as looking at the performance of ensembles of BNNs, both using the Boston housing dataset. Using the state-of-the-art in nested sampling, we numerically sample the full (non-Gaussian and multimodal) network posterior and obtain numerical estimates of the Bayesian evidence, considering network models with up to 156 trainable parameters. The networks have between zero and four hidden layers, either $\tanh$ or $ReLU$ activation functions, and with and without hierarchical priors. The ensembles of BNNs are obtained by determining the posterior distribution over networks, from the posterior samples of individual BNNs re-weighted by the associated Bayesian evidence values. There is good correlation between out-of-sample performance and evidence, as well as a remarkable symmetry between the evidence versus model size and out-of-sample performance versus model size planes. Networks with $ReLU$ activation functions have consistently higher evidences than those with $\tanh$ functions, and this is reflected in their out-of-sample performance. Ensembling over architectures acts to further improve performance relative to the individual BNNs.

preprint2013arXiv

Bayesian analysis of radial velocity data of GJ667C with correlated noise: evidence for only 2 planets

GJ667C is the least massive component of a triple star system which lies at a distance of about 6.8 pc (22.1 light-years) from Earth. GJ667C has received much attention recently due to the claims that it hosts up to seven planets including three super-Earths inside the habitable zone. We present a Bayesian technique for the analysis of radial velocity (RV) data-sets in the presence of correlated noise component ("red noise"), with unknown parameters. We also introduce hyper-parameters in our model in order to deal statistically with under or over-estimated error bars on measured RVs as well as inconsistencies between different data-sets. By applying this method to the RV data-set of GJ667C, we show that this data-set contains a significant correlated (red) noise component with correlation timescale for HARPS data of order 9 days. Our analysis shows that the data only provides strong evidence for the presence of two planets: GJ667Cb and c with periods 7.19d and 28.13d respectively, with some hints towards the presence of a third signal with period 91d. The planetary nature of this third signal is not clear and additional RV observations are required for its confirmation. Previous claims of the detection of additional planets in this system are due the erroneous assumption of white noise. Using the standard white noise assumption, our method leads to the detection of up to five signals in this system. We also find that with the red noise model, the measurement uncertainties from HARPS for this system are under-estimated at the level of ~50 per cent.

preprint2011arXiv

An investigation into the Multiple Optimised Parameter Estimation and Data compression algorithm

We investigate the use of the Multiple Optimised Parameter Estimation and Data compression algorithm (MOPED) for data compression and faster evaluation of likelihood functions. Since MOPED only guarantees maintaining the Fisher matrix of the likelihood at a chosen point, multimodal and some degenerate distributions will present a problem. We present examples of scenarios in which MOPED does faithfully represent the true likelihood but also cases in which it does not. Through these examples, we aim to define a set of criteria for which MOPED will accurately represent the likelihood and hence may be used to obtain a significant reduction in the time needed to calculate it. These criteria may involve the evaluation of the full likelihood function for comparison.

preprint2010arXiv

The Mock LISA Data Challenges: from Challenge 3 to Challenge 4

The Mock LISA Data Challenges are a program to demonstrate LISA data-analysis capabilities and to encourage their development. Each round of challenges consists of one or more datasets containing simulated instrument noise and gravitational waves from sources of undisclosed parameters. Participants analyze the datasets and report best-fit solutions for the source parameters. Here we present the results of the third challenge, issued in Apr 2008, which demonstrated the positive recovery of signals from chirping Galactic binaries, from spinning supermassive--black-hole binaries (with optimal SNRs between ~ 10 and 2000), from simultaneous extreme-mass-ratio inspirals (SNRs of 10-50), from cosmic-string-cusp bursts (SNRs of 10-100), and from a relatively loud isotropic background with Omega_gw(f) ~ 10^-11, slightly below the LISA instrument noise.