Researcher profile

Daniel Giles

Daniel Giles contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 17 - UnverifiedVerification L1Unclaimed author
4works
0followers
5topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

4 published item(s)

preprint2026arXiv

Uncertainty Quantification of Surrogate Models using Conformal Prediction

Data-driven surrogate models offer quick approximations to complex numerical and experimental systems but typically lack uncertainty quantification, limiting their reliability in safety-critical applications. While Bayesian methods provide uncertainty estimates, they offer no statistical guarantees and struggle with high-dimensional spatio-temporal problems due to computational costs. We present a conformal prediction (CP) framework that provides statistically guaranteed marginal coverage for surrogate models in a model-agnostic manner with near-zero computational cost. Our approach handles high-dimensional spatio-temporal outputs by performing cell-wise calibration while preserving the tensorial structure of predictions. Through extensive empirical evaluation across diverse applications including fluid dynamics, magnetohydrodynamics, weather forecasting, and fusion diagnostics, we demonstrate that CP achieves empirical coverage with valid error bars regardless of model architecture, training regime, or output dimensionality. We evaluate three nonconformity scores (conformalised quantile regression, absolute error residual, and standard deviation) for both deterministic and probabilistic models, showing that guaranteed coverage holds even for out-of-distribution predictions where models are deployed on physics regimes different from training data. Calibration requires only seconds to minutes on standard hardware. The framework enables rigorous validation of pre-trained surrogate models for downstream applications without retraining. While CP provides marginal rather than conditional coverage and assumes exchangeability between calibration and test data, our method circumvents the curse of dimensionality inherent in traditional uncertainty quantification approaches, offering a practical tool for trustworthy deployment of machine learning in physical sciences.

preprint2022arXiv

Searching the SETI Ellipsoid with Gaia

The SETI Ellipsoid is a geometric method for prioritizing technosignature observations based on the strategy of receiving signals synchronized to conspicuous astronomical events. Precise distances to nearby stars from Gaia makes constraining Ellipsoid crossing times possible. Here we explore the utility of using the Gaia Catalog of Nearby Stars to select targets on the SN 1987A SETI Ellipsoid, as well the Ellipsoids defined by 278 classical novae. Less than 8% of stars within the 100 pc sample are inside the SN 1987A SETI Ellipsoid, meaning the vast majority of nearby stars are still viable targets for monitoring over time. We find an average of 734 stars per year within the 100 pc volume will intersect the Ellipsoid from SN 1987A, with ~10% of those having distance uncertainties from Gaia better than 0.1 lyr.

preprint2020arXiv

Density Based Outlier Scoring on Kepler Data

In the present era of large scale surveys, big data presents new challenges to the discovery process for anomalous data. Such data can be indicative of systematic errors, extreme (or rare) forms of known phenomena, or most interestingly, truly novel phenomena which exhibit as-of-yet unobserved behaviors. In this work we present an outlier scoring methodology to identify and characterize the most promising unusual sources to facilitate discoveries of such anomalous data. We have developed a data mining method based on k-Nearest Neighbor distance in feature space to efficiently identify the most anomalous lightcurves. We test variations of this method including using principal components of the feature space, removing select features, the effect of the choice of k, and scoring to subset samples. We evaluate the peformance of our scoring on known object classes and find that our scoring consistently scores rare (<1000) object classes higher than common classes. We have applied scoring to all long cadence lightcurves of quarters 1 to 17 of Kepler&#39;s prime mission and present outlier scores for all 2.8 million lightcurves for the roughly 200k objects.

preprint2020arXiv

Modelling with Volna-OP2: Towards tsunami threat reduction

Accurate and efficient tsunami modelling is essential for providing tsunami forecasts and hazard assessments. Volna-OP2 is a finite volume solver of the nonlinear shallow water equations and its capabilities of producing both faster than real time ensembles and high resolution inundation studies are presented here. The code is massively parallelised and can utilise various high performance computing architectures. When an earthquake is detected there is always some uncertainty on the source parameters. Generating a faster than real time ensemble for maximum wave heights which captures this uncertainty would be of great benefit to tsunami warning centres. The 2003 Boumerdes earthquake (Algeria) acts as a test case for showing Volna-OP2&#39;s ability at rapidly forecasting regional maximum wave heights. Drawing on various earthquake sources proposed in the literature and scaling the magnitudes to mimic uncertainty on the source, 20 separate earthquake realisations are simulated for 4 hours real time in 97s on two Nvidia V100 GPUs. Further a reduced ensemble of the Lisbon 1755 tsunami with an emphasis on the effects to the Irish coastline is presented. Where again various earthquake sources have been drawn from the literature and simulated on a regional scale. Finally, a pilot study which builds upon the reduced ensemble results investigates the inundation of a Lisbon tsunami on key sections of the Irish coastline. The results of this pilot study highlight that the inundation is constrained to low-lying areas with maximum run-up heights of $\approx 3.4m$ being found.