Source author record

Daniel Giles

Daniel Giles appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

astro-ph.IM math.OC physics.ao-ph Artificial Intelligence physics.comp-ph physics.plasm-ph

Catalog footprint

What is connected

6works

6topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Uncertainty Quantification of Surrogate Models using Conformal Prediction

Data-driven surrogate models offer quick approximations to complex numerical and experimental systems but typically lack uncertainty quantification, limiting their reliability in safety-critical applications. While Bayesian methods provide uncertainty estimates, they offer no statistical guarantees and struggle with high-dimensional spatio-temporal problems due to computational costs. We present a conformal prediction (CP) framework that provides statistically guaranteed marginal coverage for surrogate models in a model-agnostic manner with near-zero computational cost. Our approach handles high-dimensional spatio-temporal outputs by performing cell-wise calibration while preserving the tensorial structure of predictions. Through extensive empirical evaluation across diverse applications including fluid dynamics, magnetohydrodynamics, weather forecasting, and fusion diagnostics, we demonstrate that CP achieves empirical coverage with valid error bars regardless of model architecture, training regime, or output dimensionality. We evaluate three nonconformity scores (conformalised quantile regression, absolute error residual, and standard deviation) for both deterministic and probabilistic models, showing that guaranteed coverage holds even for out-of-distribution predictions where models are deployed on physics regimes different from training data. Calibration requires only seconds to minutes on standard hardware. The framework enables rigorous validation of pre-trained surrogate models for downstream applications without retraining. While CP provides marginal rather than conditional coverage and assumes exchangeability between calibration and test data, our method circumvents the curse of dimensionality inherent in traditional uncertainty quantification approaches, offering a practical tool for trustworthy deployment of machine learning in physical sciences.

preprint2022arXiv

Searching the SETI Ellipsoid with Gaia

The SETI Ellipsoid is a geometric method for prioritizing technosignature observations based on the strategy of receiving signals synchronized to conspicuous astronomical events. Precise distances to nearby stars from Gaia makes constraining Ellipsoid crossing times possible. Here we explore the utility of using the Gaia Catalog of Nearby Stars to select targets on the SN 1987A SETI Ellipsoid, as well the Ellipsoids defined by 278 classical novae. Less than 8% of stars within the 100 pc sample are inside the SN 1987A SETI Ellipsoid, meaning the vast majority of nearby stars are still viable targets for monitoring over time. We find an average of 734 stars per year within the 100 pc volume will intersect the Ellipsoid from SN 1987A, with ~10% of those having distance uncertainties from Gaia better than 0.1 lyr.

preprint2020arXiv

Density Based Outlier Scoring on Kepler Data

In the present era of large scale surveys, big data presents new challenges to the discovery process for anomalous data. Such data can be indicative of systematic errors, extreme (or rare) forms of known phenomena, or most interestingly, truly novel phenomena which exhibit as-of-yet unobserved behaviors. In this work we present an outlier scoring methodology to identify and characterize the most promising unusual sources to facilitate discoveries of such anomalous data. We have developed a data mining method based on k-Nearest Neighbor distance in feature space to efficiently identify the most anomalous lightcurves. We test variations of this method including using principal components of the feature space, removing select features, the effect of the choice of k, and scoring to subset samples. We evaluate the peformance of our scoring on known object classes and find that our scoring consistently scores rare (<1000) object classes higher than common classes. We have applied scoring to all long cadence lightcurves of quarters 1 to 17 of Kepler's prime mission and present outlier scores for all 2.8 million lightcurves for the roughly 200k objects.

preprint2020arXiv

Modelling with Volna-OP2: Towards tsunami threat reduction

Accurate and efficient tsunami modelling is essential for providing tsunami forecasts and hazard assessments. Volna-OP2 is a finite volume solver of the nonlinear shallow water equations and its capabilities of producing both faster than real time ensembles and high resolution inundation studies are presented here. The code is massively parallelised and can utilise various high performance computing architectures. When an earthquake is detected there is always some uncertainty on the source parameters. Generating a faster than real time ensemble for maximum wave heights which captures this uncertainty would be of great benefit to tsunami warning centres. The 2003 Boumerdes earthquake (Algeria) acts as a test case for showing Volna-OP2's ability at rapidly forecasting regional maximum wave heights. Drawing on various earthquake sources proposed in the literature and scaling the magnitudes to mimic uncertainty on the source, 20 separate earthquake realisations are simulated for 4 hours real time in 97s on two Nvidia V100 GPUs. Further a reduced ensemble of the Lisbon 1755 tsunami with an emphasis on the effects to the Irish coastline is presented. Where again various earthquake sources have been drawn from the literature and simulated on a regional scale. Finally, a pilot study which builds upon the reduced ensemble results investigates the inundation of a Lisbon tsunami on key sections of the Irish coastline. The results of this pilot study highlight that the inundation is constrained to low-lying areas with maximum run-up heights of $\approx 3.4m$ being found.

preprint2016arXiv

Minimizing Differences of Convex Functions and Applications to Facility Location and Clustering

In this paper we develop algorithms to solve generalized weighted Fermat-Torricelli problems with positive and negative weights and multifacility location problems involving distances generated by Minkowski gauges. We also introduce a new model of clustering based on squared distances to convex sets. Using the Nesterov smoothing technique and an algorithm for minimizing differences of convex functions called the DCA introduced by Tao and An, we develop effective algorithms for solving these problems.

preprint2015arXiv

The Log-Exponential Smoothing Technique and Nesterov's Accelerated Gradient Method for Generalized Sylvester Problems

The Sylvester smallest enclosing circle problem involves finding the smallest circle that encloses a finite number of points in the plane. We consider generalized versions of the Sylvester problem in which the points are replaced by sets. Based on the log-exponential smoothing technique and Nesterov's accelerated gradient method, we present an effective numerical algorithm for solving these problems.