Researcher profile

Kevin Burke

Kevin Burke contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
12works
0followers
12topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

12 published item(s)

preprint2026arXiv

Comparison of generalised additive models and neural networks in applications: A systematic review

Neural networks have become a popular tool in predictive modelling, more commonly associated with machine learning and artificial intelligence than with statistics. Generalised Additive Models (GAMs) are flexible non-linear statistical models that retain interpretability. Both are state-of-the-art in their own right, with their respective advantages and disadvantages. This paper analyses how these two model classes have performed on real-world tabular data. Following PRISMA guidelines, we conducted a systematic review of papers that performed empirical comparisons of GAMs and neural networks. Eligible papers were identified, yielding 143 papers, with 430 datasets. Key attributes at both paper and dataset levels were extracted and reported. Beyond summarising comparisons, we analyse reported performance metrics using mixed-effects modelling to investigate potential characteristics that can explain and quantify observed differences, including application area, study year, sample size, number of predictors, and neural network complexity. Across datasets, no consistent evidence of superiority was found for either GAMs or neural networks when considering the most frequently reported metrics (RMSE, $R^2$, and AUC). Neural networks tended to outperform in larger datasets and in those with more predictors, but this advantage narrowed over time. Conversely, GAMs remained competitive, particularly in smaller data settings, while retaining interpretability. Reporting of dataset characteristics and neural network complexity was incomplete in much of the literature, limiting transparency and reproducibility. This review highlights that GAMs and neural networks should be viewed as complementary approaches rather than competitors. For many tabular applications, the performance trade-off is modest, and interpretability may favour GAMs.

preprint2023arXiv

Robust Distributional Regression with Automatic Variable Selection

Datasets with extreme observations and/or heavy-tailed error distributions are commonly encountered and should be analyzed with careful consideration of these features from a statistical perspective. Small deviations from an assumed model, such as the presence of outliers, can cause classical regression procedures to break down, potentially leading to unreliable inferences. Other distributional deviations, such as heteroscedasticity, can be handled by going beyond the mean and modelling the scale parameter in terms of covariates. We propose a method that accounts for heavy tails and heteroscedasticity through the use of a generalized normal distribution (GND). The GND contains a kurtosis-characterizing shape parameter that moves the model smoothly between the normal distribution and the heavier-tailed Laplace distribution - thus covering both classical and robust regression. A key component of statistical inference is determining the set of covariates that influence the response variable. While correctly accounting for kurtosis and heteroscedasticity is crucial to this endeavour, a procedure for variable selection is still required. For this purpose, we use a novel penalized estimation procedure that avoids the typical computationally demanding grid search for tuning parameters. This is particularly valuable in the distributional regression setting where the location and scale parameters depend on covariates, since the standard approach would have multiple tuning parameters (one for each distributional parameter). We achieve this by using a "smooth information criterion" that can be optimized directly, where the tuning parameters are fixed at log(n) in the BIC case.

preprint2023arXiv

Variable Selection Using a Smooth Information Criterion for Distributional Regression Models

Modern variable selection procedures make use of penalization methods to execute simultaneous model selection and estimation. A popular method is the LASSO (least absolute shrinkage and selection operator), the use of which requires selecting the value of a tuning parameter. This parameter is typically tuned by minimizing the cross-validation error or Bayesian information criterion (BIC) but this can be computationally intensive as it involves fitting an array of different models and selecting the best one. In contrast with this standard approach, we have developed a procedure based on the so-called "smooth IC" (SIC) in which the tuning parameter is automatically selected in one step. We also extend this model selection procedure to the distributional regression framework, which is more flexible than classical regression modelling. Distributional regression, also known as multiparameter regression (MPR), introduces flexibility by taking account of the effect of covariates through multiple distributional parameters simultaneously, e.g., mean and variance. These models are useful in the context of normal linear regression when the process under study exhibits heteroscedastic behaviour. Reformulating the distributional regression estimation problem in terms of penalized likelihood enables us to take advantage of the close relationship between model selection criteria and penalization. Utilizing the SIC is computationally advantageous, as it obviates the issue of having to choose multiple tuning parameters.

preprint2022arXiv

Process Visualization of Manufacturing Execution System (MES) Data

Process visualizations of data from manufacturing execution systems (MESs) provide the ability to generate valuable insights for improved decision-making. Industry 4.0 is awakening a digital transformation where advanced analytics and visualizations are critical. Exploiting MESs with data-driven strategies can have a major impact on business outcomes. The advantages of employing process visualizations are demonstrated through an application to real-world data. Visualizations, such as dashboards, enable the user to examine the performance of a production line at a high level. Furthermore, the addition of interactivity facilitates the user to customize the data they want to observe. Evidence of process variability between shifts and days of the week can be investigated with the goal of optimizing production.

preprint2021arXiv

An age-structured SEIR model for COVID--19 incidence in Dublin, Ireland with framework for evaluating health intervention cost

Strategies adopted globally to mitigate the threat of COVID-19 have primarily involved lockdown measures with substantial economic and social costs with varying degrees of success. Morbidity patterns of COVID-19 variants have a strong association with age, while restrictive lockdown measures have association with negative mental health outcomes in some age groups. Reduced economic prospects may also afflict some age cohorts more than others. Motivated by this, we propose a model to describe COVID-19 community spread incorporating the role of age-specific social interactions. Through a flexible parameterisation of an age-structured deterministic Susceptible Exposed Infectious Removed (SEIR) model, we provide a means for characterising different forms of lockdown which may impact specific age groups differently. Social interactions are represented through age group to age group contact matrices, which can be trained using available data and are thus locally adapted. This framework is easy to interpret and suitable for describing counterfactual scenarios, which could assist policy makers with regard to minimising morbidity balanced with the costs of prospective suppression strategies. Our work originates from an Irish context and we use disease monitoring data from February 29th 2020 to January 31st 2021 gathered by Irish governmental agencies. We demonstrate how Irish lockdown scenarios can be constructed using the proposed model formulation and show results of retrospective fitting to incidence rates and forward planning with relevant ``what if/instead of'' lockdown counterfactuals with uncertainty quantification. Our formulation is agnostic to a specific locale, in that lockdown strategies in other regions can be straightforwardly encoded using this model. The methods we describe are made publicly available online through an accessible and easy to use web interface.

preprint2020arXiv

A Data-Driven Control-Theoretic Paradigm for Pandemic Mitigation with Application to Covid-19

In this paper, we introduce a new control-theoretic paradigm for mitigating the spread of a virus. To this end, our discrete-time controller, aims to reduce the number of new daily deaths, and consequently, the cumulative number of deaths. In contrast to much of the existing literature, we do not rely on a potentially complex virus transmission model whose equations must be customized to the "particulars" of the pandemic at hand. For new viruses such as Covid-19, the epidemiology driving the modelling process may not be well known and model estimation with limited data may be unreliable. With this motivation in mind, the new paradigm described here is data-driven and, to a large extent, we avoid modelling difficulties by concentrating on just two key quantities which are common to pandemics: the doubling time, denoted by $d(k)$ and the peak day denoted by $θ(k)$. Our numerical studies to date suggest that our appealingly simple model can provide a reasonable fit to real data. Given that time is of the essence during the ongoing global health crisis, the intent of this paper is to introduce this new paradigm to control practitioners and describe a number of new research directions suggested by our current results.

preprint2020arXiv

A generalised mean-field approximation for the Deffuant opinion dynamics model on networks

When the interactions of agents on a network are assumed to follow the Deffuant opinion dynamics model, the outcomes are known to depend on the structure of the underlying network. This behavior cannot be captured by existing mean-field approximations for the Deffuant model. In this paper, a generalised mean-field approximation is derived that accounts for the effects of network topology on Deffuant dynamics through the degree distribution or community structure of the network. The accuracy of the approximation is examined by comparison with large-scale Monte Carlo simulations on both synthetic and real-world networks.

preprint2020arXiv

A Generalization of the Classical Kelly Betting Formula to the Case of Temporal Correlation

For sequential betting games, Kelly's theory, aimed at maximization of the logarithmic growth of one's account value, involves optimization of the so-called betting fraction $K$. In this Letter, we extend the classical formulation to allow for temporal correlation among bets. To demonstrate the potential of this new paradigm, for simplicity of exposition, we mainly address the case of a coin-flipping game with even-money payoff. To this end, we solve a problem with memory depth $m$. By this, we mean that the outcomes of coin flips are no longer assumed to be i.i.d.random variables. Instead, the probability of heads on flip $k$ depends on previous flips $k-1,k-2,...,k-m$. For the simplest case of $n$ flips, with $m = 1$, we obtain a closed form solution $K_n$ for the optimal betting fraction. This generalizes the classical result for the memoryless case. That is, instead of fraction $K^* = 2p-1$ which pervades the literature for a coin with probability of heads $p\geq 1/2$, our new fraction $K_n$ depends on both $n$ and the parameters associated with the temporal correlation. Generalizations of these results for $m > 1$ and numerical simulations are also included. Finally, we indicate how the theory extends to time-varying feedback and alternative payoff distributions.

preprint2020arXiv

A Generalized Framework for Simultaneous Long-Short Feedback Trading

We present a generalization of the Simultaneous Long-Short (SLS) trading strategy described in recent control literature wherein we allow for different parameters across the short and long sides of the controller; we refer to this new strategy as Generalized SLS (GSLS). Furthermore, we investigate the conditions under which positive gain can be assured within the GSLS setup for both deterministic stock price evolution and geometric Brownian motion. In contrast to existing literature in this area (which places little emphasis on the practical application of SLS strategies), we suggest optimization procedures for selecting the control parameters based on historical data, and we extensively test these procedures across a large number of real stock price trajectories (495 in total). We find that the implementation of such optimization procedures greatly improves the performance compared with fixing control parameters, and, indeed, the GSLS strategy outperforms the simpler SLS strategy in general.

preprint2020arXiv

A likelihood-based approach for cure regression models

We propose a new likelihood-based approach for estimation, inference and variable selection for parametric cure regression models in time-to-event analysis under random right-censoring. In this context, it often happens that some subjects are "cured", i.e., they will never experience the event of interest. Then, the sample of censored observations is an unlabeled mixture of cured and "susceptible" subjects. Using inverse probability censoring weighting (IPCW), we propose a likelihood-based estimation procedure for the cure regression model without making assumptions about the distribution of survival times for the susceptible subjects. The IPCW approach does require a preliminary estimate of the censoring distribution, for which general parametric, semi- or non-parametric approaches can be used. The incorporation of a penalty term in our estimation procedure is straightforward; in particular, we propose L1-type penalties for variable selection. Our theoretical results are derived under mild assumptions. Simulation experiments and real data analysis illustrate the effectiveness of the new approach.

preprint2020arXiv

Agreement Threshold on Axelrod's model of Cultural Dissemination

Shared opinions are an important feature in the formation of social groups. In this paper, we use the Axelrod model of cultural dissemination to represent opinion-based groups. In the Axelrod model, each agent has a set of features which each holds one of a set of nominally related traits. Survey data, for example, has a similar structure, where each participant answers each of a set of items with responses from a fixed list. We present an alternative method of displaying the Axelrod model by representing it as a bipartite graph, i.e., participants and their responses as separate nodes. This allows us to see which feature-trait combinations are selected in the final state. This visualisation is particularly useful when representing survey data as it illustrates the co-evolution of cultures and opinion-based groups in Axelrod's model of cultural diffusion. We also present a modification to the Axelrod model. A standard finding of the Axelrod model with many features is for all agents to fully agree in one cluster. We introduce an agreement threshold and allow nodes to interact only with those neighbours who are within this threshold (i.e., those with similar opinions) rather than those with any opinion. This method reliably yields a large number of clusters for small agreement thresholds and, importantly, does not limit to single cluster when the number of features grows large. This potentially provides a method for modelling opinion-based groups where as opinions are added, the number of clusters increase.

preprint2019arXiv

Multi-Parameter Regression Survival Modelling: An Alternative to Proportional Hazards

It is standard practice for covariates to enter a parametric model through a single distributional parameter of interest, for example, the scale parameter in many standard survival models. Indeed, the well-known proportional hazards model is of this kind. In this paper we discuss a more general approach whereby covariates enter the model through more than one distributional parameter simultaneously (e.g., scale and shape parameters). We refer to this practice as "multi-parameter regression" (MPR) modelling and explore its use in a survival analysis context. We find that multi-parameter regression leads to more flexible models which can offer greater insight into the underlying data generating process. To illustrate the concept, we consider the two-parameter Weibull model which leads to time-dependent hazard ratios, thus relaxing the typical proportional hazards assumption and motivating a new test of proportionality. A novel variable selection strategy is introduced for such multi-parameter regression models. It accounts for the correlation arising between the estimated regression coefficients in two or more linear predictors -- a feature which has not been considered by other authors in similar settings. The methods discussed have been implemented in the mpr package in R.