Source author record

Benjamin Avanzi

Benjamin Avanzi appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

q-fin.RM Applications math.PR Machine Learning math.OC math.ST Methodology q-fin.ST Statistics Theory

Catalog footprint

What is connected

9works

9topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Reinforcement Learning for Micro-Level Claims Reserving

Outstanding claim liabilities are revised repeatedly as claims develop, yet most modern reserving models are trained as one-shot predictors and typically learn only from settled claims. We formulate individual claims reserving as a claim-level Markov decision process in which an agent sequentially updates outstanding claim liability (OCL) estimates over development, using continuous actions and a reward design that balances accuracy with stable reserve revisions. A key advantage of this reinforcement learning (RL) approach is that it can learn from all observed claim trajectories, including claims that remain open at valuation, thereby avoiding the reduced sample size and selection effects inherent in supervised methods trained on ultimate outcomes only. We also introduce practical components needed for actuarial use -- initialisation of new claims, temporally consistent tuning via a rolling-settlement scheme, and an importance-weighting mechanism to mitigate portfolio-level underestimation driven by the rarity of large claims. On CAS and SPLICE synthetic general insurance datasets, the proposed Soft Actor-Critic implementation delivers competitive claim-level accuracy and strong aggregate OCL performance, particularly for the immature claim segments that drive most of the liability.

preprint2025arXiv

On the use of case estimate and transactional payment data in neural networks for individual loss reserving

The use of neural networks trained on individual claims data has become increasingly popular in the actuarial reserving literature. We consider how to best input historical payment data in neural network models. Additionally, case estimates are also available in the format of a time series, and we extend our analysis to assessing their predictive power. In this paper, we compare a feed-forward neural network trained on summarised transactions to a recurrent neural network equipped to analyse a claim's entire payment history and/or case estimate development history. We draw conclusions from training and comparing the performance of the models on multiple, comparable highly complex datasets simulated from SPLICE (Avanzi, Taylor and Wang, 2023). We find evidence that case estimates will improve predictions significantly, but that equipping the neural network with memory only leads to meagre improvements. Although the case estimation process and quality will vary significantly between insurers, we provide a standardised methodology for assessing their value.

preprint2022arXiv

SPLICE: A Synthetic Paid Loss and Incurred Cost Experience Simulator

In this paper, we first introduce a simulator of cases estimates of incurred losses, called `SPLICE` (Synthetic Paid Loss and Incurred Cost Experience). In three modules, case estimates are simulated in continuous time, and a record is output for each individual claim. Revisions for the case estimates are also simulated as a sequence over the lifetime of the claim, in a number of different situations. Furthermore, some dependencies in relation to case estimates of incurred losses are incorporated, particularly recognizing certain properties of case estimates that are found in practice. For example, the magnitude of revisions depends on ultimate claim size, as does the distribution of the revisions over time. Some of these revisions occur in response to occurrence of claim payments, and so `SPLICE` requires input of simulated per-claim payment histories. The claim data can be summarized by accident and payment "periods" whose duration is an arbitrary choice (e.g. month, quarter, etc.) available to the user. `SPLICE` is a fully documented R package that is publicly available and open source (on CRAN). It is built on an existing simulator of individual claim experience called `SynthETIC` (Avanzi et al., 2021a,b), which offers flexible modelling of occurrence, notification, as well as the timing and magnitude of individual partial payments. This is in contrast with the incurred losses, which constitute the additional contribution of `SPLICE`. The inclusion of incurred loss estimates provides a facility that almost no other simulators do.

preprint2021arXiv

Stochastic loss reserving with mixture density neural networks

Neural networks offer a versatile, flexible and accurate approach to loss reserving. However, such applications have focused primarily on the (important) problem of fitting accurate central estimates of the outstanding claims. In practice, properties regarding the variability of outstanding claims are equally important (e.g., quantiles for regulatory purposes). In this paper we fill this gap by applying a Mixture Density Network ("MDN") to loss reserving. The approach combines a neural network architecture with a mixture Gaussian distribution to achieve simultaneously an accurate central estimate along with flexible distributional choice. Model fitting is done using a rolling-origin approach. Our approach consistently outperforms the classical over-dispersed model both for central estimates and quantiles of interest, when applied to a wide range of simulated environments of various complexity and specifications. We further extend the MDN approach by proposing two extensions. Firstly, we present a hybrid GLM-MDN approach called "ResMDN". This hybrid approach balances the tractability and ease of understanding of a traditional GLM model on one hand, with the additional accuracy and distributional flexibility provided by the MDN on the other. We show that it can successfully improve the errors of the baseline ccODP, although there is generally a loss of performance when compared to the MDN in the examples we considered. Secondly, we allow for explicit projection constraints, so that actuarial judgement can be directly incorporated in the modelling process. Throughout, we focus on aggregate loss triangles, and show that our methodologies are tractable, and that they out-perform traditional approaches even with relatively limited amounts of data. We use both simulated data -- to validate properties, and real data -- to illustrate and ascertain practicality of the approaches.

preprint2020arXiv

A counterexample to the central limit theorem for pairwise independent random variables having a common arbitrary margin

The Central Limit Theorem (CLT) is one of the most fundamental results in statistics. It states that the standardized sample mean of a sequence of $n$ mutually independent and identically distributed random variables with finite first and second moments converges in distribution to a standard Gaussian as $n$ goes to infinity. In particular, pairwise independence of the sequence is generally not sufficient for the theorem to hold. We construct explicitly a sequence of pairwise independent random variables having a common but arbitrary marginal distribution $F$ (satisfying very mild conditions) for which the CLT is not verified. We study the extent of this 'failure' of the CLT by obtaining, in closed form, the asymptotic distribution of the sample mean of our sequence. This is illustrated through several theoretical examples, for which we provide associated computing codes in the R language.

preprint2020arXiv

A multivariate evolutionary generalised linear model framework with adaptive estimation for claims reserving

In this paper, we develop a multivariate evolutionary generalised linear model (GLM) framework for claims reserving, which allows for dynamic features of claims activity in conjunction with dependency across business lines to accurately assess claims reserves. We extend the traditional GLM reserving framework on two fronts: GLM fixed factors are allowed to evolve in a recursive manner, and dependence is incorporated in the specification of these factors using a common shock approach. We consider factors that evolve across accident years in conjunction with factors that evolve across calendar years. This two-dimensional evolution of factors is unconventional as a traditional evolutionary model typically considers the evolution in one single time dimension. This creates challenges for the estimation process, which we tackle in this paper. We develop the formulation of a particle filtering algorithm with parameter learning procedure. This is an adaptive estimation approach which updates evolving factors of the framework recursively over time. We implement and illustrate our model with a simulated data set, as well as a set of real data from a Canadian insurer.

preprint2020arXiv

Modelling and understanding count processes through a Markov-modulated non-homogeneous Poisson process framework

The Markov-modulated Poisson process is utilised for count modelling in a variety of areas such as queueing, reliability, network and insurance claims analysis. In this paper, we extend the Markov-modulated Poisson process framework through the introduction of a flexible frequency perturbation measure. This contribution enables known information of observed event arrivals to be naturally incorporated in a tractable manner, while the hidden Markov chain captures the effect of unobservable drivers of the data. In addition to increases in accuracy and interpretability, this method supplements analysis of the latent factors. Further, this procedure naturally incorporates data features such as over-dispersion and autocorrelation. Additional insights can be generated to assist analysis, including a procedure for iterative model improvement. Implementation difficulties are also addressed with a focus on dealing with large data sets, where latent models are especially advantageous due the large number of observations facilitating identification of hidden factors. Namely, computational issues such as numerical underflow and high processing cost arise in this context and in this paper, we produce procedures to overcome these problems. This modelling framework is demonstrated using a large insurance data set to illustrate theoretical, practical and computational contributions and an empirical comparison to other count models highlight the advantages of the proposed approach.

preprint2020arXiv

Optimal periodic dividend strategies for spectrally positive Lévy risk processes with fixed transaction costs

We consider the general class of spectrally positive Lévy risk processes, which are appropriate for businesses with continuous expenses and lump sum gains whose timing and sizes are stochastic. Motivated by the fact that dividends cannot be paid at any time in real life, we study $\textit{periodic}$ dividend strategies whereby dividend decisions are made according to a separate arrival process. In this paper, we investigate the impact of fixed transaction costs on the optimal periodic dividend strategy, and show that a periodic $(b_u,b_l)$ strategy is optimal when decision times arrive according to an independent Poisson process. Such a strategy leads to lump sum dividends that bring the surplus back to $b_l$ as long as it is no less than $b_u$ at a dividend decision time. The expected present value of dividends (net of transaction costs) is provided explicitly with the help of scale functions. Results are illustrated.

preprint2016arXiv

On optimal joint reflective and refractive dividend strategies in spectrally positive Lévy models

The expected present value of dividends is one of the classical stability criteria in actuarial risk theory. In this context, numerous papers considered threshold (refractive) and barrier (reflective) dividend strategies. These were shown to be optimal in a number of different contexts for bounded and unbounded payout rates, respectively. In this paper, motivated by the behaviour of some dividend paying stock exchange companies, we determine the optimal dividend strategy when both continuous (refractive) and lump sum (reflective) dividends can be paid at any time, and if they are subject to different transaction rates. We consider the general family of spectrally positive Lévy processes. Using scale functions, we obtain explicit formulas for the expected present value of dividends until ruin, with a penalty at ruin. We develop a verification lemma, and show that a two-layer (a,b) strategy is optimal. Such a strategy pays continuous dividends when the surplus exceeds level a>0, and all of the excess over b>a as lump sum dividend payments. Results are illustrated.

Benjamin Avanzi

What is connected

Connect this record

See the researcher in context

Building this map preview

9 published item(s)

Reinforcement Learning for Micro-Level Claims Reserving

On the use of case estimate and transactional payment data in neural networks for individual loss reserving

SPLICE: A Synthetic Paid Loss and Incurred Cost Experience Simulator

Stochastic loss reserving with mixture density neural networks

A counterexample to the central limit theorem for pairwise independent random variables having a common arbitrary margin

A multivariate evolutionary generalised linear model framework with adaptive estimation for claims reserving

Modelling and understanding count processes through a Markov-modulated non-homogeneous Poisson process framework

Optimal periodic dividend strategies for spectrally positive Lévy risk processes with fixed transaction costs

On optimal joint reflective and refractive dividend strategies in spectrally positive Lévy models