Source author record

Katrien Antonio

Katrien Antonio appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Applications Machine Learning Cryptography and Security Social and Information Networks

Catalog footprint

What is connected

3works

4topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2020arXiv

Boosting insights in insurance tariff plans with tree-based machine learning methods

Pricing actuaries typically operate within the framework of generalized linear models (GLMs). With the upswing of data analytics, our study puts focus on machine learning methods to develop full tariff plans built from both the frequency and severity of claims. We adapt the loss functions used in the algorithms such that the specific characteristics of insurance data are carefully incorporated: highly unbalanced count data with excess zeros and varying exposure on the frequency side combined with scarce, but potentially long-tailed data on the severity side. A key requirement is the need for transparent and interpretable pricing models which are easily explainable to all stakeholders. We therefore focus on machine learning with decision trees: starting from simple regression trees, we work towards more advanced ensembles such as random forests and boosted trees. We show how to choose the optimal tuning parameters for these models in an elaborate cross-validation scheme, we present visualization tools to obtain insights from the resulting models and the economic value of these new modeling approaches is evaluated. Boosted trees outperform the classical GLMs, allowing the insurer to form profitable portfolios and to guard against potential adverse risk selection.

preprint2020arXiv

Pricing service maintenance contracts using predictive analytics

As more manufacturers shift their focus from selling products to end solutions, full-service maintenance contracts gain traction in the business world. These contracts cover all maintenance related costs during a predetermined horizon in exchange for a fixed service fee and relieve customers from uncertain maintenance costs. To guarantee profitability, the service fees should at least cover the expected costs during the contract horizon. As these expected costs may depend on several machine-dependent characteristics, e.g. operational environment, the service fees should also be differentiated based on these characteristics. If not, customers that are less prone to high maintenance costs will not buy into or renege on the contract. The latter can lead to adverse selection and leave the service provider with a maintenance-heavy portfolio, which may be detrimental to the profitability of the service contracts. We contribute to the literature with a data-driven tariff plan based on the calibration of predictive models that take into account the different machine profiles. This conveys to the service provider which machine profiles should be attracted at which price. We demonstrate the advantage of a differentiated tariff plan and show how it better protects against adverse selection.

preprint2020arXiv

Social network analytics for supervised fraud detection in insurance

Insurance fraud occurs when policyholders file claims that are exaggerated or based on intentional damages. This contribution develops a fraud detection strategy by extracting insightful information from the social network of a claim. First, we construct a network by linking claims with all their involved parties, including the policyholders, brokers, experts, and garages. Next, we establish fraud as a social phenomenon in the network and use the BiRank algorithm with a fraud specific query vector to compute a fraud score for each claim. From the network, we extract features related to the fraud scores as well as the claims' neighborhood structure. Finally, we combine these network features with the claim-specific features and build a supervised model with fraud in motor insurance as the target variable. Although we build a model for only motor insurance, the network includes claims from all available lines of business. Our results show that models with features derived from the network perform well when detecting fraud and even outperform the models using only the classical claim-specific features. Combining network and claim-specific features further improves the performance of supervised learning models to detect fraud. The resulting model flags highly suspicions claims that need to be further investigated. Our approach provides a guided and intelligent selection of claims and contributes to a more effective fraud investigation process.

Katrien Antonio

What is connected

Connect this record

See the researcher in context

Building this map preview

3 published item(s)

Boosting insights in insurance tariff plans with tree-based machine learning methods

Pricing service maintenance contracts using predictive analytics

Social network analytics for supervised fraud detection in insurance