Source author record

Dirk Tasche

Dirk Tasche appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

q-fin.RM Machine Learning Applications cond-mat math.ST Statistics Theory cond-mat.stat-mech math.PR physics.soc-ph q-fin.CP q-fin.ST

Catalog footprint

What is connected

19works

11topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Class Prior Estimation under Covariate Shift: No Problem?

We show that in the context of classification the property of source and target distributions to be related by covariate shift may be lost if the information content captured in the covariates is reduced, for instance by dropping components or mapping into a lower-dimensional or finite space. As a consequence, under covariate shift simple approaches to class prior estimation in the style of classify and count with or without adjustment are infeasible. We prove that transformations of the covariates that preserve the covariate shift property are necessarily sufficient in the statistical sense for the full set of covariates. A probing algorithm as alternative approach to class prior estimation under covariate shift is proposed.

preprint2021arXiv

Calibrating sufficiently

When probabilistic classifiers are trained and calibrated, the so-called grouping loss component of the calibration loss can easily be overlooked. Grouping loss refers to the gap between observable information and information actually exploited in the calibration exercise. We investigate the relation between grouping loss and the concept of sufficiency, identifying comonotonicity as a useful criterion for sufficiency. We revisit the probing reduction approach of Langford & Zadrozny (2005) and find that it produces an estimator of probabilistic classifiers that reduces grouping loss. Finally, we discuss Brier curves as tools to support training and 'sufficient' calibration of probabilistic classifiers.

preprint2016arXiv

Does quantification without adjustments work?

Classification is the task of predicting the class labels of objects based on the observation of their features. In contrast, quantification has been defined as the task of determining the prevalences of the different sorts of class labels in a target dataset. The simplest approach to quantification is Classify & Count where a classifier is optimised for classification on a training set and applied to the target dataset for the prediction of class labels. In the case of binary quantification, the number of predicted positive labels is then used as an estimate of the prevalence of the positive class in the target dataset. Since the performance of Classify & Count for quantification is known to be inferior its results typically are subject to adjustments. However, some researchers recently have suggested that Classify & Count might actually work without adjustments if it is based on a classifer that was specifically trained for quantification. We discuss the theoretical foundation for this claim and explore its potential and limitations with a numerical example based on the binormal model with equal variances. In order to identify an optimal quantifier in the binormal setting, we introduce the concept of local Bayes optimality. As a side remark, we present a complete proof of a theorem by Ye et al. (2012).

preprint2015arXiv

Fitting a distribution to Value-at-Risk and Expected Shortfall, with an application to covered bonds

Covered bonds are a specific example of senior secured debt. If the issuer of the bonds defaults the proceeds of the assets in the cover pool are used for their debt service. If in this situation the cover pool proceeds do not suffice for the debt service, the creditors of the bonds have recourse to the issuer's assets and their claims are pari passu with the claims of the creditors of senior unsecured debt. Historically, covered bonds have been very safe investments. During their more than two hundred years of existence, investors never suffered losses due to missed payments from covered bonds. From a risk management perspective, therefore modelling covered bonds losses is mainly of interest for estimating the impact that the asset encumbrance by the cover pool has on the loss characteristics of the issuer's senior unsecured debt. We explore one-period structural modelling approaches for covered bonds and senior unsecured debt losses with one and two asset value variables respectively. Obviously, two-assets models with separate values of the cover pool and the issuer's remaining portfolio allow for more realistic modelling. However, we demonstrate that exact calibration of such models may be impossible. We also investigate a one-asset model in which the riskiness of the cover pool is reflected by a risk-based adjustment of the encumbrance ratio of the issuer's assets.

preprint2015arXiv

The two defaults scenario for stressing credit portfolio loss distributions

The impact of a stress scenario of default events on the loss distribution of a credit portfolio can be assessed by determining the loss distribution conditional on these events. While it is conceptually easy to estimate loss distributions conditional on default events by means of Monte Carlo simulation, it becomes impractical for two or more simultaneous defaults as then the conditioning event is extremely rare. We provide an analytical approach to the calculation of the conditional loss distribution for the CreditRisk+ portfolio model with independent random loss given default distributions. The analytical solution for this case can be used to check the accuracy of an approximation to the conditional loss distribution whereby the unconditional model is run with stressed input probabilities of default (PDs). It turns out that this approximation is unbiased. Numerical examples, however, suggest that the approximation may be seriously inaccurate but that the inaccuracy leads to overestimation of tail losses and hence the approach errs on the conservative side.

preprint2015arXiv

What is the best risk measure in practice? A comparison of standard measures

Expected Shortfall (ES) has been widely accepted as a risk measure that is conceptually superior to Value-at-Risk (VaR). At the same time, however, it has been criticised for issues relating to backtesting. In particular, ES has been found not to be elicitable which means that backtesting for ES is less straightforward than, e.g., backtesting for VaR. Expectiles have been suggested as potentially better alternatives to both ES and VaR. In this paper, we revisit commonly accepted desirable properties of risk measures like coherence, comonotonic additivity, robustness and elicitability. We check VaR, ES and Expectiles with regard to whether or not they enjoy these properties, with particular emphasis on Expectiles. We also consider their impact on capital allocation, an important issue in risk management. We find that, despite the caveats that apply to the estimation and backtesting of ES, it can be considered a good risk measure. As a consequence, there is no sufficient evidence to justify an all-inclusive replacement of ES by Expectiles in applications. For backtesting ES, we propose an empirical approach that consists in replacing ES by a set of four quantiles, which should allow to make use of backtesting methods for VaR. Keywords: Backtesting; capital allocation; coherence; diversification; elicitability; expected shortfall; expectile; forecasts; probability integral transform (PIT); risk measure; risk management; robustness; value-at-risk

preprint2014arXiv

Exact fit of simple finite mixture models

How to forecast next year's portfolio-wide credit default rate based on last year's default observations and the current score distribution? A classical approach to this problem consists of fitting a mixture of the conditional score distributions observed last year to the current score distribution. This is a special (simple) case of a finite mixture model where the mixture components are fixed and only the weights of the components are estimated. The optimum weights provide a forecast of next year's portfolio-wide default rate. We point out that the maximum-likelihood (ML) approach to fitting the mixture distribution not only gives an optimum but even an exact fit if we allow the mixture components to vary but keep their density ratio fix. From this observation we can conclude that the standard default rate forecast based on last year's conditional default rates will always be located between last year's portfolio-wide default rate and the ML forecast for next year. As an application example, then cost quantification is discussed. We also discuss how the mixture model based estimation methods can be used to forecast total loss. This involves the reinterpretation of an individual classification problem as a collective quantification problem.

preprint2014arXiv

The Law of Total Odds

The law of total probability may be deployed in binary classification exercises to estimate the unconditional class probabilities if the class proportions in the training set are not representative of the population class proportions. We argue that this is not a conceptually sound approach and suggest an alternative based on the new law of total odds. We quantify the bias of the total probability estimator of the unconditional class probabilities and show that the total odds estimator is unbiased. The sample version of the total odds estimator is shown to coincide with a maximum-likelihood estimator known from the literature. The law of total odds can also be used for transforming the conditional class probabilities if independent estimates of the unconditional class probabilities of the population are available. Keywords: Total probability, likelihood ratio, Bayes' formula, binary classification, relative odds, unbiased estimator, supervised learning, dataset shift.

preprint2013arXiv

Bayesian estimation of probabilities of default for low default portfolios

The estimation of probabilities of default (PDs) for low default portfolios by means of upper confidence bounds is a well established procedure in many financial institutions. However, there are often discussions within the institutions or between institutions and supervisors about which confidence level to use for the estimation. The Bayesian estimator for the PD based on the uninformed, uniform prior distribution is an obvious alternative that avoids the choice of a confidence level. In this paper, we demonstrate that in the case of independent default events the upper confidence bounds can be represented as quantiles of a Bayesian posterior distribution based on a prior that is slightly more conservative than the uninformed prior. We then describe how to implement the uninformed and conservative Bayesian estimators in the dependent one- and multi-period default data cases and compare their estimates to the upper confidence bound estimates. The comparison leads us to suggest a constrained version of the uninformed (neutral) Bayesian estimator as an alternative to the upper confidence bound estimators.

preprint2013arXiv

The art of probability-of-default curve calibration

PD curve calibration refers to the transformation of a set of rating grade level probabilities of default (PDs) to another average PD level that is determined by a change of the underlying portfolio-wide PD. This paper presents a framework that allows to explore a variety of calibration approaches and the conditions under which they are fit for purpose. We test the approaches discussed by applying them to publicly available datasets of agency rating and default statistics that can be considered typical for the scope of application of the approaches. We show that the popular 'scaled PDs' approach is theoretically questionable and identify an alternative calibration approach ('scaled likelihood ratio') that is both theoretically sound and performs better on the test datasets. Keywords: Probability of default, calibration, likelihood ratio, Bayes' formula, rating profile, binary classification.

preprint2012arXiv

Bounds for rating override rates

Overrides of credit ratings are important correctives of ratings that are determined by statistical rating models. Financial institutions and banking regulators agree on this because on the one hand errors with ratings of corporates or banks can have fatal consequences for the lending institutions and on the other hand errors by statistical methods can be minimised but not completely avoided. Nonetheless, rating overrides can be misused in order to conceal the real riskiness of borrowers or even entire portfolios. That is why rating overrides usually are strictly governed and carefully recorded. It is not clear, however, which frequency of overrides is appropriate for a given rating model within a predefined time period. This paper argues that there is a natural error rate associated with a statistical rating model that may be used to inform assessment of whether or not an observed override rate is adequate. The natural error rate is closely related to the rating model's discriminatory power and can readily be calculated.

preprint2012arXiv

Capital allocation for credit portfolios under normal and stressed market conditions

If the probability of default parameters (PDs) fed as input into a credit portfolio model are estimated as through-the-cycle (TTC) PDs stressed market conditions have little impact on the results of the capital calculations conducted with the model. At first glance, this is totally different if the PDs are estimated as point-in-time (PIT) PDs. However, it can be argued that the reflection of stressed market conditions in input PDs should correspond to the use of reduced correlation parameters or even the removal of correlations in the model. Additionally, the confidence levels applied for the capital calculations might be made reflective of the changing market conditions. We investigate the interplay of PIT PDs, correlations, and confidence levels in a credit portfolio model in more detail and analyse possible designs of capital-levelling policies. Our findings may of interest to banks that want to combine their approaches to capital measurement and allocation with active portfolio management that, by its nature, needs to be reflective of current market conditions.

preprint2010arXiv

Estimating discriminatory power and PD curves when the number of defaults is small

The intention with this paper is to provide all the estimation concepts and techniques that are needed to implement a two-phases approach to the parametric estimation of probability of default (PD) curves. In the first phase of this approach, a raw PD curve is estimated based on parameters that reflect discriminatory power. In the second phase of the approach, the raw PD curve is calibrated to fit a target unconditional PD. The concepts and techniques presented include a discussion of different definitions of area under the curve (AUC) and accuracy ratio (AR), a simulation study on the performance of confidence interval estimators for AUC, a discussion of the one-parametric approach to the estimation of PD curves by van der Burgt (2008) and alternative approaches, as well as a simulation study on the performance of the presented PD curve estimators. The topics are treated in depth in order to provide the full rationale behind them and to produce results that can be implemented immediately.

preprint2006arXiv

Validation of internal rating systems and PD estimates

This paper elaborates on the validation requirements for rating systems and probabilities of default (PDs) which were introduced with the New Capital Standards (Basel II). We start in Section 2 with some introductory remarks on the topics and approaches that will be discussed later on. Then we have a view on the developments in banking regulation that have enforced the interest of the public in validation techniques. When doing so, we put the main emphasis on the issues with quantitative validation. The techniques discussed here could be used in order to meet the quantitative regulatory requirements. However, their appropriateness will depend on the specific conditions under which they are applied. In order to have a common ground for the description of the different techniques, we introduce in Section 3 a theoretical framework that will be the basis for the further considerations. Intuitively, a good rating system should show higher probabilities of default for the less creditworthy rating grades. Therefore, in Section 4, we discuss how this monotonicity property is reflected in the theoretical framework from Section 3. In Section 5, we study the meaning of discriminatory power and some tools for measuring it in some detail. We will see that there are tools that might be more appropriate than others for the purpose of regulatory validation of discriminatory power. The topic in Section 6 is calibration of rating systems. We introduce some of the tests that can be used for checking correct calibration and discuss the properties of the different tests. We then conclude in Section 7 with some comments on the question which tools might be most appropriate for quantitative validation of rating systems and probabilities of default.

preprint2003arXiv

A traffic lights approach to PD validation

As a consequence of the dependence experienced in loan portfolios, the standard binomial test which is based on the assumption of independence does not appear appropriate for validating probabilities of default (PDs). The model underlying the new rules for minimum capital requirements (Basle II) is taken as a point of departure for deriving two parametric test procedures that incorporate dependence effects. The first one makes use of the so-called granularity adjustment approach while the the second one is based on moment matching.

preprint2002arXiv

Credit Risk Contributions to Value-at-Risk and Expected Shortfall

This paper presents analytical solutions to the problem of how to calculate sensible VaR (Value-at-Risk) and ES (Expected Shortfall) contributions in the CreditRisk+ methodology. Via the ES contributions, ES itself can be exactly computed in finitely many steps. The methods are illustrated by numerical examples.

preprint2002arXiv

Expected Shortfall and Beyond

Financial institutions have to allocate so-called "economic capital" in order to guarantee solvency to their clients and counter parties. Mathematically speaking, any methodology of allocating capital is a "risk measure", i.e. a function mapping random variables to the real numbers. Nowadays "value-at-risk", which is defined as a fixed level quantile of the random variable under consideration, is the most popular risk measure. Unfortunately, it fails to reward diversification, as it is not "subadditive". In the search for a suitable alternative to value-at-risk, "Expected Shortfall" (or "conditional value-at-risk" or "tail value-at-risk") has been characterized as the smallest "coherent" and "law invariant" risk measure to dominate value-at-risk. We discuss these and some other properties of Expected Shortfall as well as its generalization to a class of coherent risk measures which can incorporate higher moment effects. Moreover, we suggest a general method on how to attribute Expected Shortfall "risk contributions" to portfolio components. Key words: Expected Shortfall; Value-at-Risk; Spectral Risk Measure; coherence; risk contribution.

preprint2002arXiv

Remarks on the monotonicity of default probabilities

The consultative papers for the Basel II Accord require rating systems to provide a ranking of obligors in the sense that the rating categories indicate the creditworthiness in terms of default probabilities. As a consequence, the default probabilities ought to present a monotonous function of the ordered rating categories. This requirement appears quite intuitive. In this paper, however, we show that the intuition can be founded on mathematical facts. We prove that, in the closely related context of a continuous score function, monotonicity of the conditional default probabilities is equivalent to optimality of the corresponding decision rules in the test-theoretic sense. As a consequence, the optimality can be checked by inspection of the ordinal dominance graph (also called Receiver Operating Characteristic curve) of the score function: it obtains if and only if the curve is concave. We conclude the paper by exploring the connection between the area under the ordinal dominance graph and the so-called Information Value which is used by some vendors of scoring systems. Keywords: Conditional default probability, score function, most powerful test, Information Value, Accuracy Ratio.

preprint2001arXiv

Expected Shortfall: a natural coherent alternative to Value at Risk

We discuss the coherence properties of Expected Shortfall (ES) as a financial risk measure. This statistic arises in a natural way from the estimation of the "average of the 100p % worst losses" in a sample of returns to a portfolio. Here p is some fixed confidence level. We also compare several alternative representations of ES which turn out to be more appropriate for certain purposes.

Dirk Tasche

What is connected

Connect this record

See the researcher in context

Building this map preview

19 published item(s)

Class Prior Estimation under Covariate Shift: No Problem?

Calibrating sufficiently

Does quantification without adjustments work?

Fitting a distribution to Value-at-Risk and Expected Shortfall, with an application to covered bonds

The two defaults scenario for stressing credit portfolio loss distributions

What is the best risk measure in practice? A comparison of standard measures

Exact fit of simple finite mixture models

The Law of Total Odds

Bayesian estimation of probabilities of default for low default portfolios

The art of probability-of-default curve calibration

Bounds for rating override rates

Capital allocation for credit portfolios under normal and stressed market conditions

Estimating discriminatory power and PD curves when the number of defaults is small

Validation of internal rating systems and PD estimates

A traffic lights approach to PD validation

Credit Risk Contributions to Value-at-Risk and Expected Shortfall

Expected Shortfall and Beyond

Remarks on the monotonicity of default probabilities

Expected Shortfall: a natural coherent alternative to Value at Risk