Researcher profile

Dirk Tasche

Dirk Tasche contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
14works
0followers
11topics
3close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

14 published item(s)

preprint2022arXiv

Class Prior Estimation under Covariate Shift: No Problem?

We show that in the context of classification the property of source and target distributions to be related by covariate shift may be lost if the information content captured in the covariates is reduced, for instance by dropping components or mapping into a lower-dimensional or finite space. As a consequence, under covariate shift simple approaches to class prior estimation in the style of classify and count with or without adjustment are infeasible. We prove that transformations of the covariates that preserve the covariate shift property are necessarily sufficient in the statistical sense for the full set of covariates. A probing algorithm as alternative approach to class prior estimation under covariate shift is proposed.

preprint2021arXiv

Calibrating sufficiently

When probabilistic classifiers are trained and calibrated, the so-called grouping loss component of the calibration loss can easily be overlooked. Grouping loss refers to the gap between observable information and information actually exploited in the calibration exercise. We investigate the relation between grouping loss and the concept of sufficiency, identifying comonotonicity as a useful criterion for sufficiency. We revisit the probing reduction approach of Langford & Zadrozny (2005) and find that it produces an estimator of probabilistic classifiers that reduces grouping loss. Finally, we discuss Brier curves as tools to support training and 'sufficient' calibration of probabilistic classifiers.

preprint2014arXiv

The Law of Total Odds

The law of total probability may be deployed in binary classification exercises to estimate the unconditional class probabilities if the class proportions in the training set are not representative of the population class proportions. We argue that this is not a conceptually sound approach and suggest an alternative based on the new law of total odds. We quantify the bias of the total probability estimator of the unconditional class probabilities and show that the total odds estimator is unbiased. The sample version of the total odds estimator is shown to coincide with a maximum-likelihood estimator known from the literature. The law of total odds can also be used for transforming the conditional class probabilities if independent estimates of the unconditional class probabilities of the population are available. Keywords: Total probability, likelihood ratio, Bayes' formula, binary classification, relative odds, unbiased estimator, supervised learning, dataset shift.

preprint2013arXiv

Bayesian estimation of probabilities of default for low default portfolios

The estimation of probabilities of default (PDs) for low default portfolios by means of upper confidence bounds is a well established procedure in many financial institutions. However, there are often discussions within the institutions or between institutions and supervisors about which confidence level to use for the estimation. The Bayesian estimator for the PD based on the uninformed, uniform prior distribution is an obvious alternative that avoids the choice of a confidence level. In this paper, we demonstrate that in the case of independent default events the upper confidence bounds can be represented as quantiles of a Bayesian posterior distribution based on a prior that is slightly more conservative than the uninformed prior. We then describe how to implement the uninformed and conservative Bayesian estimators in the dependent one- and multi-period default data cases and compare their estimates to the upper confidence bound estimates. The comparison leads us to suggest a constrained version of the uninformed (neutral) Bayesian estimator as an alternative to the upper confidence bound estimators.

preprint2013arXiv

The art of probability-of-default curve calibration

PD curve calibration refers to the transformation of a set of rating grade level probabilities of default (PDs) to another average PD level that is determined by a change of the underlying portfolio-wide PD. This paper presents a framework that allows to explore a variety of calibration approaches and the conditions under which they are fit for purpose. We test the approaches discussed by applying them to publicly available datasets of agency rating and default statistics that can be considered typical for the scope of application of the approaches. We show that the popular 'scaled PDs' approach is theoretically questionable and identify an alternative calibration approach ('scaled likelihood ratio') that is both theoretically sound and performs better on the test datasets. Keywords: Probability of default, calibration, likelihood ratio, Bayes' formula, rating profile, binary classification.

preprint2012arXiv

Bounds for rating override rates

Overrides of credit ratings are important correctives of ratings that are determined by statistical rating models. Financial institutions and banking regulators agree on this because on the one hand errors with ratings of corporates or banks can have fatal consequences for the lending institutions and on the other hand errors by statistical methods can be minimised but not completely avoided. Nonetheless, rating overrides can be misused in order to conceal the real riskiness of borrowers or even entire portfolios. That is why rating overrides usually are strictly governed and carefully recorded. It is not clear, however, which frequency of overrides is appropriate for a given rating model within a predefined time period. This paper argues that there is a natural error rate associated with a statistical rating model that may be used to inform assessment of whether or not an observed override rate is adequate. The natural error rate is closely related to the rating model's discriminatory power and can readily be calculated.

preprint2012arXiv

Capital allocation for credit portfolios under normal and stressed market conditions

If the probability of default parameters (PDs) fed as input into a credit portfolio model are estimated as through-the-cycle (TTC) PDs stressed market conditions have little impact on the results of the capital calculations conducted with the model. At first glance, this is totally different if the PDs are estimated as point-in-time (PIT) PDs. However, it can be argued that the reflection of stressed market conditions in input PDs should correspond to the use of reduced correlation parameters or even the removal of correlations in the model. Additionally, the confidence levels applied for the capital calculations might be made reflective of the changing market conditions. We investigate the interplay of PIT PDs, correlations, and confidence levels in a credit portfolio model in more detail and analyse possible designs of capital-levelling policies. Our findings may of interest to banks that want to combine their approaches to capital measurement and allocation with active portfolio management that, by its nature, needs to be reflective of current market conditions.

preprint2010arXiv

Estimating discriminatory power and PD curves when the number of defaults is small

The intention with this paper is to provide all the estimation concepts and techniques that are needed to implement a two-phases approach to the parametric estimation of probability of default (PD) curves. In the first phase of this approach, a raw PD curve is estimated based on parameters that reflect discriminatory power. In the second phase of the approach, the raw PD curve is calibrated to fit a target unconditional PD. The concepts and techniques presented include a discussion of different definitions of area under the curve (AUC) and accuracy ratio (AR), a simulation study on the performance of confidence interval estimators for AUC, a discussion of the one-parametric approach to the estimation of PD curves by van der Burgt (2008) and alternative approaches, as well as a simulation study on the performance of the presented PD curve estimators. The topics are treated in depth in order to provide the full rationale behind them and to produce results that can be implemented immediately.

preprint2006arXiv

Validation of internal rating systems and PD estimates

This paper elaborates on the validation requirements for rating systems and probabilities of default (PDs) which were introduced with the New Capital Standards (Basel II). We start in Section 2 with some introductory remarks on the topics and approaches that will be discussed later on. Then we have a view on the developments in banking regulation that have enforced the interest of the public in validation techniques. When doing so, we put the main emphasis on the issues with quantitative validation. The techniques discussed here could be used in order to meet the quantitative regulatory requirements. However, their appropriateness will depend on the specific conditions under which they are applied. In order to have a common ground for the description of the different techniques, we introduce in Section 3 a theoretical framework that will be the basis for the further considerations. Intuitively, a good rating system should show higher probabilities of default for the less creditworthy rating grades. Therefore, in Section 4, we discuss how this monotonicity property is reflected in the theoretical framework from Section 3. In Section 5, we study the meaning of discriminatory power and some tools for measuring it in some detail. We will see that there are tools that might be more appropriate than others for the purpose of regulatory validation of discriminatory power. The topic in Section 6 is calibration of rating systems. We introduce some of the tests that can be used for checking correct calibration and discuss the properties of the different tests. We then conclude in Section 7 with some comments on the question which tools might be most appropriate for quantitative validation of rating systems and probabilities of default.

preprint2003arXiv

A traffic lights approach to PD validation

As a consequence of the dependence experienced in loan portfolios, the standard binomial test which is based on the assumption of independence does not appear appropriate for validating probabilities of default (PDs). The model underlying the new rules for minimum capital requirements (Basle II) is taken as a point of departure for deriving two parametric test procedures that incorporate dependence effects. The first one makes use of the so-called granularity adjustment approach while the the second one is based on moment matching.

preprint2002arXiv

Expected Shortfall and Beyond

Financial institutions have to allocate so-called "economic capital" in order to guarantee solvency to their clients and counter parties. Mathematically speaking, any methodology of allocating capital is a "risk measure", i.e. a function mapping random variables to the real numbers. Nowadays "value-at-risk", which is defined as a fixed level quantile of the random variable under consideration, is the most popular risk measure. Unfortunately, it fails to reward diversification, as it is not "subadditive". In the search for a suitable alternative to value-at-risk, "Expected Shortfall" (or "conditional value-at-risk" or "tail value-at-risk") has been characterized as the smallest "coherent" and "law invariant" risk measure to dominate value-at-risk. We discuss these and some other properties of Expected Shortfall as well as its generalization to a class of coherent risk measures which can incorporate higher moment effects. Moreover, we suggest a general method on how to attribute Expected Shortfall "risk contributions" to portfolio components. Key words: Expected Shortfall; Value-at-Risk; Spectral Risk Measure; coherence; risk contribution.

preprint2002arXiv

Remarks on the monotonicity of default probabilities

The consultative papers for the Basel II Accord require rating systems to provide a ranking of obligors in the sense that the rating categories indicate the creditworthiness in terms of default probabilities. As a consequence, the default probabilities ought to present a monotonous function of the ordered rating categories. This requirement appears quite intuitive. In this paper, however, we show that the intuition can be founded on mathematical facts. We prove that, in the closely related context of a continuous score function, monotonicity of the conditional default probabilities is equivalent to optimality of the corresponding decision rules in the test-theoretic sense. As a consequence, the optimality can be checked by inspection of the ordinal dominance graph (also called Receiver Operating Characteristic curve) of the score function: it obtains if and only if the curve is concave. We conclude the paper by exploring the connection between the area under the ordinal dominance graph and the so-called Information Value which is used by some vendors of scoring systems. Keywords: Conditional default probability, score function, most powerful test, Information Value, Accuracy Ratio.

preprint2001arXiv

Expected Shortfall: a natural coherent alternative to Value at Risk

We discuss the coherence properties of Expected Shortfall (ES) as a financial risk measure. This statistic arises in a natural way from the estimation of the "average of the 100p % worst losses" in a sample of returns to a portfolio. Here p is some fixed confidence level. We also compare several alternative representations of ES which turn out to be more appropriate for certain purposes.