Source author record

Tim Wirtz

Tim Wirtz appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math-ph math.MP math.ST Statistics Theory Machine Learning cond-mat.stat-mech cs.CY hep-lat hep-th

Catalog footprint

What is connected

12works

9topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Tailored Uncertainty Estimation for Deep Learning Systems

Uncertainty estimation bears the potential to make deep learning (DL) systems more reliable. Standard techniques for uncertainty estimation, however, come along with specific combinations of strengths and weaknesses, e.g., with respect to estimation quality, generalization abilities and computational complexity. To actually harness the potential of uncertainty quantification, estimators are required whose properties closely match the requirements of a given use case. In this work, we propose a framework that, firstly, structures and shapes these requirements, secondly, guides the selection of a suitable uncertainty estimation method and, thirdly, provides strategies to validate this choice and to uncover structural weaknesses. By contributing tailored uncertainty estimation in this sense, our framework helps to foster trustworthy DL systems. Moreover, it anticipates prospective machine learning regulations that require, e.g., in the EU, evidences for the technical appropriateness of machine learning systems. Our framework provides such evidences for system components modeling uncertainty.

preprint2021arXiv

A Novel Regression Loss for Non-Parametric Uncertainty Optimization

Quantification of uncertainty is one of the most promising approaches to establish safe machine learning. Despite its importance, it is far from being generally solved, especially for neural networks. One of the most commonly used approaches so far is Monte Carlo dropout, which is computationally cheap and easy to apply in practice. However, it can underestimate the uncertainty. We propose a new objective, referred to as second-moment loss (SML), to address this issue. While the full network is encouraged to model the mean, the dropout networks are explicitly used to optimize the model variance. We intensively study the performance of the new objective on various UCI regression datasets. Comparing to the state-of-the-art of deep ensembles, SML leads to comparable prediction accuracies and uncertainty estimates while only requiring a single model. Under distribution shift, we observe moderate improvements. As a side result, we introduce an intuitive Wasserstein distance-based uncertainty measure that is non-saturating and thus allows to resolve quality differences between any two uncertainty estimates.

preprint2021arXiv

Approaching Neural Network Uncertainty Realism

Statistical models are inherently uncertain. Quantifying or at least upper-bounding their uncertainties is vital for safety-critical systems such as autonomous vehicles. While standard neural networks do not report this information, several approaches exist to integrate uncertainty estimates into them. Assessing the quality of these uncertainty estimates is not straightforward, as no direct ground truth labels are available. Instead, implicit statistical assessments are required. For regression, we propose to evaluate uncertainty realism -- a strict quality criterion -- with a Mahalanobis distance-based statistical test. An empirical evaluation reveals the need for uncertainty measures that are appropriate to upper-bound heavy-tailed empirical errors. Alongside, we transfer the variational U-Net classification architecture to standard supervised image-to-image tasks. We adopt it to the automotive domain and show that it significantly improves uncertainty realism compared to a plain encoder-decoder model.

preprint2021arXiv

Inspect, Understand, Overcome: A Survey of Practical Methods for AI Safety

The use of deep neural networks (DNNs) in safety-critical applications like mobile health and autonomous driving is challenging due to numerous model-inherent shortcomings. These shortcomings are diverse and range from a lack of generalization over insufficient interpretability to problems with malicious inputs. Cyber-physical systems employing DNNs are therefore likely to suffer from safety concerns. In recent years, a zoo of state-of-the-art techniques aiming to address these safety concerns has emerged. This work provides a structured and broad overview of them. We first identify categories of insufficiencies to then describe research activities aiming at their detection, quantification, or mitigation. Our paper addresses both machine learning experts and safety engineers: The former ones might profit from the broad range of machine learning topics covered and discussions on limitations of recent methods. The latter ones might gain insights into the specifics of modern ML methods. We moreover hope that our contribution fuels discussions on desiderata for ML systems and strategies on how to propel existing approaches accordingly.

preprint2020arXiv

Characteristics of Monte Carlo Dropout in Wide Neural Networks

Monte Carlo (MC) dropout is one of the state-of-the-art approaches for uncertainty estimation in neural networks (NNs). It has been interpreted as approximately performing Bayesian inference. Based on previous work on the approximation of Gaussian processes by wide and deep neural networks with random weights, we study the limiting distribution of wide untrained NNs under dropout more rigorously and prove that they as well converge to Gaussian processes for fixed sets of weights and biases. We sketch an argument that this property might also hold for infinitely wide feed-forward networks that are trained with (full-batch) gradient descent. The theory is contrasted by an empirical analysis in which we find correlations and non-Gaussian behaviour for the pre-activations of finite width NNs. We therefore investigate how (strongly) correlated pre-activations can induce non-Gaussian behavior in NNs with strongly correlated weights.

preprint2015arXiv

The Correlated Jacobi and the Correlated Cauchy-Lorentz ensembles

We calculate the $k$-point generating function of the correlated Jacobi ensemble using supersymmetric methods. We use the result for complex matrices for $k=1$ to derive a closed-form expression for eigenvalue density. For real matrices we obtain the density in terms of a twofold integral that we evaluate numerically. For both expressions we find agreement when comparing with Monte Carlo simulations. Relations between these quantities for the Jacobi and the Cauchy-Lorentz ensemble are derived.

preprint2015arXiv

The Smallest Eigenvalue Distribution in the Real Wishart-Laguerre Ensemble with Even Topology

We consider rectangular random matrices of size $p\times n$ belonging to the real Wishart-Laguerre ensemble also known as the chiral Gaussian orthogonal ensemble. This ensemble appears in many applications like QCD, mesoscopic physics, and time series analysis. We are particularly interested in the distribution of the smallest non-zero eigenvalue and the gap probability to find no eigenvalue in an interval $[0,t]$. While for odd topology $ν=n-p$ explicit closed results are known for finite and infinite matrix size, for even $ν>2$ only recursive expressions in $p$ are available.The smallest eigenvalue distribution as well as the gap probability for general even $ν$ is equivalent to expectation values of characteristic polynomials raised to a half-integer. The computation of such averages is done via a combination of skew-orthogonal polynomials and bosonisation methods. The results are given in terms of Pfaffian determinants both at finite $p$ and in the hard edge scaling limit ($p\to\infty$ and $ν$ fixed) for an arbitrary even topology $ν$. Numerical simulations for the correlated Wishart ensemble illustrate the universality of our results in this particular limit. These simulations point to a validity of the hard edge scaling limit beyond the invariant case.

preprint2014arXiv

Distribution of the Smallest Eigenvalue in Complex and Real Correlated Wishart Ensembles

For the correlated Gaussian Wishart ensemble we compute the distribution of the smallest eigenvalue and a related gap probability.We obtain exact results for the complex (β=2) and for the real case (β=1). For a particular set of empirical correlation matrices we find universality in the spectral density, for both real and complex ensembles and all kinds of rectangularity. We calculate the asymptotic and universal results for the gap probability and the distribution of the smallest eigenvalue. We use the Supersymmetry method, in particular the generalized Hubbard-Stratonovich transformation and superbosonization.

preprint2014arXiv

Eigenvalue Density of the Doubly Correlated Wishart Model: Exact Results

Data sets collected at different times and different observing points can possess correlations at different times $and$ at different positions. The doubly correlated Wishart model takes both into account. We calculate the eigenvalue density of the Wishart correlation matrices using supersymmetry. In the complex case we obtain a new closed form expression which we compare to previous results in the literature. In the more relevant and much more complicated real case we derive an expression for the density in terms of a fourfold integral. Finally, we calculate the density in the limit of large correlation matrices.

preprint2014arXiv

Limiting Statistics of the Largest and Smallest Eigenvalues in the Correlated Wishart Model

The correlated Wishart model provides a standard tool for the analysis of correlations in a rich variety of systems. Although much is known for complex correlation matrices, the empirically much more important real case still poses substantial challenges. We put forward a new approach, which maps arbitrary statistical quantities, depending on invariants only, to invariant Hermitian matrix models. For completeness we also include the quaternion case and deal with all three cases in a unified way. As an important application, we study the statistics of the largest eigenvalue and its limiting distributions in the correlated Wishart model, because they help to estimate the behavior of large complex systems. We show that even for fully correlated Wishart ensembles, the Tracy-Widom distribution can be the limiting distribution of the largest as well as the smallest eigenvalue, provided that a certain scaling of the empirical eigenvalues holds.

preprint2013arXiv

Distribution of the Smallest Eigenvalue in the Correlated Wishart Model

Wishart random matrix theory is of major importance for the analysis of correlated time series. The distribution of the smallest eigenvalue for Wishart correlation matrices is particularly interesting in many applications. In the complex and in the real case, we calculate it exactly for arbitrary empirical eigenvalues, i.e., for fully correlated Gaussian Wishart ensembles. To this end, we derive certain dualities of matrix models in ordinary space. We thereby completely avoid the otherwise unsurmountable problem of computing a highly non-trivial group integral. Our results are compact and much easier to handle than previous ones. Furthermore, we obtain a new universality for the distribution of the smallest eigenvalue on the proper local scale.

preprint2011arXiv

Worldsheet operator product expansions and p-point functions in AdS3/CFT2

We construct the operator product expansions (OPE) of the chiral primary operators in the worldsheet theory for strings on AdS_3 x S^3 x T^4. As an interesting application, we will use the worldsheet OPEs to derive a recursion relation for a particular class of extremal p-point correlators on the sphere. We compare our result with the corresponding recursion relation previously found in the symmetric orbifold theory on the boundary of AdS_3.

Tim Wirtz

What is connected

Connect this record

See the researcher in context

Building this map preview

12 published item(s)

Tailored Uncertainty Estimation for Deep Learning Systems

A Novel Regression Loss for Non-Parametric Uncertainty Optimization

Approaching Neural Network Uncertainty Realism

Inspect, Understand, Overcome: A Survey of Practical Methods for AI Safety

Characteristics of Monte Carlo Dropout in Wide Neural Networks

The Correlated Jacobi and the Correlated Cauchy-Lorentz ensembles

The Smallest Eigenvalue Distribution in the Real Wishart-Laguerre Ensemble with Even Topology

Distribution of the Smallest Eigenvalue in Complex and Real Correlated Wishart Ensembles

Eigenvalue Density of the Doubly Correlated Wishart Model: Exact Results

Limiting Statistics of the Largest and Smallest Eigenvalues in the Correlated Wishart Model

Distribution of the Smallest Eigenvalue in the Correlated Wishart Model

Worldsheet operator product expansions and p-point functions in AdS3/CFT2