Source author record

Ansgar Steland

Ansgar Steland appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Applications math.ST Statistics Theory math.PR Methodology Machine Learning Artificial Intelligence Computer Vision cs.CY eess.SP

Catalog footprint

What is connected

10works

10topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Sequential Gaussian approximation for nonstationary time series in high dimensions

Gaussian couplings of partial sum processes are derived for the high-dimensional regime $d=o(n^{1/3})$. The coupling is derived for sums of independent random vectors and subsequently extended to nonstationary time series. Our inequalities depend explicitly on the dimension and on a measure of nonstationarity, and are thus also applicable to arrays of random vectors. To enable high-dimensional statistical inference, a feasible Gaussian approximation scheme is proposed. Applications to sequential testing and change-point detection are described.

preprint2021arXiv

Cross-Validation and Uncertainty Determination for Randomized Neural Networks with Applications to Mobile Sensors

Randomized artificial neural networks such as extreme learning machines provide an attractive and efficient method for supervised learning under limited computing ressources and green machine learning. This especially applies when equipping mobile devices (sensors) with weak artificial intelligence. Results are discussed about supervised learning with such networks and regression methods in terms of consistency and bounds for the generalization and prediction error. Especially, some recent results are reviewed addressing learning with data sampled by moving sensors leading to non-stationary and dependent samples. As randomized networks lead to random out-of-sample performance measures, we study a cross-validation approach to handle the randomness and make use of it to improve out-of-sample performance. Additionally, a computationally efficient approach to determine the resulting uncertainty in terms of a confidence interval for the mean out-of-sample prediction error is discussed based on two-stage estimation. The approach is applied to a prediction problem arising in vehicle integrated photovoltaics.

preprint2020arXiv

Detecting Changes in the Second Moment Structure of High-Dimensional Sensor-Type Data in a $K$-Sample Setting

The $K$ sample problem for high-dimensional vector time series is studied, especially focusing on sensor data streams, in order to analyze the second moment structure and detect changes across samples and/or across variables cumulated sum (CUSUM) statistics of bilinear forms of the sample covariance matrix. In this model $K$ independent vector time series $\mathbf{Y}_{T,1},\dots,\mathbf{Y}_{T,K}$ are observed over a time span $ [0,T] $, which may correspond to $K$ sensors (locations) yielding $d$-dimensional data as well as $K$ locations where $d$ sensors emit univariate data. Unequal sample sizes are considered as arising when the sampling rate of the sensors differs. We provide large sample approximations and two related change-point statistics, a sums of squares and a pooled variance statistic. The resulting procedures are investigated by simulations and illustrated by analyzing a real data set.

preprint2020arXiv

Is there a role for statistics in artificial intelligence?

The research on and application of artificial intelligence (AI) has triggered a comprehensive scientific, economic, social and political discussion. Here we argue that statistics, as an interdisciplinary scientific field, plays a substantial role both for the theoretical and practical understanding of AI and for its future development. Statistics might even be considered a core element of AI. With its specialist knowledge of data evaluation, starting with the precise formulation of the research question and passing through a study design stage on to analysis and interpretation of the results, statistics is a natural partner for other disciplines in teaching, research and practice. This paper aims at contributing to the current discussion by highlighting the relevance of statistical methodology in the context of AI development. In particular, we discuss contributions of statistics to the field of artificial intelligence concerning methodological development, planning and design of studies, assessment of data quality and data collection, differentiation of causality and associations and assessment of uncertainty in results. Moreover, the paper also deals with the equally necessary and meaningful extension of curricula in schools and universities.

preprint2020arXiv

Testing and Estimating Change-Points in the Covariance Matrix of a High-Dimensional Time Series

This paper studies methods for testing and estimating change-points in the covariance structure of a high-dimensional linear time series. The assumed framework allows for a large class of multivariate linear processes (including vector autoregressive moving average (VARMA) models) of growing dimension and spiked covariance models. The approach uses bilinear forms of the centered or non-centered sample variance-covariance matrix. Change-point testing and estimation are based on maximally selected weighted cumulated sum (CUSUM) statistics. Large sample approximations under a change-point regime are provided including a multivariate CUSUM transform of increasing dimension. For the unknown asymptotic variance and covariance parameters associated to (pairs of) CUSUM statistics we propose consistent estimators. Based on weak laws of large numbers for their sequential versions, we also consider stopped sample estimation where observations until the estimated change-point are used. Finite sample properties of the procedures are investigated by simulations and their application is illustrated by analyzing a real data set from environmetrics.

preprint2018arXiv

Automatic Processing and Solar Cell Detection in Photovoltaic Electroluminescence Images

Electroluminescence (EL) imaging is a powerful and established technique for assessing the quality of photovoltaic (PV) modules, which consist of many electrically connected solar cells arranged in a grid. The analysis of imperfect real-world images requires reliable methods for preprocessing, detection and extraction of the cells. We propose several methods for those tasks, which, however, can be modified to related imaging problems where similar geometric objects need to be detected accurately. Allowing for images taken under difficult outdoor conditions, we present methods to correct for rotation and perspective distortions. The next important step is the extraction of the solar cells of a PV module, for instance to pass them to a procedure to detect and analyze defects on their surface. We propose a method based on specialized Hough transforms, which allows to extract the cells even when the module is surrounded by disturbing background and a fast method based on cumulated sums (CUSUM) change detection to extract the cell area of single-cell mini-module, where the correction of perspective distortion is implicitly done. The methods are highly automatized to allow for big data analyses. Their application to a large database of EL images substantiates that the methods work reliably on a large scale for real-world images. Simulations show that the approach achieves high accuracy, reliability and robustness. This even holds for low contrast images as evaluated by comparing the simulated accuracy for a low and a high contrast image.

preprint2018arXiv

Shrinkage for Covariance Estimation: Asymptotics, Confidence Intervals, Bounds and Applications in Sensor Monitoring and Finance

When shrinking a covariance matrix towards (a multiple) of the identity matrix, the trace of the covariance matrix arises naturally as the optimal scaling factor for the identity target. The trace also appears in other context, for example when measuring the size of a matrix or the amount of uncertainty. Of particular interest is the case when the dimension of the covariance matrix is large. Then the problem arises that the sample covariance matrix is singular if the dimension is larger than the sample size. Another issue is that usually the estimation has to based on correlated time series data. We study the estimation of the trace functional allowing for a high-dimensional time series model, where the dimension is allowed to grow with the sample size - without any constraint. Based on a recent result, we investigate a confidence interval for the trace, which also allows us to propose lower and upper bounds for the shrinkage covariance estimator as well as bounds for the variance of projections. In addition, we provide a novel result dealing with shrinkage towards a diagonal target. We investigate the accuracy of the confidence interval by a simulation study, which indicates good performance, and analyze three stock market data sets to illustrate the proposed bounds, where the dimension (number of stocks) ranges between $32$ and $475$. Especially, we apply the results to portfolio optimization and determine bounds for the risk associated to the variance-minimizing portfolio.

preprint2017arXiv

Asymptotics for high-dimensional covariance matrices and quadratic forms with applications to the trace functional and shrinkage

We establish large sample approximations for an arbitray number of bilinear forms of the sample variance-covariance matrix of a high-dimensional vector time series using $ \ell_1$-bounded and small $\ell_2$-bounded weighting vectors. Estimation of the asymptotic covariance structure is also discussed. The results hold true without any constraint on the dimension, the number of forms and the sample size or their ratios. Concrete and potential applications are widespread and cover high-dimensional data science problems such as tests for large numbers of covariances, sparse portfolio optimization and projections onto sparse principal components or more general spanning sets as frequently considered, e.g. in classification and dictionary learning. As two specific applications of our results, we study in greater detail the asymptotics of the trace functional and shrinkage estimation of covariance matrices. In shrinkage estimation, it turns out that the asymptotics differs for weighting vectors bounded away from orthogonaliy and nearly orthogonal ones in the sense that their inner product converges to 0.

preprint2010arXiv

A Binary Control Chart to Detect Small Jumps

The classic N p chart gives a signal if the number of successes in a sequence of inde- pendent binary variables exceeds a control limit. Motivated by engineering applications in industrial image processing and, to some extent, financial statistics, we study a simple modification of this chart, which uses only the most recent observations. Our aim is to construct a control chart for detecting a shift of an unknown size, allowing for an unknown distribution of the error terms. Simulation studies indicate that the proposed chart is su- perior in terms of out-of-control average run length, when one is interest in the detection of very small shifts. We provide a (functional) central limit theorem under a change-point model with local alternatives which explains that unexpected and interesting behavior. Since real observations are often not independent, the question arises whether these re- sults still hold true for the dependent case. Indeed, our asymptotic results work under the fairly general condition that the observations form a martingale difference array. This enlarges the applicability of our results considerably, firstly, to a large class time series models, and, secondly, to locally dependent image data, as we demonstrate by an example.

preprint2010arXiv

Sequentially Updated Residuals and Detection of Stationary Errors in Polynomial Regression Models

The question whether a time series behaves as a random walk or as a station- ary process is an important and delicate problem, particularly arising in financial statistics, econometrics, and engineering. This paper studies the problem to detect sequentially that the error terms in a polynomial regression model no longer behave as a random walk but as a stationary process. We provide the asymptotic distribution theory for a monitoring procedure given by a control chart, i.e., a stopping time, which is related to a well known unit root test statistic calculated from sequentially updated residuals. We provide a functional central limit theorem for the corresponding stochastic process which implies a central limit theorem for the control chart. The finite sample properties are investigated by a simulation study.

Ansgar Steland

What is connected

Connect this record

See the researcher in context

Building this map preview

10 published item(s)

Sequential Gaussian approximation for nonstationary time series in high dimensions

Cross-Validation and Uncertainty Determination for Randomized Neural Networks with Applications to Mobile Sensors

Detecting Changes in the Second Moment Structure of High-Dimensional Sensor-Type Data in a $K$-Sample Setting

Is there a role for statistics in artificial intelligence?

Testing and Estimating Change-Points in the Covariance Matrix of a High-Dimensional Time Series

Automatic Processing and Solar Cell Detection in Photovoltaic Electroluminescence Images

Shrinkage for Covariance Estimation: Asymptotics, Confidence Intervals, Bounds and Applications in Sensor Monitoring and Finance

Asymptotics for high-dimensional covariance matrices and quadratic forms with applications to the trace functional and shrinkage

A Binary Control Chart to Detect Small Jumps

Sequentially Updated Residuals and Detection of Stationary Errors in Polynomial Regression Models