Source author record

Adam B Kashlak

Adam B Kashlak appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Methodology math.ST Statistics Theory math.PR Computation Machine Learning math.FA

Catalog footprint

What is connected

5works

7topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Topological Hidden Markov Models

The hidden Markov model (HMM) is a classic modeling tool with a wide swath of applications. Its inception considered observations restricted to a finite alphabet, but it was quickly extended to multivariate continuous distributions. In this article, we further extend the HMM from mixtures of normal distributions in $d$-dimensional Euclidean space to general Gaussian measure mixtures in locally convex topological spaces. The main innovation is the use of the Onsager-Machlup functional as a proxy for the probability density function in infinite dimensional spaces. This allows for choice of a Cameron-Martin space suitable for a given application. We demonstrate the versatility of this methodology by applying it to simulated diffusion processes such as Brownian and fractional Brownian sample paths as well as the Ornstein-Uhlenbeck process. Our methodology is applied to the identification of sleep states from overnight polysomnography time series data with the aim of diagnosing Obstructive Sleep Apnea in pediatric patients. It is also applied to a series of annual cumulative snowfall curves from 1940 to 1990 in the city of Edmonton, Alberta.

preprint2021arXiv

A reproducing kernel Hilbert space framework for functional data classification

We encounter a bottleneck when we try to borrow the strength of classical classifiers to classify functional data. The major issue is that functional data are intrinsically infinite dimensional, thus classical classifiers cannot be applied directly or have poor performance due to the curse of dimensionality. To address this concern, we propose to project functional data onto one specific direction, and then a distance-weighted discrimination DWD classifier is built upon the projection score. The projection direction is identified through minimizing an empirical risk function that contains the particular loss function in a DWD classifier, over a reproducing kernel Hilbert space. Hence our proposed classifier can avoid overfitting and enjoy appealing properties of DWD classifiers. This framework is further extended to accommodate functional data classification problems where scalar covariates are involved. In contrast to previous work, we establish a non-asymptotic estimation error bound on the relative misclassification rate. In finite sample case, we demonstrate that the proposed classifiers compare favorably with some commonly used functional classifiers in terms of prediction accuracy through simulation studies and a real-world application.

preprint2021arXiv

Functional Response Designs via the Analytic Permutation Test

Vast literature on experimental design extends from Fisher and Snedecor to the modern day. When data lies beyond the assumption of univariate normality, nonparametric methods including rank based statistics and permutation tests are enlisted. The permutation test is a versatile exact nonparametric significance test that requires drastically fewer assumptions than similar parametric tests. The main downfall of the permutation test is high computational cost making this approach laborious for complex data and sophisticated experimental designs and completely infeasible in any application requiring speedy results such as high throughput streaming data. We rectify this problem through application of concentration inequalities and thus propose a computation free permutation test -- i.e. a permutation-less permutation test. This general framework is applied to multivariate, matrix-valued, and functional data. We improve these concentration bounds via a novel incomplete beta transform. We extend our theory from 2-sample to $k$-sample testing through the use of weakly dependent Rademacher chaoses and modified decoupling inequalities. We test this methodology on classic functional data sets including the Berkeley growth curves and the phoneme dataset. We further consider analysis of spoken vowel sound under two experimental designs: the Latin square and the randomized block design.

preprint2020arXiv

Computation-free Nonparametric testing for Local and Global Spatial Autocorrelation with application to the Canadian Electorate

Measures of local and global spatial association are key tools for exploratory spatial data analysis. Many such measures exist including Moran's $I$, Geary's $C$, and the Getis-Ord $G$ and $G^*$ statistics. A parametric approach to testing for significance relies on strong assumptions, which are often not met by real world data. Alternatively, the most popular nonparametric approach, the permutation test, imposes a large computational burden especially for massive graphical networks. Hence, we propose a computation-free approach to nonparametric permutation testing for local and global measures of spatial autocorrelation stemming from generalizations of the Khintchine inequality from functional analysis and the theory of $L^p$ spaces. Our methodology is demonstrated on the results of the 2019 federal Canadian election in the province of Alberta. We recorded the percentage of the vote gained by the conservative candidate in each riding. This data is not normal, and the sample size is fixed at $n=34$ ridings making the parametric approach invalid. In contrast, running a classic permutation test for every riding, for multiple test statistics, with various neighbourhood structures, and multiple testing correction would require the simulation of millions of permutations. We are able to achieve similar statistical power on this dataset to the permutation test without the need for tedious simulation. We also consider data simulated across the entire electoral map of Canada.

preprint2016arXiv

Improved Rademacher symmetrization through a Wasserstein based measure of asymmetry

We propose of an improved version of the ubiquitous symmetrization inequality making use of the Wasserstein distance between a measure and its reflection in order to quantify the symmetry of the given measure. An empirical bound on this asymmetric correction term is derived through a bootstrap procedure and shown to give tighter results in practical settings than the original uncorrected inequality. Lastly, a wide range of applications are detailed including testing for data symmetry, constructing nonasymptotic high dimensional confidence sets, bounding the variance of an empirical process, and improving constants in Nemirovski style inequalities for Banach space valued random variables.