Source author record

Marc Ditzhaus

Marc Ditzhaus appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Methodology math.ST Statistics Theory Machine Learning

Catalog footprint

What is connected

7works

4topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

A Multiple kernel testing procedure for non-proportional hazards in factorial designs

In this paper we propose a Multiple kernel testing procedure to infer survival data when several factors (e.g. different treatment groups, gender, medical history) and their interaction are of interest simultaneously. Our method is able to deal with complex data and can be seen as an alternative to the omnipresent Cox model when assumptions such as proportionality cannot be justified. Our methodology combines well-known concepts from Survival Analysis, Machine Learning and Multiple Testing: differently weighted log-rank tests, kernel methods and multiple contrast tests. By that, complex hazard alternatives beyond the classical proportional hazard set-up can be detected. Moreover, multiple comparisons are performed by fully exploiting the dependence structure of the single testing procedures to avoid a loss of power. In all, this leads to a flexible and powerful procedure for factorial survival designs whose theoretical validity is proven by martingale arguments and the theory for $V$-statistics. We evaluate the performance of our method in an extensive simulation study and illustrate it by a real data analysis.

preprint2022arXiv

Hypothesis testing for matched pairs with missing data by maximum mean discrepancy: An application to continuous glucose monitoring

A frequent problem in statistical science is how to properly handle missing data in matched paired observations. There is a large body of literature coping with the univariate case. Yet, the ongoing technological progress in measuring biological systems raises the need for addressing more complex data, e.g., graphs, strings and probability distributions, among others. In order to fill this gap, this paper proposes new estimators of the maximum mean discrepancy (MMD) to handle complex matched pairs with missing data. These estimators can detect differences in data distributions under different missingness mechanisms. The validity of this approach is proven and further studied in an extensive simulation study, and results of statistical consistency are provided. Data from continuous glucose monitoring in a longitudinal population-based diabetes study are used to illustrate the application of this approach. By employing the new distributional representations together with cluster analysis, new clinical criteria on how glucose changes vary at the distributional level over five years can be explored.

preprint2021arXiv

Studentized Permutation Method for Comparing Restricted Mean Survival Times with Small Sample from Randomized Trials

Recent observations, especially in cancer immunotherapy clinical trials with time-to-event outcomes, show that the commonly used proportial hazard assumption is often not justifiable, hampering an appropriate analyse of the data by hazard ratios. An attractive alternative advocated is given by the restricted mean survival time (RMST), which does not rely on any model assumption and can always be interpreted intuitively. As pointed out recently by Horiguchi and Uno (2020), methods for the RMST based on asymptotic theory suffer from inflated type-I error under small sample sizes. To overcome this problem, they suggested a permutation strategy leading to more convincing results in simulations. However, their proposal requires an exchangeable data set-up between comparison groups which may be limiting in practice. In addition, it is not possible to invert their testing procedure to obtain valid confidence intervals, which can provide more in-depth information. In this paper, we address these limitations by proposing a studentized permutation test as well as the corresponding permutation-based confidence intervals. In our extensive simulation study, we demonstrate the advantage of our new method, especially in situations with relative small sample sizes and unbalanced groups. Finally we illustrate the application of the proposed method by re-analysing data from a recent lung cancer clinical trial.

preprint2020arXiv

Inferring median survival differences in general factorial designs via permutation tests

Factorial survival designs with right-censored observations are commonly inferred by Cox regression and explained by means of hazard ratios. However, in case of non-proportional hazards, their interpretation can become cumbersome; especially for clinicians. We therefore offer an alternative: median survival times are used to estimate treatment and interaction effects and null hypotheses are formulated in contrasts of their population versions. Permutation-based tests and confidence regions are proposed and shown to be asymptotically valid. Their type-1 error control and power behavior are investigated in extensive simulations, showing the new methods' wide applicability. The latter is complemented by an illustrative data analysis.

preprint2020arXiv

Permutation inference in factorial survival designs with the CASANOVA

We propose inference procedures for general nonparametric factorial survival designs with possibly right-censored data. Similar to additive Aalen models, null hypotheses are formulated in terms of cumulative hazards. Thereby, deviations are measured in terms of quadratic forms in Nelson-Aalen-type integrals. Different to existing approaches this allows to work without restrictive model assumptions as proportional hazards. In particular, crossing survival or hazard curves can be detected without a significant loss of power. For a distribution-free application of the method, a permutation strategy is suggested. The resulting procedures' asymptotic validity as well as their consistency are proven and their small sample performances are analyzed in extensive simulations. Their applicability is finally illustrated by analyzing an oncology data set.

preprint2020arXiv

Permutation test for the multivariate coefficient of variation in factorial designs

New inference methods for the multivariate coefficient of variation and its reciprocal, the standardized mean, are presented. While there are various testing procedures for both parameters in the univariate case, it is less known how to do inference in the multivariate setting appropriately. There are some existing procedures but they rely on restrictive assumptions on the underlying distributions. We tackle this problem by applying Wald-type statistics in the context of general, potentially heteroscedastic factorial designs. In addition to the $k$-sample case, higher-way layouts can be incorporated into this framework allowing the discussion of main and interaction effects. The resulting procedures are shown to be asymptotically valid under the null hypothesis and consistent under general alternatives. To improve the finite sample performance, we suggest permutation versions of the tests and shown that the tests' asymptotic properties can be transferred to them. An exhaustive simulation study compares the new tests, their permutation counterparts and existing methods. To further analyse the differences between the tests, we conduct two illustrative real data examples.

preprint2020arXiv

QANOVA: Quantile-based Permutation Methods For General Factorial Designs

Population means and standard deviations are the most common estimands to quantify effects in factorial layouts. In fact, most statistical procedures in such designs are built towards inferring means or contrasts thereof. For more robust analyses, we consider the population median, the interquartile range (IQR) and more general quantile combinations as estimands in which we formulate null hypotheses and calculate compatible confidence regions. Based upon simultaneous multivariate central limit theorems and corresponding resampling results, we derive asymptotically correct procedures in general, potentially heteroscedastic, factorial designs with univariate endpoints. Special cases cover robust tests for the population median or the IQR in arbitrary crossed one-, two- and higher-way layouts with potentially heteroscedastic error distributions. In extensive simulations we analyze their small sample properties and also conduct an illustrating data analysis comparing children's height and weight from different countries.

Marc Ditzhaus

What is connected

Connect this record

See the researcher in context

Building this map preview

7 published item(s)

A Multiple kernel testing procedure for non-proportional hazards in factorial designs

Hypothesis testing for matched pairs with missing data by maximum mean discrepancy: An application to continuous glucose monitoring

Studentized Permutation Method for Comparing Restricted Mean Survival Times with Small Sample from Randomized Trials

Inferring median survival differences in general factorial designs via permutation tests

Permutation inference in factorial survival designs with the CASANOVA

Permutation test for the multivariate coefficient of variation in factorial designs

QANOVA: Quantile-based Permutation Methods For General Factorial Designs