Source author record

Anh-Tuan Hoang

Anh-Tuan Hoang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Methodology

Catalog footprint

What is connected

2works

1topics

1close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2020arXiv

On the usage of randomized p-values in the Schweder-Spjotvoll estimator

We are concerned with multiple test problems with composite null hypotheses and the estimation of the proportion $π_{0}$ of true null hypotheses. The Schweder-Spjøtvoll estimator $\hatπ_0$ utilizes marginal $p$-values and only works properly if the $p$-values that correspond to the true null hypotheses are uniformly distributed on $[0,1]$ ($\mathrm{Uni}[0,1]$-distributed). In the case of composite null hypotheses, marginal $p$-values are usually computed under least favorable parameter configurations (LFCs). Thus, they are stochastically larger than $\mathrm{Uni}[0,1]$ under non-LFCs in the null hypotheses. When using these LFC-based $p$-values, $\hatπ_0$ tends to overestimate $π_{0}$. We introduce a new way of randomizing $p$-values that depends on a tuning parameter $c\in[0,1]$, such that $c=0$ and $c=1$ lead to $\mathrm{Uni}[0,1]$-distributed $p$-values, which are independent of the data, and to the original LFC-based $p$-values, respectively. For a certain value $c=c^{\star}$ the bias of $\hatπ_0$ is minimized when using our randomized $p$-values. This often also entails a smaller mean squared error of the estimator as compared to the usage of the LFC-based $p$-values. We analyze these points theoretically, and we demonstrate them numerically in computer simulations under various standard statistical models.

preprint2020arXiv

Randomized p-values for multiple testing and their application in replicability analysis

We are concerned with testing replicability hypotheses for many endpoints simultaneously. This constitutes a multiple test problem with composite null hypotheses. Traditional $p$-values, which are computed under least favourable parameter configurations, are over-conservative in the case of composite null hypotheses. As demonstrated in prior work, this poses severe challenges in the multiple testing context, especially when one goal of the statistical analysis is to estimate the proportion $π_0$ of true null hypotheses. Randomized $p$-values have been proposed to remedy this issue. In the present work, we discuss the application of randomized $p$-values in replicability analysis. In particular, we introduce a general class of statistical models for which valid, randomized $p$-values can be calculated easily. By means of computer simulations, we demonstrate that their usage typically leads to a much more accurate estimation of $π_0$. Finally, we apply our proposed methodology to a real data example from genomics.

Anh-Tuan Hoang

What is connected

Connect this record

See the researcher in context

Building this map preview

2 published item(s)

On the usage of randomized p-values in the Schweder-Spjotvoll estimator

Randomized p-values for multiple testing and their application in replicability analysis