Researcher profile

Jochen Blath

Jochen Blath contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
10works
0followers
3topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

10 published item(s)

preprint2020arXiv

Invasion and fixation of microbial dormancy traits under competitive pressure

Microbial dormancy is an evolutionary trait that has emerged independently at various positions across the tree of life. It describes the ability of a microorganism to switch to a metabolically inactive state that can withstand unfavorable conditions. However, maintaining such a trait requires additional resources that could otherwise be used to increase e.g. reproductive rates. In this paper, we aim for gaining a basic understanding under which conditions maintaining a seed bank of dormant individuals provides a "fitness advantage" when facing resource limitations and competition for resources among individuals (in an otherwise stable environment). In particular, we wish to understand when an individual with a "dormancy trait" can invade a resident population lacking this trait despite having a lower reproduction rate than the residents. To this end, we follow a stochastic individual-based approach employing birth-and-death processes, where dormancy is triggered by competitive pressure for resources. In the large-population limit, we identify a necessary and sufficient condition under which a complete invasion of mutants has a positive probability. Further, we explicitly determine the limiting probability of invasion and the asymptotic time to fixation of mutants in the case of a successful invasion. In the proofs, we observe the three classical phases of invasion dynamics in the guise of Coron et al. (2017, 2019).

preprint2020arXiv

Statistical tools for seed bank detection

In this article, we derive statistical tools to analyze and distinguish the patterns of genetic variability produced by classical and recent population genetic models related to seed banks. In particular, we are concerned with models described by the Kingman coalescent (K), models exhibiting so-called weak seed banks described by a time-changed Kingman coalescent (W), models with so-called strong seed bank described by the seed bank coalescent (S) and the classical two-island model by Wright, described by the structured coalescent (TI). As the presence of a (strong) seed bank should stratify a population, we expect it to produce a signal roughly comparable to the presence of population structure. We begin with a brief analysis of Wright's $F_{ST}$, which is a classical but crude measure for population structure, followed by a derivation of the expected site frequency spectrum (SFS) in the infinite sites model based on 'phase-type distribution calculus' as recently discussed by Hobolth et al. (2019). Both the $F_{ST}$ and the SFS can be readily computed under various population models, they discard statistical signal. Hence we also derive exact likelihoods for the full sampling probabilities, which can be achieved via recursions and a Monte Carlo scheme both in the infinite alleles and the infinite sites model. We employ a pseudo-marginal Metropolis-Hastings algorithm of Andrieu and Roberts (2009) to provide a method for simultaneous model selection and parameter inference under the so-called infinitely-many sites model, which is the most relevant in real applications. It turns out that this full likelihood method can reliably distinguish among the model classes (K, W), (S) and (TI) on the basis of simulated data even from moderate sample sizes. It is also possible to infer mutation rates, and in particular determine whether mutation is taking place in the (strong) seed bank.

preprint2014arXiv

Genealogy of a Wright Fisher model with strong seed bank component

We investigate the behaviour of the genealogy of a Wright-Fisher population model under the influence of a strong seed-bank effect. More precisely, we consider a simple seed-bank age distribution with two atoms, leading to either classical or long genealogical jumps (the latter modeling the effect of seed-dormancy). We assume that the length of these long jumps scales like a power $N^β$ of the original population size $N$, thus giving rise to a `strong' seed-bank effect. For a certain range of $β$, we prove that the ancestral process of a sample of $n$ individuals converges under a non-classical time-scaling to Kingman's $n-$coalescent. Further, for a wider range of parameters, we analyze the time to the most recent common ancestor of two individuals analytically and by simulation.

preprint2014arXiv

The largest strongly connected component in Wakeley et al's cyclical pedigree model

We establish a link between Wakeley et al's (2012) cyclical pedigree model from population genetics and a randomized directed configuration model (DCM) considered by Cooper and Frieze (2004). We then exploit this link in combination with asymptotic results for the in-degree distribution of the corresponding DCM to compute the asymptotic size of the largest strongly connected component $S^N$ (where $N$ is the population size) of the DCM resp. the pedigree. The size of the giant component can be characterized explicitly (amounting to approximately $80 \%$ of the total populations size) and thus contributes to a reduced `pedigree effective population size'. In addition, the second largest strongly connected component is only of size $O(\log N)$. Moreover, we describe the size and structure of the `domain of attraction' of $S^N$. In particular, we show that with high probability for any individual the shortest ancestral line reaches $S^N$ after $O(\log \log N)$ generations, while almost all other ancestral lines take at most $O(\log N)$ generations.

preprint2013arXiv

Statistical properties of the site-frequency spectrum associated with Lambda-coalescents

Statistical properties of the site frequency spectrum associated with Lambda-coalescents are our objects of study. In particular, we derive recursions for the expected value, variance, and covariance of the spectrum, extending earlier results of Fu (1995) for the classical Kingman coalescent. Estimating coalescent parameters introduced by certain Lambda-coalescents for datasets too large for full likelihood methods is our focus. The recursions for the expected values we obtain can be used to find the parameter values which give the best fit to the observed frequency spectrum. The expected values are also used to approximate the probability a (derived) mutation arises on a branch subtending a given number of leaves (DNA sequences), allowing us to apply a pseudo-likelihood inference to estimate coalescence parameters associated with certain subclasses of Lambda coalescents. The properties of the pseudo-likelihood approach are investigated on simulated as well as real mtDNA datasets for the high fecundity Atlantic cod (\emph{Gadus morhua}). Our results for two subclasses of Lambda coalescents show that one can distinguish these subclasses from the Kingman coalescent, as well as between the Lambda-subclasses, even for moderate sample sizes.

preprint2013arXiv

The ancestral process of long term seed bank models

We present a new model for seed banks, where direct ancestors of individuals may have lived in the near as well as the very far past. The classical Wright-Fisher model, as well as a seed bank model with bounded age distribution considered by Kaj, Krone and Lascoux (2001) are special cases of our model. We discern three parameter regimes of the seed bank age distribution, which lead to substantially different behaviour in terms of genetic variability, in particular with respect to fixation of types and time to the most recent common ancestor. We prove that for age distributions with finite mean, the ancestral process converges to a time-changed Kingman coalescent, while in the case of infinite mean, ancestral lineages might not merge at all with positive probability. Further, we present a construction of the forward in time process in equilibrium. The mathematical methods are based on renewal theory, the urn process introduced by Kaj et al., as well as on a paper by Hammond and Sheffield (2011).

preprint2012arXiv

An ancestral recombination graph for diploid populations with skewed offspring distribution

A large offspring number diploid biparental multilocus population model of Moran type is our object of study. At each timestep, a pair of diploid individuals drawn uniformly at random contribute offspring to the population. The number of offspring can be large relative to the total population size. Similar `heavily skewed' reproduction mechanisms have been considered by various authors recently. Each diploid parental individual contributes exactly one chromosome to each diploid offspring, and hence ancestral lineages can only coalesce when in distinct individuals. A separation of timescales phenomenon is thus observed. A result of Möhle (1998) is extended to obtain convergence of the ancestral process to an ancestral recombination graph necessarily admitting simultaneous multiple mergers of ancestral lineages. The usual ancestral recombination graph is obtained as a special case of our model when the parents contribute only one offspring to the population each time. Due to diploidy and large offspring numbers, novel effects appear. For example, the marginal genealogy at each locus admits simultaneous multiple mergers in up to four groups, and different loci remain substantially correlated even as the recombination rate grows large. Thus, genealogies for loci far apart on the same chromosome remain correlated. Correlation in coalescence times for two loci is derived and shown to be a function of the coalescence parameters of our model. Extending the observations by Eldon and Wakeley (2008), predictions of linkage disequilibrium are shown to be functions of the reproduction parameters of our model, in addition to the recombination rate. Correlations in ratios of coalescence times between loci can be high, even when the recombination rate is high and sample size is large.

preprint2012arXiv

Analysis of DNA sequence variation within marine species using Beta-coalescents

We apply recently developed inference methods based on general coalescent processes to DNA sequence data obtained from various marine species. Several of these species are believed to exhibit so-called shallow gene genealogies, potentially due to extreme reproductive behaviour, e.g. via Hedgecock's "reproduction sweepstakes". Besides the data analysis, in particular the inference of mutation rates and the estimation of the (real) time to the most recent common ancestor, we briefly address the question whether the genealogies might be adequately described by so-called Beta coalescents (as opposed to Kingman's coalescent), allowing multiple mergers of genealogies. The choice of the underlying coalescent model for the genealogy has drastic implications for the estimation of the above quantities, in particular the real-time embedding of the genealogy.

preprint2011arXiv

Importance sampling for Lambda-coalescents in the infinitely many sites model

We present and discuss new importance sampling schemes for the approximate computation of the sample probability of observed genetic types in the infinitely many sites model from population genetics. More specifically, we extend the 'classical framework', where genealogies are assumed to be governed by Kingman's coalescent, to the more general class of Lambda-coalescents and develop further Hobolth et. al.'s (2008) idea of deriving importance sampling schemes based on 'compressed genetrees'. The resulting schemes extend earlier work by Griffiths and Tavaré (1994), Stephens and Donnelly (2000), Birkner and Blath (2008) and Hobolth et. al. (2008). We conclude with a performance comparison of classical and new schemes for Beta- and Kingman coalescents.

preprint2010arXiv

On the moments and the interface of the symbiotic branching model

In this paper we introduce a critical curve separating the asymptotic behavior of the moments of the symbiotic branching model, introduced by Etheridge and Fleischmann [Stochastic Process. Appl. 114 (2004) 127--160] into two regimes. Using arguments based on two different dualities and a classical result of Spitzer [Trans. Amer. Math. Soc. 87 (1958) 187--197] on the exit-time of a planar Brownian motion from a wedge, we prove that the parameter governing the model provides regimes of bounded and exponentially growing moments separated by subexponential growth. The moments turn out to be closely linked to the limiting distribution as time tends to infinity. The limiting distribution can be derived by a self-duality argument extending a result of Dawson and Perkins [Ann. Probab. 26 (1998) 1088--1138] for the mutually catalytic branching model. As an application, we show how a bound on the 35th moment improves the result of Etheridge and Fleischmann [Stochastic Process. Appl. 114 (2004) 127--160] on the speed of the propagation of the interface of the symbiotic branching model.