Researcher profile

Jeremy G Sumner

Jeremy G Sumner contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 15 - Baseline
3works
0followers
6topics
1close collaborators

Actions

Decide how to stay connected

Follow researcher0

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

3 published item(s)

preprint2018arXiv

Systematics and symmetry in molecular phylogenetic modelling: perspectives from physics

The aim of this review is to present and analyze the probabilistic models of mathematical phylogenetics which have been intensively used in recent years in biology as the cornerstone of attempts to infer and reconstruct the ancestral relationships between species. We outline the development of theoretical phylogenetics, from the earliest studies based on morphological characters, through to the use of molecular data in a wide variety of forms. We bring the lens of mathematical physics to bear on the formulation of theoretical models, focussing on the applicability of many methods from the toolkit of that tradition -- techniques of groups and representations to guide model specification and to exploit the multilinear setting of the models in the presence of underlying symmetries; extensions to coalgebraic properties of the generators associated to rate matrices underlying the models, in relation to the graphical structures (trees and networks) which form the search space for inferring evolutionary trees. Aspects presented, include relating model classes to relevant matrix Lie algebras, as well as manipulations with group characters to enumerate various natural polynomial invariants, for identifying robust, low-parameter quantities for use in inference. Above all, we wish to emphasize the many features of multipartite entanglement which are shared between descriptions of quantum states on the physics side, and the multi-way tensor probability arrays arising in phylogenetics. In some instances, well-known objects such as the Cayley hyperdeterminant (the `tangle') can be directly imported into the formalism -- for models with binary character traits, and triplets of taxa. In other cases new objects appear, such as the remarkable quintic `squangle' invariants for quartet tree discrimination and DNA data, with their own unique interpretation in the phylogenetic modeling context.

preprint2016arXiv

Dimensional reduction for the general Markov model on phylogenetic trees

We present a method of dimensional reduction for the general Markov model of sequence evolution on a phylogenetic tree. We show that taking certain linear combinations of the associated random variables (site pattern counts) reduces the dimensionality of the model from exponential in the number of extant taxa, to quadratic in the number of taxa, while retaining the ability to statistically identify phylogenetic divergence events. A key feature is the identification of an invariant subspace which depends only bilinearly on the model parameters, in contrast to the usual multi-linear dependence in the full space. We discuss potential applications including the computation of split (edge) weights on phylogenetic trees from observed sequence data.

preprint2014arXiv

Matrix group structure and Markov invariants in the strand symmetric phylogenetic substitution model

We consider the continuous-time presentation of the strand symmetric phylogenetic substitution model (in which rate parameters are unchanged under nucleotide permutations given by Watson-Crick base conjugation). Algebraic analysis of the model's underlying structure as a matrix group leads to a change of basis where the rate generator matrix is given by a two-part block decomposition. We apply representation theoretic techniques and, for any (fixed) number of phylogenetic taxa $L$ and polynomial degree $D$ of interest, provide the means to classify and enumerate the associated Markov invariants. In particular, in the quadratic and cubic cases we prove there are precisely 1/3$(3^L+(-1)^L)$ and $6^{L-1}$ linearly independent Markov invariants, respectively. Additionally, we give the explicit polynomial forms of the Markov invariants for (i) the quadratic case with any number of taxa $L$, and (ii) the cubic case in the special case of a three-taxa phylogenetic tree. We close by showing our results are of practical interest since the quadratic Markov invariants provide independent estimates of phylogenetic distances based on (i) substitution rates within Watson-Crick conjugate pairs, and (ii) substitution rates across conjugate base pairs.