Source author record

Thomas J. X. Li

Thomas J. X. Li appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

9works
10topics
4close collaborators

Actions

Connect this record

Log in to claim

Research graph

See the researcher in context

Open full explorer

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

9 published item(s)

preprint2022arXiv

A computational framework for weighted simplicial homology

We provide a bottom up construction of torsion generators for weighted homology of a weighted complex over a discrete valuation ring $R=\mathbb{F}[[π]]$. This is achieved by starting from a basis for classical homology of the $n$-th skeleton for the underlying complex with coefficients in the residue field $\mathbb{F}$ and then lifting it to a basis for the weighted homology with coefficients in the ring $R$. Using the latter, a bijection is established between $n+1$ and $n$ dimensional simplices whose weight ratios provide the exponents of the $π$-monomials that generate each torsion summand in the structure theorem of the weighted homology modules over $R$. We present algorithms that subsume the torsion computation by reducing it to normalization over the residue field of $R$, and describe a Python package we implemented that takes advantage of this reduction and performs the computation efficiently.

preprint2022arXiv

On Weighted Simplicial Homology

We develop a framework for computing the homology of weighted simplicial complexes with coefficients in a discrete valuation ring. A weighted simplicial complex, $(X,v)$, introduced by Dawson [Cah. Topol. Géom. Différ. Catég. 31 (1990), pp. 229--243], is a simplicial complex, $X$, together with an integer-valued function, $v$, assigning weights to simplices, such that the weight of any of faces are monotonously increasing. In addition, weighted homology, $H_n^v(X)$, features a new boundary operator, $\partial_n^v$. In difference to Dawson, our approach is centered at a natural homomorphism $θ$ of weighted chain complexes. The key object is $H^v_{n}(X/θ)$, the weighted homology of a quotient of chain complexes induced by $θ$, appearing in a long exact sequence linking weighted homologies with different weights. We shall construct bases for the kernel and image of the weighted boundary map, identifying $n$-simplices as either $κ_n$- or $μ_n$-vertices. Long exact sequences of weighted homology groups and the bases, allow us to prove a structure theorem for the weighted simplicial homology with coefficients in a ring of formal power series $R=\mathbb{F}[[π]]$, where $\mathbb{F}$ is a field. Relative to simplicial homology new torsion arises and we shall show that the torsion modules are connected to a pairing between distinguished $κ_n$ and $μ_{n+1}$ simplices.

preprint2016arXiv

RNA secondary structures having a compatible sequence of certain nucleotide ratios

Given a random RNA secondary structure, $S$, we study RNA sequences having fixed ratios of nuclotides that are compatible with $S$. We perform this analysis for RNA secondary structures subject to various base pairing rules and minimum arc- and stack-length restrictions. Our main result reads as follows: in the simplex of the nucleotide ratios there exists a convex region in which, in the limit of long sequences, a random structure a.a.s.~has compatible sequence with these ratios and outside of which a.a.s.~a random structure has no such compatible sequence. We localize this region for RNA secondary structures subject to various base pairing rules and minimum arc- and stack-length restrictions. In particular, for {\bf GC}-sequences having a ratio of {\bf G} nucleotides smaller than $1/3$, a random RNA secondary structure without any minimum arc- and stack-length restrictions has a.a.s.~no such compatible sequence. For sequences having a ratio of {\bf G} nucleotides larger than $1/3$, a random RNA secondary structure has a.a.s. such compatible sequences. We discuss our results in the context of various families of RNA structures.

preprint2016arXiv

Statistics of topological RNA structures

In this paper we study properties of topological RNA structures, i.e.~RNA contact structures with cross-serial interactions that are filtered by their topological genus. RNA secondary structures within this framework are topological structures having genus zero. We derive a new bivariate generating function whose singular expansion allows us to analyze the distributions of arcs, stacks, hairpin- , interior- and multi-loops. We then extend this analysis to H-type pseudoknots, kissing hairpins as well as $3$-knots and compute their respective expectation values. Finally we discuss our results and put them into context with data obtained by uniform sampling structures of fixed genus.

preprint2014arXiv

A combinatorial interpretation of the $κ^{\star}_{g}(n)$ coefficients

Studying the virtual Euler characteristic of the moduli space of curves, Harer and Zagier compute the generating function $C_g(z)$ of unicellular maps of genus $g$. They furthermore identify coefficients, $κ^{\star}_{g}(n)$, which fully determine the series $C_g(z)$. The main result of this paper is a combinatorial interpretation of $κ^{\star}_{g}(n)$. We show that these enumerate a class of unicellular maps, which correspond $1$-to-$2^{2g}$ to a specific type of trees, referred to as O-trees. O-trees are a variant of the C-decorated trees introduced by Chapuy, Féray and Fusy. We exhaustively enumerate the number $s_{g}(n)$ of shapes of genus $g$ with $n$ edges, which is a specific class of unicellular maps with vertex degree at least three. Furthermore we give combinatorial proofs for expressing the generating functions $C_g(z)$ and $S_g(z)$ for unicellular maps and shapes in terms of $κ^{\star}_{g}(n)$, respectively. We then prove a two term recursion for $κ^{\star}_{g}(n)$ and that for any fixed $g$, the sequence $\{κ_{g,t}\}_{t=0}^g$ is log-concave, where $κ^{\star}_{g}(n)= κ_{g,t}$, for $n=2g+t-1$.

preprint2013arXiv

Combinatorics of $γ$-structures

In this paper we study canonical $γ$-structures, a class of RNA pseudoknot structures that plays a key role in the context of polynomial time folding of RNA pseudoknot structures. A $γ$-structure is composed by specific building blocks, that have topological genus less than or equal to $γ$, where composition means concatenation and nesting of such blocks. Our main result is the derivation of the generating function of $γ$-structures via symbolic enumeration using so called irreducible shadows. We furthermore recursively compute the generating polynomials of irreducible shadows of genus $\le γ$. $γ$-structures are constructed via $γ$-matchings. For $1\le γ\le 10$, we compute Puiseux-expansions at the unique, dominant singularities, allowing us to derive simple asymptotic formulas for the number of $γ$-structures.

preprint2012arXiv

The topological filtration of $γ$-structures

In this paper we study $γ$-structures filtered by topological genus. $γ$-structures are a class of RNA pseudoknot structures that plays a key role in the context of polynomial time folding of RNA pseudoknot structures. A $γ$-structure is composed by specific building blocks, that have topological genus less than or equal to $γ$, where composition means concatenation and nesting of such blocks. Our main results are the derivation of a new bivariate generating function for $γ$-structures via symbolic methods, the singularity analysis of the solutions and a central limit theorem for the distribution of topological genus in $γ$-structures of given length. In our derivation specific bivariate polynomials play a central role. Their coefficients count particular motifs of fixed topological genus and they are of relevance in the context of genus recursion and novel folding algorithms.

preprint2010arXiv

Combinatorial analysis of interacting RNA molecules

Recently several minimum free energy (MFE) folding algorithms for predicting the joint structure of two interacting RNA molecules have been proposed. Their folding targets are interaction structures, that can be represented as diagrams with two backbones drawn horizontally on top of each other such that (1) intramolecular and intermolecular bonds are noncrossing and (2) there is no "zig-zag" configuration. This paper studies joint structures with arc-length at least four in which both, interior and exterior stack-lengths are at least two (no isolated arcs). The key idea in this paper is to consider a new type of shape, based on which joint structures can be derived via symbolic enumeration. Our results imply simple asymptotic formulas for the number of joint structures with surprisingly small exponential growth rates. They are of interest in the context of designing prediction algorithms for RNA-RNA interactions.

preprint2010arXiv

Combinatorics of RNA-RNA interaction

RNA-RNA binding is an important phenomenon observed for many classes of non-coding RNAs and plays a crucial role in a number of regulatory processes. Recently several MFE folding algorithms for predicting the joint structure of two interacting RNA molecules have been proposed. Here joint structure means that in a diagram representation the intramolecular bonds of each partner are pseudoknot-free, that the intermolecular binding pairs are noncrossing, and that there is no so-called ``zig-zag'' configuration. This paper presents the combinatorics of RNA interaction structures including their generating function, singularity analysis as well as explicit recurrence relations. In particular, our results imply simple asymptotic formulas for the number of joint structures.