Source author record

Liang Hong

Liang Hong appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.PR math.FA math.ST Statistics Theory Biological Physics Computational Engineering, Finance, and Science Computer Vision cond-mat.mes-hall cond-mat.mtrl-sci cond-mat.soft Machine Learning math.GN math.GR math.OC physics.chem-ph physics.comp-ph Quantitative Methods

Catalog footprint

What is connected

17works

17topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

The true detection probability versus the subjective detection probability of a uniformly optimal search plan

This article investigates the difference between the true detection probability and the subjective probability of a uniformly optimal search plan. Its main contributions are multi-fold. First, it provides a set of examples to show that, in terms of the true detection probability, the uniformly optimal search plan may or may not be optimal. Secondly, it establishes that the true detection probability of the uniformly optimal search plan based on a composite prior can be less than that of the composite uniformly search plan based on different priors. Next, it argues that an open problem is unsolvable. Finally, it shows that the true detection probability of the uniformly optimal search plan converges to one as the search time approaches infinity.

preprint2026arXiv

Towards A Generative Protein Evolution Machine with DPLM-Evo

Proteins are shaped by gradual evolution under biophysical and functional constraints. Protein language models learn rich evolutionary constraints from large-scale sequences, and discrete diffusion-based protein language models~(\eg, DPLMs) are promising for both understanding and generation. However, existing DPLMs typically rely on masking-based absorbing diffusion that contradicts a simple biological intuition: proteins evolve through accumulated edits, not by emerging from masks. Consequently, these frameworks lack explicit pretraining objectives for substitution and insertion/deletion (indel) operations, limiting both optimization-style post-editing and flexible guided generation. To address these limitations, we present DPLM-Evo, an evolutionary discrete diffusion framework that explicitly predicts substitution, insertion, and deletion operations during denoising. DPLM-Evo decouples an upsampled-length latent alignment space from the variable-length observed sequence space, which makes indel-aware generation tractable and enables adaptive scaffold growth throughout the process with negligible computational overhead. To better align substitutions with real evolution, we further introduce a contextualized evolutionary noising kernel that produces biologically informed, context-dependent mutation patterns. Across tasks, DPLM-Evo improves sequence understanding and achieves state-of-the-art mutation effect prediction performance on ProteinGym in the single-sequence setting. It also enables variable-length simulated evolution, and post-editing/optimization of existing proteins via explicit edit trajectories.

preprint2022arXiv

Interpretable RNA Foundation Model from Unannotated Data for Highly Accurate RNA Structure and Function Predictions

Non-coding RNA structure and function are essential to understanding various biological processes, such as cell signaling, gene expression, and post-transcriptional regulations. These are all among the core problems in the RNA field. With the rapid growth of sequencing technology, we have accumulated a massive amount of unannotated RNA sequences. On the other hand, expensive experimental observatory results in only limited numbers of annotated data and 3D structures. Hence, it is still challenging to design computational methods for predicting their structures and functions. The lack of annotated data and systematic study causes inferior performance. To resolve the issue, we propose a novel RNA foundation model (RNA-FM) to take advantage of all the 23 million non-coding RNA sequences through self-supervised learning. Within this approach, we discover that the pre-trained RNA-FM could infer sequential and evolutionary information of non-coding RNAs without using any labels. Furthermore, we demonstrate RNA-FM's effectiveness by applying it to the downstream secondary/3D structure prediction, SARS-CoV-2 genome structure and evolution prediction, protein-RNA binding preference modeling, and gene expression regulation modeling. The comprehensive experiments show that the proposed method improves the RNA structural and functional modelling results significantly and consistently. Despite only being trained with unlabelled data, RNA-FM can serve as the foundational model for the field.

preprint2022arXiv

The $ω^3$ scaling of the vibrational density of states in quasi-2D nanoconfined solids

Atomic vibrations play a vital role in the functions of various physical, chemical, and biological systems. The vibrational properties and the specific heat of crystalline bulk materials are well described by Debye theory, which successfully predicts the quadratic $ω^{2}$ low-frequency scaling of the vibrational density of states (VDOS) in bulk ordered solids from few fundamental assumptions. However, the analogous framework for nanoconfined materials with fewer degrees of freedom has been far less well explored. Using inelastic neutron scattering, we characterize the VDOS of amorphous ice confined to a thickness of $\approx 1$ nm inside graphene oxide membranes and we observe a crossover from the Debye $ω^2$ scaling to an anomalous $ω^3$ behaviour upon reducing the confinement size $L$. Additionally, using molecular dynamics simulations, we confirm the experimental findings and also prove that such a scaling of the VDOS appears in both crystalline and amorphous solids under slab-confinement. We theoretically demonstrate that this low-frequency $ω^3$ law results from the geometric constraints on the momentum phase space induced by confinement along one spatial direction. Finally, we predict that the Debye scaling reappears at a characteristic frequency $ω_\times= v L/2π$, with $v$ the speed of sound of the material, and we confirm this quantitative estimate with simulations. This new physical phenomenon, revealed by combining theoretical, experimental and simulations results, is relevant to a myriad of systems both in synthetic and biological contexts and it could impact various technological applications for systems under confinement such as nano-devices or thin films.

preprint2021arXiv

Superscalability of the random batch Ewald method

Coulomb interaction, following an inverse-square force-law, quantifies the amount of force between two stationary and electrically charged particles. The long-range nature of Coulomb interactions poses a major challenge to molecular dynamics simulations which are major tools for problems at the nano-/micro- scale. Various algorithms are developed to calculate the pairwise Coulomb interactions to a linear scaling but the poor scalability limits the size of simulated systems. Here, we conduct an efficient molecular dynamics algorithm with the random batch Ewald method on all-atom systems where the complete Fourier components in the Coulomb interaction are replaced by randomly selected mini-batches. By simulating the $N$-body systems up to 100 million particles using $10$ thousand CPU cores, we show that this algorithm furnishes $O(N)$ complexity, almost perfect scalability and an order of magnitude faster computational speed when compared to the existing state-of-the-art algorithms. Further examinations of our algorithm on distinct systems, including pure water, micro-phase-separated electrolyte and protein solution demonstrate that the spatiotemporal information on all time and length scales investigated and thermodynamic quantities derived from our algorithm are in perfect agreement with those obtained from the existing algorithms. Therefore, our algorithm provides a breakthrough solution on scalability of computing the Coulomb interaction. It is particularly useful and cost-effective to simulate ultra-large systems, which was either impossible or very costing to conduct using existing algorithms, thus would benefit the broad community of sciences.

preprint2020arXiv

SCAttNet: Semantic Segmentation Network with Spatial and Channel Attention Mechanism for High-Resolution Remote Sensing Images

High-resolution remote sensing images (HRRSIs) contain substantial ground object information, such as texture, shape, and spatial location. Semantic segmentation, which is an important task for element extraction, has been widely used in processing mass HRRSIs. However, HRRSIs often exhibit large intraclass variance and small interclass variance due to the diversity and complexity of ground objects, thereby bringing great challenges to a semantic segmentation task. In this paper, we propose a new end-to-end semantic segmentation network, which integrates lightweight spatial and channel attention modules that can refine features adaptively. We compare our method with several classic methods on the ISPRS Vaihingen and Potsdam datasets. Experimental results show that our method can achieve better semantic segmentation results. The source codes are available at https://github.com/lehaifeng/SCAttNet.

preprint2016arXiv

On the Riesz decomposition property and the interpolation property of stopping times

It is known that random variables have the Riesz decomposition property and the interpolation property. These properties are not only interesting in their own rights; they have been applied to quantitative finance and actuarial mathematics. One would naturally ask whether the same holds for stopping times. We give an affirmative answer in this paper. We also point out that optional times possess these two properties too.

preprint2015arXiv

Fuzzy Riesz subspaces, fuzzy ideals, fuzzy bands and fuzzy band projections

Fuzzy ordered linear spaces, Riesz spaces, fuzzy Archimedean spaces and $σ$-complete fuzzy Riesz spaces were defined and studied in several works. Following the efforts along this line, we define fuzzy Riesz subspaces, fuzzy ideals, fuzzy bands and fuzzy band projections and establish their fundamental properties.

preprint2015arXiv

Locally solid topological lattice-ordered groups

Locally solid Riesz spaces have been widely investigated in the past several decades; but locally solid topological lattice-ordered groups seem to be largely unexplored. The paper is an attempt to initiate a relatively systematic study of locally solid topological lattice-ordered groups. We give both Robert-Namioka-type characterization and Fremlin-type characterization of locally solid topological lattice-ordered groups. In particular, we show that a group topology on a lattice-ordered group is locally solid if and only if it is generated by a family of translation-invariant lattice pseudometrics. We also investigate (1) the basic properties of lattice group homomorphism on locally solid topological lattice-ordered groups; (2) the relationship between order-bounded subsets and topologically bounded subsets in locally solid topological lattice-ordered groups; (3) the Hausdorff completion of locally solid topological lattice-ordered groups.

preprint2015arXiv

On order-bounded subsets of locally solid Riesz spaces

In a topological Riesz space there are two types of bounded subsets: order bounded subsets and topologically bounded subsets. It is natural to ask (1) whether an order bounded subset is topologically bounded and (2) whether a topologically bounded subset is order bounded. A classical result gives a partial answer to (1) by saying that an order bounded subset of a locally solid Riesz space is topologically bounded. This paper attempts to further investigate these two questions. In particular, we show that (i) there exists a non-locally solid topological Riesz space in which every order bounded subset is topologically bounded; (ii) if a topological Riesz space is not locally solid, an order bounded subset need not be topologically bounded; (iii) a topologically bounded subset need not be order bounded even in a locally convex-solid Riesz space. Next, we show that (iv) if a locally solid Riesz space has an order bounded topological neighborhood of zero, then every topologically bounded subset is order bounded; (v) however, a locally convex-solid Riesz space may not possess an order bounded topological neighborhood of zero even if every topologically bounded subset is order bounded; (vi) a pseudometrizable locally solid Riesz space need not have an order bounded topological neighborhood of zero. In addition, we give some results about the relationship between order bounded subsets and positive homogeneous operators.

preprint2014arXiv

And$\hat{o}$-Douglas type characterization of generalized conditional expectations, optional projections and predictable projections

Generalized conditional expectations, optional projections and predictable projections of stochastic processes play important roles in the general theory of stochastic processes, semimartingale theory and stochastic calculus. They share some important properties with ordinary conditional expectations. While the characterization of ordinary conditional expectations has been studied by several authors, no similar work seems to have been done for these three concepts. This paper aims at undertaking this task by giving And$\hat{o}$-Douglas type characterization theorem for each of them.

preprint2014arXiv

The linear topology associated with weak convergence of probability measures

This expository note aims at illustrating weak convergence of probability measures from a broader view than a previously published paper. Though the results are standard for functional analysts, this approach is rarely known by statisticians and our presentation gives an alternative view than most standard probability textbooks. In particular, this functional approach clarifies the underlying topological structure of weak convergence. We hope this short note is helpful for those who are interested in weak convergence as well as instructors of measure theoretic probability.

preprint2013arXiv

A note on Bayesian convergence rates under local prior support conditions

Bounds on Bayesian posterior convergence rates, assuming the prior satisfies both local and global support conditions, are now readily available. In this paper we explore, in the context of density estimation, Bayesian convergence rates assuming only local prior support conditions. Our results give optimal rates under minimal conditions using very simple arguments.

preprint2013arXiv

Long-Time Mean Square Displacements in Proteins

We propose a method for obtaining the intrinsic, long time mean square displacement (MSD) of atoms and molecules in proteins from finite time molecular dynamics (MD) simulations. Typical data from simulations are limited to times of 1 to 10 ns and over this time period the calculated MSD continues to increase without a clear limiting value. The proposed method consists of fitting a model to MD simulation-derived values of the incoherent intermediate neutron scattering function, $I_{inc}(Q,t)$, for finite times. The infinite time MSD, $<r^2>$, appears as a parameter in the model and is determined by fits of the model to the finite time $I_{inc}(Q,t)$. Specifically, the $<r^2>$ is defined in the usual way in terms of the Debye-Waller factor as $I(Q,t = \infty) = \exp(- Q^2 <r^2 > /3)$. The method is illustrated by obtaining the intrinsic MSD $<r^2>$ of hydrated lysozyme powder (h = 0.4 g water/g protein) over a wide temperature range. The intrinsic $<r^2>$ obtained from data out to 1 ns and to 10 ns is found to be the same. The intrinsic $<r^2>$ is approximately twice the value of the MSD that is reached in simulations after times of 1 ns which correspond to those observed using neutron instruments that have an energy resolution width of 1 μeV.

preprint2013arXiv

On the interpolation property and dominated decomposition property of quasimartingales

For a quasimartingale majorized by another quasimartingale, it is natural to ask whether a third quasimartingale can be inserted between them. In this paper, we give an affirmative answer to this problem. We also establish a dominated decomposition property of quasimartingales. In addition, we show that a weak interpolation property holds for supermartingales and local supermartingales. Our approach also yields the interpolation property and dominated decomposition property for Markov chains.

preprint2013arXiv

Weak convergence of probability measures: a topological vector space point of view

Weak convergence of probability measures is one of the most important topics in the field probability and statistics. In this survey paper, we look at weak convergence of probability measures from the topological vector space point of view. We start from the key concepts and results about weak topology and weak convergence under the general framework of topological vector spaces. Then we restrict our attention to the space of probability measures and see how the general results specialize to those in probability theory. Finally, we will review some important facts about the metrizability of weak topology. We hope the general approach reviewed in this paper can provide an alternative view and some insights.

preprint2012arXiv

On convergence rates of Bayesian predictive densities and posterior distributions

Frequentist-style large-sample properties of Bayesian posterior distributions, such as consistency and convergence rates, are important considerations in nonparametric problems. In this paper we give an analysis of Bayesian asymptotics based primarily on predictive densities. Our analysis is unified in the sense that essentially the same approach can be taken to develop convergence rate results in iid, mis-specified iid, independent non-iid, and dependent data cases.

Liang Hong

What is connected

Connect this record

See the researcher in context

Building this map preview

17 published item(s)

The true detection probability versus the subjective detection probability of a uniformly optimal search plan

Towards A Generative Protein Evolution Machine with DPLM-Evo

Interpretable RNA Foundation Model from Unannotated Data for Highly Accurate RNA Structure and Function Predictions

The $ω^3$ scaling of the vibrational density of states in quasi-2D nanoconfined solids

Superscalability of the random batch Ewald method

SCAttNet: Semantic Segmentation Network with Spatial and Channel Attention Mechanism for High-Resolution Remote Sensing Images

On the Riesz decomposition property and the interpolation property of stopping times

Fuzzy Riesz subspaces, fuzzy ideals, fuzzy bands and fuzzy band projections

Locally solid topological lattice-ordered groups

On order-bounded subsets of locally solid Riesz spaces

And$\hat{o}$-Douglas type characterization of generalized conditional expectations, optional projections and predictable projections

The linear topology associated with weak convergence of probability measures

A note on Bayesian convergence rates under local prior support conditions

Long-Time Mean Square Displacements in Proteins

On the interpolation property and dominated decomposition property of quasimartingales

Weak convergence of probability measures: a topological vector space point of view

On convergence rates of Bayesian predictive densities and posterior distributions