Researcher profile

Mingzhe Li

Mingzhe Li contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
14works
0followers
14topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

14 published item(s)

preprint2026arXiv

A multi-mesh adaptive finite element method for solving the Gross-Pitaevskii equation

It is found that the wave functions of the Gross-Pitaevskii equation (GPE) often vary significantly in different spatial regions, with some components exhibiting sharp variations while others remain smooth. Solving the GPE on a single mesh, even with adaptive refinement, can lead to excessive computational costs due to the need to accommodate the most oscillatory solution. To address this issue, we present a multi-mesh adaptive finite element method for solving the GPE. To this end, we first convert it into a time-dependent equation through the imaginary time propagation method. Then the equation is discretized by the backward Euler method temporally and the multi-mesh adaptive finite element method spatially. The proposed method is compared with the single-mesh adaptive method through a series of numerical experiments, which demonstrate that the multi-mesh adaptive method can achieve the same numerical accuracy with less computational consumption.

preprint2026arXiv

Offline Policy Optimization with Posterior Sampling

A fundamental challenge in model-based offline reinforcement learning (RL) lies in the trade-off between generalization and robustness against exploitation errors in out-of-distribution (OOD) regions. While OOD samples may capture valid underlying physical dynamics, they also introduce the risk of model exploitation. Existing methods typically address this risk through excessive pessimistic regularization, which ensures robustness but often sacrifices generalization. To overcome this limitation, we propose Posterior Sampling-based Policy Optimization (PSPO), which formulates dynamics modeling as a Bayesian inference process to derive a posterior that explicitly quantifies model fidelity. Through the integration of posterior sampling and constrained policy optimization, our method leverages dynamics-consistent OOD transitions for generalization while ensuring robustness against model exploitation. Theoretically, we formulate Q-value estimation under posterior sampling as a stochastic approximation problem and establish its convergence. We decompose policy optimization into a sequence of constrained subproblems, demonstrating that solving these subproblems guarantees monotonic improvement until convergence. Experiments on standard benchmarks validate that PSPO achieves superior performance compared to state-of-the-art baselines.

preprint2026arXiv

Source or It Didn't Happen: A Multi-Agent Framework for Citation Hallucination Detection

Large language models are increasingly used in scientific writing, yet they can fabricate citation-shaped references that appear plausible but fail bibliographic verification. Existing detectors often reduce verification to binary found/not-found decisions and rely on brittle parsing or incomplete retrieval, offering little field-level signal to auditors. We reframe citation hallucination detection as taxonomy-aligned field-level adjudication and introduce a 12-code taxonomy spanning Real, Potential, and Hallucinated citations. Based on this taxonomy, we build CiteTracer, a cascading multi-agent detector that extracts structured citations from PDF and BibTeX, retrieves evidence through cache lookup, URL fetch, scholar connectors, and web search, applies deterministic field matching, and routes ambiguous cases to class-specialist judgers. We release a benchmark of 2,450 synthetic citations built from real seeds with controlled LLM mutations, paired with 957 real-world fabricated citations drawn from ICLR 2026 and an anonymous conference desk-rejected submissions. CiteTracer reaches 97.1% accuracy on the synthetic benchmark, with class-level F1 scores of 97.0, 95.8, and 98.5 for Real, Potential, and Hallucinated, respectively, and detects 97.1% of fabrications on the real-world set without abstaining. Code: https://github.com/aaFrostnova/CiteTracer.

preprint2026arXiv

Spatiotemporal Detection and Uncertainty Visualization of Atmospheric Blocking Events

Atmospheric blocking events are quasi-stationary high-pressure systems that disrupt the typical paths of polar and subtropical air currents, often producing prolonged extreme weather events such as summer heat waves or winter cold spells. Despite their critical role in shaping mid-latitude weather, accurately modeling and analyzing blocking events in long meteorological records remains a significant challenge. To address this challenge, we present an uncertainty visualization framework for detecting and characterizing atmospheric blocking events. First, we introduce a geometry-based detection and tracking method, evaluated on both pre-industrial climate model simulations (UKESM) and reanalysis data (ERA5), which represent historical Earth observations assimilated from satellite and station measurements onto regular numerical grids using weather models. Second, we propose a suite of uncertainty-aware summaries: contour boxplots that capture representative boundaries and their variability, frequency heatmaps that encode occurrences, and 3D temporal stacks that situate these patterns in time. Third, we demonstrate our framework in a case study of the 2003 European heatwave, mapping the spatiotemporal occurrences of blocking events using these summaries. Collectively, these uncertainty visualizations reveal where blocking events are most likely to occur and how their spatial footprints evolve over time. We envision our framework as a valuable tool for climate scientists and meteorologists: by analyzing how blocking frequency, duration, and intensity vary across regions and climate scenarios, it supports both the study of historical blocking events and the assessment of scenario-dependent climate risks associated with changes in extreme weather linked to blocking.

preprint2026arXiv

STAR-S: Improving Safety Alignment through Self-Taught Reasoning on Safety Rules

Defending against jailbreak attacks is crucial for the safe deployment of Large Language Models (LLMs). Recent research has attempted to improve safety by training models to reason over safety rules before responding. However, a key issue lies in determining what form of safety reasoning effectively defends against jailbreak attacks, which is difficult to explicitly design or directly obtain. To address this, we propose \textbf{STAR-S} (\textbf{S}elf-\textbf{TA}ught \textbf{R}easoning based on \textbf{S}afety rules), a framework that integrates the learning of safety rule reasoning into a self-taught loop. The core of STAR-S involves eliciting reasoning and reflection guided by safety rules, then leveraging fine-tuning to enhance safety reasoning. Repeating this process creates a synergistic cycle. Improvements in the model's reasoning and interpretation of safety rules allow it to produce better reasoning data under safety rule prompts, which is then utilized for further training. Experiments show that STAR-S effectively defends against jailbreak attacks, outperforming baselines. Code is available at: https://github.com/pikepokenew/STAR_S.git.

preprint2023arXiv

EZInterviewer: To Improve Job Interview Performance with Mock Interview Generator

Interview has been regarded as one of the most crucial step for recruitment. To fully prepare for the interview with the recruiters, job seekers usually practice with mock interviews between each other. However, such a mock interview with peers is generally far away from the real interview experience: the mock interviewers are not guaranteed to be professional and are not likely to behave like a real interviewer. Due to the rapid growth of online recruitment in recent years, recruiters tend to have online interviews, which makes it possible to collect real interview data from real interviewers. In this paper, we propose a novel application named EZInterviewer, which aims to learn from the online interview data and provides mock interview services to the job seekers. The task is challenging in two ways: (1) the interview data are now available but still of low-resource; (2) to generate meaningful and relevant interview dialogs requires thorough understanding of both resumes and job descriptions. To address the low-resource challenge, EZInterviewer is trained on a very small set of interview dialogs. The key idea is to reduce the number of parameters that rely on interview dialogs by disentangling the knowledge selector and dialog generator so that most parameters can be trained with ungrounded dialogs as well as the resume data that are not low-resource. Evaluation results on a real-world job interview dialog dataset indicate that we achieve promising results to generate mock interviews. With the help of EZInterviewer, we hope to make mock interview practice become easier for job seekers.

preprint2023arXiv

Follow the Timeline! Generating Abstractive and Extractive Timeline Summary in Chronological Order

Nowadays, time-stamped web documents related to a general news query floods spread throughout the Internet, and timeline summarization targets concisely summarizing the evolution trajectory of events along the timeline. Unlike traditional document summarization, timeline summarization needs to model the time series information of the input events and summarize important events in chronological order. To tackle this challenge, in this paper, we propose a Unified Timeline Summarizer (UTS) that can generate abstractive and extractive timeline summaries in time order. Concretely, in the encoder part, we propose a graph-based event encoder that relates multiple events according to their content dependency and learns a global representation of each event. In the decoder part, to ensure the chronological order of the abstractive summary, we propose to extract the feature of event-level attention in its generation process with sequential information remained and use it to simulate the evolutionary attention of the ground truth summary. The event-level attention can also be used to assist in extracting summary, where the extracted summary also comes in time sequence. We augment the previous Chinese large-scale timeline summarization dataset and collect a new English timeline dataset. Extensive experiments conducted on these datasets and on the out-of-domain Timeline 17 dataset show that UTS achieves state-of-the-art performance in terms of both automatic and human evaluations.

preprint2022arXiv

A simple parity violating model in the symmetric teleparallel gravity and its cosmological perturbations

The parity violating gravity models based on the symmetric teleparallel gravity have been considered in the literature, but their applications in cosmology and especially the modifications to cosmological perturbations have not been fully explored. In this paper we consider such a simple model which modifies general relativity by a parity non-conserved coupling, within the framework of the symmetric teleparallel gravity. We study in detail its cosmological applications and focus on its cosmological perturbation theory. Besides the already known parity violation in the tensor perturbations, we find that the vector perturbations in this model are promoted to be dynamical degrees of freedom, and the left- and right-handed vector modes propagate with different velocities. More importantly, we find that one of the vector modes is a ghost at high momentum scales, which will give rise to the problem of vacuum instability in the quantum theory of cosmological perturbations.

preprint2022arXiv

Ghost instability in the teleparallel gravity model with parity violations

In this paper we consider the parity violating gravity model within the framework of teleparallel gravity. The parity violations are caused by the couplings of a scalar field to the scalar invariants which are parity-odd and quadratic in the torsion tensor. Totally there are two such type independent invariants, and one of them is the Nieh-Yan density. Through investigations on the cosmological perturbations of this model, we find that in general it suffers from the difficulties of ghost instability in the scalar and vector perturbations. But in the special case only the coupling to the Nieh-Yan density exists, this model is ghost free and reduces to the Nieh-Yan modified Teleparallel Gravity model. We also analyze the severity of the ghost instability by studying the perturbations around the Minkowski background.

preprint2022arXiv

Possible consistent model of parity violations in the symmetric teleparallel gravity

Many parity violating gravity models suffer from the ghost instability problem. In this paper, we consider a symmetric teleparallel gravity model that extends the general relativity equivalent model by several parity violating interactions between the gravitational field and a scalar field. These interactions exclude higher derivatives and are quadratic in the nonmetricity tensor. Through investigations on the linear cosmological perturbations, our results show that, in general, this model suffers from the difficulty caused by the existence of ghost mode in the vector perturbations. However, in cases where a special condition is imposed on the coefficients of these parity violating interactions, this model can be ghost free.

preprint2022arXiv

Target-aware Abstractive Related Work Generation with Contrastive Learning

The related work section is an important component of a scientific paper, which highlights the contribution of the target paper in the context of the reference papers. Authors can save their time and effort by using the automatically generated related work section as a draft to complete the final related work. Most of the existing related work section generation methods rely on extracting off-the-shelf sentences to make a comparative discussion about the target work and the reference papers. However, such sentences need to be written in advance and are hard to obtain in practice. Hence, in this paper, we propose an abstractive target-aware related work generator (TAG), which can generate related work sections consisting of new sentences. Concretely, we first propose a target-aware graph encoder, which models the relationships between reference papers and the target paper with target-centered attention mechanisms. In the decoding process, we propose a hierarchical decoder that attends to the nodes of different levels in the graph with keyphrases as semantic indicators. Finally, to generate a more informative related work, we propose multi-level contrastive optimization objectives, which aim to maximize the mutual information between the generated related work with the references and minimize that with non-references. Extensive experiments on two public scholar datasets show that the proposed model brings substantial improvements over several strong baselines in terms of automatic and tailored human evaluations.

preprint2021arXiv

The design of the Ali CMB Polarization Telescope receiver

Ali CMB Polarization Telescope (AliCPT-1) is the first CMB degree-scale polarimeter to be deployed on the Tibetan plateau at 5,250m above sea level. AliCPT-1 is a 90/150 GHz 72 cm aperture, two-lens refracting telescope cooled down to 4 K. Alumina lenses, 800mm in diameter, image the CMB in a 33.4° field of view on a 636mm wide focal plane. The modularized focal plane consists of dichroic polarization-sensitive Transition-Edge Sensors (TESes). Each module includes 1,704 optically active TESes fabricated on a 150mm diameter silicon wafer. Each TES array is read out with a microwave multiplexing readout system capable of a multiplexing factor up to 2,048. Such a large multiplexing factor has allowed the practical deployment of tens of thousands of detectors, enabling the design of a receiver that can operate up to 19 TES arrays for a total of 32,376 TESes. AliCPT-1 leverages the technological advancements in the detector design from multiple generations of previously successful feedhorn-coupled polarimeters, and in the instrument design from BICEP-3, but applied on a larger scale. The cryostat receiver is currently under integration and testing. During the first deployment year, the focal plane will be populated with up to 4 TES arrays. Further TES arrays will be deployed in the following years, fully populating the focal plane with 19 arrays on the fourth deployment year. Here we present the AliCPT-1 receiver design, and how the design has been optimized to meet the experimental requirements.

preprint2020arXiv

The effects on CMB power spectra and bispectra from the polarization rotation and its correlations with temperature and E-polarization

The Chern-Simons term, through which the cosmic Axion-like field couples to the electromagnetic field, has the effect to rotate CMB polarization directions and to break the CPT symmetry. This rotation will change the CMB power spectra, no matter isotropic or anisotropic the rotation angle is. In this paper we revisit this issue by further considering the correlations between the (anisotropic) rotation angle $α$ and the CMB temperature and (unrotated) $E$ polarization fields. These correlations could be generated in the Axion-like models with nonzero potential under the adiabatic initial condition. We first investigate how these correlations contribute further modifications to the CMB power spectra, then calculate the CMB bispectra for the temperature and rotated polarization fields. These bispectra would vanish if the $Tα$ and $Eα$ correlations are absent. So, they are useful in searching for CPT violation and the $Tα$ and $Eα$ correlations arisen in the Axion-like models.

preprint2019arXiv

Joint constraint on primordial gravitational waves and polarization rotation angle with current CMB polarization data

Cosmological CPT violation will rotate the polarized direction of CMB photons, convert partial CMB E mode into B mode and vice versa. It will generate non-zero EB, TB spectra and change the EE, BB, TE spectra. This phenomenon gives us a way to detect the CPT-violation signature from CMB observations, and also provides a new mechanism to produce B mode polarization. In this paper, we perform a global analysis on tensor-to-scalar ratio $r$ and polarization rotation angles based on current CMB datasets with both low $\ell$ (Planck, BICEP2/Keck Array) and high $\ell$ (POLARBEAR, SPTpol, ACTPol). Benefited from the high precision of CMB data, we obtain the isotropic rotation angle $\barα = -0.01^\circ \pm 0.37^\circ $ at 68% C.L., the variance of the anisotropic rotation angles $C^α(0)<0.0032\,\mathrm{rad}^2$, the scale invariant power spectrum $D^{αα}_{\ell \in [2, 350]}<4.71\times 10^{-5} \,\mathrm{rad}^2$ and $r<0.057$ at 95% C.L.. Our result shows that with the polarization rotation effect, the 95% upper limit on $r$ gets tightened by 17%.