Researcher profile

Jiayu Li

Jiayu Li contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
18works
0followers
13topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

18 published item(s)

preprint2026arXiv

CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence

Multimodal Large Language Models (MLLMs) have significantly advanced document understanding, yet current Doc-VQA evaluations score only the final answer and leave the supporting evidence unchecked. This answer-only approach masks a critical failure mode: a model can land on the correct answer while grounding it in the wrong passage -- a critical risk in high-stakes domains like law, finance, and medicine, where every conclusion must be traceable to a specific source region. To address this, we introduce CiteVQA, a benchmark that requires models to return element-level bounding-box citations alongside each answer, evaluating both jointly. CiteVQA comprises 1,897 questions across 711 PDFs spanning seven domains and two languages, averaging 40.6 pages per document. To ensure fidelity and scalability, the ground-truth citations are generated by an automated pipeline-which identifies crucial evidence via masking ablation-and are subsequently validated through expert review. At the core of our evaluation is Strict Attributed Accuracy (SAA), which credits a prediction only when the answer and the cited region are both correct. Auditing 20 MLLMs reveals a pervasive Attribution Hallucination: models frequently produce the right answer while citing the wrong region. The strongest system (Gemini-3.1-Pro-Preview) achieves an SAA of only 76.0, and the strongest open-source MLLM reaches just 22.5. Ultimately, towards trustworthy document intelligence, CiteVQA exposes a reliability gap that answer-only evaluations overlook, providing the instrumentation needed to close it. Our repository is available at https://github.com/opendatalab/CiteVQA.

preprint2026arXiv

Dual-Channel Tensor Neural Networks: Finite-Sample Theory and Conformal Structure Selection

Tensor-valued data arise naturally in neuroimaging, genomics, climate science, and spatiotemporal networks, where multilinear dependencies across modes carry information that is destroyed under vectorization. Existing approaches either impose a single low-rank structure, which can miss localized signal, or treat the tensor as a long vector, which discards its multiway geometry. We propose a *Dual-Channel Tensor Neural Network* (DC-TNN) that decomposes each tensor input into a low-rank core and a sparse refinement, and processes the two components through coupled neural channels. The framework is structure-agnostic and accommodates CP, Tucker, and tensor-train cores within a single architecture. For estimation, we establish non-asymptotic risk bounds for the DC-TNN estimator that decompose into network approximation, core estimation, and refinement-selection terms, and show that the effective dimension is determined jointly by the core rank and refinement sparsity rather than by the ambient tensor size. For inference, we develop a *structure-aware conformal ROC* procedure that calibrates within the core-refinement latent space and produces ROC and AUC confidence bands with finite-sample, distribution-free coverage. Building on this, we propose a *conformal structure selector* that, to our knowledge, is the *first distribution-free procedure* for choosing among candidate tensor decompositions with finite-sample validity. Simulations and an analysis of a protein dataset demonstrate competitive predictive accuracy, reliable uncertainty quantification, and consistent recovery of the tensor structure.

preprint2024arXiv

LEFormer: A Hybrid CNN-Transformer Architecture for Accurate Lake Extraction from Remote Sensing Imagery

Lake extraction from remote sensing images is challenging due to the complex lake shapes and inherent data noises. Existing methods suffer from blurred segmentation boundaries and poor foreground modeling. This paper proposes a hybrid CNN-Transformer architecture, called LEFormer, for accurate lake extraction. LEFormer contains three main modules: CNN encoder, Transformer encoder, and cross-encoder fusion. The CNN encoder effectively recovers local spatial information and improves fine-scale details. Simultaneously, the Transformer encoder captures long-range dependencies between sequences of any length, allowing them to obtain global features and context information. The cross-encoder fusion module integrates the local and global features to improve mask prediction. Experimental results show that LEFormer consistently achieves state-of-the-art performance and efficiency on the Surface Water and the Qinghai-Tibet Plateau Lake datasets. Specifically, LEFormer achieves 90.86% and 97.42% mIoU on two datasets with a parameter count of 3.61M, respectively, while being 20 minor than the previous best lake extraction method. The source code is available at https://github.com/BastianChen/LEFormer.

preprint2023arXiv

Linguistic-style-aware Neural Networks for Fake News Detection

We propose the hierarchical recursive neural network (HERO) to predict fake news by learning its linguistic style, which is distinguishable from the truth, as psychological theories reveal. We first generate the hierarchical linguistic tree of news documents; by doing so, we translate each news document's linguistic style into its writer's usage of words and how these words are recursively structured as phrases, sentences, paragraphs, and, ultimately, the document. By integrating the hierarchical linguistic tree with the neural network, the proposed method learns and classifies the representation of news documents by capturing their locally sequential and globally recursive structures that are linguistically meaningful. It is the first work offering the hierarchical linguistic tree and the neural network preserving the tree information to our best knowledge. Experimental results based on public real-world datasets demonstrate the proposed method's effectiveness, which can outperform state-of-the-art techniques in classifying short and long news documents. We also examine the differential linguistic style of fake news and the truth and observe some patterns of fake news. The code and data have been publicly available.

preprint2022arXiv

3D Morphology of Open Clusters in the Solar Neighborhood with Gaia EDR3 II: Hierarchical Star Formation Revealed by Spatial and Kinematic Substructures

We identify members of 65 open clusters in the solar neighborhood using the machine-learning algorithm StarGO based on Gaia EDR3 data. After adding members of twenty clusters from previous studies (Pang et al. 2021a,b; Li et al. 2021) we obtain 85 clusters, and study their morphology and kinematics. We classify the substructures outside the tidal radius into four categories: filamentary (f1) and fractal (f2) for clusters $<100$ Myr, and halo (h) and tidal-tail (t) for clusters $>100$ Myr. The kinematical substructures of f1-type clusters are elongated; these resemble the disrupted cluster Group X. Kinematic tails are distinct in t-type clusters, especially Pleiades. We identify 29 hierarchical groups in four young regions (Alessi 20, IC 348, LP 2373, LP 2442); ten among these are new. The hierarchical groups form filament networks. Two regions (Alessi 20, LP 2373) exhibit global &#34;orthogonal&#34; expansion (stellar motion perpendicular to the filament), which might cause complete dispersal. Infalling-like flows (stellar motion along the filament) are found in UBC 31 and related hierarchical groups in the IC 348 region. Stellar groups in the LP 2442 region (LP 2442 gp 1-5) are spatially well-mixed but kinematically coherent. A merging process might be ongoing in the LP 2442 subgroups. For younger systems ($\lesssim30$ Myr), the mean axis ratio, cluster mass and half-mass radius tend to increase with age values. These correlations between structural parameters may imply two dynamical processes occurring in the hierarchical formation scenario in young stellar groups: (1) filament dissolution and (2) sub-group mergers.

preprint2022arXiv

A blow-up formula for stationary quaternionic maps

Let $(M, J^α, α=1,2,3)$ and $(N, {\cal J}^α, α=1,2,3)$ be Hyperkähler manifolds. Suppose that $u_k$ is a sequence of stationary quaternionic maps and converges weakly to $u$ in $H^{1,2}(M,N)$, we derive a blow-up formula for $\lim_{k\to\infty}d(u_k^*{\cal J}^α)$, for $α=1,2,3$, in the weak sense. As a corollary, we show that the maps constructed by Chen-Li [CL2] and by Foscolo [F] can not be tangent maps (c.f [LT], Theorem 3.1) of a stationary quaternionic map satisfing $d(u^*{\cal J}^α)=0$.

preprint2022arXiv

Active control of thermal emission by graphene-nanowire coupled plasmonic metasurfaces

Metasurfaces, together with graphene plasmonics, have become prominent for the emissivity control in thermal engineering, both passively through changing the geometric parameters and packing density of the metasurfaces, and actively through graphene gating or doping. We demonstrate a graphene-nanowire coupled plasmonic metasurface utilizing the hybrid localized surface plasmon modes of the nanowire array and graphene. The nanowire array makes the hybrid surface plasmon mode localized, allowing a free-space excitation. The single layer graphene, via the gating between the underneath mirror and a top electrode, can actively tune the spectral emissivity by almost 90%. In addition, the hybrid plasmon mode provides an extra degree of freedom to modulate the p-polarized emissivity with a five-fold enhancement, especially for large emission angles.

preprint2022arXiv

Boundary value problem for the mean field equation on a compact Riemann surface

Let $(Σ,g)$ be a compact Riemann surface with smooth boundary $\partialΣ$, $Δ_g$ be the Laplace-Beltrami operator, and $h$ be a positive smooth function. Using a min-max scheme introduced by Djadli-Malchiodi (2006) and Djadli (2008), we prove that if $Σ$ is non-contractible, then for any $ρ\in(8kπ,8(k+1)π)$ with $k\in\mathbb{N}^\ast$, the mean field equation $$\left\{\begin{array}{lll} Δ_g u=ρ\frac{he^u}{\int_Σhe^udv_g}&{\rm in}&Σ\\[1.5ex] u=0&{\rm on}&\partialΣ\end{array}\right.$$ has a solution. This generalizes earlier existence results of Ding-Jost-Li-Wang (1999) and Chen-Lin (2003) in the Euclidean domain. Also we consider the corresponding Neumann boundary value problem. If $h$ is a positive smooth function, then for any $ρ\in(4kπ,4(k+1)π)$ with $k\in\mathbb{N}^\ast$, the mean field equation $$\left\{\begin{array}{lll} Δ_g u=ρ\left(\frac{he^u}{\int_Σhe^udv_g}-\frac{1}{|Σ|}\right)&{\rm in}&Σ\\[1.5ex] \partial u/\partial{\mathbf{v}}=0&{\rm on}&\partialΣ\end{array}\right.$$ has a solution, where $\mathbf{v}$ denotes the unit normal outward vector on $\partialΣ$. Note that in this case we do not require the surface to be non-contractible.

preprint2022arXiv

Designing light-element materials with large effective spin-orbit coupling

Spin-orbit coupling (SOC), the core of numerous condensed-matter phenomena such as nontrivial band gap, magnetocrystalline anisotropy, etc, is generally considered to be appreciable only in heavy elements, detrimental to the synthetization and application of functional materials. Therefore, amplifying the SOC effect in light elements is of great importance. Here, focusing on 3d and 4d systems, we demonstrate that the interplay between crystal symmetry and electron correlation can dramatically enhance the SOC effect in certain partially occupied orbital multiplets, through the self-consistently reinforced orbital polarization as a pivot. We then provide design principles and comprehensive databases, in which we list all the Wyckoff positions and site symmetries, in all two-dimensional (2D) and three-dimensional crystals that potentially have such enhanced SOC effect. As an important demonstration, we predict nine material candidates from our selected 2D material pool as high-temperature quantum anomalous Hall insulators with large nontrivial band gaps of hundreds of meV. Our work provides an efficient and straightforward way to predict promising SOC-active materials, releasing the burden of requiring heavy elements for next-generation spin-orbitronic materials and devices.

preprint2022arXiv

Role of hidden spin polarization in non-reciprocal transport of antiferromagnets

The discovery of hidden spin polarization (HSP) in centrosymmetric nonmagnetic crystals, i.e., spatially distributed spin polarization originated from local symmetry breaking, has promised an expanded material pool for future spintronics. However, the measurements of such exotic effects have been limited to subtle space- and momentum-resolved techniques, unfortunately hindering their applications. Here, we theoretically predict macroscopic non-reciprocal transports induced by HSP when coupling another spatially distributed quantity, such as staggered local moments in a PT-symmetric anti-ferromagnet. By using a four-band model Hamiltonian, we demonstrate that HSP plays a crucial role in determining the asymmetric bands with respect to opposite momenta. Such band asymmetry leads to non-reciprocal nonlinear conductivity, exemplified by tetragonal CuMnAs via first-principles calculations. We further provide the material design principles for large nonlinear conductivity, including two-dimensional nature, multiple band crossings near the Fermi level, and symmetry protected HSP. Our work not only reveals direct spintronic applications of HSP (such as Néel order detection), but also sheds light on finding observables of other &#39;&#39;hidden effects&#39;&#39;, such as hidden optical polarization and hidden Berry curvature.

preprint2022arXiv

The $C^0$-convergence at the Neumann boundary for Liouville equations

In this paper, we study the blow-up analysis for a sequence of solutions to the Liouville type equation with exponential Neumann boundary condition. For interior case, i.e. the blow-up point is an interior point, Li \cite{Li} gave a uniform asymptotic estimate. Later, Zhang \cite{Zhang} and Gluck \cite{Gluck} improved Li&#39;s estimate in the sense of $C^0$-convergence by using the method of moving planes or classification of solutions of the linearized version of Liouville equation. If the sequence blows up at a boundary point, Bao-Wang-Zhou \cite{Bao-Wang-Zhou} proved a similar asymptotic estimate of Li \cite{Li}. In this paper, we will prove a $C^0$-convergence result in this boundary blow-up process. Our method is different from \cite{Zhang,Gluck}.

preprint2022arXiv

Topological Magnetoelectric Response in Ferromagnetic Axion Insulators

Topological magnetoelectric effect (TME) is a hallmark response of the topological field theory, which provides a paradigm shift in the study of emergent topological phenomena. However, its direct observation is yet to be realized due to the demanding magnetic configuration required to gap all the surface states. Here, we theoretically propose that the axion insulators with a simple ferromagnetic configuration, such as MnBi2Te4/(Bi2Te3)n family, provide an ideal playground to realize TME. In a designed triangular prism geometry, all the surface states are magnetically gapped. Under a vertical electric field, the surface Hall currents give rise to a nearly half-quantized orbital moment, accompanied with a gapless chiral hinge mode circulating parallelly. Thus, the orbital magnetization from the two topological origins can be easily distinguished by reversing the electric field. Our work paves a new avenue towards the direct observation of TME in realistic axion-insulator materials.

preprint2020arXiv

Distinct Topological Surface States on the Two Terminations of MnBi$_4$Te$_7$

The recent discovered intrinsic magnetic topological insulator MnBi2Te4 have been met with unusual success in hosting emergent phenomena such as the quantum anomalous Hall effect and the axion insulator states. However, the surface-bulk correspondence of the Mn-Bi-Te family, composed by the superlattice-like MnBi2Te4/(Bi2Te3)n (n = 0, 1, 2, 3 ...) layered structure, remains intriguing but elusive. Here, by using scanning tunneling microscopy (STM) and angle-resolved photoemission spectroscopy (ARPES) techniques, we unambiguously assign the two distinct surface states of MnBi4Te7 (n = 1) to the quintuple-layer (QL) Bi2Te3 termination and the septuple-layer (SL) MnBi2Te4 termination, respectively. A comparison of the experimental observations with theoretical calculations reveals the diverging topological behaviors, especially the hybridization effect between magnetic and nonmagnetic layers, on the two terminations: a gap on the QL termination originating from the topological surface states of the QL hybridizing with the bands of the beneath SL, and a gapless Dirac-cone band structure on the SL termination with time-reversal symmetry. The quasi-particle interference patterns further confirm the topological nature of the surface states for both terminations, continuing far above the Fermi energy. The QL termination carries a spin-helical Dirac state with hexagonal warping, while at the SL termination, a strongly canted helical state from the surface lies between a pair of Rashba-split states from its neighboring layer. Our work elucidates an unprecedented hybridization effect between the building blocks of the topological surface states, and also reveals the termination-dependent time-reversal symmetry breaking in a magnetic topological insulator, rendering an ideal platform to realize the half-integer quantum Hall effect and relevant quantum phenomena.

preprint2020arXiv

Enhanced Secrecy Rate Maximization for Directional Modulation Networks via IRS

Intelligent reflecting surface (IRS) is of low-cost and energy-efficiency and will be a promising technology for the future wireless communications like sixth generation. To address the problem of conventional directional modulation (DM) that Alice only transmits single confidential bit stream (CBS) to Bob with multiple antennas in a line-of-sight channel, IRS is proposed to create friendly multipaths for DM such that two CBSs can be transmitted from Alice to Bob. This will significantly enhance the secrecy rate (SR) of DM. To maximize the SR (Max-SR), a general non-convex optimization problem is formulated with the unit-modulus constraint of IRS phase-shift matrix (PSM), and the general alternating iterative (GAI) algorithm is proposed to jointly obtain the transmit beamforming vectors (TBVs) and PSM by alternately optimizing one and fixing another. To reduce its high complexity, a low-complexity iterative algorithm for Max-SR is proposed by placing the constraint of null-space (NS) on the TBVs, called NS projection (NSP). Here, each CBS is transmitted separately in the NSs of other CBS and AN channels. Simulation results show that the SRs of the proposed GAI and NSP can approximately double that of IRS-based DM with single CBS for massive IRS in the high signal-to-noise ratio region.

preprint2020arXiv

Half-Magnetic Topological Insulator

Topological magnets are a new family of quantum materials providing great potential to realize emergent phenomena, such as quantum anomalous Hall effect and axion-insulator state. Here we present our discovery that stoichiometric ferromagnet MnBi8Te13 with natural heterostructure MnBi2Te4-(Bi2Te3)3 is an unprecedented half-magnetic topological insulator, with the magnetization existing at the MnBi2Te4 surface but not at the opposite surface terminated by triple Bi2Te3 layers. Our angle-resolved photoemission spectroscopy measurements unveil a massive Dirac gap at the MnBi2Te4 surface, and gapless Dirac cone on the other side. Remarkably, the Dirac gap (~28 meV) at MnBi2Te4 surface decreases monotonically with increasing temperature and closes right at the Curie temperature, thereby representing the first smoking-gun spectroscopic evidence of magnetization-induced topological surface gap among all known magnetic topological materials. We further demonstrate theoretically that the half-magnetic topological insulator is desirable to realize the half-quantized surface anomalous Hall effect, which serves as a direct proof of the general concept of axion electrodynamics in condensed matter systems.

preprint2020arXiv

Realization and transport investigation of a single layer-twisted bilayer graphene junction

We report on low-temperature transport study of a single layer graphene (SLG)-twisted bilayer graphene (tBLG) junction device. The SLG-tBLG junction in the device is grown by chemical vapor deposition and the device is fabricated in a Hall-bar configuration on Si/SiO$_2$ substrate. The longitudinal resistances across the SLG-tBLG junction (cross-junction resistances) on the two sides of the Hall bar and the Hall resistances of SLG and tBLG in the device are measured. In the quantum Hall regime, the measurements show that the measured cross-junction resistances exhibit a series of new quantized plateaus and the appearance of these resistance plateaus can be attributed to the presence of the well-defined edge-channel transport along the SLG-tBLG junction interface. The measurements also show that the difference between the cross-junction resistances measured on the two sides of the Hall-bar provides a sensitive measure to the edge channel transport characteristics in the two graphene layers that constitute the SLG-tBLG junction and to degeneracy lifting of the Landau levels in the tBLG layer. Temperature dependent measurements of the cross-junction resistance in the quantum Hall regime are also carried out and the influence of the transverse transport of the bulk Landau levels on the edge channel transport along the SLG-tBLG junction interface are extracted. These results enrich the understanding of the charge transport across interfaces in graphene hybrid structures and open up new opportunities for probing exotic quantum phenomena in graphene devices.

preprint2020arXiv

Secure Multigroup Multicast Communication Systems via Intelligent Reflecting Surface

This paper considers a secure multigroup multicast multiple-input single-output (MISO) communication system aided by an intelligent reflecting surface (IRS). Specifically, we aim to minimize the transmit power at the Alice via jointly optimizing the transmit beamformer, AN vector and phase shifts at the IRS subject to the secrecy rate constraints as well as the unit modulus constraints of IRS phase shifts. However, the optimization problem is non-convex and directly solving it is intractable. To tackle the optimization problem, we first transform it into a semidefinite relaxation (SDR) problem, and then alternately update the transmit beamformer and AN matrix as well as the phase shifts at the IRS. In order to reduce the high computational complexity, we further propose a low-complexity algorithm based on second-order cone programming (SOCP). We decouple the optimization problem into two sub-problems and optimize the transmit beamformer, AN vector and the phase shifts alternately by solving two corresponding SOCP sub-problem. Simulation results show that the proposed SDR and SOCP schemes require half or less transmit power than the scheme without IRS, which demonstrates the advantages of introducing IRS and the effectiveness of the proposed methods.