Researcher profile

Lu Wei

Lu Wei contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
10works
0followers
14topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

10 published item(s)

preprint2026arXiv

How well can off-the-shelf LLMs elucidate molecular structures from mass spectra using chain-of-thought reasoning?

Mass spectrometry (MS) is a powerful analytical technique for identifying small molecules, yet determining complete molecular structures directly from tandem mass spectra (MS/MS) remains a long-standing challenge due to complex fragmentation patterns and the vast diversity of chemical space. Recent progress in large language models (LLMs) has shown promise for reasoning-intensive scientific tasks, but their capability for chemical interpretation is still unclear. In this work, we introduce a Chain-of-Thought (CoT) prompting framework and benchmark that evaluate how LLMs reason about mass spectral data to predict molecular structures. We formalize expert chemists' reasoning steps-such as double bond equivalent (DBE) analysis, neutral loss identification, and fragment assembly-into structured prompts and assess multiple state-of-the-art LLMs (Claude-3.5-Sonnet, GPT-4o-mini, and Llama-3 series) in a zero-shot setting using the MassSpecGym dataset. Our evaluation across metrics of SMILES validity, formula consistency, and structural similarity reveals that while LLMs can produce syntactically valid and partially plausible structures, they fail to achieve chemical accuracy or link reasoning to correct molecular predictions. These findings highlight both the interpretive potential and the current limitations of LLM-based reasoning for molecular elucidation, providing a foundation for future work that combines domain knowledge and reinforcement learning to achieve chemically grounded AI reasoning.

preprint2022arXiv

Modeling and Analysis of Intermittent Federated Learning Over Cellular-Connected UAV Networks

Federated learning (FL) is a promising distributed learning technique particularly suitable for wireless learning scenarios since it can accomplish a learning task without raw data transportation so as to preserve data privacy and lower network resource consumption. However, current works on FL over wireless networks do not profoundly study the fundamental performance of FL over wireless networks that suffers from communication outage due to channel impairment and network interference. To accurately exploit the performance of FL over wireless networks, this paper proposes a novel intermittent FL model over a cellular-connected unmanned aerial vehicle (UAV) network, which characterizes communication outage from UAV (clients) to their server and data heterogeneity among the datasets at UAVs. We propose an analytically tractable framework to derive the uplink outage probability and use it to devise a simulation-based approach so as to evaluate the performance of the proposed intermittent FL model. Our findings reveal how the intermittent FL model is impacted by uplink communication outage and UAV deployment. Extensive numerical simulations are provided to show the consistency between the simulated and analytical performances of the proposed intermittent FL model.

preprint2022arXiv

Spatio-Temporal Federated Learning for Massive Wireless Edge Networks

This paper presents a novel approach to conduct highly efficient federated learning (FL) over a massive wireless edge network, where an edge server and numerous mobile devices (clients) jointly learn a global model without transporting the huge amount of data collected by the mobile devices to the edge server. The proposed FL approach is referred to as spatio-temporal FL (STFL), which jointly exploits the spatial and temporal correlations between the learning updates from different mobile devices scheduled to join STFL in various training epochs. The STFL model not only represents the realistic intermittent learning behavior from the edge server to the mobile devices due to data delivery outage, but also features a mechanism of compensating loss learning updates in order to mitigate the impacts of intermittent learning. An analytical framework of STFL is proposed and employed to study the learning capability of STFL via its convergence performance. In particular, we have assessed the impact of data delivery outage, intermittent learning mitigation, and statistical heterogeneity of datasets on the convergence performance of STFL. The results provide crucial insights into the design and analysis of STFL-based wireless networks.

preprint2022arXiv

Toward Ubiquitous and Flexible Coverage of UAV-IRS-Assisted NOMA Networks

This paper studies how to achieve a high and flexible coverage performance of a large-scale cellular network that enables unmanned aerial vehicles (UAVs) for non-orthogonal multiple access (NOMA) transmission to simultaneously serve multiple users. The considered cellular network consists of a tier of base stations and a tier of UAVs. Each UAV is mounted with an intelligent reflecting surface (IRS) in order to serve as an aerial IRS reflecting signals between a base station and a user in the network. All the UAVs in the network are deployed based on a newly proposed three-dimensional (3D) point process that leads to a tractable and accurate analysis of the association statistics, which is traditionally difficult to analyze due to the mobility of UAVs. In light of this, we are able to analyze the downlink coverage of UAV-IRS-assisted NOMA transmission for two users and derive the corresponding coverage probabilities. Our coverage analyses shed light on the optimal allocations of transmit power between NOMA users and UAVs to accomplish the goal of ubiquitous and flexible NOMA transmission. We also conduct numerical simulations to validate our coverage analytical results while demonstrating the improved coverage performance achieved by aerial IRSs.

preprint2021arXiv

Second-order statistics of fermionic Gaussian states

We study the statistical behavior of entanglement in quantum bipartite systems over fermionic Gaussian states as measured by von Neumann entropy and entanglement capacity. The focus is on the variance of von Neumann entropy and the mean entanglement capacity that belong to the so-defined second-order statistics. The main results are the exact yet explicit formulas of the two considered second-order statistics for fixed subsystem dimension differences. We also conjecture the exact variance of von Neumann entropy valid for arbitrary subsystem dimensions. Based on the obtained results, we analytically study the numerically observed phenomena of Gaussianity of von Neumann entropy and linear growth of average capacity.

preprint2020arXiv

Entanglement Area Law for Shallow and Deep Quantum Neural Network States

A study of the artificial neural network representation of quantum many-body states is presented. The locality and entanglement properties of states for shallow and deep quantum neural networks are investigated in detail. By introducing the notion of local quasi-product states, for which the locally connected shallow feed-forward neural network states and restricted Boltzmann machine states are special cases, we show that Rényi entanglement entropies of all these states obey the entanglement area law. Besides, we also investigate the entanglement features of deep Boltzmann machine states and show that locality constraints imposed on the neural networks make the states obey the entanglement area law. Finally, as an application, we apply the notion of Rényi entanglement entropy to understanding the power of neural networks and show that image classification problems which can be efficiently solved must obey the area law.

preprint2020arXiv

Exact variance of von Neumann entanglement entropy over the Bures-Hall measure

The Bures-Hall distance metric between quantum states is a unique measure that satisfies various useful properties for quantum information processing. In this work, we study the statistical behavior of quantum entanglement over the Bures-Hall ensemble as measured by von Neumann entropy. The average von Neumann entropy over such an ensemble has been recently obtained, whereas the main result of this work is an explicit expression of the corresponding variance that specifies the fluctuation around its average. The starting point of the calculations is the connection between correlation functions of the Bures-Hall ensemble and these of the Cauchy-Laguerre ensemble. The derived variance formula, together with the known mean formula, leads to a simple but accurate Gaussian approximation to the distribution of von Neumann entropy of finite-size systems. This Gaussian approximation is also conjectured to be the limiting distribution for large dimensional systems.

preprint2020arXiv

Proof of Sarkar-Kumar's Conjectures on Average Entanglement Entropies over the Bures-Hall Ensemble

Sarkar and Kumar recently conjectured [J. Phys. A: Math. Theor. $\textbf{52}$, 295203 (2019)] that for a bipartite system of Hilbert dimension $mn$, the mean values of quantum purity and von Neumann entropy of a subsystem of dimension $m\leq n$ over the Bures-Hall measure are given by \begin{equation*} \frac{2n(2n+m)-m^{2}+1}{2n(2mn-m^2+2)} \end{equation*} and \begin{equation*} ψ_{0}\left(mn-\frac{m^2}{2}+1\right)-ψ_{0}\left(n+\frac{1}{2}\right), \end{equation*} respectively, where $ψ_{0}(\cdot)$ is the digamma function. We prove the above conjectured formulas in this work. A key ingredient of the proofs is Forrester and Kieburg's discovery on the connection between the Bures-Hall ensemble and the Cauchy-Laguerre biorthogonal ensemble studied by Bertola, Gekhtman, and Szmigielski.

preprint2019arXiv

Skewness of von Neumann entanglement entropy

We study quantum bipartite systems in a random pure state, where von Neumann entropy is considered as a measure of the entanglement. Expressions of the first and second exact cumulants of von Neumann entropy, relevant respectively to the average and fluctuation behavior, are known in the literature. The focus of this paper is on its skewness that specifies the degree of asymmetry of the distribution. Computing the skewness requires additionally the third cumulant, an exact formula of which is the main result of this work. In proving the main result, we obtain as a byproduct various summation identities involving polygamma and related functions. The derived third cumulant also leads to an improved approximation to the distribution of von Neumann entropy.

preprint2018arXiv

On the exact variance of Tsallis entropy in a random pure state

Tsallis entropy is a useful one-parameter generalization of the standard von Neumann entropy in information theory. We study the variance of Tsallis entropy of bipartite quantum systems in a random pure state. The main result is an exact variance formula of Tsallis entropy that involves finite sums of some terminating hypergeometric functions. In the special cases of quadratic entropy and small subsystem dimensions, the main result is further simplified to explicit variance expressions. As a byproduct, we find an independent proof of the recently proved variance formula of von Neumann entropy based on the derived moment relation to the Tsallis entropy.