Source author record

Zhiyuan Wang

Zhiyuan Wang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

19works

22topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

$A^3$-Bench: Benchmarking Memory-Driven Scientific Reasoning via Anchor and Attractor Activation

Scientific reasoning relies not only on logical inference but also on activating prior knowledge and experiential structures. Memory can efficiently reuse knowledge and enhance reasoning consistency and stability. However, existing benchmarks mainly evaluate final answers or step-by-step coherence, overlooking the \textit{memory-driven} mechanisms that underlie human reasoning, which involves activating anchors and attractors, then integrating them into multi-step inference. To address this gap, we propose $A^3$-Bench~ https://a3-bench.github.io, a benchmark designed to evaluate scientific reasoning through dual-scale memory-driven activation, grounded in Anchor and Attractor Activation. First, we annotate 2,198 science reasoning problems across domains using the SAPM process(subject, anchor & attractor, problem, and memory developing). Second, we introduce a dual-scale memory evaluation framework utilizing anchors and attractors, along with the AAUI(Anchor--Attractor Utilization Index) metric to measure memory activation rates. Finally, through experiments with various base models and paradigms, we validate $A^3$-Bench and analyze how memory activation impacts reasoning performance, providing insights into memory-driven scientific reasoning.

preprint2026arXiv

BalanceRAG: Joint Risk Calibration for Cascaded Retrieval-Augmented Generation

Large language models (LLMs) can enhance factuality via retrieval-augmented generation (RAG), but applying RAG to every query is unnecessary when the model-only answer is reliable. This motivates cascaded RAG: each query is first handled by an LLM-only branch, escalated to a RAG fallback only if the primary branch is uncertain, and abstained from when neither branch is sufficiently trustworthy. However, calibrating such cascades stage by stage may be conservative, since the final utility depends on joint uncertainty thresholding of LLM-only and RAG. In this work, we develop BalanceRAG to certify threshold pairs at a target risk level. Given uncertainty scores from the two branches, BalanceRAG frames each threshold pair as an operating point on a two-dimensional lattice and identifies safe operating points using sequential graphical testing. This enables risk-adaptive threshold calibration, controlling the system-level error rate among accepted points, while retaining more examples. Furthermore, BalanceRAG extends to multi-risk calibration, allowing retrieval usage to be bounded together with the selection-conditioned risk. Experiments on three open-domain question answering (QA) benchmarks across multiple LLM backbones demonstrate that BalanceRAG meets prescribed risk levels, preserves higher coverage and more accepted correct examples, and reduces unnecessary retrieval calls compared with always-on RAG.

preprint2026arXiv

Decoupling Semantics and Fingerprints: A Universal Representation for AI-Generated Image Detection

Detecting AI-generated images across unseen architectures remains challenging, as existing models often overfit to generator-specific fingerprints and semantic content rather than learning universal forgery traces. We attribute this failure to feature entanglement: detectors learn these factors as a single entangled representation, where universal forgery traces are inextricably confounded with both generator-specific fingerprints and semantic content. Crucially, our spectral analysis reveals that this entanglement is avoidable: distinct generator-specific fingerprints (e.g., GAN stripes vs. Diffusion Model spots) occupy disjoint frequency subspaces and coexist as independent superpositions. Leveraging this physical orthogonality, we propose the Orthogonal Decomposition and Purification Network (ODP-Net) to structurally disentangle these factors. Specifically, ODP-Net employs (1) Instance-aware Orthogonal Decomposition to project features into mutually exclusive subspaces: universal forgery traces, generator-specific fingerprints, and semantic content; (2) Perturbation-based Purification to enforce semantic invariance via cross-sample feature injection; and (3) Manifold Alignment to bridge domain gaps. By explicitly decoupling universal forgery traces from generator-specific fingerprints and semantic content, ODP-Net achieves state-of-the-art performance on unseen architectures (e.g., Stable Diffusion 3), validating that structural disentanglement is key to generalization.

preprint2026arXiv

MAXS: Meta-Adaptive Exploration with LLM Agents

Large Language Model (LLM) Agents exhibit inherent reasoning abilities through the collaboration of multiple tools. However, during agent inference, existing methods often suffer from (i) locally myopic generation, due to the absence of lookahead, and (ii) trajectory instability, where minor early errors can escalate into divergent reasoning paths. These issues make it difficult to balance global effectiveness and computational efficiency. To address these two issues, we propose meta-adaptive exploration with LLM agents https://github.com/exoskeletonzj/MAXS, a meta-adaptive reasoning framework based on LLM Agents that flexibly integrates tool execution and reasoning planning. MAXS employs a lookahead strategy to extend reasoning paths a few steps ahead, estimating the advantage value of tool usage, and combines step consistency variance and inter-step trend slopes to jointly select stable, consistent, and high-value reasoning steps. Additionally, we introduce a trajectory convergence mechanism that controls computational cost by halting further rollouts once path consistency is achieved, enabling a balance between resource efficiency and global effectiveness in multi-tool reasoning. We conduct extensive empirical studies across three base models (MiMo-VL-7B, Qwen2.5-VL-7B, Qwen2.5-VL-32B) and five datasets, demonstrating that MAXS consistently outperforms existing methods in both performance and inference efficiency. Further analysis confirms the effectiveness of our lookahead strategy and tool usage.

preprint2026arXiv

PULSE: Agentic Investigation with Passive Sensing for Proactive Intervention in Cancer Survivorship

Cancer survivors face elevated rates of depression, anxiety, and general emotional distress, yet the precise moments they most need support are often the moments when self-report is sparse, a phenomenon we term the diary paradox. Passive smartphone sensing offers a continuous, unobtrusive alternative, but prior sensing-based affect prediction has been limited by an accuracy ceiling, suggesting a bottleneck not only in available data, but in how behavioral signals are interpreted. We present PULSE, a system that shifts from fixed feature pipelines to agentic sensing investigation: LLM agents equipped with eight purpose-built tools autonomously query smartphone sensing data, compare current behavior against personalized baselines, and calibrate inferences through retrieval-augmented population-level comparisons. Rather than receiving pre-formatted feature summaries, agents decide which modalities to inspect, how far back to look, and how deeply to investigate, mirroring hypothesis-driven clinical reasoning. We evaluate PULSE through a 2*2 factorial design crossing reasoning architecture (structured vs. agentic) with data modality (sensing-only vs. with diary) on 50 cancer survivors from a longitudinal study of cancer survivors. Agentic reasoning is the primary driver of performance: agentic multimodal agent achieves balanced accuracy of 0.743 for emotion regulation desire with diary and sensing data, while agentic agents predict intervention availability at 0.713 with passive sensing data only. These results suggest that agentic investigation may be a cornerstone for unlocking the clinical value of passive sensing, advancing the feasibility of proactive just-in-time mental health support.

preprint2026arXiv

Subjective-Objective Median-based Importance Technique (SOMIT) to Aid Multi-Criteria Renewable Energy Evaluation

Accelerating the renewable energy transition requires informed decision-making that accounts for the diverse financial, technical, environmental, and social trade-offs across different renewable energy technologies. A critical step in this multi-criteria decision-making (MCDM) process is the determination of appropriate criteria weights. However, deriving these weights often solely involves either subjective assessment from decision-makers or objective weighting methods, each of which has limitations in terms of cognitive burden, potential bias, and insufficient contextual relevance. This study proposes the subjective-objective median-based importance technique (SOMIT), a novel hybrid approach for determining criteria weights in MCDM. By tailoring SOMIT to renewable energy evaluation, the method directly supports applied energy system planning, policy analysis, and technology prioritization under carbon neutrality goals. The practical utility of SOMIT is demonstrated through two MCDM case studies on renewable energy decision-making in India and Saudi Arabia. Using the derived weights from SOMIT, the TOPSIS method ranks the renewable energy alternatives, with solar power achieving the highest performance scores in both cases. The main contributions of this work are five-fold: 1) the proposed SOMIT reduces the number of required subjective comparisons from the conventional quadratic order to a linear order; 2) SOMIT is more robust to outliers in the alternatives-criteria matrix (ACM); 3) SOMIT balances subjective expert knowledge with objective data-driven insights, thereby mitigating bias; 4) SOMIT is inherently modular, allowing both its individual parts and the complete approach to be seamlessly coupled with a wide range of MCDM methods commonly applied in energy systems and policy analysis; 5) a dedicated Python library, pysomit, is developed for SOMIT.

preprint2022arXiv

AACC: Asymmetric Actor-Critic in Contextual Reinforcement Learning

Reinforcement Learning (RL) techniques have drawn great attention in many challenging tasks, but their performance deteriorates dramatically when applied to real-world problems. Various methods, such as domain randomization, have been proposed to deal with such situations by training agents under different environmental setups, and therefore they can be generalized to different environments during deployment. However, they usually do not incorporate the underlying environmental factor information that the agents interact with properly and thus can be overly conservative when facing changes in the surroundings. In this paper, we first formalize the task of adapting to changing environmental dynamics in RL as a generalization problem using Contextual Markov Decision Processes (CMDPs). We then propose the Asymmetric Actor-Critic in Contextual RL (AACC) as an end-to-end actor-critic method to deal with such generalization tasks. We demonstrate the essential improvements in the performance of AACC over existing baselines experimentally in a range of simulated environments.

preprint2022arXiv

BKP Hierarchy, Affine Coordinates, and a Formula for Connected Bosonic $N$-Point Functions

We derive a formula for the connected $n$-point functions of a tau-function of the BKP hierarchy in terms of its affine coordinates. This is a BKP-analogue of a formula for KP tau-functions proved by Zhou in [arXiv:1507.01679]. Moreover, we prove a simple relation between the KP-affine coordinates of a tau-function $τ(\mathbf{t})$ of the KdV hierarchy and the BKP-affine coordinates of $τ(\mathbf{t}/2)$. As applications, we present a new algorithm to compute the free energies of the Witten-Kontsevich tau-function and the Brézin-Gross-Witten tau-function.

preprint2022arXiv

From Personalized Medicine to Population Health: A Survey of mHealth Sensing Techniques

Mobile Sensing Apps have been widely used as a practical approach to collect behavioral and health-related information from individuals and provide timely intervention to promote health and well-beings, such as mental health and chronic cares. As the objectives of mobile sensing could be either \emph{(a) personalized medicine for individuals} or \emph{(b) public health for populations}, in this work we review the design of these mobile sensing apps, and propose to categorize the design of these apps/systems in two paradigms -- \emph{(i) Personal Sensing} and \emph{(ii) Crowd Sensing} paradigms. While both sensing paradigms might incorporate with common ubiquitous sensing technologies, such as wearable sensors, mobility monitoring, mobile data offloading, and/or cloud-based data analytics to collect and process sensing data from individuals, we present a novel taxonomy system with two major components that can specify and classify apps/systems from aspects of the life-cycle of mHealth Sensing: \emph{(1) Sensing Task Creation \& Participation}, \emph{(2) Health Surveillance \& Data Collection}, and \emph{(3) Data Analysis \& Knowledge Discovery}. With respect to different goals of the two paradigms, this work systematically reviews this field, and summarizes the design of typical apps/systems in the view of the configurations and interactions between these two components. In addition to summarization, the proposed taxonomy system also helps figure out the potential directions of mobile sensing for health from both personalized medicines and population health perspectives.

preprint2022arXiv

Graph Neural Networks in IoT: A Survey

The Internet of Things (IoT) boom has revolutionized almost every corner of people's daily lives: healthcare, home, transportation, manufacturing, supply chain, and so on. With the recent development of sensor and communication technologies, IoT devices including smart wearables, cameras, smartwatches, and autonomous vehicles can accurately measure and perceive their surrounding environment. Continuous sensing generates massive amounts of data and presents challenges for machine learning. Deep learning models (e.g., convolution neural networks and recurrent neural networks) have been extensively employed in solving IoT tasks by learning patterns from multi-modal sensory data. Graph Neural Networks (GNNs), an emerging and fast-growing family of neural network models, can capture complex interactions within sensor topology and have been demonstrated to achieve state-of-the-art results in numerous IoT learning tasks. In this survey, we present a comprehensive review of recent advances in the application of GNNs to the IoT field, including a deep dive analysis of GNN design in various IoT sensing environments, an overarching list of public data and source code from the collected publications, and future research directions. To keep track of newly published works, we collect representative papers and their open-source implementations and create a Github repository at https://github.com/GuiminDong/GNN4IoT.

preprint2022arXiv

Optical trapping of high-aspect-ratio NaYF hexagonal prisms for kHz-MHz gravitational wave detectors

We present experimental results on optical trapping of Yb-doped $β-$NaYF sub-wavelength-thickness high-aspect-ratio hexagonal prisms with a micron-scale radius. The prisms are trapped in vacuum using an optical standing wave, oriented with the normal vector to their face along the beam propagation direction, and exhibit characteristic modes of three translational and two torsional degrees of freedom. The measured motional spectra are compared with numerical simulations. This plate-like geometry simultaneously enables trapping with low photon-recoil-heating, high mass, and high trap frequency, potentially leading to advances in high frequency gravitational wave searches in the Levitated Sensor Detector (LSD), currently under construction. The material used here has previously been shown to exhibit internal cooling via laser refrigeration when optically trapped and illuminated with light of suitable wavelength. Employing such laser refrigeration methods in the context of our work may enable higher trapping intensity thus and higher trap frequencies for gravitational wave searches approaching the several hundred kHz range.

preprint2022arXiv

Topological correlations in three dimensional classical Ising models: an exact solution with a continuous phase transition

We study a three-dimensional (3D) classical Ising model that is exactly solvable when some coupling constants take certain imaginary values. The solution combines and generalizes the Onsager-Kaufman solution of the 2D Ising model and the solution of Kitaev's honeycomb model, leading to a three-parameter phase diagram with a third order phase transition between two distinct phases. Interestingly, the phases of this model are distinguished by topological features: the expectation value of a certain family of loop observables depend only on the topology of the loop (whether the loop is contractible), and are quantized at rational values that differ in the two phases. We show that a related exactly solvable 3D classical statistical model with real coupling constants also shows the topological features of one of these phases. Furthermore, even in the model with complex parameters, the partition function has some physical relevance, as it can be interpreted as the transition amplitude of a quantum dynamical process and may shed light on dynamical quantum phase transitions.

preprint2020arXiv

Enhance the performance of navigation: A two-stage machine learning approach

Real time traffic navigation is an important capability in smart transportation technologies, which has been extensively studied these years. Due to the vast development of edge devices, collecting real time traffic data is no longer a problem. However, real traffic navigation is still considered to be a particularly challenging problem because of the time-varying patterns of the traffic flow and unpredictable accidents/congestion. To give accurate and reliable navigation results, predicting the future traffic flow(speed,congestion,volume,etc) in a fast and accurate way is of great importance. In this paper, we adopt the ideas of ensemble learning and develop a two-stage machine learning model to give accurate navigation results. We model the traffic flow as a time series and apply XGBoost algorithm to get accurate predictions on future traffic conditions(1st stage). We then apply the Top K Dijkstra algorithm to find a set of shortest paths from the give start point to the destination as the candidates of the output optimal path. With the prediction results in the 1st stage, we find one optimal path from the candidates as the output of the navigation algorithm. We show that our navigation algorithm can be greatly improved via EOPF(Enhanced Optimal Path Finding), which is based on neural network(2nd stage). We show that our method can be over 7% better than the method without EOPF in many situations, which indicates the effectiveness of our model.

preprint2020arXiv

Monetizing Edge Service in Mobile Internet Ecosystem

In mobile Internet ecosystem, Mobile Users (MUs) purchase wireless data services from Internet Service Provider (ISP) to access to Internet and acquire the interested content services (e.g., online game) from Content Provider (CP). The popularity of intelligent functions (e.g., AI and 3D modeling) increases the computation-intensity of the content services, leading to a growing computation pressure for the MUs' resource-limited devices. To this end, edge computing service is emerging as a promising approach to alleviate the MUs' computation pressure while keeping their quality-of-service, via offloading some computation tasks of MUs to edge (computing) servers deployed at the local network edge. Thus, Edge Service Provider (ESP), who deploys the edge servers and offers the edge computing service, becomes an upcoming new stakeholder in the ecosystem. In this work, we study the economic interactions of MUs, ISP, CP, and ESP in the new ecosystem with edge computing service, where MUs can acquire the computation-intensive content services (offered by CP) and offload some computation tasks, together with the necessary raw input data, to edge servers (deployed by ESP) through ISP. We first study the MU's Joint Content Acquisition and Task Offloading (J-CATO) problem, which aims to maximize his long-term payoff. We derive the off-line solution with crucial insights, based on which we design an online strategy with provable performance. Then, we study the ESP's edge service monetization problem. We propose a pricing policy that can achieve a constant fraction of the ex-post optimal revenue with an extra constant loss for the ESP. Numerical results show that the edge computing service can stimulate the MUs' content acquisition and improve the payoffs of MUs, ISP, and CP.

preprint2020arXiv

PointPWC-Net: A Coarse-to-Fine Network for Supervised and Self-Supervised Scene Flow Estimation on 3D Point Clouds

We propose a novel end-to-end deep scene flow model, called PointPWC-Net, on 3D point clouds in a coarse-to-fine fashion. Flow computed at the coarse level is upsampled and warped to a finer level, enabling the algorithm to accommodate for large motion without a prohibitive search space. We introduce novel cost volume, upsampling, and warping layers to efficiently handle 3D point cloud data. Unlike traditional cost volumes that require exhaustively computing all the cost values on a high-dimensional grid, our point-based formulation discretizes the cost volume onto input 3D points, and a PointConv operation efficiently computes convolutions on the cost volume. Experiment results on FlyingThings3D outperform the state-of-the-art by a large margin. We further explore novel self-supervised losses to train our model and achieve comparable results to state-of-the-art trained with supervised loss. Without any fine-tuning, our method also shows great generalization ability on KITTI Scene Flow 2015 dataset, outperforming all previous methods.

preprint2020arXiv

Probing topological spin structures using light-polarization and magnetic microscopy

We present an imaging modality that enables detection of magnetic moments and their resulting stray magnetic fields. We use wide-field magnetic imaging that employs a diamond-based magnetometer and has combined magneto-optic detection (e.g. magneto-optic Kerr effect) capabilities. We employ such an instrument to image magnetic (stripe) domains in multilayered ferromagnetic structures.

preprint2020arXiv

Tightening the Lieb-Robinson Bound in Locally-Interacting Systems

The Lieb-Robinson (LR) bound rigorously shows that in quantum systems with short-range interactions, the maximum amount of information that travels beyond an effective "light cone" decays exponentially with distance from the light-cone front, which expands at finite velocity. Despite being a fundamental result, existing bounds are often extremely loose, limiting their applications. We introduce a method that dramatically and qualitatively improves LR bounds in models with finite-range interactions. Most prominently, in systems with a large local Hilbert space dimension $D$, our method gives an LR velocity that grows much slower than previous bounds with $D$ as $D\to \infty$. For example, in the Heisenberg model with spin $S$, we find $v\leq$ const. compared to the previous $v\propto S$ which diverges at large $S$, and in multiorbital Hubbard models with $N$ orbitals, we find $v\propto \sqrt{N}$ instead of previous $v\propto N$, and similarly in the $N$-state truncated Bose-Hubbard model and Wen's quantum rotor model. Our bounds also scale qualitatively better in some systems when the spatial dimension or certain model parameters become large, for example in the $d$-dimensional quantum Ising model and perturbed toric code models. Even in spin-1/2 Ising and Fermi-Hubbard models, our method improves the LR velocity by an order of magnitude with typical model parameters, and significantly improves the LR bound at large distance and early time.

preprint2019arXiv

Towards Simultaneous Observation of Path and Interference of Single Photon in a Modified Mach-Zehnder Interferometer

Classical wisdom of wave-particle duality says that it is impossible to observe simultaneously the wave and particle nature of microscopic object. Mathematically the principle requests that the interference visibility V and which-path distinguishability D satisfy an orthodox limit of square(V)+square(D)<=1. This work presents a new wave-particle duality test experiment with single photon in a modified Mach-Zehnder interferometer and convincingly show the possibility of breaking the limit. The key element of the interferometer is a weakly-scattering total-internal reflection prism surface, which exhibits pronounced single-photon interference with a visibility up to 0.97 and simultaneously provides path distinguishability of 0.83. Apparently square(V)+square(D)=1.63 far exceeds the orthodox limit set by the principle of wave-particle duality for single photon. It is expected that more delicate experiments in future should be able to demonstrate the ultimate regime of square(V)+square(D) approaching 2 and shed new light on the foundations of contemporary quantum mechanics.

preprint2016arXiv

A unified approach to hadron phenomenology at zero and finite temperatures in a hard-wall AdS/QCD model

We propose a unified approach to study meson, nucleon and $Δ$-baryon properties at zero and finite temperatures in the context of hard-wall AdS/QCD model. We first combine some previous works dealing with mesons and baryons separately, and introduce a new parameter~$ξ$ so that the model could give a universal description of spectrum and couplings of both sectors in a self-consistent way. All observables calculated numerically show reasonable agreement with experimental data. We then study these observables at nonzero temperature by modifying the AdS space-time into AdS-Schwartzchild space-time. Numerically solving the model, we find an interesting temperature dependence of the spectrum and the couplings. We also make a prediction on the finite temperature decay width of some nucleon and $Δ$ excited states.

Zhiyuan Wang

What is connected

Connect this record

See the researcher in context

Building this map preview

19 published item(s)

$A^3$-Bench: Benchmarking Memory-Driven Scientific Reasoning via Anchor and Attractor Activation

BalanceRAG: Joint Risk Calibration for Cascaded Retrieval-Augmented Generation

Decoupling Semantics and Fingerprints: A Universal Representation for AI-Generated Image Detection

MAXS: Meta-Adaptive Exploration with LLM Agents

PULSE: Agentic Investigation with Passive Sensing for Proactive Intervention in Cancer Survivorship

Subjective-Objective Median-based Importance Technique (SOMIT) to Aid Multi-Criteria Renewable Energy Evaluation

AACC: Asymmetric Actor-Critic in Contextual Reinforcement Learning

BKP Hierarchy, Affine Coordinates, and a Formula for Connected Bosonic $N$-Point Functions

From Personalized Medicine to Population Health: A Survey of mHealth Sensing Techniques

Graph Neural Networks in IoT: A Survey

Optical trapping of high-aspect-ratio NaYF hexagonal prisms for kHz-MHz gravitational wave detectors

Topological correlations in three dimensional classical Ising models: an exact solution with a continuous phase transition

Enhance the performance of navigation: A two-stage machine learning approach

Monetizing Edge Service in Mobile Internet Ecosystem

PointPWC-Net: A Coarse-to-Fine Network for Supervised and Self-Supervised Scene Flow Estimation on 3D Point Clouds

Probing topological spin structures using light-polarization and magnetic microscopy

Tightening the Lieb-Robinson Bound in Locally-Interacting Systems

Towards Simultaneous Observation of Path and Interference of Single Photon in a Modified Mach-Zehnder Interferometer

A unified approach to hadron phenomenology at zero and finite temperatures in a hard-wall AdS/QCD model