Researcher profile

Zhen Wu

Zhen Wu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
11works
0followers
8topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

11 published item(s)

preprint2026arXiv

Causal Evidence for Attention Head Imbalance in Modality Conflict Hallucination

Modality-conflict hallucination occurs when multimodal large language models (MLLMs) prioritize erroneous textual premises over contradictory visual evidence. To understand why visual evidence fails to prevail during generation, we take a mechanistic perspective and examine which internal components drive or resist this failure. We perform head-level causal analysis using path patching across five open-source MLLMs and identify two groups of attention heads with opposing causal roles: hallucination-driving heads and hallucination-resisting heads. We find a consistent asymmetry: driving effects are more broadly distributed and carry greater aggregate weight, whereas resisting effects concentrate in a small number of high-importance heads. Ablation experiments further confirm that these groups exert opposing effects during generation: distributed driving influence and localized resistance together form an imbalanced routing structure that biases generation toward the erroneous premise. Motivated by this finding, we propose MACI (Modality-conflict-Aware Causal Intervention), a conditional intervention that suppresses causally identified hallucination-driving heads only when conflict is detected. Across five MLLMs, MACI achieves the largest hallucination reduction among compared inference-time baselines on the MMMC benchmark with a favorable hallucination-accuracy trade-off, and transfers zero-shot to the SCI-SemanticConflict test.

preprint2026arXiv

Locomotion Beyond Feet

Most locomotion methods for humanoid robots focus on leg-based gaits, yet natural bipeds frequently rely on hands, knees, and elbows to establish additional contacts for stability and support in complex environments. This paper introduces Locomotion Beyond Feet, a comprehensive system for whole-body humanoid locomotion across extremely challenging terrains, including low-clearance spaces under chairs, knee-high walls, knee-high platforms, and steep ascending and descending stairs. Our approach addresses two key challenges: contact-rich motion planning and generalization across diverse terrains. To this end, we combine physics-grounded keyframe animation with reinforcement learning. Keyframes encode human knowledge of motor skills, are embodiment-specific, and can be readily validated in simulation or on hardware, while reinforcement learning transforms these references into robust, physically accurate motions. We further employ a hierarchical framework consisting of terrain-specific motion-tracking policies, failure recovery mechanisms, and a vision-based skill planner. Real-world experiments demonstrate that Locomotion Beyond Feet achieves robust whole-body locomotion and generalizes across obstacle sizes, obstacle instances, and terrain sequences.

preprint2026arXiv

The spatiotemporal Born rule is quasiprobabilistic

Contrary to general relativity, quantum theory treats space and time in fundamentally different ways. In particular, while joint probabilities associated with spacelike separated measurements are defined in terms of the Born rule, joint probabilities associated with measurements performed in sequence are defined in terms of the state-update rule. In this work, we show that one obtains a more unified perspective of space and time in quantum theory by embracing a quasiprobabilistic description of sequential measurements. More precisely, we show that there exists a unique \emph{pseudo}-density operator encoding canonical quasiprobabilities associated with sequential measurements in precisely the same manner that a density operator encodes joint probabilities associated with spacelike separated measurements, thus providing a natural extension of the Born rule into the temporal domain. As an application, we show how such a spatiotemporal Born rule combined in conjunction with a quantum Bayes' rule yields an operational notion of time-reversal symmetry for sequential measurements on an \emph{open} quantum system.

preprint2023arXiv

Linear-Quadratic Delayed Mean-Field Social Optimization

A linear quadratic (LQ) stochastic optimization problem with delay involving weakly-coupled large population is investigated in this paper. Different to classic mean field (MF) game, here agents cooperate with each other to minimize the so-called \emph{social} objective. With the aid of \emph{delayed person-by-person optimality} principle, one arrives at an auxiliary LQ delayed control problem by decentralized information. A decentralized strategy is obtained by feat of an MF type anticipated forward-backward stochastic differential delay equation (AFBSDDE) consistency condition. The discounting method with delay feature is employed to solve the consistency condition system. Finally, by some estimates of AFBSDDEs we derive the asymptotic social optimality.

preprint2022arXiv

Continuous-time mean-variance portfolio selection under non-Markovian regime-switching model with random horizon

In this paper, we consider a continuous-time mean-variance portfolio selection with regime-switching and random horizon. Unlike previous works, the dynamic of assets are described by non-Markovian regime-switching models in the sense that all the market parameters are predictable with respect to the filtration generated jointly by Markov chain and Brownian motion. We formulate this problem as a constrained stochastic linear-quadratic optimal control problem. The Markov chain is assumed to be independent of the Brownian motion. So the market is incomplete. We derive closed-form expressions for both the optimal portfolios and the efficient frontier. All the results are different from those in the problem with fixed time horizon.

preprint2022arXiv

Exploiting Unlabeled Data for Target-Oriented Opinion Words Extraction

Target-oriented Opinion Words Extraction (TOWE) is a fine-grained sentiment analysis task that aims to extract the corresponding opinion words of a given opinion target from the sentence. Recently, deep learning approaches have made remarkable progress on this task. Nevertheless, the TOWE task still suffers from the scarcity of training data due to the expensive data annotation process. Limited labeled data increase the risk of distribution shift between test data and training data. In this paper, we propose exploiting massive unlabeled data to reduce the risk by increasing the exposure of the model to varying distribution shifts. Specifically, we propose a novel Multi-Grained Consistency Regularization (MGCR) method to make use of unlabeled data and design two filters specifically for TOWE to filter noisy data at different granularity. Extensive experimental results on four TOWE benchmark datasets indicate the superiority of MGCR compared with current state-of-the-art methods. The in-depth analysis also demonstrates the effectiveness of the different-granularity filters. Our codes are available at https://github.com/TOWESSL/TOWESSL.

preprint2022arXiv

Linear-Quadratic Large-Population Problem with Partial Information: Hamiltonian Approach and Riccati Approach

This paper studies a class of partial information linear-quadratic mean-field game problems. A general stochastic large-population system is considered, where the diffusion term of the dynamic of each agent can depend on the state and control. We study both the control constrained case and unconstrained case. In control constrained case, by using Hamiltonian approach and convex analysis, the explicit decentralized strategies can be obtained through projection operator. The corresponding Hamiltonian type consistency condition system is derived, which turns out to be a nonlinear mean-field forward-backward stochastic differential equation with projection operator. The well-posedness of such kind of equations is proved by using discounting method. Moreover, the corresponding $\varepsilon$-Nash equilibrium property is verified. In control unconstrained case, the decentralized strategies can be further represented explicitly as the feedback of filtered state through Riccati approach. The existence and uniqueness of a solution to a new Riccati type consistency condition system is also discussed. As an application, a general inter-bank borrowing and lending problem is studied to illustrate that the effect of partial information cannot be ignored.

preprint2022arXiv

Linear-Quadratic Mean Field Games of Controls with Non-Monotone Data

In this paper, we study a class of linear-quadratic (LQ) mean field games of controls with common noises and their corresponding $N$-player games. The theory of mean field game of controls considers a class of mean field games where the interaction is via the joint law of both the state and control. By the stochastic maximum principle, we first analyze the limiting behavior of the representative player and obtain his/her optimal control in a feedback form with the given distributional flow of the population and its control. The mean field equilibrium is determined by the Nash certainty equivalence (NCE) system. Thanks to the common noise, we do not require any monotonicity conditions for the solvability of the NCE system. We also study the master equation arising from LQ mean field games of controls, which is a finite-dimensional second-order parabolic equation. It can be shown that the master equation admits a unique classical solution over an arbitrary time horizon without any monotonicity conditions. Beyond that, we can solve the $N$-player games directly by further assuming the non-degeneracy of the idiosyncratic noises. As byproducts, we prove the quantitative convergence results from the $N$-player game to the mean field game and the propagation of chaos property for the related optimal trajectories.

preprint2022arXiv

Two Equivalent Families of Linear Fully Coupled Forward Backward Stochastic Differential Equations

In this paper, we investigate two families of fully coupled linear Forward-Backward Stochastic Differential Equations (FBSDE). Within these families, one could get the same well-posedness of FBSDEs with totally different structures. The first family of FBSDEs are proved to be equivalent with respect to the Unified Approach. Thus one could get the well-posedness of the whole family if one member exists a unique solution. Another equivalent family of FBSDEs are investigated by introducing a linear transformation method. By reason of the fully coupling structure between the forward and backward equations, it leads to a highly interdependence in solutions. We are able to lower the coupling of FBSDEs, by virtue of the idea of transformation, without losing the well-posedness. Moreover, owing to the non-degeneracy of the transformation matrix, the solution to original FBSDE is totally determined by solutions of FBSDE after transformation. In addition, an example of optimal Linear Quadratic (LQ) problem is presented to illustrate.

preprint2020arXiv

Dynamic Programming Principle for Backward Doubly Stochastic Recursive Optimal Control Problem and Sobolev Weak Solution of The Stochastic Hamilton-Bellman Equation

In this paper, we study backward doubly stochastic recursive optimal control problem where the cost function is described by the solution of a backward doubly stochastic differential equation. We give the dynamical programming principle for this kind of optimal control problem and show that the value function is the unique Sobolev weak solution for the corresponding stochastic Hamilton-Jacobi-Bellman equation.

preprint2020arXiv

Latent Opinions Transfer Network for Target-Oriented Opinion Words Extraction

Target-oriented opinion words extraction (TOWE) is a new subtask of ABSA, which aims to extract the corresponding opinion words for a given opinion target in a sentence. Recently, neural network methods have been applied to this task and achieve promising results. However, the difficulty of annotation causes the datasets of TOWE to be insufficient, which heavily limits the performance of neural models. By contrast, abundant review sentiment classification data are easily available at online review sites. These reviews contain substantial latent opinions information and semantic patterns. In this paper, we propose a novel model to transfer these opinions knowledge from resource-rich review sentiment classification datasets to low-resource task TOWE. To address the challenges in the transfer process, we design an effective transformation method to obtain latent opinions, then integrate them into TOWE. Extensive experimental results show that our model achieves better performance compared to other state-of-the-art methods and significantly outperforms the base model without transferring opinions knowledge. Further analysis validates the effectiveness of our model.