Researcher profile

Mengqi Zhang

Mengqi Zhang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
10works
0followers
9topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

10 published item(s)

preprint2026arXiv

Uncovering Entity Identity Confusion in Multimodal Knowledge Editing

Multimodal knowledge editing (MKE) aims to correct the internal knowledge of large vision-language models after deployment, yet the behavioral patterns of post-edit models remain underexplored. In this paper, we identify a systemic failure mode in edited models, termed Entity Identity Confusion (EIC): edited models exhibit an absurd behavior where text-only queries about the original entity's identity unexpectedly return information about the new entity. To rigorously investigate EIC, we construct EC-Bench, a diagnostic benchmark that directly probes how image-entity bindings shift before and after editing. Our analysis reveals that EIC stems from existing methods failing to distinguish between Image-Entity (I-E) binding and Entity-Entity (E-E) relational knowledge in the model, causing models to overfit E-E associations as a shortcut: the image is still perceived as the original entity, with the new entity's name serving only as a spurious identity label. We further explore potential mitigation strategies, showing that constraining edits to the model's I-E processing stage encourages edits to act more faithfully on I-E binding, thereby substantially reducing EIC. Based on these findings, we discuss principled desiderata for faithful MKE and provide methodological guidance for future research.

preprint2025arXiv

A Diffuse-Interface Marangoni Instability

We investigate a novel Marangoni-induced instability that arises exclusively in diffuse fluid interfaces, absent in classical sharp-interface models. Using a validated phase-field Navier-Stokes-Allen-Cahn framework, we linearize the governing equations to analyze the onset and development of interfacial instability driven by solute-induced surface tension gradients. A critical interfacial thickness scaling inversely with the Marangoni number, $δ_\mathrm{cr} \sim Ma^{-1}$, emerges from the balance between advective and diffusive transport. Unlike sharp-interface scenarios where matched viscosity and diffusivity stabilize the interface, finite thickness induces asymmetric solute distributions and tangential velocity shifts that destabilize the system. We identify universal power-law scalings of velocity and concentration offsets with a modified Marangoni number $Ma^δ$, independent of capillary number and interfacial mobility. A critical crossover at $Ma^δ\approx 590$ distinguishes diffusion-dominated stabilization from advection-driven destabilization. These findings highlight the importance of diffuse-interface effects in multiphase flows, with implications for miscible fluids, soft matter, and microfluidics where interfacial thickness and coupled transport phenomena are non-negligible.

preprint2022arXiv

A three-dimensional dynamic mode decomposition analysis of wind farm flow aerodynamics

High-fidelity large-eddy simulations are suitable to obtain insight into the complex flow dynamics in extended wind farms. In order to better understand these flow dynamics, we use dynamic mode decomposition (DMD) to analyze and reconstruct the flow field in large-scale numerically simulated wind farms by large-eddy simulations (LES). Different wind farm layouts are considered, and we find that a combination of horizontal and vertical staggering leads to improved wind farm performance compared to traditional horizontal staggering. We analyze the wind farm flows using the amplitude selection (AP) and sparsity-promoting (SP method) DMD approach. We find that the AP method tends to select modes with a small length scale and a high frequency, while the SP method selects large coherent structures with low frequency. The latter are somewhat reminiscent of modes obtained using proper orthogonal decomposition (POD). We find that a relatively limited number of SP-DMD modes is sufficient to accurately reconstruct the flow field in the entire wind farm, whereas the AP-DMD method requires more modes to achieve an accurate reconstruction. Thus, the SP-DMD method has a smaller performance loss compared to the AP-DMD method in terms of the reconstruction of the flow field.

preprint2022arXiv

Deep Contrastive Multiview Network Embedding

Multiview network embedding aims at projecting nodes in the network to low-dimensional vectors, while preserving their multiple relations and attribute information. Contrastive learning approaches have shown promising performance in this task. However, they neglect the semantic consistency between fused and view representations and have difficulty in modeling complementary information between different views. To deal with these deficiencies, this work presents a novel Contrastive leaRning framEwork for Multiview network Embedding (CREME). In our work, different views can be obtained based on the various relations among nodes. Then, we generate view embeddings via proper view encoders and utilize an attentive multiview aggregator to fuse these representations. Particularly, we design two collaborative contrastive objectives, view fusion InfoMax and inter-view InfoMin, to train the model in a self-supervised manner. The former objective distills information from embeddings generated from different views, while the latter captures complementary information among views to promote distinctive view embeddings. We also show that the two objectives can be unified into one objective for model training. Extensive experiments on three real-world datasets demonstrate that our proposed CREME is able to consistently outperform state-of-the-art methods.

preprint2022arXiv

Latent Structure Mining with Contrastive Modality Fusion for Multimedia Recommendation

Recent years have witnessed growing interests in multimedia recommendation, which aims to predict whether a user will interact with an item with multimodal contents. Previous studies focus on modeling user-item interactions with multimodal features included as side information. However, this scheme is not well-designed for multimedia recommendation. Firstly, only collaborative item-item relationships are implicitly modeled through high-order item-user-item co-occurrences. We argue that the latent semantic item-item structures underlying these multimodal contents could be beneficial for learning better item representations and assist the recommender models to comprehensively discover candidate items. Secondly, previous studies disregard the fine-grained multimodal fusion. Although having access to multiple modalities might allow us to capture rich information, we argue that the simple coarse-grained fusion by linear combination or concatenation in previous work is insufficient to fully understand content information and item relationships.To this end, we propose a latent structure MIning with ContRastive mOdality fusion method (MICRO for brevity). To be specific, we devise a novel modality-aware structure learning module, which learns item-item relationships for each modality. Based on the learned modality-aware latent item relationships, we perform graph convolutions that explicitly inject item affinities to modality-aware item representations. Then, we design a novel contrastive method to fuse multimodal features. These enriched item representations can be plugged into existing collaborative filtering methods to make more accurate recommendations. Extensive experiments on real-world datasets demonstrate the superiority of our method over state-of-the-art baselines.

preprint2022arXiv

Nonlinear spatiotemporal instabilities in two-dimensional electroconvective flows

This work studies the effects of a through-flow on two-dimensional electrohydrodynamic (EHD) flows of a dielectric liquid confined between two plane plates, as a model problem to further our understanding of the fluid mechanics in the presence of an electric field. The liquid is subjected to a strong unipolar charge injection from the bottom plate and a pressure gradient along the streamwise direction. Highly-accurate numerical simulations and weakly nonlinear stability analyses based on multiple-scale expansion and amplitude expansion methods are used to unravel the nonlinear spatiotemporal instability mechanisms in this combined flow. We found that the through-flow makes the hysteresis loop in the EHD flow narrower. In the numerical simulation of an impulse response, the leading and trailing edges of the wavepacket within the nonlinear regime are consistent with the linear ones, a result which we also verified against that in natural convection. In addition, as the bifurcation in EHD-Poiseuille flows is of a subcritical nature, nonlinear finite-amplitude solutions exist in the subcritical regime, and our calculation indicates that they are convectively unstable. The validity of the Ginzburg-Landau equation (GLE), derived from the weakly nonlinear expansion of Navier-Stokes equations and the Maxwell's equations in the quasi-electrostatic limit, serving as a physical reduced-order model for probing the spatiotemporal dynamics in this flow, has also been investigated. We found that the coefficients in the GLE calculated using amplitude expansion method can predict the absolute growth rates even when the parameters are away from the linear critical conditions, compared favourably with the local dispersion relation, whereas the validity range of the GLE derived from the multiple-scale expansion method is confined to the vicinity of the linear critical conditions.

preprint2022arXiv

On the large-Weissenberg-number scaling laws in viscoelastic pipe flows

This work explains a scaling law of the first Landau coefficient of the derived Ginzburg-Landau equation (GLE) in the weakly nonlinear analysis of axisymmetric viscoelastic pipe flows in the large-Weissenberg-number ($Wi$) limit, recently reported in Wan et al. J. Fluid Mech. (2021), vol. 929, A16. Using an asymptotic method, we derive a reduced system, which captures the characteristics of the linear centre-mode instability near the critical condition in the large-$Wi$ limit. Based on the reduced system we then conduct a weakly nonlinear analysis using a multiple-scale expansion method, which readily explains the aforementioned scaling law of the Landau coefficient and some other scaling laws. Particularly, the equilibrium amplitude of disturbance near linear critical conditions is found to scale as $Wi^{-1/2}$, which may be of interest to experimentalists. The current analysis reduces the numbers of parameters and unknowns and exemplifies an approach to studying the viscoelastic flow at large $Wi$, which could shed new light on the understanding of its nonlinear dynamics.

preprint2022arXiv

Reinforcement-learning-based control of convectively-unstable flows

This work reports the application of a model-free deep-reinforcement-learning-based (DRL) flow control strategy to suppress perturbations evolving in the 1-D linearised Kuramoto-Sivashinsky (KS) equation and 2-D boundary layer flows. The former is commonly used to model the disturbance developing in flat-plate boundary layer flows. These flow systems are convectively unstable, being able to amplify the upstream disturbance, and are thus difficult to control. The control action is implemented through a volumetric force at a fixed position and the control performance is evaluated by the reduction of perturbation amplitude downstream. We first demonstrate the effectiveness of the DRL-based control in the KS system subjected to a random upstream noise. The amplitude of perturbation monitored downstream is significantly reduced and the learnt policy is shown to be robust to both measurement and external noise. One of our focuses is to optimally place sensors in the DRL control using the gradient-free particle swarm optimisation algorithm. After the optimisation process for different numbers of sensors, a specific eight-sensor placement is found to yield the best control performance. The optimised sensor placement in the KS equation is applied directly to control 2-D Blasius boundary layer flows and can efficiently reduce the downstream perturbation energy. Via flow analyses, the control mechanism found by DRL is the opposition control. Besides, it is found that when the flow instability information is embedded in the reward function of DRL to penalise the instability, the control performance can be further improved in this convectively-unstable flow.

preprint2021arXiv

Dynamic Graph Collaborative Filtering

Dynamic recommendation is essential for modern recommender systems to provide real-time predictions based on sequential data. In real-world scenarios, the popularity of items and interests of users change over time. Based on this assumption, many previous works focus on interaction sequences and learn evolutionary embeddings of users and items. However, we argue that sequence-based models are not able to capture collaborative information among users and items directly. Here we propose Dynamic Graph Collaborative Filtering (DGCF), a novel framework leveraging dynamic graphs to capture collaborative and sequential relations of both items and users at the same time. We propose three update mechanisms: zero-order 'inheritance', first-order 'propagation', and second-order 'aggregation', to represent the impact on a user or item when a new interaction occurs. Based on them, we update related user and item embeddings simultaneously when interactions occur in turn, and then use the latest embeddings to make recommendations. Extensive experiments conducted on three public datasets show that DGCF significantly outperforms the state-of-the-art dynamic recommendation methods up to 30. Our approach achieves higher performance when the dataset contains less action repetition, indicating the effectiveness of integrating dynamic collaborative information.

preprint2021arXiv

Personalized Graph Neural Networks with Attention Mechanism for Session-Aware Recommendation

The problem of session-aware recommendation aims to predict users' next click based on their current session and historical sessions. Existing session-aware recommendation methods have defects in capturing complex item transition relationships. Other than that, most of them fail to explicitly distinguish the effects of different historical sessions on the current session. To this end, we propose a novel method, named Personalized Graph Neural Networks with Attention Mechanism (A-PGNN) for brevity. A-PGNN mainly consists of two components: one is Personalized Graph Neural Network (PGNN), which is used to extract the personalized structural information in each user behavior graph, compared with the traditional Graph Neural Network (GNN) model, which considers the role of the user when the node embeddding is updated. The other is Dot-Product Attention mechanism, which draws on the Transformer net to explicitly model the effect of historical sessions on the current session. Extensive experiments conducted on two real-world data sets show that A-PGNN evidently outperforms the state-of-the-art personalized session-aware recommendation methods.