Researcher profile

Sheng Lu

Sheng Lu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 15 - UnverifiedVerification L1Unclaimed author
3works
0followers
4topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

3 published item(s)

preprint2025arXiv

Taming Hallucinations: Boosting MLLMs' Video Understanding via Counterfactual Video Generation

Multimodal Large Language Models (MLLMs) have made remarkable progress in video understanding. However, they suffer from a critical vulnerability: an over-reliance on language priors, which can lead to visual ungrounded hallucinations, especially when processing counterfactual videos that defy common sense. This limitation, stemming from the intrinsic data imbalance between text and video, is challenging to address due to the substantial cost of collecting and annotating counterfactual data. To address this, we introduce DualityForge, a novel counterfactual data synthesis framework that employs controllable, diffusion-based video editing to transform real-world videos into counterfactual scenarios. By embedding structured contextual information into the video editing and QA generation processes, the framework automatically produces high-quality QA pairs together with original-edited video pairs for contrastive training. Based on this, we build DualityVidQA, a large-scale video dataset designed to reduce MLLM hallucinations. In addition, to fully exploit the contrastive nature of our paired data, we propose Duality-Normalized Advantage Training (DNA-Train), a two-stage SFT-RL training regime where the RL phase applies pair-wise $\ell_1$ advantage normalization, thereby enabling a more stable and efficient policy optimization. Experiments on DualityVidQA-Test demonstrate that our method substantially reduces model hallucinations on counterfactual videos, yielding a relative improvement of 24.0% over the Qwen2.5-VL-7B baseline. Moreover, our approach achieves significant gains across both hallucination and general-purpose benchmarks, indicating strong generalization capability. We will open-source our dataset and code.

preprint2022arXiv

The local-global principle for divisibility in CM elliptic curves

We consider the local-global principle for divisibility in the Mordell-Weil group of a CM elliptic curve defined over a number field. For each prime $p$ we give sharp lower bounds on the degree $d$ of a number field over which there exists a CM elliptic curve which gives a counterexample to the local-global principle for divisibility by a power of $p$. As a corollary we deduce that there are at most finitely many elliptic curves (with or without CM) which are counterexamples with $p > 2d+1$. We also deduce that the local-global principle for divisibility by powers of $7$ holds over quadratic fields.

preprint2010arXiv

Biharmonic maps in two dimensions

Biharmonic maps between surfaces are studied in this paper. We compute the bitension field of a map between surfaces with conformal metrics in complex coordinates. As applications, we show that a linear map from Euclidean plane into $(\mathbb{R}^2, σ^2dwd\bar w)$ is always biharmonic if the conformal factor $σ$ is bi-analytic; we construct a family of such $ σ$, and we give a classification of linear biharmonic maps between $2$-spheres. We also study biharmonic maps between surfaces with warped product metrics. This includes a classification of linear biharmonic maps between hyperbolic planes and some constructions of many proper biharmonic maps into a circular cone or a helicoid.