Researcher profile

Mengqi Huang

Mengqi Huang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
8works
0followers
6topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

8 published item(s)

preprint2026arXiv

Lance: Unified Multimodal Modeling by Multi-Task Synergy

We present Lance, a lightweight native unified model supporting multimodal understanding, generation, and editing for both images and videos. Rather than relying on model capacity scaling or text-image-dominant designs, Lance explores a practical paradigm for unified multimodal modeling via collaborative multi-task training. It is grounded in two core principles: unified context modeling and decoupled capability pathways. Specifically, Lance is trained from scratch and employs a dual-stream mixture-of-experts architecture on shared interleaved multimodal sequences, enabling joint context learning while decoupling the pathways for understanding and generation. We further introduce modality-aware rotary positional encoding to mitigate interference among heterogeneous visual tokens and boost cross-task alignment. During training, Lance adopts a staged multi-task training paradigm with capability-oriented objectives and adaptive data scheduling to strengthen both semantic comprehension and visual generation performance. Experimental results demonstrate that Lance substantially outperforms existing open-source unified models in image and video generation, while retaining strong multimodal understanding capabilities. The homepage is available at https://lance-project.github.io.

preprint2026arXiv

Stream-R1: Reliability-Perplexity Aware Reward Distillation for Streaming Video Generation

Distillation-based acceleration has become foundational for making autoregressive streaming video diffusion models practical, with distribution matching distillation (DMD) as the de facto choice. Existing methods, however, train the student to match the teacher's output indiscriminately, treating every rollout, frame, and pixel as equally reliable supervision. We argue that this caps distilled quality, since it overlooks two complementary axes of variance in DMD supervision: Inter-Reliability across student rollouts whose supervision varies in reliability, and Intra-Perplexity across spatial regions and temporal frames that contribute unequally to where quality can still be improved. The objective thus conflates two questions under a uniform weight: whether to learn from each rollout, and where to concentrate optimization within it. To address this, we propose Stream-R1, a Reliability-Perplexity Aware Reward Distillation framework that adaptively reweights the distillation objective at both rollout and spatiotemporal-element levels through a single shared reward-guided mechanism. At the Inter-Reliability level, Stream-R1 rescales each rollout's loss by an exponential of a pretrained video reward score, so that rollouts with reliable supervision dominate optimization. At the Intra-Perplexity level, it back-propagates the same reward model to extract per-pixel gradient saliency, which is factored into spatial and temporal weights that concentrate optimization pressure on regions and frames where refinement yields the largest expected gain. An adaptive balancing mechanism prevents any single quality axis from dominating across visual quality, motion quality, and text alignment. Stream-R1 attains consistent improvements on all three dimensions over distillation baselines on standard streaming video generation benchmarks, without architectural modification or additional inference cost.

preprint2026arXiv

Stream-T1: Test-Time Scaling for Streaming Video Generation

While Test-Time Scaling (TTS) offers a promising direction to enhance video generation without the surging costs of training, current test-time video generation methods based on diffusion models suffer from exorbitant candidate exploration costs and lack temporal guidance. To address these structural bottlenecks, we propose shifting the focus to streaming video generation. We identify that its chunk-level synthesis and few denoising steps are intrinsically suited for TTS, significantly lowering computational overhead while enabling fine-grained temporal control. Driven by this insight, we introduced Stream-T1, a pioneering comprehensive TTS framework exclusively tailored for streaming video generation. Specifically, Stream-T1 is composed of three units: (1) Stream -Scaled Noise Propagation, which actively refines the initial latent noise of the generating chunk using historically proven, high-quality previous chunk noise, effectively establishes temporal dependency and utilizing the historical Gaussian prior to guide the current generation; (2) Stream -Scaled Reward Pruning, which comprehensively evaluates generated candidates to strike an optimal balance between local spatial aesthetics and global temporal coherence by integrating immediate short-term assessments with sliding-window-based long-term evaluations; (3) Stream-Scaled Memory Sinking, which dynamically routes the context evicted from KV-cache into distinct updating pathways guided by the reward feedback, ensuring that previously generated visual information effectively anchors and guides the subsequent video stream. Evaluated on both 5s and 30s comprehensive video benchmarks, Stream-T1 demonstrates profound superiority, significantly improving temporal consistency, motion smoothness, and frame-level visual quality.

preprint2022arXiv

Electric-Field-Induced Coherent Control of Nitrogen Vacancy Centers

Enabling scalable and energy-efficient control of spin defects in solid-state media is desirable for realizing transformative quantum information technologies. Exploiting voltage-controlled magnetic anisotropy, we report coherent manipulation of nitrogen-vacancy (NV) centers by the spatially confined magnetic stray fields produced by a proximate resonant magnetic tunnel junction (MTJ). Remarkably, the coherent coupling between NV centers and the MTJ can be systematically controlled by a DC bias voltage, allowing for appreciable electrical tunability in the presented hybrid system. In comparison with current state-of-the-art techniques, the demonstrated NV-based quantum operational platform exhibits significant advantages in scalability, device compatibility, and energy-efficiency, further expanding the role of NV centers in a broad range of quantum computing, sensing, and communications applications.

preprint2022arXiv

Quantum sensing and imaging of spin-orbit-torque-driven spin dynamics in noncollinear antiferromagnet Mn3Sn

Novel noncollinear antiferromagnets with spontaneous time-reversal symmetry breaking, nontrivial band topology, and unconventional transport properties have received immense research interest over the past decade due to their rich physics and enormous promise in technological applications. One of the central focuses in this emerging field is exploring the relationship between the microscopic magnetic structure and exotic material properties. Here, the nanoscale imaging of both spin-orbit-torque-induced deterministic magnetic switching and chiral spin rotation in noncollinear antiferromagnet Mn3Sn films using nitrogen-vacancy (NV) centers is reported. Direct evidence of the off-resonance dipole-dipole coupling between the spin dynamics in Mn3Sn and proximate NV centers is also demonstrated with NV relaxometry measurements. These results demonstrate the unique capabilities of NV centers in accessing the local information of the magnetic order and dynamics in these emergent quantum materials and suggest new opportunities for investigating the interplay between topology and magnetism in a broad range of topological magnets.

preprint2021arXiv

Quantum Imaging of Magnetic Phase Transitions and Spin Fluctuations in Intrinsic Magnetic Topological Nanoflakes

Topological materials featuring exotic band structures, unconventional current flow patterns, and emergent organizing principles offer attractive platforms for the development of next-generation transformative quantum electronic technologies. The family of MnBi2Te4 (Bi2Te3)n materials is naturally relevant in this context due to their nontrivial band topology, tunable magnetism, and recently discovered extraordinary quantum transport behaviors. Despite numerous pioneering studies, to date, the local magnetic properties of MnBi2Te4 (Bi2Te3)n remain an open question, hindering a comprehensive understanding of their fundamental material properties. Exploiting nitrogen-vacancy (NV) centers in diamond, we report nanoscale quantum imaging of magnetic phase transitions and spin fluctuations in exfoliated MnBi2Te4 (Bi2Te3)n flakes, revealing the underlying spin transport physics and magnetic domains at the nanoscale. Our results highlight the unique advantage of NV centers in exploring the magnetic properties of emergent quantum materials, opening new opportunities for investigating the interplay between topology and magnetism.

preprint2021arXiv

Wide Field Imaging of van der Waals Ferromagnet Fe3GeTe2 by Spin Defects in Hexagonal Boron Nitride

Emergent color centers with accessible spins hosted by van der Waals materials have attracted substantial interest in recent years due to their significant potential for implementing transformative quantum sensing technologies. Hexagonal boron nitride (hBN) is naturally relevant in this context due to its remarkable ease of integration into devices consisting of low-dimensional materials. Taking advantage of boron vacancy spin defects in hBN, we report nanoscale quantum imaging of low-dimensional ferromagnetism sustained in Fe3GeTe2/hBN van der Waals heterostructures. Exploiting spin relaxometry methods, we have further observed spatially varying magnetic fluctuations in the exfoliated Fe3GeTe2 flake, whose magnitude reaches a peak value around the Curie temperature. Our results demonstrate the capability of spin defects in hBN of investigating local magnetic properties of layered materials in an accessible and precise way, which can be extended readily to a broad range of miniaturized van der Waals heterostructure systems.

preprint2020arXiv

Quantum Sensing of Spin Transport Properties of an Antiferromagnetic Insulator

Antiferromagnetic insulators (AFIs) are of significant interest due to their potential to develop next-generation spintronic devices. One major effort in this emerging field is to harness AFIs for long-range spin information communication and storage. Here, we report a non-invasive method to optically access the intrinsic spin transport properties of an archetypical AFI α-Fe2O3 via nitrogen-vacancy (NV) quantum spin sensors. By NV relaxometry measurements, we successfully detect the time-dependent fluctuations of the longitudinal spin density of α-Fe2O3. The observed frequency dependence of the NV relaxation rate is in agreement with a theoretical model, from which an intrinsic spin diffusion constant of α-Fe2O3 is experimentally measured in the absence of external spin biases. Our results highlight the significant opportunity offered by NV centers in diagnosing the underlying spin transport properties in a broad range of high-frequency magnetic materials, which are challenging to access by more conventional measurement techniques.