Researcher profile

Haibo Qiu

Haibo Qiu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
6works
0followers
4topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

6 published item(s)

preprint2026arXiv

MobileDreamer: Generative Sketch World Model for GUI Agent

Mobile GUI agents have shown strong potential in real-world automation and practical applications. However, most existing agents remain reactive, making decisions mainly from current screen, which limits their performance on long-horizon tasks. Building a world model from repeated interactions enables forecasting action outcomes and supports better decision making for mobile GUI agents. This is challenging because the model must predict post-action states with spatial awareness while remaining efficient enough for practical deployment. In this paper, we propose MobileDreamer, an efficient world-model-based lookahead framework to equip the GUI agents based on the future imagination provided by the world model. It consists of textual sketch world model and rollout imagination for GUI agent. Textual sketch world model forecasts post-action states through a learning process to transform digital images into key task-related sketches, and designs a novel order-invariant learning strategy to preserve the spatial information of GUI elements. The rollout imagination strategy for GUI agent optimizes the action-selection process by leveraging the prediction capability of world model. Experiments on Android World show that MobileDreamer achieves state-of-the-art performance and improves task success by 5.25%. World model evaluations further verify that our textual sketch modeling accurately forecasts key GUI elements.

preprint2026arXiv

Seirênes: Adversarial Self-Play with Evolving Distractions for LLM Reasoning

We present Seirênes, a self-play RL framework that transforms contextual interference from a failure mode of LLM reasoning into an internal training signal for co-evolving more resilient reasoners. While RL with verifiable rewards has significantly advanced reasoning capabilities, models can still exhibit fragility when encountering non-idealized contexts: scenarios characterized by superfluous information, tangential instructions, or incidental correlations that differ from the clean distributions typical of standard benchmarks. Seirênes harnesses this vulnerability through a parameter-shared and adversarial self-play loop. Within this framework, a single model is trained to both construct plausible yet distracting contexts that expose its own reasoning blind spots, and solve problems by discerning the essential task from these perturbations to recover the core underlying logic. By pitting these competing objectives against each other, Seirênes compels the model to move beyond superficial pattern matching and anchors its capabilities in robust underlying reasoning. This continuous interaction sustains an informative co-evolutionary curriculum as the model improves. Across seven mathematical reasoning benchmarks and model scales from 4B to 30B, Seirênes achieves average gains of +10.2, +9.1, and +7.2 points. Besides, distracting contexts produced by the 4B Seirênes model reduce the accuracy of top-tier closed-source models (GPT and Gemini) by roughly 4--5 points, revealing Seirênes' general ability to uncover reasoning models' blind spots.

preprint2025arXiv

Reading or Reasoning? Format Decoupled Reinforcement Learning for Document OCR

Reading text from images or scanned documents via OCR models has been a longstanding focus of researchers. Intuitively, text reading is perceived as a straightforward perceptual task, and existing work primarily focuses on constructing enriched data engineering to enhance SFT capabilities. In this work, we observe that even advanced OCR models exhibit significantly higher entropy in formatted text (\emph{e.g.}, formula, table, etc.) compared to plain text, often by an order of magnitude. These statistical patterns reveal that advanced OCR models struggle with high output uncertainty when dealing with format sensitive document, suggesting that reasoning over diverse reading pathways may improve OCR performance. To address this, we propose format decoupled reinforcement learning (FD-RL), which leverages high-entropy patterns for targeted optimization. Our approach employs entropy-based data filtration strategy to identify format-intensive instances, and adopt format decoupled rewards tailored to different format types, enabling format-level validation rather than token-level memorization. FD-RL achieves an average score of 90.41 on OmniDocBench, setting a new record for end-to-end models on this highly popular benchmark. More importantly, we conduct comprehensive ablation studies over data, training, filtering, and rewarding strategies, thoroughly validating their effectiveness.

preprint2022arXiv

End2End Occluded Face Recognition by Masking Corrupted Features

With the recent advancement of deep convolutional neural networks, significant progress has been made in general face recognition. However, the state-of-the-art general face recognition models do not generalize well to occluded face images, which are exactly the common cases in real-world scenarios. The potential reasons are the absences of large-scale occluded face data for training and specific designs for tackling corrupted features brought by occlusions. This paper presents a novel face recognition method that is robust to occlusions based on a single end-to-end deep neural network. Our approach, named FROM (Face Recognition with Occlusion Masks), learns to discover the corrupted features from the deep convolutional neural networks, and clean them by the dynamically learned masks. In addition, we construct massive occluded face images to train FROM effectively and efficiently. FROM is simple yet powerful compared to the existing methods that either rely on external detectors to discover the occlusions or employ shallow models which are less discriminative. Experimental results on the LFW, Megaface challenge 1, RMF2, AR dataset and other simulated occluded/masked datasets confirm that FROM dramatically improves the accuracy under occlusions, and generalizes well on general face recognition. Code is available at https://github.com/haibo-qiu/FROM

preprint2021arXiv

Stripe and junction-vortex phases in linearly coupled Bose-Einstein condensates

Soon after its theoretical prediction, striped-density states in the presence of synthetic spin-orbit coupling were realized in Bose-Einstein condensates of ultracold neutral atoms [J.-R. Li et al., Nature \textbf{543}, 91 (2017)]. The achievement opens avenues to explore the interplay of superfluidity and crystalline order in the search for supersolid features and materials. The system considered is essentially made of two linearly coupled Bose-Einstein condensates, that is a pseudo-spin-$1/2$ system, subject to a spin-dependent gauge field $σ_z \hbar k_\ell$. Under these conditions the stripe phase is achieved when the linear coupling $\hbarΩ/2$ is small against the gauge energy $mΩ/\hbar k_\ell^2<1$ . The resulting density stripes have been interpreted as a standing-wave, interference pattern with approximate wavenumber $2k_\ell$. Here, we show that the emergence of the stripe phase is induced by an array of Josephson vortices living in the junction defined by the linear coupling. As happens in superconducting junctions subject to external magnetic fields, a vortex array is the natural response of the superfluid system to the presence of a gauge field. Also similar to superconductors, the Josephson currents and their associated vortices can be present as a metastable state in the absence of gauge field. We provide closed-form solutions to the 1D mean field equations that account for such vortex arrays. The underlying Josephson currents coincide with the analytical solutions to the sine-Gordon equation for the relative phase of superconducting junctions [C. Owen and D. Scalapino, Phys. Rev. \textbf{164}, 538 (1967)].

preprint2020arXiv

Unlocked-relative-phase states in arrays of Bose-Einstein condensates

Phase engineering techniques are used to control the dynamics of long-bosonic-Josephson-junction arrays built by linearly coupling Bose-Einstein condensates. Just at the middle point of the underlying discrete energy band of the system, unlocked-relative-phase states are shown to be stationary along with the locked-relative-phase Bloch waves. In finite, experimentally-feasible systems, such states find ranges of dynamical stability that depend on the ratio of coupling to interaction energy. The same ratio determines different decay regimes, which include the recurrence of staggered-soliton trains in the condensates around Josephson loop currents at the junctions. These transient solitons are also found in their stationary configurations, which provide striped-density states by means of either dark-soliton or bright-soliton trains. Additionally, the preparation of maximally out-of-phase (or splay) states is demonstrated to evolve into an oscillation of the uniform density of the condensates that keeps constant the total density of the system and robust against noise at low coupling.