Source author record

Haibo Qiu

Haibo Qiu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

cond-mat.quant-gas Artificial Intelligence Computer Vision nlin.PS quant-ph Computation and Language nlin.CD

Catalog footprint

What is connected

9works

7topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

MobileDreamer: Generative Sketch World Model for GUI Agent

Mobile GUI agents have shown strong potential in real-world automation and practical applications. However, most existing agents remain reactive, making decisions mainly from current screen, which limits their performance on long-horizon tasks. Building a world model from repeated interactions enables forecasting action outcomes and supports better decision making for mobile GUI agents. This is challenging because the model must predict post-action states with spatial awareness while remaining efficient enough for practical deployment. In this paper, we propose MobileDreamer, an efficient world-model-based lookahead framework to equip the GUI agents based on the future imagination provided by the world model. It consists of textual sketch world model and rollout imagination for GUI agent. Textual sketch world model forecasts post-action states through a learning process to transform digital images into key task-related sketches, and designs a novel order-invariant learning strategy to preserve the spatial information of GUI elements. The rollout imagination strategy for GUI agent optimizes the action-selection process by leveraging the prediction capability of world model. Experiments on Android World show that MobileDreamer achieves state-of-the-art performance and improves task success by 5.25%. World model evaluations further verify that our textual sketch modeling accurately forecasts key GUI elements.

preprint2026arXiv

Seirênes: Adversarial Self-Play with Evolving Distractions for LLM Reasoning

We present Seirênes, a self-play RL framework that transforms contextual interference from a failure mode of LLM reasoning into an internal training signal for co-evolving more resilient reasoners. While RL with verifiable rewards has significantly advanced reasoning capabilities, models can still exhibit fragility when encountering non-idealized contexts: scenarios characterized by superfluous information, tangential instructions, or incidental correlations that differ from the clean distributions typical of standard benchmarks. Seirênes harnesses this vulnerability through a parameter-shared and adversarial self-play loop. Within this framework, a single model is trained to both construct plausible yet distracting contexts that expose its own reasoning blind spots, and solve problems by discerning the essential task from these perturbations to recover the core underlying logic. By pitting these competing objectives against each other, Seirênes compels the model to move beyond superficial pattern matching and anchors its capabilities in robust underlying reasoning. This continuous interaction sustains an informative co-evolutionary curriculum as the model improves. Across seven mathematical reasoning benchmarks and model scales from 4B to 30B, Seirênes achieves average gains of +10.2, +9.1, and +7.2 points. Besides, distracting contexts produced by the 4B Seirênes model reduce the accuracy of top-tier closed-source models (GPT and Gemini) by roughly 4--5 points, revealing Seirênes' general ability to uncover reasoning models' blind spots.

preprint2025arXiv

Reading or Reasoning? Format Decoupled Reinforcement Learning for Document OCR

Reading text from images or scanned documents via OCR models has been a longstanding focus of researchers. Intuitively, text reading is perceived as a straightforward perceptual task, and existing work primarily focuses on constructing enriched data engineering to enhance SFT capabilities. In this work, we observe that even advanced OCR models exhibit significantly higher entropy in formatted text (\emph{e.g.}, formula, table, etc.) compared to plain text, often by an order of magnitude. These statistical patterns reveal that advanced OCR models struggle with high output uncertainty when dealing with format sensitive document, suggesting that reasoning over diverse reading pathways may improve OCR performance. To address this, we propose format decoupled reinforcement learning (FD-RL), which leverages high-entropy patterns for targeted optimization. Our approach employs entropy-based data filtration strategy to identify format-intensive instances, and adopt format decoupled rewards tailored to different format types, enabling format-level validation rather than token-level memorization. FD-RL achieves an average score of 90.41 on OmniDocBench, setting a new record for end-to-end models on this highly popular benchmark. More importantly, we conduct comprehensive ablation studies over data, training, filtering, and rewarding strategies, thoroughly validating their effectiveness.

preprint2022arXiv

End2End Occluded Face Recognition by Masking Corrupted Features

With the recent advancement of deep convolutional neural networks, significant progress has been made in general face recognition. However, the state-of-the-art general face recognition models do not generalize well to occluded face images, which are exactly the common cases in real-world scenarios. The potential reasons are the absences of large-scale occluded face data for training and specific designs for tackling corrupted features brought by occlusions. This paper presents a novel face recognition method that is robust to occlusions based on a single end-to-end deep neural network. Our approach, named FROM (Face Recognition with Occlusion Masks), learns to discover the corrupted features from the deep convolutional neural networks, and clean them by the dynamically learned masks. In addition, we construct massive occluded face images to train FROM effectively and efficiently. FROM is simple yet powerful compared to the existing methods that either rely on external detectors to discover the occlusions or employ shallow models which are less discriminative. Experimental results on the LFW, Megaface challenge 1, RMF2, AR dataset and other simulated occluded/masked datasets confirm that FROM dramatically improves the accuracy under occlusions, and generalizes well on general face recognition. Code is available at https://github.com/haibo-qiu/FROM

preprint2021arXiv

Stripe and junction-vortex phases in linearly coupled Bose-Einstein condensates

Soon after its theoretical prediction, striped-density states in the presence of synthetic spin-orbit coupling were realized in Bose-Einstein condensates of ultracold neutral atoms [J.-R. Li et al., Nature \textbf{543}, 91 (2017)]. The achievement opens avenues to explore the interplay of superfluidity and crystalline order in the search for supersolid features and materials. The system considered is essentially made of two linearly coupled Bose-Einstein condensates, that is a pseudo-spin-$1/2$ system, subject to a spin-dependent gauge field $σ_z \hbar k_\ell$. Under these conditions the stripe phase is achieved when the linear coupling $\hbarΩ/2$ is small against the gauge energy $mΩ/\hbar k_\ell^2<1$ . The resulting density stripes have been interpreted as a standing-wave, interference pattern with approximate wavenumber $2k_\ell$. Here, we show that the emergence of the stripe phase is induced by an array of Josephson vortices living in the junction defined by the linear coupling. As happens in superconducting junctions subject to external magnetic fields, a vortex array is the natural response of the superfluid system to the presence of a gauge field. Also similar to superconductors, the Josephson currents and their associated vortices can be present as a metastable state in the absence of gauge field. We provide closed-form solutions to the 1D mean field equations that account for such vortex arrays. The underlying Josephson currents coincide with the analytical solutions to the sine-Gordon equation for the relative phase of superconducting junctions [C. Owen and D. Scalapino, Phys. Rev. \textbf{164}, 538 (1967)].

preprint2020arXiv

Unlocked-relative-phase states in arrays of Bose-Einstein condensates

Phase engineering techniques are used to control the dynamics of long-bosonic-Josephson-junction arrays built by linearly coupling Bose-Einstein condensates. Just at the middle point of the underlying discrete energy band of the system, unlocked-relative-phase states are shown to be stationary along with the locked-relative-phase Bloch waves. In finite, experimentally-feasible systems, such states find ranges of dynamical stability that depend on the ratio of coupling to interaction energy. The same ratio determines different decay regimes, which include the recurrence of staggered-soliton trains in the condensates around Josephson loop currents at the junctions. These transient solitons are also found in their stationary configurations, which provide striped-density states by means of either dark-soliton or bright-soliton trains. Additionally, the preparation of maximally out-of-phase (or splay) states is demonstrated to evolve into an oscillation of the uniform density of the condensates that keeps constant the total density of the system and robust against noise at low coupling.

preprint2015arXiv

Hybrid synchronization in coupled ultracold atomic gases

We study the time evolution of two coupled many-body quantum systems one of which is assumed to be Bose condensed. Specifically, we consider two ultracold atomic clouds populating each two localized single-particle states, i.e. a two-component Bosonic Josephson junction. The cold atoms cloud can retain its coherence when coupled to the condensate and displays synchronization with the latter, differing from usual entrainment. We term this effect among the ultracold and the condensed clouds as {\it hybrid synchronization}. The onset of synchronization, which we observe in the evolution of average properties of both gases when increasing their coupling, is found to be related to the many-body properties of the quantum gas, e.g. condensed fraction, quantum fluctuations of the particle number differences. We discuss the effects of different initial preparations, the influence of unequal particle numbers for the two clouds, and explore the dependence on the initial quantum state, e.g. coherent state, squeezed state and Fock state, finding essentially the same phenomenology in all cases.

preprint2014arXiv

Measure synchronization in quantum many-body systems

The concept of measure synchronization between two coupled quantum many-body systems is presented. In general terms we consider two quantum many-body systems whose dynamics gets coupled through the contact particle-particle interaction. This coupling is shown to produce measure synchronization, a generalization of synchrony to a large class of systems which takes place in absence of dissipation. We find that in quantum measure synchronization, the many-body quantum properties for the two subsystems, e.g. condensed fractions and particle fluctuations, behave in a coordinated way. To illustrate the concept we consider a simple case of two species of bosons occupying two distinct quantum states. Measure synchronization can be readily explored with state-of-the-art techniques in ultracold atomic gases and, if propertly controlled, be employed to share quantum correlations between different degrees of freedom.

preprint2013arXiv

Measure synchronization in a two-species bosonic Josephson junction

Measure synchronization (MS) in a two-species bosonic Josephson junction (BJJ) is studied based on semi-classical theory. Six different scenarios for MS, including two in the Josephson oscillation regime (0 phase mode) and four in the self-trapping regime ($π$ phase mode), have been clearly shown. Systematic investigations of the common features behind these different scenarios have been performed. We show that the average energies of the two species merge at the MS transition point. The scaling of the power law near the MS transition has been verified, and the critical exponent is 1/2 for all of the different scenarios for MS. We also illustrate MS in a three-dimensional phase space; from this illustration, more detailed information on the dynamical process can be obtained. Particularly, by analyzing the Poincare sections with changing interspecies interactions, we find that the two-species BJJ exhibits separatrix crossing behavior at MS transition point, and such behavior depicts the general mechanism behind the different scenarios for the MS transitions. The new critical behavior found in a two-species BJJ is expected to be found in real systems of atomic Bose gases.