Researcher profile

Zhe Cui

Zhe Cui contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
7works
0followers
8topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

7 published item(s)

preprint2026arXiv

GUITester: Enabling GUI Agents for Exploratory Defect Discovery

Exploratory GUI testing is essential for software quality but suffers from high manual costs. While Multi-modal Large Language Model (MLLM) agents excel in navigation, they fail to autonomously discover defects due to two core challenges: \textit{Goal-Oriented Masking}, where agents prioritize task completion over reporting anomalies, and \textit{Execution-Bias Attribution}, where system defects are misidentified as agent errors. To address these, we first introduce \textbf{GUITestBench}, the first interactive benchmark for this task, featuring 143 tasks across 26 defects. We then propose \textbf{GUITester}, a multi-agent framework that decouples navigation from verification via two modules: (i) a \textit{Planning-Execution Module (PEM)} that proactively probes for defects via embedded testing intents, and (ii) a \textit{Hierarchical Reflection Module (HRM)} that resolves attribution ambiguity through interaction history analysis. GUITester achieves an F1-score of 48.90\% (Pass@3) on GUITestBench, outperforming state-of-the-art baselines (33.35\%). Our work demonstrates the feasibility of autonomous exploratory testing and provides a robust foundation for future GUI quality assurance~\footnote{Our code is now available in~\href{https://github.com/ADaM-BJTU/GUITestBench}{https://github.com/ADaM-BJTU/GUITestBench}}.

preprint2026arXiv

Visual Merit or Linguistic Crutch? A Close Look at DeepSeek-OCR

DeepSeek-OCR utilizes an optical 2D mapping approach to achieve high-ratio vision-text compression, claiming to decode text tokens exceeding ten times the input visual tokens. While this suggests a promising solution for the LLM long-context bottleneck, we investigate a critical question: "Visual merit or linguistic crutch - which drives DeepSeek-OCR's performance?" By employing sentence-level and word-level semantic corruption, we isolate the model's intrinsic OCR capabilities from its language priors. Results demonstrate that without linguistic support, DeepSeek-OCR's performance plummets from approximately 90% to 20%. Comparative benchmarking against 13 baseline models reveals that traditional pipeline OCR methods exhibit significantly higher robustness to such semantic perturbations than end-to-end methods. Furthermore, we find that lower visual token counts correlate with increased reliance on priors, exacerbating hallucination risks. Context stress testing also reveals a total model collapse around 10,000 text tokens, suggesting that current optical compression techniques may paradoxically aggravate the long-context bottleneck. This study empirically defines DeepSeek-OCR's capability boundaries and offers essential insights for future optimizations of the vision-text compression paradigm. We release all data, results and scripts used in this study at https://github.com/dududuck00/DeepSeekOCR.

preprint2022arXiv

Lodestar: Supporting Independent Learning and Rapid Experimentation Through Data-Driven Analysis Recommendations

Keeping abreast of current trends, technologies, and best practices in visualization and data analysis is becoming increasingly difficult, especially for fledgling data scientists. In this paper, we propose Lodestar, an interactive computational notebook that allows users to quickly explore and construct new data science workflows by selecting from a list of automated analysis recommendations. We derive our recommendations from directed graphs of known analysis states, with two input sources: one manually curated from online data science tutorials, and another extracted through semi-automatic analysis of a corpus of over 6,000 Jupyter notebooks. We evaluate Lodestar in a formative study guiding our next set of improvements to the tool. Our results suggest that users find Lodestar useful for rapidly creating data science workflows.

preprint2022arXiv

Multi-similarity based Hyperrelation Network for few-shot segmentation

Few-shot semantic segmentation aims at recognizing the object regions of unseen categories with only a few annotated examples as supervision. The key to few-shot segmentation is to establish a robust semantic relationship between the support and query images and to prevent overfitting. In this paper, we propose an effective Multi-similarity Hyperrelation Network (MSHNet) to tackle the few-shot semantic segmentation problem. In MSHNet, we propose a new Generative Prototype Similarity (GPS), which together with cosine similarity can establish a strong semantic relation between the support and query images. The locally generated prototype similarity based on global feature is logically complementary to the global cosine similarity based on local feature, and the relationship between the query image and the supported image can be expressed more comprehensively by using the two similarities simultaneously. In addition, we propose a Symmetric Merging Block (SMB) in MSHNet to efficiently merge multi-layer, multi-shot and multi-similarity hyperrelational features. MSHNet is built on the basis of similarity rather than specific category features, which can achieve more general unity and effectively reduce overfitting. On two benchmark semantic segmentation datasets Pascal-5i and COCO-20i, MSHNet achieves new state-of-the-art performances on 1-shot and 5-shot semantic segmentation tasks.

preprint2021arXiv

A Study of Magnetized White Dwarf + Helium Star Binary Evolution to Type Ia Supernovae

The white dwarf (WD) + helium (He) star binary channel plays an important role in the single degenerate scenario for the progenitors of type Ia supernovae (SNe Ia). Previous studies on the WD + main sequence star evolution have shown that the magnetic fields of WDs may significantly influence their accretion and nuclear burning processes. In this work we focus on the evolution of magnetized WD + He star binaries with detailed stellar evolution and binary population synthesis (BPS) calculations. In the case of magnetized WDs, the magnetic fields may disrupt the inner regions of the accretion disk, funnel the accretion flow onto the polar caps, and even confine helium burning within the caps.We find that, for WDs with sufficiently strong magnetic fields, the parameter space of the potential SN Ia progenitor systems shrinks toward shorter orbital periods and lower donor masses compared with that in the non-magnetized WD case. The reason is that the magnetic confinement usually works with relatively high mass transfer rates, which can trigger strong wind mass loss from the WD, thus limiting the He-rich mass accumulation efficiency. The surviving companion stars are likely of low-mass at the moment of the SN explosions, which can be regarded as a possible explanation for the non-detection of surviving companions after the SNe or inside the SN remnants. However, the corresponding birthrate of Galactic SNe Ia in our high-magnetic models is estimated to be ~(0.08-0.13) * 10^{-3} yr^{-1}( ~0.17-0.28 * 10^{-3}yr^{-1} for the non-magnetic models), significantly lower than the observed Galactic SN Ia birthrate.

preprint2021arXiv

Are there magnetars in high-mass X-ray binaries?

Magnetars form a special population of neutron stars with strong magnetic fields and long spin periods. About 30 magnetars and magnetar candidates known currently are probably isolated. But the possibility that magnetars are in binaries hasn't been excluded. In this work, we perform spin evolution of neutron stars with different magnetic fields in wind-fed high-mass X-ray binaries and compare the spin period distribution with observations, aiming to find magnetars in binaries. Our simulation shows that some of the neutron stars, which have long spin periods or in wide-separation systems, need strong magnetic fields to explain their spin evolution. This implies that there are probably magnetars in high-mass X-ray binaries. Moreover, this can further provide a theoretical basis for some unclear astronomical phenomena, such as the possible origin of periodic fast radio bursts from magnetars in binary systems.

preprint2020arXiv

Dense Registration and Mosaicking of Fingerprints by Training an End-to-End Network

Dense registration of fingerprints is a challenging task due to elastic skin distortion, low image quality, and self-similarity of ridge pattern. To overcome the limitation of handcraft features, we propose to train an end-to-end network to directly output pixel-wise displacement field between two fingerprints. The proposed network includes a siamese network for feature embedding, and a following encoder-decoder network for regressing displacement field. By applying displacement fields reliably estimated by tracing high quality fingerprint videos to challenging fingerprints, we synthesize a large number of training fingerprint pairs with ground truth displacement fields. In addition, based on the proposed registration algorithm, we propose a fingerprint mosaicking method based on optimal seam selection. Registration and matching experiments on FVC2004 databases, Tsinghua Distorted Fingerprint (TDF) database, and NIST SD27 latent fingerprint database show that our registration method outperforms previous dense registration methods in accuracy and efficiency. Mosaicking experiment on FVC2004 DB1 demonstrates that the proposed algorithm produced higher quality fingerprints than other algorithms which also validates the performance of our registration algorithm.