Researcher profile

Xiyu Wang

Xiyu Wang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
7works
0followers
5topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

7 published item(s)

preprint2026arXiv

Early Semantic Grounding in Image Editing Models for Zero-Shot Referring Image Segmentation

Instruction-based image editing (IIE) models have recently demonstrated strong capability in modifying specific image regions according to natural language instructions, which implicitly requires identifying where an edit should be applied. This indicates that such models inherently perform language-conditioned visual semantic grounding. In this work, we investigate whether this implicit grounding can be leveraged for zero-shot referring image segmentation (RIS), a task that requires pixel-level localization of objects described by natural language expressions. Through systematic analysis, we reveal that strong foreground-background separability emerges in the internal representations of these models at the earliest denoising timestep, well before any visible image transformation occurs. Building on this insight, we propose a training-free framework that repurposes pretrained image editing models for RIS by exploiting their intermediate representations. Our approach decomposes localization into two complementary components: attention-based spatial priors that estimate where to focus, and feature-based semantic discrimination that determines what to segment. By leveraging feature-space separability, the framework produces accurate segmentation masks using only a single denoising step, without requiring full image synthesis. Extensive experiments on RefCOCO, RefCOCO+, and RefCOCOg demonstrate that our method achieves superior performance over existing zero-shot baselines.

preprint2023arXiv

Async-fork: Mitigating Query Latency Spikes Incurred by the Fork-based Snapshot Mechanism from the OS Level

In-memory key-value stores (IMKVSes) serve many online applications because of their efficiency. To support data backup, popular industrial IMKVSes periodically take a point-in-time snapshot of the in-memory data with the system call fork. However, this mechanism can result in latency spikes for queries arriving during the snapshot period because fork leads the engine into the kernel mode in which the engine is out-of-service for queries. In contrast to existing research focusing on optimizing snapshot algorithms, we optimize the fork operation to address the latency spikes problem from the operating system (OS) level, while keeping the data persistent mechanism in IMKVSes unchanged. Specifically, we first conduct an in-depth study to reveal the impact of the fork operation as well as the optimization techniques on query latency. Based on findings in the study, we propose Async-fork to offload the work of copying the page table from the engine (the parent process) to the child process as copying the page table dominates the execution time of fork. To keep data consistent between the parent and the child, we design the proactive synchronization strategy. Async-fork is implemented in the Linux kernel and deployed into the online Redis database in public clouds. Our experiment results show that compared with the default fork method in OS, Async-fork reduces the tail latency of queries arriving during the snapshot period by 81.76% on an 8GB instance and 99.84% on a 64GB instance.

preprint2022arXiv

Ambient backscatter communications using LTE cell specific reference signals

Long Term Evolution (LTE) systems provide ubiquitous coverage for mobile communications, which makes it a promising candidate to be used as a signal source in the ambient backscatter communications. In this paper, we propose a system in which a backscatter device modulates the ambient LTE signal by changing its reflection coefficient and the receiver uses the LTE Cell Specific Reference Signals (CRS) to estimate the channel and demodulates the backscattered signal from the obtained channel impulse response estimates. We first outline the overall system, discuss the receiver operation, and then provide experimental evidence on the practicality of the proposed system.

preprint2022arXiv

Calibrating Class Weights with Multi-Modal Information for Partial Video Domain Adaptation

Assuming the source label space subsumes the target one, Partial Video Domain Adaptation (PVDA) is a more general and practical scenario for cross-domain video classification problems. The key challenge of PVDA is to mitigate the negative transfer caused by the source-only outlier classes. To tackle this challenge, a crucial step is to aggregate target predictions to assign class weights by up-weighing target classes and down-weighing outlier classes. However, the incorrect predictions of class weights can mislead the network and lead to negative transfer. Previous works improve the class weight accuracy by utilizing temporal features and attention mechanisms, but these methods may fall short when trying to generate accurate class weight when domain shifts are significant, as in most real-world scenarios. To deal with these challenges, we propose the Multi-modality Cluster-calibrated partial Adversarial Network (MCAN). MCAN enhances video feature extraction with multi-modal features from multiple temporal scales to form more robust overall features. It utilizes a novel class weight calibration method to alleviate the negative transfer caused by incorrect class weights. The calibration method tries to identify and weigh correct and incorrect predictions using distributional information implied by unsupervised clustering. Extensive experiments are conducted on prevailing PVDA benchmarks, and the proposed MCAN achieves significant improvements when compared to state-of-the-art PVDA methods.

preprint2022arXiv

Parametric Euler Sums of Harmonic Numbers

We define a parametric variant of generalized Euler sums and construct contour integration to give some explicit evaluations of these parametric Euler sums. In particular, we establish several explicit formulas of (Hurwitz) zeta functions, linear and quadratic parametric Euler sums. Furthermore, we also give an explicit evaluation of alternating double zeta values $\ze(\overline{2j},2m+1)$ in terms of a combination of alternating Riemann zeta values by using the parametric Euler sums.

preprint2020arXiv

Coherent Multi-antenna Receiver for BPSK-modulated Ambient Backscatter Tags

Ambient Backscatter Communication (AmBC) is an emerging communication technology that can enable green Internet-of-Things deployments. The widespread acceptance of this paradigm is limited by low Signal-to-Interference-Plus-Noise Ratio (SINR) of the signal impinging on the receiver antenna due to the strong direct path interference and unknown ambient signal. The adverse impact of these two factors can be mitigated by using non-coherent multi-antenna receivers, which is known to require higher SINR to reach Bit-Error-Rate (BER) performance of coherent receivers. However, in literature, coherent receivers for AmBC systems are little-studied because of unknown ambient signal, unknown location of AmBC tags, and varying channel conditions. In this paper, a coherent multi-antenna receiver, which does not require a prior information of the ambient signal, for decoding Binary-Phase-shift-Keying (BPSK) modulated signal is presented. The performance of the proposed receiver is compared with the ideal coherent receiver that has a perfect phase information, and also with the performance of non-coherent receiver, which assumes distributions for ambient signal and phase offset caused by excess length of the backscatter path. Comparative simulation results show the designed receiver can achieve the same BER-performance of the ideal coherent receiver with 1-dB more SINR, which corresponds to 5-dB or more gain with respect to non-coherent reception of On-Off-Keying modulated signals. Variation of the detection performance with the tag location shows that the coverage area is in the close vicinity of the transmitter and a larger region around the receiver, which is consistent with the theoretical results.

preprint2020arXiv

Optimum Multi-Antenna Ambient Backscatter Receiver for General Binary-Modulated Signal

Ambient backscatter communication (AmBC) is becoming increasingly popular for enabling green communication amidst the continual development of the Internet-of-things paradigm. Efforts have been put into backscatter signal detection as the detection performance is limited by the low signal-to-interference-plus-noise ratio (SINR) of the signal at the receiver. The low SINR can be improved by adopting a multi-antenna receiver. In this paper, the optimum multi-antenna receiver that does not impose any constraints on the types of binary modulation performed by the backscatter device and the waveform used by the ambient source system is studied. The proposed receiver owns a simple structure formed by two beamformers. Bit error rate (BER) performances of the optimum receiver are derived under constant-amplitude ambient signal and Gaussian-distributed ambient signal. Moreover, to facilitate the implementation of the optimum receiver, a simplified receiver is proposed and practical approximations to required beamformers are provided. The derived optimum receiver avoids the complex direct path interference cancellation and coherent reception, but exploits the fact that backscatter signal changes the composite channel impinging at the receiver and the directivity of receiver antenna array. Comparative simulation results show that the performance of the optimum receiver achieves the same performance as the coherent receiver even though it realizes non-coherent reception. The studied receivers provide high flexibility for implementing simple and low-cost receivers in different AmBC systems.