Researcher profile

Yu Zeng

Yu Zeng contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
11works
0followers
7topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

11 published item(s)

preprint2026arXiv

Flow-OPD: On-Policy Distillation for Flow Matching Models

Existing Flow Matching (FM) text-to-image models suffer from two critical bottlenecks under multi-task alignment: the reward sparsity induced by scalar-valued rewards, and the gradient interference arising from jointly optimizing heterogeneous objectives, which together give rise to a 'seesaw effect' of competing metrics and pervasive reward hacking. Inspired by the success of On-Policy Distillation (OPD) in the large language model community, we propose Flow-OPD, the first unified post-training framework that integrates on-policy distillation into Flow Matching models. Flow-OPD adopts a two-stage alignment strategy: it first cultivates domain-specialized teacher models via single-reward GRPO fine-tuning, allowing each expert to reach its performance ceiling in isolation; it then establishes a robust initial policy through a Flow-based Cold-Start scheme and seamlessly consolidates heterogeneous expertise into a single student via a three-step orchestration of on-policy sampling, task-routing labeling, and dense trajectory-level supervision. We further introduce Manifold Anchor Regularization (MAR), which leverages a task-agnostic teacher to provide full-data supervision that anchors generation to a high-quality manifold, effectively mitigating the aesthetic degradation commonly observed in purely RL-driven alignment. Built upon Stable Diffusion 3.5 Medium, Flow-OPD raises the GenEval score from 63 to 92 and the OCR accuracy from 59 to 94, yielding an overall improvement of roughly 10 points over vanilla GRPO, while preserving image fidelity and human-preference alignment and exhibiting an emergent 'teacher-surpassing' effect. These results establish Flow-OPD as a scalable alignment paradigm for building generalist text-to-image models. The codes and weights will be released in: https://github.com/CostaliyA/Flow-OPD .

preprint2022arXiv

Shape-guided Object Inpainting

Previous works on image inpainting mainly focus on inpainting background or partially missing objects, while the problem of inpainting an entire missing object remains unexplored. This work studies a new image inpainting task, i.e. shape-guided object inpainting. Given an incomplete input image, the goal is to fill in the hole by generating an object based on the context and implicit guidance given by the hole shape. Since previous methods for image inpainting are mainly designed for background inpainting, they are not suitable for this task. Therefore, we propose a new data preparation method and a novel Contextual Object Generator (CogNet) for the object inpainting task. On the data side, we incorporate object priors into training data by using object instances as holes. The CogNet has a two-stream architecture that combines the standard bottom-up image completion process with a top-down object generation process. A predictive class embedding module bridges the two streams by predicting the class of the missing object from the bottom-up features, from which a semantic object map is derived as the input of the top-down stream. Experiments demonstrate that the proposed method can generate realistic objects that fit the context in terms of both visual appearance and semantic meanings. Code can be found at the project page \url{https://zengxianyu.github.io/objpaint}

preprint2021arXiv

Multipartite entanglement of the topologically ordered state in a perturbed toric code

We demonstrate that multipartite entanglement, witnessed by the quantum Fisher information (QFI), can characterize topological quantum phase transitions in the spin-$\frac{1}{2}$ toric code model on a square lattice with external fields. We show that the QFI density of the ground state can be written in terms of the expectation values of gauge-invariant Wilson loops for different sizes of square regions and identify $\mathbb{Z}_2$ topological order by its scaling behavior. Furthermore, we use this multipartite entanglement witness to investigate thermalization and disorder-assisted stabilization of topological order after a quantum quench. Moreover, with an upper bound of the QFI, we demonstrate the absence of finite-temperature topological order in the 2D toric code model in the thermodynamic limit. Our results provide insights to topological phases, which are robust against external disturbances, and are candidates for topologically protected quantum computation.

preprint2020arXiv

Delay Adaptation Method for Relay Assisted Optical Wireless Systems

In this paper, we investigate optical wireless repeaters as relay terminals between a transmitter and a user in an Infrared Optical Wireless Communication (IROWC) system. A delay adaptation method is introduced to solve the problem of irregular signal arrival time from different relay terminals. Three different relay terminal deployment scenarios were investigated in a typical two-phase relay IROWC system with the proposed delay adaptation method. The simulation results indicate that the proposed system has better impulse response compared to the conventional system and that the root-mean-square delay spread of the relay system with the delay adaptation method is on average 30% less than the conventional system.

preprint2020arXiv

Diagonal entropy in many-body systems: Volume effect and quantum phase transitions

We investigate the diagonal entropy(DE) of the ground state for quantum many-body systems, including the XY model and the Ising model with next nearest neighbour interactions. We focus on the DE of a subsystem of L continuous spins. We show that the DE in many-body systems, regardless of integrability, can be represented as a volume term plus a logarithmic correction and a constant offset. Quantum phase transition points can be explicitly identified by the three coefficients thereof. Besides, by combining entanglement entropy and the relative entropy of quantum coherence, as two celebrated representatives of quantumness, we simply obtain the DE, which naturally has the potential to reveal the information of quantumness. More importantly, the DE is concerning only the diagonal form of the ground state reduced density matrix, making it feasible to measure in real experiments, and therefore it has immediate applications in demonstrating quantum supremacy on state-of-the-art quantum simulators.

preprint2020arXiv

High-Resolution Image Inpainting with Iterative Confidence Feedback and Guided Upsampling

Existing image inpainting methods often produce artifacts when dealing with large holes in real applications. To address this challenge, we propose an iterative inpainting method with a feedback mechanism. Specifically, we introduce a deep generative model which not only outputs an inpainting result but also a corresponding confidence map. Using this map as feedback, it progressively fills the hole by trusting only high-confidence pixels inside the hole at each iteration and focuses on the remaining pixels in the next iteration. As it reuses partial predictions from the previous iterations as known pixels, this process gradually improves the result. In addition, we propose a guided upsampling network to enable generation of high-resolution inpainting results. We achieve this by extending the Contextual Attention module to borrow high-resolution feature patches in the input image. Furthermore, to mimic real object removal scenarios, we collect a large object mask dataset and synthesize more realistic training data that better simulates user inputs. Experiments show that our method significantly outperforms existing methods in both quantitative and qualitative evaluations. More results and Web APP are available at https://zengxianyu.github.io/iic.