Source author record

Xu Zheng

Xu Zheng appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Artificial Intelligence physics.app-ph cond-mat.soft cond-mat.stat-mech Machine Learning Networking and Internet Architecture physics.optics Software Engineering

Catalog footprint

What is connected

8works

8topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

RHyVE: Competence-Aware Verification and Phase-Aware Deployment for LLM-Generated Reward Hypotheses

Large language models (LLMs) make reward design in reinforcement learning substantially more scalable, but generated rewards are not automatically reliable training objectives. Existing work has focused primarily on generating, evolving, or selecting reward candidates, while paying less attention to when such candidates can be verified and deployed during policy optimization. We study this deployment-time problem by treating generated rewards as reward hypotheses whose utility depends on the competence of the current policy and the phase of training. We propose \textsc{RHyVE}, a competence-aware verification and phase-aware deployment protocol that compares small sets of reward hypotheses from shared policy checkpoints using short-horizon fork verification. Our experiments show that reward rankings are unreliable at low competence but become informative after task-dependent thresholds. On a sparse manipulation task, phase-aware deployment improves peak and retained performance under a locked protocol. Updated LLM-generated reward-candidate experiments show candidate-family-dependent behavior: generated pools can exhibit phase-dependent winner changes, but no fixed warm-up schedule is universally optimal. Held-out schedule selection, conservative selector baselines, compute-matched controls, and scale controls further show that \textsc{RHyVE} is best understood as a verification-informed deployment protocol rather than a universal scheduler. Dense and all-failure boundary experiments delimit the scope of the method. Together, these results suggest that reward generation and reward deployment should be studied as coupled problems: generated rewards must be verified and deployed under changing policy competence.

preprint2022arXiv

Synthesising Electronic Health Records: Cystic Fibrosis Patient Group

Class imbalance can often degrade predictive performance of supervised learning algorithms. Balanced classes can be obtained by oversampling exact copies, with noise, or interpolation between nearest neighbours (as in traditional SMOTE methods). Oversampling tabular data using augmentation, as is typical in computer vision tasks, can be achieved with deep generative models. Deep generative models are effective data synthesisers due to their ability to capture complex underlying distributions. Synthetic data in healthcare can enhance interoperability between healthcare providers by ensuring patient privacy. Equipped with large synthetic datasets which do well to represent small patient groups, machine learning in healthcare can address the current challenges of bias and generalisability. This paper evaluates synthetic data generators ability to synthesise patient electronic health records. We test the utility of synthetic data for patient outcome classification, observing increased predictive performance when augmenting imbalanced datasets with synthetic data.

preprint2021arXiv

Reciprocity of thermal diffusion in time-modulated systems

The reciprocity principle governs the symmetry in transmission of electromagnetic and acoustic waves, as well as the diffusion of heat between two points in space, with important consequences for thermal management and energy harvesting. There has been significant recent interest in materials with time-modulated properties, which have been shown to efficiently break reciprocity for light, sound, and even charge diffusion. Quite surprisingly, here we show that, from a practical point of view, time modulation cannot generally be used to break reciprocity for thermal diffusion. We establish a theoretical framework to accurately describe the behavior of diffusive processes under time modulation, and prove that thermal reciprocity in dynamic materials is generally preserved by the continuity equation, unless some external bias or special material is considered. We then experimentally demonstrate reciprocal heat transfer in a time-modulated device. Our findings correct previous misconceptions regarding reciprocity breaking for thermal diffusion, revealing the generality of symmetry constraints in heat transfer, and clarifying its differences from other transport processes in what concerns the principles of reciprocity and microscopic reversibility.

preprint2020arXiv

A Systematic Literature Review of Modern Software Visualization

We report on the state-of-the-art of software visualization. To ensure reproducibility, we adopted the Systematic Literature Review methodology. That is, we analyzed 1440 entries from IEEE Xplore and ACM Digital Library databases. We selected 105 relevant full papers published in 2013-2019, which we classified based on the aspect of the software system that is supported (i.e., structure, behavior, and evolution). For each paper, we extracted main dimensions that characterize software visualizations, such as software engineering tasks, roles of users, information visualization techniques, and media used to display visualizations. We provide researchers in the field an overview of the state-of-the-art in software visualization and highlight research opportunities. We also help developers to identify suitable visualizations for their particular context by matching software visualizations to development concerns and concrete details to obtain available visualization tools.

preprint2020arXiv

Effect of Interfacial Thermal Resistance in Thermal Cloak

When heat transfers through interface between two different materials, it will encounter an interfacial thermal resistance (ITR) that makes the temperature discontinuous. This effect has been totally neglected so far in the research of thermal cloak, in particular when the thermal cloak is built with multilayer structures. In this paper, we investigate the effect of ITR on the performance of the thermal cloak by using both analytical and numerical method. Our results show that the existence of ITR will distort the external field, thus destroy the cloak. Moreover, we found that the effect of ITR can be quantified by a parameter called characteristic length.

preprint2016arXiv

Technique Report: Scheduling Flows with Multiple Service Frequency Constraints

With the fast development of wireless technologies, wireless applications have invaded various areas in people's lives with a wide range of capabilities. Guaranteeing Quality-of-Service (QoS) is the key to the success of those applications. One of the QoS requirements, service frequency, is very important for tasks including multimedia transmission in the Internet of Things. A service frequency constraint denotes the length of time period during which a link can transmit at least once. Unfortunately, it has not been well addressed yet. Therefore, this paper proposes a new framework to schedule multi transmitting flows in wireless networks considering service frequency constraint for each link. In our model, the constraints for flows are heterogeneous due to the diversity of users' behaviors. We first introduce a new definition for network stability with service frequency constraints and demonstrate that the novel scheduling policy is throughput-optimal in one fundamental category of network models. After that, we discuss the performance of a wireless network with service frequency constraints from the views of capacity region and total queue length. Finally, a series of evaluations indicate the proposed scheduling policy can guarantee service frequency and achieve a good performance on the aspect of queue length of each flow.

preprint2015arXiv

Evidence of near-infrared partial photonic bandgap in polymeric rod-connected diamond structures

We present the simulation, fabrication, and optical characterization of low-index polymeric rod-connected diamond (RCD) structures. Such complex three-dimensional photonic crystal structures are created via direct laser writing by two-photon polymerization. To our knowledge, this is the first measurement at near-infrared wavelengths, showing partial photonic bandgaps for this structure. We characterize structures in transmission and reflection using angular resolved Fourier image spectroscopy to visualize the band structure. Comparison of the numerical simulations of such structures with the experimentally measured data show good agreement for both P- and S-polarizations.

preprint2013arXiv

Non-Gaussian statistics for the motion of self-propelled Janus particles: experiment versus theory

Spherical Janus particles are one of the most prominent examples for active Brownian objects. Here, we study the diffusiophoretic motion of such microswimmers in experiment and in theory. Three stages are found: simple Brownian motion at short times, super-diffusion at intermediate times, and finally diffusive behavior again at long times. These three regimes observed in the experiments are compared with a theoretical model for the Langevin dynamics of self-propelled particles with coupled translational and rotational motion. Besides the mean square displacement also higher displacement moments are addressed. In particular, theoretical predictions regarding the non-Gaussian behavior of self-propelled particles are verified in the experiments. Furthermore, the full displacement probability distribution is analyzed, where in agreement with Brownian dynamics simulations either an extremely broadened peak or a pronounced double-peak structure is found depending on the experimental conditions.