Researcher profile

Liang Zhu

Liang Zhu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
11works
0followers
12topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

11 published item(s)

preprint2026arXiv

Act-Adaptive Margin: Dynamically Calibrating Reward Models for Subjective Ambiguity

Currently, most reinforcement learning tasks focus on domains like mathematics and programming, where verification is relatively straightforward. However, in subjective tasks such as role-playing, alignment techniques struggle to make progress, primarily because subjective reward modeling using the Bradley-Terry model faces significant challenges when dealing with ambiguous preferences. To improve reward modeling in subjective tasks, this paper proposes AAM (\textbf{\underline{A}}ct-\textbf{\underline{A}}daptive \textbf{\underline{M}}argin), which enhances reward modeling by dynamically calibrating preference margins using the model's internal parameter knowledge. We design two versions of AAM that efficiently generate contextually-appropriate preference gaps without additional human annotation. This approach fundamentally improves how reward models handle subjective rewards by better integrating generative understanding with preference scoring. To validate AAM's effectiveness in subjective reward modeling, we conduct evaluations on RewardBench, JudgeBench, and challenging role-playing tasks. Results show that AAM significantly improves subjective reward modeling performance, enhancing Bradley-Terry reward models by 2.95\% in general tasks and 4.85\% in subjective role-playing tasks. Furthermore, reward models trained with AAM can help downstream alignment tasks achieve better results. Our test results show that applying rewards generated by AAM-Augmented RM to preference learning techniques (e.g., GRPO) achieves state-of-the-art results on CharacterEval and Charm. Code and dataset are available at https://github.com/calubkk/AAM.

preprint2022arXiv

Dative epitaxy of commensurate monocrystalline covalent-van der Waals moiré supercrystal

Realizing van der Waals (vdW) epitaxy in the 80s represents a breakthrough that circumvents the stringent lattice matching and processing compatibility requirements in conventional covalent heteroepitaxy. However, due to the weak vdW interactions, there is little control over film qualities by the substrate. Typically, discrete domains with a spread of misorientation angles are formed, limiting the applicability of vdW epitaxy. Here we report the epitaxial growth of monocrystalline, covalent Cr5Te8 2D crystals on monolayer vdW WSe2 by chemical vapor deposition, driven by interfacial dative bond formation. The lattice of Cr5Te8, with a lateral dimension of a few ten microns, is fully commensurate with that of WSe2 via 3 x 3 (Cr5Te8)-7 x 7 (WSe2) supercell matching, forming a single crystalline moire superlattice. Our work has established a conceptually distinct paradigm of thin film epitaxy termed dative epitaxy, which takes full advantage of covalent epitaxy with chemical bonding for fixing the atomic registry and crystal orientation, while circumventing its stringent lattice matching and processing compatibility requirements; conversely, it ensures the full flexibility of vdW epitaxy, while avoiding its poor orientation control. Cr5Te8 2D crystals grown by dative epitaxy exhibit square magnetic hysteresis, suggesting minimized interfacial defects that can serve as pinning sites.

preprint2022arXiv

Joint 3-D Positioning and Power Allocation for UAV Relay Aided by Geographic Information

In this paper, we study to employ geographic information to address the blockage problem of air-to-ground links between UAV and terrestrial nodes. In particular, a UAV relay is deployed to establish communication links from a ground base station to multiple ground users. To improve communication capacity, we first model the blockage effect caused by buildings according to the three-dimensional (3-D) geographic information. Then, an optimization problem is formulated to maximize the minimum capacity among users by jointly optimizing the 3-D position and power allocation of the UAV relay, under the constraints of link capacity, maximum transmit power, and blockage. To solve this complex non-convex problem, a two-loop optimization framework is developed based on Lagrangian relaxation. The outer-loop aims to obtain proper Lagrangian multipliers to ensure the solution of the Lagrangian problem converge to the tightest upper bound on the original problem. The inner-loop solves the Lagrangian problem by applying the block coordinate descent (BCD) and successive convex approximation (SCA) techniques, where UAV 3-D positioning and power allocation are alternately optimized in each iteration. Simulation results confirm that the proposed solution significantly outperforms two benchmark schemes and achieves a performance close to the upper bound on the UAV relay system.

preprint2020arXiv

An improved sample size calculation method for score tests in generalized linear models

Self and Mauritsen (1988) developed a sample size determination procedure for score tests in generalized linear models under contiguous alternatives. Its performance may deteriorate when the effect size is large. We propose a modification of the Self-Mauritsen method by taking into account of the variance of the score statistic under both the null and alternative hypotheses, and extend the method to noninferiority trials. The modified approach is employed to calculate the sample size for the logistic regression and negative binomial regression in superiority and noninferiority trials. We further explain why the formulae recently derived by Zhu and Lakkis tend to underestimate the required sample size for the negative binomial regression. Numerical examples are used to demonstrate the accuracy of the proposed method.

preprint2020arXiv

MOTS: Multiple Object Tracking for General Categories Based On Few-Shot Method

Most modern Multi-Object Tracking (MOT) systems typically apply REID-based paradigm to hold a balance between computational efficiency and performance. In the past few years, numerous attempts have been made to perfect the systems. Although they presented favorable performance, they were constrained to track specified category. Drawing on the ideas of few shot method, we pioneered a new multi-target tracking system, named MOTS, which is based on metrics but not limited to track specific category. It contains two stages in series: In the first stage, we design the self-Adaptive-matching module to perform simple targets matching, which can complete 88.76% assignments without sacrificing performance on MOT16 training set. In the second stage, a Fine-match Network was carefully designed for unmatched targets. With a newly built TRACK-REID data-set, the Fine-match Network can perform matching of 31 category targets, even generalizes to unseen categories.

preprint2020arXiv

Tailored pore gradient in phenolic membranes for adjustable permselectivity by leveraging different poloxamers

Cost-affordable phenolic membranes having gradient nanostructures can be facilely synthesized from resol oligomers in the presence of ZnCl2 and poloxamers. The gradient nanostructures are formed by stacking phenolic nanoparticles with gradually enlarged diameters as the distance from the upper surface increases. The use of poloxamers for creating gelation surroundings is of great significance for controlling the growth of phenolic nanoparticles, which in turn dictates the performance of the phenolic membranes thus-produced. Hence, a study of the effects of poloxamers species on the preparation of the phenolic membranes is highly demanded since such robust membranes have much potential to be scale up for mass production. Herein, the poloxamer Pluronic F127 (EO106-PO70-EO106; EO = ethyleneoxide, PO = propyleneoxide) was introduced in the membrane-forming formulations. As opposed to P123 (EO20-PO70-EO20) that we used previously, F127 possessing extended PEO chains can delay the gelation during membrane formation. Hence, the phenolic nucleates are able to grow for longer durations, leading to the generation of more distinct gradient nanostructures in the phenolic membranes. Enhanced permeance can then be realized with F127-derived phenolic membranes. We also demonstrate that L31 (EO1-PO22-EO1) with merely single terminal EO units at the ends of the PPO block could be used to prepare gradient phenolic membranes. This work is not only much helpful to deeply understand the design of the structural gradient in phenolic membranes, but capable of sheding light on the development of such intriguing structures for water purification.

preprint2020arXiv

Towards in-store multi-person tracking using head detection and track heatmaps

Computer vision algorithms are being implemented across a breadth of industries to enable technological innovations. In this paper, we study the problem of computer vision based customer tracking in retail industry. To this end, we introduce a dataset collected from a camera in an office environment where participants mimic various behaviors of customers in a supermarket. In addition, we describe an illustrative example of the use of this dataset for tracking participants based on a head tracking model in an effort to minimize errors due to occlusion. Furthermore, we propose a model for recognizing customers and staff based on their movement patterns. The model is evaluated using a real-world dataset collected in a supermarket over a 24-hour period that achieves 98% accuracy during training and 93% accuracy during evaluation.

preprint2020arXiv

Tunable THz generation and enhanced nonlinear effects with active and passive graphene hyperbolic metamaterials

The active and nonlinear graphene properties are limited due to weak light matter interaction between the ultrathin graphene and the incident light. In this work, we present enhanced nonlinear effects at the low terahertz (THz) range by designing a new patterned graphene hyperbolic metamaterial (GHMM). More specifically, it is demonstrated that the third harmonic generation (THG) can be significantly enhanced by the proposed GHMM due to the field enhancement at the resonance as well as the supported slow light response that fosters strong light matter interaction.

preprint2016arXiv

Multilayer network analysis of nuclear reactions

The nuclear reaction network is usually studied via precise calculation of differential equation sets, and much research interest has been focused on the characteristics of nuclides, such as half-life and size limit. In this paper, however, we adopt the methods from both multilayer and reaction networks, and obtain a distinctive view by mapping all the nuclear reactions in JINA REACLIB database into a directed network with 4 layers: neutron, proton, $^4$He and the remainder. The layer names correspond to reaction types decided by the currency particles consumed. This combined approach reveals that, in the remainder layer, the $β$-stability has high correlation with node degree difference and overlapping coefficient. Moreover, when reaction rates are considered as node strength, we find that, at lower temperatures, nuclide half-life scales reciprocally with its out-strength. The connection between physical properties and topological characteristics may help to explore the boundary of the nuclide chart.

preprint2016arXiv

Photoluminescence of InGaAs/GaAsBi/InGaAs type-II quantum well grown by gas source molecular beam epitaxy

InGaAs/GaAsBi/InGaAs quantum wells (QWs) were grown on GaAs substrates by gas source molecular beam epitaxy for realizing the type II band-edge line-up. Both type I and type II transitions were observed in the Bi containing W QWs and the photoluminescence intensity was enhanced in the sample with a high Bi content, which is mainly due to the improvement of carrier confinement. Blue-shift of type II transitions at high excitation power density was observed and ascribed to the band-bending effect. The calculated transition energies based on 8 band k.p model fit well with the experiment results. The experimental and theoretical results show that the type-II QW design is a new promising candidate for realizing long wavelength GaAs-based light emitting devices near 1.3 um.

preprint2013arXiv

Study on the Energy Dependence of the Radii of Jets by the HBT Correlation Method in e+e- collisions

The energy dependence of the radii size of jets are studied in detail by the HBT correlation method using Monte Carlo Simulation generator Jetset7.4 to produce 40,000,000 events of e$^+$e$^-$ collisions at $\sqrt s =30$, 50, 70, 91.2, 110, 130, 150 and 170 GeV. The radii of jets are measured using the HBT correlation method with the indistinguishability of identical final state pions. It is found that the average radii of quark-jets and gluon-jets are independent of the c.m. energy of e$^+$e$^-$ collisions. The average radius of quark-jets are obviously larger than that of gluon-jets. The invariable average radii of quark-jets and gluon-jets in e$^+$e$^-$ collisions are obtained at the end of parton evolvement.