Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
17works
0followers
15topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

17 published item(s)

preprint2026arXiv

Intelligent Elastic Feature Fading: Enabling Model Retrain-Free Feature Efficiency Rollouts at Scale

Large-scale ranking systems depend on thousands of features derived from user behavior across multiple time horizons. Typically requires model retraining -- resulting in long iteration cycles (3--6 months), substantial GPU resource consumption, and limited rollout throughput. We introduce Intelligent Elastic Feature Fading (IEFF), a production infrastructure system that enables retrain-free feature efficiency rollouts by elastically controlling feature coverage and distribution at serving time. IEFF supports incremental feature coverage adjustments while models adapt through recurring training, eliminating dependencies on explicit retraining cycles. The system incorporates strict safety guardrails, reversibility mechanisms, and comprehensive monitoring to ensure stability at scale. Across multiple production use cases, IEFF accelerates efficiency-related rollouts by 5$\times$, eliminates retraining-related GPU overhead, and enables faster capacity recycling. Extensive offline and online experiments demonstrate that gradual feature fading prevents 50--55\% of online performance degradation compared to abrupt feature removal, while maintaining stable model behavior. These results establish elastic, system-level feature fading as a practical and scalable approach for managing feature efficiency in modern industrial ranking systems.

preprint2026arXiv

Learning to Dock: Geometric Deep Learning for Predicting Supramolecular Host-Guest Complexes

Predicting non-covalent host-guest recognition remains challenging due to the complex interplay of electrostatics, dispersion, and steric effects, and the limited transferability of existing docking approaches to synthetic supramolecular systems. Here we present DeepHostGuest, a geometric deep-learning framework that learns generalizable recognition principles directly from experimentally resolved host-guest structures. Hosts are encoded as electrostatic surfaces and guests as molecular graphs, enabling transferable learning across diverse supramolecular systems. DeepHostGuest achieves high-accuracy predictions (RMSD $\leq 2$ Angstrom for 80.8% of test cases), substantially outperforming classical docking without case-specific tuning. Notably, the method generalizes beyond its training domain to crystalline sponge systems, accurately capturing the binding of large amphiphilic molecules within metal-organic cages. Beyond predicting binding conformations, the structures generated by DeepHostGuest serve as a reliable basis for accurate binding free-energy calculations. Density Functional Theory (DFT)-calculated affinities correlate well with experiment, enabling structure-property relationships across 876 host-guest complexes spanning 34 host families. Interpretable feature analysis reveals that binding strength arises from a cooperative interplay of host polarity, guest hydrophobicity, and geometric complementarity, with distinct design regimes across supramolecular classes. Together, these results establish data-driven molecular recognition as a practical route to predictive supramolecular design, enabling high-throughput virtual screening and rational optimization of functional host-guest systems.

preprint2026arXiv

Multiple Consistent 2D-3D Mappings for Robust Zero-Shot 3D Visual Grounding

Zero-shot 3D Visual Grounding (3DVG) is a critical capability for open-world embodied AI. However, existing methods are fundamentally bottlenecked by the poor quality of open-vocabulary 3D proposals, suffering from inaccurate categories and imprecise geometries, as well as the spatial redundancy of exhaustive multi-view reasoning. To address these challenges, we propose MCM-VG, a novel framework that achieves robust zero-shot 3DVG by explicitly establishing Multiple Consistent 2D-3D Mappings. Instead of passively relying on noisy 3D segments, MCM-VG enforces 2D-3D consistency across three fundamental dimensions to achieve precise target localization and reliable reasoning. First, a Semantic Alignment module corrects category mismatches via LLM-driven query parsing and coarse-to-fine 2D-3D matching. Second, an Instance Rectification module leverages VLM-guided 2D segmentations to reconstruct missing targets, back-projecting these reliable visual priors to establish accurate 3D geometries. Finally, to eliminate spatial redundancy, a Viewpoint Distillation module clusters 3D camera directions to extract optimal frames. By pairing these optimal RGB frames with Bird's Eye View maps into concise visual prompt sets, we formulate the final target disambiguation as a multiple-choice reasoning task for Vision-Language Models. Extensive evaluations on ScanRefer and Nr3D benchmarks demonstrate that MCM-VG sets a new state-of-the-art for zero-shot 3D visual grounding. Remarkably, it achieves 62.0\% and 53.6\% in Acc@0.25 and Acc@0.5 on ScanRefer, outperforming previous baselines by substantial margins of 6.4\% and 4.0\%.

preprint2026arXiv

RecruitScope: A Visual Analytics System for Multidimensional Recruitment Data Analysis

Online recruitment platforms have become the dominant channel for modern hiring, yet most platforms offer only basic filtering capabilities, such as job title, keyword, and salary range. This hinders comprehensive analysis of multi-attribute relationships and job market patterns across different scales. We present RecruitScope, a visual analytics system designed to support multidimensional and cross-level exploration of recruitment data for job seekers and employers, particularly HR specialists. Through coordinated visualizations, RecruitScope enables users to analyze job positions and salary patterns from multiple perspectives, interpret industry dynamics at the macro level, and identify emerging positions at the micro level. We demonstrate the effectiveness of RecruitScope through case studies that reveal regional salary distribution patterns, characterize industry growth trajectories, and discover high-demand emerging roles in the job market.

preprint2022arXiv

A new way to explore cosmological tensions using gravitational waves and strong gravitational lensing

In recent years, a crisis in the standard cosmology has been caused by inconsistencies in the measurements of some key cosmological parameters, Hubble constant $H_0$ and cosmic curvature parameter $Ω_K$ for example. It is necessary to remeasure them with the cosmological model-independent methods. In this paper, based on the distance sum rule, we present such a way to constrain $H_0$ and $Ω_K$ simultaneously in the late universe from strong gravitational lensing time delay (SGLTD) data and gravitational wave (GW) standard siren data simulated from the future observation of the Einstein Telescope (ET). Based on the currently 6 observed SGLTD data, we find that the constraint precision of $H_0$ from the combined 100 GW events can be comparable with the measurement from SH0ES collaboration. As the number of GW events increases to 700, the constraint precision of $H_0$ will exceed that of the \textit{Planck} 2018 results. Considering 1000 GW events as the conservative estimation of ET in ten-year observation, we obtain $H_0=73.69\pm 0.36 \mathrm{~km~s^{-1}~Mpc^{-1}}$ with a 0.5\% uncertainty and $Ω_K=0.076^{+0.068}_{-0.087}$. In addition, we simulate 55 SGL systems with 6.6\% uncertainty for the measurement of time-delay distance. By combining with 1000 GWs, we infer that $H_0=73.65\pm0.35 \mathrm{~km~s^{-1}~Mpc^{-1}}$ and $Ω_K=0.008\pm0.048$. Our results suggest that this approach can play an important role in exploring cosmological tensions.

preprint2022arXiv

Constrained Reinforcement Learning for Short Video Recommendation

The wide popularity of short videos on social media poses new opportunities and challenges to optimize recommender systems on the video-sharing platforms. Users provide complex and multi-faceted responses towards recommendations, including watch time and various types of interactions with videos. As a result, established recommendation algorithms that concern a single objective are not adequate to meet this new demand of optimizing comprehensive user experiences. In this paper, we formulate the problem of short video recommendation as a constrained Markov Decision Process (MDP), where platforms want to optimize the main goal of user watch time in long term, with the constraint of accommodating the auxiliary responses of user interactions such as sharing/downloading videos. To solve the constrained MDP, we propose a two-stage reinforcement learning approach based on actor-critic framework. At stage one, we learn individual policies to optimize each auxiliary response. At stage two, we learn a policy to (i) optimize the main response and (ii) stay close to policies learned at the first stage, which effectively guarantees the performance of this main policy on the auxiliaries. Through extensive simulations, we demonstrate effectiveness of our approach over alternatives in both optimizing the main goal as well as balancing the others. We further show the advantage of our approach in live experiments of short video recommendations, where it significantly outperforms other baselines in terms of watch time and interactions from video views. Our approach has been fully launched in the production system to optimize user experiences on the platform.

preprint2022arXiv

Detecting and Monitoring Tidal Dissipation of Hot Jupiters in the Era of SiTian

Transit Timing Variation (TTV) of hot Jupiters provides direct observational evidence of planet tidal dissipation. Detecting tidal dissipation through TTV needs high precision transit timings and long timing baselines. In this work, we predict and discuss the potential scientific contribution of SiTian Survey in detecting and analyzing exoplanet TTV. We develop a tidal dissipation detection pipeline for SiTian Survey that aims at time-domain astronomy with 72 1-meter optical telescopes. The pipeline includes the modules of light curve deblending, transit timing obtaining, and TTV modeling. SiTian is capable to detect more than 25,000 exoplanets among which we expect $\sim$50 sources showing evidence of tidal dissipation. We present detection and analysis of tidal dissipating targets, based on simulated SiTian light curves of XO-3b and WASP-161b. The transit light curve modeling gives consistent results within 1$σ$ to input values of simulated light curves. Also, the parameter uncertainties predicted by Monte-Carlo Markov Chain are consistent with the distribution obtained from simulating and modeling the light curve 1000 times. The timing precision of SiTian observations is $\sim$ 0.5 minutes with one transit visit. We show that differences between TTV origins, e.g., tidal dissipation, apsidal precession, multiple planets, would be significant, considering the timing precision and baseline. The detection rate of tidal dissipating hot Jupiters would answer a crucial question of whether the planet migrates at an early formation stage or random stages due to perturbations, e.g., planet scattering, secular interaction. SiTian identified targets would be constructive given that the sample would extend tenfold.

preprint2022arXiv

Exploring the Impact of Negative Samples of Contrastive Learning: A Case Study of Sentence Embedding

Contrastive learning is emerging as a powerful technique for extracting knowledge from unlabeled data. This technique requires a balanced mixture of two ingredients: positive (similar) and negative (dissimilar) samples. This is typically achieved by maintaining a queue of negative samples during training. Prior works in the area typically uses a fixed-length negative sample queue, but how the negative sample size affects the model performance remains unclear. The opaque impact of the number of negative samples on performance when employing contrastive learning aroused our in-depth exploration. This paper presents a momentum contrastive learning model with negative sample queue for sentence embedding, namely MoCoSE. We add the prediction layer to the online branch to make the model asymmetric and together with EMA update mechanism of the target branch to prevent the model from collapsing. We define a maximum traceable distance metric, through which we learn to what extent the text contrastive learning benefits from the historical information of negative samples. Our experiments find that the best results are obtained when the maximum traceable distance is at a certain range, demonstrating that there is an optimal range of historical information for a negative sample queue. We evaluate the proposed unsupervised MoCoSE on the semantic text similarity (STS) task and obtain an average Spearman's correlation of $77.27\%$. Source code is available at https://github.com/xbdxwyh/mocose.

preprint2022arXiv

Finding Quasars behind the Galactic Plane. II. Spectroscopic Identifications of 204 Quasars at $|b|< 20°$

Quasars behind the Galactic plane (GPQs) are important astrometric references and valuable probes of Galactic gas, yet the search for GPQs is difficult due to severe extinction and source crowding in the Galactic plane. In this paper, we present a sample of 204 spectroscopically confirmed GPQs at |b|<20°, 191 of which are new discoveries. This GPQ sample covers a wide redshift range from 0.069 to 4.487. For the subset of 230 observed GPQ candidates, the lower limit of the purity of quasars is 85.2%, and the lower limit of the fraction of stellar contaminants is 6.1%. Using a multicomponent spectral fitting, we measure the emission line and continuum flux of the GPQs, and estimate their single-epoch virial black hole masses. Due to selection effects raised from Galactic extinction and target magnitude, these GPQs have higher black hole masses and continuum luminosities in comparison to the SDSS DR7 quasar sample. The spectral-fitting results and black hole mass estimates are compiled into a main spectral catalog, and an extended spectral catalog of GPQs. The successful identifications prove the reliability of both our GPQ selection methods and the GPQ candidate catalog, shedding light on the astrometric and astrophysical programs that make use of a large sample of GPQs in the future.

preprint2022arXiv

Investigating the dynamical models of cosmology with recent observations and upcoming gravitational-wave data

We explore and compare the capabilities of the recent observations of standard cosmological probes and the future observations of gravitational-wave (GW) standard sirens on constraining cosmological parameters. It is carried out in the frameworks of two typical dynamical models of cosmology, i.e., the $ω_0ω_a$CDM model with $ω(z) = ω_0 +ω_a*z/(1+z)$, and the $ξ$-index model with $ρ_X\proptoρ_ma^ξ$, where $ω(z)$ is the dark energy equation of state, and $ρ_X$ and $ρ_m$ are the energy densities of dark energy and matter, respectively. In the cosmological analysis, the employed data sets include the recent observations of the standard cosmological probes, i.e., Type Ia supernovae (SNe Ia), baryon acoustic oscillation (BAO) and cosmic microwave background (CMB), and also the mock GW standard siren sample with 1000 merging neutron star events anticipated from the third-generation detectors. In the scenarios of both $ω_0ω_a$CDM and $ξ$-index models, it turns out that the mock GW sample can reduce the uncertainty of the Hubble constant $H_0$ by about 50\% relative to that from the joint SNe+BAO+CMB sample; nevertheless, the SNe+BAO+CMB sample demonstrates better performance on limiting other parameters. Furthermore, the Bayesian evidence is applied to compare the dynamical models with the $Λ$CDM model. The Bayesian evidences computed from the SNe+BAO+CMB sample reveal that the $Λ$CDM model is the most supported one; moreover, the $ω_0ω_a$CDM model is more competitive than the $ξ$-index model.

preprint2022arXiv

Revisiting Chaplygin gas cosmologies with the recent observations of high-redshfit quasars

In this paper, we use the latest observations of quasars covering the redshift range of $0.04<z<5.1$ to investigate a series of Chaplygin gas models as candidates for unified dark matter and dark energy. Based on different combinations of available standard candle and standard ruler data, we put constraints on the generalized Chaplygin gas (GCG), modified Chaplygin gas (MCG), new generalized Chaplygin gas (NGCG) and viscous generalized Chaplygin gas (VGCG) models. Moreover, we apply Jensen-Shannon divergence (JSD), statefinder diagnostics, and the deviance information criterion (DIC) to distinguish these CG models, based on the statistical results derived from Markov chain Monte Carlo method. The results show that (1) The standard ruler data could provide more stringent constraints on the cosmological parameters of different CG models considered in this analysis. Interestingly, the matter density parameter $Ω_{m}$ and Hubble constant $H_{0}$ derived from the available data are well consistent with those from the Planck 2018 results; (2) Based on the statistical criteria JSD, our findings demonstrate the well consistency between Chaplygin gas and the concordance $Λ$CDM model. However, in the framework of statefinder diagnostics, the GCG and NGCG models cannot be distinguished from $Λ$CDM, while MCG and VGCG models show significant deviation from $Λ$CDM in the present epoch; (3) According to the the statistical criteria DIC, we show that the MCG and VGCG models have substantial observational support from high-redshfit quasars, whereas the GCG and NGCG models miss out on the less observational support category but can not be ruled out.

preprint2020arXiv

Granular Segregation Mechanisms by Cyclic Shear

We present an X-ray tomography study of the segregation mechanisms of tracer particles in a three-dimensional cyclically sheared bi-disperse granular medium. Big tracers are dragged by convection to rise to the top surface and then remain trapped there due to the small downward convection cross-section, which leads to segregation. Additionally, we also find that the local structural up-down asymmetry due to arching effect around big tracers will induce the tracers to have a net upward displacement against its smaller neighbors, which is another mechanism for segregation.

preprint2020arXiv

Level statistics and Anderson delocalization in two-dimensional granular materials

Contrary to the theoretical predictions that all waves in two-dimensional disordered materials are localized, Anderson localization is observed only for sufficiently high frequencies in an isotropically jammed two-dimensional disordered granular packing of photoelastic disks. More specifically, we have performed an experiment in analyzing the level statistics of normal mode vibrations. We observe delocalized modes in the low-frequency boson-peak regime and localized modes in the high frequency regime with the crossover frequency just below the Debye frequency. We find that the level-distance distribution obeys Gaussian-Orthogonal-Ensemble (GOE) statistics, i.e. Wigner-Dyson distribution, in the boson-peak regime, whereas those in the high-frequency regime Poisson statistics is observed. The scenario is found to coincide with that of harmonic vibrational excitations in three-dimensional disordered solids.

preprint2020arXiv

Smart, Adaptive Energy Optimization for Mobile Web Interactions

Web technology underpins many interactive mobile applications. However, energy-efficient mobile web interactions is an outstanding challenge. Given the increasing diversity and complexity of mobile hardware, any practical optimization scheme must work for a wide range of users, mobile platforms and web workloads. This paper presents CAMEL , a novel energy optimization system for mobile web interactions. CAMEL leverages machine learning techniques to develop a smart, adaptive scheme to judiciously trade performance for reduced power consumption. Unlike prior work, C AMEL directly models how a given web content affects the user expectation and uses this to guide energy optimization. It goes further by employing transfer learning and conformal predictions to tune a previously learned model in the end-user environment and improve it over time. We apply CAMEL to Chromium and evaluate it on four distinct mobile systems involving 1,000 testing webpages and 30 users. Compared to four state-of-the-art web-event optimizers, CAMEL delivers 22% more energy savings, but with 49% fewer violations on the quality of user experience, and exhibits orders of magnitudes less overhead when targeting a new computing environment.

preprint2020arXiv

The backbone-residual model. Accurately characterising the instrumental profile of a fibre-fed echelle spectrograph

Context: Instrumental profile (IP) is the basic property of a spectrograph. Accurate IP characterisation is the prerequisite of accurate wavelength solution. It also facilitates new spectral acquisition methods such as the forward modeling and deconvolution. Aims: We investigate an IP modeling method for the fibre-fed echelle spectrograph with the emission lines of the ThAr lamp, and explore the method to evaluate the accuracy of IP characterisation. Methods: The backbone-residual (BR) model is put forward and tested on the fibre-fed High Resolution Spectrograph (HRS) at the Chinese Xinglong 2.16-m Telescope, which is the sum of the backbone function and the residual function. The backbone function is a bell-shaped function to describe the main component and the spatial variation of IP. The residual function, which is expressed as the cubic spline function, accounts for the difference between the bell-shaped function and the actual IP. The method of evaluating the accuracy of IP characterisation is based on the spectral reconstruction and Monte Carlo simulation. Results: The IP of HRS is characterised with the BR model, and the accuracy of the characterised IP reaches 0.006 of the peak value of the backbone function. This result demonstrates that the accurate IP characterisation has been achieved on HRS with the BR model, and the BR model is an excellent choice for accurate IP characterisation of fibre-fed echelle spectrographs.

preprint2020arXiv

X-ray tomography investigation of cyclically sheared granular materials

We perform combined X-ray tomography and shear force measurements on a cyclically sheared granular system with highly transient behaviors, and obtain the evolution of microscopic structures and the macroscopic shear force during the shear cycle. We explain the macroscopic behaviors of the system based on microscopic processes, including the particle level structural rearrangement and frictional contact variation. Specifically, we show how contact friction can induce large structural fluctuations and cause significant shear dilatancy effect for granular materials, and we also construct an empirical constitutive relationship for the macroscopic shear force.

preprint2019arXiv

Initial Sampling in Symmetrical Quasiclassical Dynamics Based on Li-Miller Mapping Hamiltonian

A symmetrical quasiclassical (SQC) dynamics approach based on the Li-Miller (LM) mapping Hamiltonian (SQC-LM) was employed to describe nonadiabatic dynamics. In principle, the different initial sampling procedures may be applied in the SQC-LM dynamics, and the results may be dependent on the initial sampling. We provided various initial sampling approaches and checked their influence. We selected two groups of models including site-exciton models for exciton dynamics and linear vibronic coupling models for conical intersections to test the performance of SQC-LM dynamics with the different initial sampling methods. The results were examined with respect to those of the accurate multilayer multiconfigurational time-dependent Hartree (ML-MCTDH) quantum dynamics. For both two models, the SQC-LM method more-or-less gives a reasonable description of the population dynamics, while the influence of the initial sampling approaches on the final results is noticeable. It seems that the proper initial sampling methods should be determined by the system under study. This indicates that the combination of the SQC-LM method with a suitable sampling approach may be a potential method in the description of nonadiabatic dynamics.