Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
20works
0followers
21topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

20 published item(s)

preprint2026arXiv

An Embarrassingly Simple Graph Heuristic Reveals Shortcut-Solvable Benchmarks for Sequential Recommendation

Sequential recommendation has increasingly shifted toward generative recommenders that combine sequential patterns with semantic item information. Yet these methods are often evaluated on a small set of widely used benchmarks, raising a key question: do these benchmarks actually require the advanced modeling capabilities that modern generative recommenders claim to provide? We conduct a benchmark audit with an intentionally simple graph heuristic. Starting from only the last one or two interacted items, it retrieves candidates from a few-hop item-transition graph and ranks them by item-feature similarity. Despite using no sequence encoder, generative objective, or training, this heuristic matches or outperforms many modern baselines, with relative NDCG@10 improvements of 38.10% and 44.18% over the best competing baseline on Amazon Review Sports and CDs. We show that this behavior reflects shortcut solvability rather than an artifact of one heuristic. We identify three shortcut structures that can make next-item prediction easier than expected: low-branching local transitions, feature-smooth transitions, and limited dependence on long user histories. These shortcuts need not appear together; even one or two strong signals can make simple local retrieval highly competitive, while weakening them makes the benefits of more sophisticated models clearer. Across 14 datasets, model rankings vary substantially with dataset properties, yet the heuristic remains competitive on 10 of them. Our findings suggest that strong performance on standard benchmarks does not always demonstrate advanced sequential, semantic, or generative modeling ability. We call for more careful dataset selection and dataset-level diagnostic analysis when using benchmarks to support claims about new recommendation models.

preprint2026arXiv

FlexiVoice: Enabling Flexible Style Control in Zero-Shot TTS with Natural Language Instructions

This study proposes FlexiVoice, a text-to-speech (TTS) synthesis system capable of flexible style control with zero-shot voice cloning. The speaking style is controlled by a natural-language instruction and the voice timbre is provided by a speech reference in zero-shot manner. FlexiVoice is built with an LLM core, which takes text as input, and also takes an optional natural language instruction and an optional speech reference to control style and timbre, respectively. FlexiVoice is equipped with a novel Progressive Post-Training (PPT) scheme that progressively unlocks accurate and flexible controllability. In particular, it first employs Direct Preference Optimization (DPO) to enable FlexiVoice to accurately follow both natural language instruction and speech reference simultaneously. It then uses a multi-objective Group Relative Policy Optimization (GRPO) to disentangle style instruction, reference timbre, and textual content. Finally, it adapts instruction GRPO for more advanced instruction following. Experimental results show that FlexiVoice surpasses competing baselines and demonstrates strong capability in decoupling control factors. Human evaluations further confirm its naturalness, controllability, and robustness. Audio samples are available at https://flexi-voice.github.io.

preprint2026arXiv

Interference-governed electromagnetic-thermal coupling and heat transport in pulse EUV-irradiated multilayer nanofilms

Mo-Si multilayer mirrors are central to extreme ultraviolet lithography, where nanoscale optical interference and heat accumulation together constrain reflectivity and operational stability. Here we develop an analytical electromagnetic-thermal coupling model that directly links transfer-matrix-based interference-controlled energy deposition with transient heat conduction in EUV-irradiated multilayers. The model reveals a fundamental trade-off whereby increasing the multilayer period number enhances reflectivity but simultaneously elevates temperature by impeding heat dissipation. Interference-driven volumetric absorption further gives rise to pronounced axial temperature gradients and a post-pulse downward migration of the heat-flux maximum, a delayed-heating effect inaccessible to conventional surface-flux-based models. Systematic analysis establishes scaling laws connecting interfacial thermal resistance, beam size, and incident energy density to thermal confinement and temperature rise. By incorporating interfacial compaction kinetics, the model enables a quantitative assessment of mirror lifetime. This work offers a theoretical tool for thermal-optical co-design of multilayer nanostructures including EUV mirrors under pulsed irradiation across a wide spectral range.

preprint2026arXiv

PEAR: Planner-Executor Agent Robustness Benchmark

Large Language Model (LLM)-based Multi-Agent Systems (MAS) have emerged as a powerful paradigm for tackling complex, multi-step tasks across diverse domains. However, despite their impressive capabilities, MAS remain susceptible to adversarial manipulation. Existing studies typically examine isolated attack surfaces or specific scenarios, leaving a lack of holistic understanding of MAS vulnerabilities. To bridge this gap, we introduce PEAR, a benchmark for systematically evaluating both the utility and vulnerability of planner-executor MAS. While compatible with various MAS architectures, our benchmark focuses on the planner-executor structure, which is a practical and widely adopted design. Through extensive experiments, we find that (1) a weak planner degrades overall clean task performance more severely than a weak executor; (2) while a memory module is essential for the planner, having a memory module for the executor does not impact the clean task performance; (3) there exists a trade-off between task performance and robustness; and (4) attacks targeting the planner are particularly effective at misleading the system. These findings offer actionable insights for enhancing the robustness of MAS and lay the groundwork for principled defenses in multi-agent settings.

preprint2024arXiv

Formation of collisional ring galaxies in Milgromian dynamics

Ring galaxies are rare in the Universe. A head-on or off-centre collision between an intruder galaxy and a disc galaxy leads to a collisional ring galaxy (CRG) when the intruder-to-target mass ratio (ITMR) is over 0.1 in Newtonian dynamics. Within the framework of Milgromian dynamics, the strong equivalence principle is violated due to the presence of an external field. When two galaxies collide towards each other, the dynamical mass of the phantom dark halo in a galaxy is suppressed by the external field induced by the other galaxy. As a consequence of such suppression, the gravitational perturbation for the target galaxy introduced by a collision is weakened. In this case, a minor collision may not be capable of generating a CRG. In this work, we address this issue and perform a series of numerical simulations of collisions by tuning the values of ITMR. We find that the critical ITMR is 0.5 in MOND, which is much larger than that in Newtonian dynamics. The observed massive ring galaxies, such as Arp 147, can be effectively interpreted by CRGs in MOND. This interpretation does not necessitate the presence of dark matter halos for either the target or intruder galaxies. Moreover, for a larger inclination angle or a larger impact parameter, the off-centred ring structure is fainter. The larger critical ITMR indicates that it is harder to form a CRG in Milgrom's Modified Newtonian Dynamics (MOND). To account for the observed ring structures of the NGC 922-like galaxies in MOND, it is necessary to invoke other scenarios than a dry minor collision.

preprint2022arXiv

Deblur-NeRF: Neural Radiance Fields from Blurry Images

Neural Radiance Field (NeRF) has gained considerable attention recently for 3D scene reconstruction and novel view synthesis due to its remarkable synthesis quality. However, image blurriness caused by defocus or motion, which often occurs when capturing scenes in the wild, significantly degrades its reconstruction quality. To address this problem, We propose Deblur-NeRF, the first method that can recover a sharp NeRF from blurry input. We adopt an analysis-by-synthesis approach that reconstructs blurry views by simulating the blurring process, thus making NeRF robust to blurry inputs. The core of this simulation is a novel Deformable Sparse Kernel (DSK) module that models spatially-varying blur kernels by deforming a canonical sparse kernel at each spatial location. The ray origin of each kernel point is jointly optimized, inspired by the physical blurring process. This module is parameterized as an MLP that has the ability to be generalized to various blur types. Jointly optimizing the NeRF and the DSK module allows us to restore a sharp NeRF. We demonstrate that our method can be used on both camera motion blur and defocus blur: the two most common types of blur in real scenes. Evaluation results on both synthetic and real-world data show that our method outperforms several baselines. The synthetic and real datasets along with the source code is publicly available at https://limacv.github.io/deblurnerf/

preprint2022arXiv

High-throughput decoder of quasi-cyclic LDPC codes with limited precision for continuous-variable quantum key distribution systems

More than Mbps secret key rate was demonstrated for continuous-variable quantum key distribution (CV-QKD) systems, but real-time postprocessing is not allowed, which is restricted by the throughput of the error correction decoding in postprocessing. In this paper, a high-throughput FPGA-based quasi-cyclic LDPC decoder is proposed and implemented to support Mbps real-time secret key rate generation for CV-QKD for the first time. A residual bit error correction algorithm is used to solve the problem of high frame errors rate (FER) caused by the limited precision of the decoder. Specifically, real-time high-speed decoding for CV-QKD systems with typical code rates 0.2 and 0.1 is implemented on a commercial FPGA, and two throughputs of 360.92Mbps and 194.65Mbps are achieved, respectively, which can support 17.97 Mbps and 2.48 Mbps real-time generation of secret key rates under typical transmission distances of 25km and 50km, correspondingly. The proposed method paves the way for high-rate real-time CV-QKD deployment in secure metropolitan area network.

preprint2022arXiv

Microbiome subcommunity learning with logistic-tree normal latent Dirichlet allocation

Mixed-membership (MM) models such as Latent Dirichlet Allocation (LDA) have been applied to microbiome compositional data to identify latent subcommunities of microbial species. These subcommunities are informative for understanding the biological interplay of microbes and for predicting health outcomes. However, microbiome compositions typically display substantial cross-sample heterogeneities in subcommunity compositions -- that is, the variability in the proportions of microbes in shared subcommunities across samples -- which is not accounted for in prior analyses. As a result, LDA can produce inference which is highly sensitive to the specification of the number of subcommunities and often divides a single subcommunity into multiple artificial ones. To address this limitation, we incorporate the logistic-tree normal (LTN) model into LDA to form a new MM model. This model allows cross-sample variation in the composition of each subcommunity around some "centroid" composition that defines the subcommunity. Incorporation of auxiliary Pólya-Gamma variables enables a computationally efficient collapsed blocked Gibbs sampler to carry out Bayesian inference under this model. By accounting for such heterogeneity, our new model restores the robustness of the inference in the specification of the number of subcommunities and allows meaningful subcommunities to be identified.

preprint2022arXiv

Secure two-way fiber-optic time transfer against sub-ns asymmetric delay attack

Two-way fiber-optic time transfer is a promising precise time synchronization technique with sub-nanosecond accuracy. However, asymmetric delay attack is a serious threat which cannot be prevent by any encryption method. In this paper, a dynamic model based scheme is proposed to defense the sub-nanosecond asymmetric delay attack. A threshold is set according to the estimated time difference by a two-state clock model where the fixed frequency difference is excluded from the time difference to detect the asymmetric delay attack which is smaller than the time difference induced by the fixed frequency difference. Theoretical simulation and experimental demonstration are implemented to prove the feasibility of the scheme. A two-way fiber-optic time transfer system with time stability with 24.5ps, 3.98ps, and 2.95ps at 1s, 10s, and 100s averaging time is shown under sub-ns asymmetric time delay attack experimentally. The proposed method provides a promising secure sub-ns precise time synchronization technique against asymmetric delay attack.

preprint2022arXiv

TVConv: Efficient Translation Variant Convolution for Layout-aware Visual Processing

As convolution has empowered many smart applications, dynamic convolution further equips it with the ability to adapt to diverse inputs. However, the static and dynamic convolutions are either layout-agnostic or computation-heavy, making it inappropriate for layout-specific applications, e.g., face recognition and medical image segmentation. We observe that these applications naturally exhibit the characteristics of large intra-image (spatial) variance and small cross-image variance. This observation motivates our efficient translation variant convolution (TVConv) for layout-aware visual processing. Technically, TVConv is composed of affinity maps and a weight-generating block. While affinity maps depict pixel-paired relationships gracefully, the weight-generating block can be explicitly overparameterized for better training while maintaining efficient inference. Although conceptually simple, TVConv significantly improves the efficiency of the convolution and can be readily plugged into various network architectures. Extensive experiments on face recognition show that TVConv reduces the computational cost by up to 3.1x and improves the corresponding throughput by 2.3x while maintaining a high accuracy compared to the depthwise convolution. Moreover, for the same computation cost, we boost the mean accuracy by up to 4.21%. We also conduct experiments on the optic disc/cup segmentation task and obtain better generalization performance, which helps mitigate the critical data scarcity issue. Code is available at https://github.com/JierunChen/TVConv.

preprint2021arXiv

Distillation based Multi-task Learning: A Candidate Generation Model for Improving Reading Duration

In feeds recommendation, the first step is candidate generation. Most of the candidate generation models are based on CTR estimation, which do not consider user's satisfaction with the clicked item. Items with low quality but attractive title (i.e., click baits) may be recommended to the user, which worsens the user experience. One solution to this problem is to model the click and the reading duration simultaneously under the multi-task learning (MTL) framework. There are two challenges in the modeling. The first one is how to deal with the zero duration of the negative samples, which does not necessarily indicate dislikes. The second one is how to perform multi-task learning in the candidate generation model with double tower structure that can only model one single task. In this paper, we propose an distillation based multi-task learning (DMTL) approach to tackle these two challenges. We model duration by considering its dependency of click in the MTL, and then transfer the knowledge learned from the MTL teacher model to the student candidate generation model by distillation. Experiments conducted on dataset gathered from traffic logs of Tencent Kandian's recommender system show that the proposed approach outperforms the competitors significantly in modeling duration, which demonstrates the effectiveness of the proposed candidate generation model.

preprint2021arXiv

Global Yamabe flow on asymptotically flat manifolds

In this paper, we study the existence of global Yamabe flow on asymptotically flat (in short, AF or ALE) manifolds. Note that the ADM mass is preserved in dimensions 3,4 and 5. We present a new general local existence of Yamabe flow on a complete Riemannian manifold with the initial metric quasi-isometric to a background metric of bounded scalar curvature. Asymptotic behaviour of the Yamabe flow on ALE manifolds is also addressed provided the initial scalar curvature is non-negative and there is a bounded subsolution to the corresponding Poisson equation. We also present a maximum principle for a very general parabolic equations on the complete Riemannian manifolds.

preprint2021arXiv

New Yamabe-type flow in a compact Riemannian manifold

In this paper, we set up a new Yamabe type flow on a compact Riemannian manifold $(M,g)$ of dimension $n\geq 3$. Let $ψ(x)$ be any smooth function on $M$. Let $p=\frac{n+2}{n-2}$ and $c_n=\frac{4(n-1)}{n-2}$. We study the Yamabe-type flow $u=u(t)$ satisfying {u_t}=u^{1-p}(c_nΔu-ψ(x)u)+r(t)u, \ \ in \ M\times (0,T),\ T>0 with r(t)=\int_M(c_n|\nabla u|^2+ψ(x)u^2)dv/ \int_Mu^{p+1}, which preserves the $L^{p+1}(M)$-norm and we can show that for any initial metric $u_0>0$, the flow exists globally. We also show that in some cases, the global solution converges to a smooth solution to the equation c_nΔu-ψ(x)u+r(\infty)u^{p}=0, \ \ on \ M and our result may be considered as a generalization of the result of T.Aubin, Proposition in p.131 in \cite{A82}.

preprint2020arXiv

Coarsened mixtures of hierarchical skew normal kernels for flow cytometry analyses

Flow cytometry (FCM) is the standard multi-parameter assay for measuring single cell phenotype and functionality. It is commonly used for quantifying the relative frequencies of cell subsets in blood and disaggregated tissues. A typical analysis of FCM data involves cell classification---that is, the identification of cell subgroups in the sample---and comparisons of the cell subgroups across samples or conditions. While modern experiments often necessitate the collection and processing of samples in multiple batches, analysis of FCM data across batches is challenging because differences across samples may occur due to either true biological variation or technical reasons such as antibody lot effects or instrument optics across batches. Thus a critical step in comparative analyses of multi-sample FCM data---yet missing in existing automated methods for analyzing such data---is cross-sample calibration, whose goal is to align corresponding cell subsets across multiple samples in the presence of technical variations, so that biological variations can be meaningfully compared. We introduce a Bayesian nonparametric hierarchical modeling approach for accomplishing both calibration and cell classification simultaneously in a unified probabilistic manner. Three important features of our method make it particularly effective for analyzing multi-sample FCM data: a nonparametric mixture avoids prespecifying the number of cell clusters; a hierarchical skew normal kernel that allows flexibility in the shapes of the cell subsets and cross-sample variation in their locations; and finally the "coarsening" strategy makes inference robust to departures from the model such as heavy-tailness not captured by the skew normal kernels. We demonstrate the merits of our approach in simulated examples and carry out a case study in the analysis of two multi-sample FCM data sets.

preprint2020arXiv

Constants and heat flow on graphs

In this article, we first introduce the concepts of vector fields and their divergence, and we recall the concepts of the gradient, Laplacian operator, Cheeger constants, eigenvalues, and heat kernels on a locally finite graph $V$. We give a projective characteristic of the eigenvalues. We also give an extension of Barta Theorem. Then we introduce the mini-max value of a function on a locally finite and locally connected graph. We show that for a coercive function on on a locally finite and locally connected graph, there is a mini-max value of the function provided it has two strict local minima values. We consider the discrete Morse flow for the heat flow on a finite graph in the locally finite graph $V$. We show that under suitable assumptions on the graph one has a weak discrete Morse flow for the heat flow on $S$ on any time interval. We also study the heat flow with time-variable potential and its discrete Morse flow. We propose the concepts of harmonic maps from a graph to a Riemannian manifold and pose some open questions.

preprint2020arXiv

High-throughput GPU layered decoder of multi-edge type low density parity check codes in continuous-variable quantum key distribution systems

The decoding throughput in the postprocessing is one of the bottlenecks for a continuous-variable quantum key distribution (CV-QKD) system. In this paper, we propose a layered decoder to decode quasi-cyclic multi-edge type LDPC (QC-METLDPC) codes based on graphic processing unit (GPU) in continuous-variable quantum key distribution (CV-QKD) systems. We optimize the storage method of the parity check matrix, merge the sub-matrices which are unrelated, and decode multiple codewords in parallel on GPU. Simulation results demonstrate that the average decoding speed of LDPC codes with three typical code rates, i.e., 0.1, 0.05 and 0.02, is up to 64.11Mbits/s, 48.65Mbits/s and 39.51Mbits/s, respectively, when decoding 128 codewords of length 106 simultaneously without early termination.

preprint2020arXiv

Passivation mechanisms and pre-oxidation effects on model surfaces of FeCrNi austenitic stainless steel

Passivation mechanisms were investigated on (100)-oriented Fe-18Cr-13Ni surfaces with direct transfer between surface preparation and analysis by X-ray photoelectron spectroscopy and scanning tunneling microscopy and electrochemical characterization. Starting from oxide-free surfaces, pre-oxidation at saturation under ultra-low pressure (ULP) oxygen markedly promotes the oxide film Cr(III) enrichment and hinders/delays subsequent iron oxidation in water-containing environment. Exposure to sulfuric acid at open circuit potential causes preferential dissolution of oxidized iron species. Anodic passivation forces oxide film re-growth, Cr(III) dehydroxylation and further enrichment. ULP pre-oxidation promotes Cr(III) hydroxide formation at open circuit potential, compactness of the nanogranular oxide film and corrosion protection.

preprint2020arXiv

Privacy-preserving collaborative machine learning on genomic data using TensorFlow

Machine learning (ML) methods have been widely used in genomic studies. However, genomic data are often held by different stakeholders (e.g. hospitals, universities, and healthcare companies) who consider the data as sensitive information, even though they desire to collaborate. To address this issue, recent works have proposed solutions using Secure Multi-party Computation (MPC), which train on the decentralized data in a way that the participants could learn nothing from each other beyond the final trained model. We design and implement several MPC-friendly ML primitives, including class weight adjustment and parallelizable approximation of activation function. In addition, we develop the solution as an extension to TF Encrypted~\citep{dahl2018private}, enabling us to quickly experiment with enhancements of both machine learning techniques and cryptographic protocols while leveraging the advantages of TensorFlow's optimizations. Our implementation compares favorably with state-of-the-art methods, winning first place in Track IV of the iDASH2019 secure genome analysis competition.

preprint2020arXiv

The gamma-ray and Optical Variability Analysis of the BL Lac Object 3FGL J0449.4-4350

We have assembled the historical light curves of the BL Lac Object 3FGL J0449.4-4350 at optical and gamma-ray bands, the time spanning about 10 years, analyzed the periodic variability of the light curves by using four different methods (Lomb-Scargle periodogram, REDFIT38, Jurkevich and DACF). We detected a marginally possible quasi-periodic oscillation (QPO) of ~450 days. Assuming it originates from the helical motion jet in a supermassive binary black hole (SMBBH) system undergoing major merger, we estimate the primary black hole mass M~7.7*10^{9} M_sun. To explore the origin of the gamma-ray, we investigated the optical-gamma-ray correlations using discrete correlation function (DCF) method, and found that the correlation between the two bands is very significant. This strong correlation tends to imply lepton self-synchro-Compton (LSSC) model to produce the gamma-ray.

preprint2020arXiv

Tritonlike molecules of three identical baryons

In nuclear physics, triton and helium-3 nucleus can be understood as three-body hadronic molecules. Analogous to the loosely bound structures for the triton and helium-3 nucleus, whether there is a bound state formed by three hadrons leaves us an open issue. Based on the one-boson exchange model as well as the adoption of the variational approach, we make a comprehensive investigation on the tritonlike systems of three identical baryons $NNN$, $ΛΛΛ$, $ΞΞΞ$ and $ΣΣΣ$. We predict that the three-body molecular states for the systems of three identical hadrons of baryon octet are probably existent as long as their two-body subsystems have bound states. The numerical results of this work may be helpful for the theoretical and experimental researches on the tri-hadron molecules in future.