Source author record

Wei Cheng

Wei Cheng appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

40works

27topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5

The rapid evolution of Large Language Models (LLMs) and Multimodal Large Language Models (MLLMs) has driven major gains in reasoning, perception, and generation across language and vision, yet whether these advances translate into comparable improvements in safety remains unclear, partly due to fragmented evaluations that focus on isolated modalities or threat models. In this report, we present an integrated safety evaluation of six frontier models--GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5--assessing each across language, vision-language, and image generation using a unified protocol that combines benchmark, adversarial, multilingual, and compliance evaluations. By aggregating results into safety leaderboards and model profiles, we reveal a highly uneven safety landscape: while GPT-5.2 demonstrates consistently strong and balanced performance, other models exhibit clear trade-offs across benchmark safety, adversarial robustness, multilingual generalization, and regulatory compliance. Despite strong results under standard benchmarks, all models remain highly vulnerable under adversarial testing, with worst-case safety rates dropping below 6%. Text-to-image models show slightly stronger alignment in regulated visual risk categories, yet remain fragile when faced with adversarial or semantically ambiguous prompts. Overall, these findings highlight that safety in frontier models is inherently multidimensional--shaped by modality, language, and evaluation design--underscoring the need for standardized, holistic safety assessments to better reflect real-world risk and guide responsible deployment.

preprint2026arXiv

Improving Code Translation with Syntax-Guided and Semantic-aware Preference Optimization

LLMs have shown immense potential for code translation, yet they often struggle to ensure both syntactic correctness and semantic consistency. While preference-based learning offers a promising alignment strategy, it is hindered by unreliable semantic rewards derived from sparse test cases or restrictive reference translations. We argue that a robust semantic reward for code translation must be derived directly from the source code. In this paper, we propose CTO to improve code translation with syntax-guided and semantic-aware preference optimization. Through contrastive learning, we train a cross-lingual semantic model to directly assess functional equivalence between source and translated code. By formulating code translation as a multi-objective optimization problem, this robust semantic signal is seamlessly unified with compiler-based syntactic feedback within the direct preference optimization framework. Extensive experiments on C++, Java, and Python translations demonstrate that CTO significantly outperforms existing baselines and alternative preference optimization strategies.

preprint2026arXiv

To Diff or Not to Diff? Structure-Aware and Adaptive Output Formats for Efficient LLM-based Code Editing

Large Language Models (LLMs) are increasingly used for code editing, yet the prevalent full-code generation paradigm suffers from severe efficiency bottlenecks, posing challenges for interactive coding assistants that demand low latency and cost. Despite the predominant focus on scaling model capabilities, the edit format itself has been largely overlooked in model training. In this paper, we begin with a systematic study of conventional diff formats and reveal that fragile offsets and fragmented hunks make generation highly unnatural for LLMs. To address it, we introduce BlockDiff and FuncDiff, two structure-aware diff formats that represent changes as block-level rewrites of syntactically coherent units such as control structures and functions. Furthermore, we propose AdaEdit, a general adaptive edit strategy that trains LLMs to dynamically choose the most token-efficient format between a given diff format and full code. Extensive experiments demonstrate that AdaEdit paired with structure-aware diff formats consistently matches the accuracy of full-code generation, while reducing both latency and cost by over 30% on long-code editing tasks.

preprint2025arXiv

Observation of hierarchy of Hilbert space ergodicities in the quantum dynamics of a single spin

Ergodicity, the property that all allowed configurations are explored over time, plays a pivotal role in explaining the equilibrium behavior of classical dynamical systems. Yet, such a property is typically precluded in quantum systems owing to the presence of energy eigenstates, which are stationary states in dynamics. However, recent theoretical works have argued that ergodic explorations of the Hilbert space, occurring at varying levels as measured by statistical pseudorandomness of the time-evolved quantum states, may be exhibited for quantum systems driven by Hamiltonians with aperiodic time dependencies, which do not face such obstacles. Here, we experimentally investigate the hierarchy of Hilbert-space ergodicities (HSE) achievable in the dynamics of a single quantum spin realized by a solid-state defect in diamond, upon subjecting it to various time-dependent modulations. Through continuous monitoring of spin trajectories with full state tomography, different degrees of HSE were observed, ranging from no HSE in a time-periodic (Floquet) drive, to partial HSE in a smoothly kicked time-quasiperiodic drive, to complete HSE in a drive composed of a sequence of kicks generated by the Fibonacci word. We formulate a theoretical understanding of the increasing levels of HSE observed by attributing them to increasing levels of complexities associated with the drive sequences, whose notions we elucidate. Our work constitutes the first unambiguous experimental evidence of Hilbert space ergodicity and promotes deeper investigations into the mechanisms and fine-grained levels with which closed quantum systems reach equilibrium.

preprint2022arXiv

$B_c \to J/ψ$ helicity form factors and the $B_c^+ \to J/ψ+(P, V, \ell^+ ν_\ell)$ decays

In this paper, we calculate the $B_c\to J/ψ$ helicity form factors (HFFs) up to twist-4 accuracy by using the light-cone sum rules (LCSR) approach. After extrapolating those HFFs to the physically allowable $q^2$ region, we investigate the $B^+_c$-meson two-body decays and semi-leptonic decays $B_c^+ \to J/ψ+(P, V, \ell^+ ν_\ell)$ with $P/V$ stands for light pseudoscalar/vector meson, respectively. The branching fractions can be derived by using the CKM matrix element and the $B_c$ lifetime from the Particle Data Group, and we obtain ${\cal B}(B_c^+ \to J/ψπ^+)=(0.136^{+0.002}_{-0.002})\%$, ${\cal B}(B_c^+ \to J/ψK^+)=(0.010^{+0.000}_{-0.000})\%$, ${\cal B}(B_c^+ \to J/ψρ^+) =(0.768^{+0.029}_{-0.033})\%$, ${\cal B}(B_c^+ \to J/ψK^{\ast +})=(0.043^{+0.001}_{-0.001})\%$, ${\cal B}(B_c^+ \to J/ψμ^+ν_μ)=(2.802^{+0.526}_{-0.675})\%$ and ${\cal B}(B_c^+ \to J/ψτ^+ν_τ)=(0.559^{+0.131}_{-0.170})\%$. We then obtain ${\cal R}_{π^+/μ^+ν_μ} = 0.048^{+ 0.009}_{-0.012}$ and ${\cal R}_{K^+ / π^+} = 0.075^{+0.005}_{-0.005}$, which agree with the LHCb measured value within $1σ$-error. We also obtain ${\cal R}_{J/ψ}=0.199^{+ 0.060}_{-0.077}$, which like other theoretical predictions, is consistent with the LHCb measured value within $2σ$-error. Those imply that the HFFs under the LCSR approach are also applicable to the $B^+_c$ meson two-body decays and semi-leptonic decays $B_c^+ \to J/ψ+(P, V, \ell^+ ν_\ell)$, and the HFFs obtained by using LCSR in a new way implies that there may be new physics in the $B_c\to J/ψ\ell^+ ν_\ell$ semi-leptonic decays.

preprint2022arXiv

Attacking Masked Cryptographic Implementations: Information-Theoretic Bounds

Measuring the information leakage is critical for evaluating the practical security of cryptographic devices against side-channel analysis. Information-theoretic measures can be used (along with Fano's inequality) to derive upper bounds on the success rate of any possible attack in terms of the number of side-channel measurements. Equivalently, this gives lower bounds on the number of queries for a given success probability of attack. In this paper, we consider cryptographic implementations protected by (first-order) masking schemes, and derive several information-theoretic bounds on the efficiency of any (second-order) attack. The obtained bounds are generic in that they do not depend on a specific attack but only on the leakage and masking models, through the mutual information between side-channel measurements and the secret key. Numerical evaluations confirm that our bounds reflect the practical performance of optimal maximum likelihood attacks.

preprint2022arXiv

Conflict-aware Inference of Python Compatible Runtime Environments with Domain Knowledge Graph

Code sharing and reuse is a widespread use practice in software engineering. Although a vast amount of open-source Python code is accessible on many online platforms, programmers often find it difficult to restore a successful runtime environment. Previous studies validated automatic inference of Python dependencies using pre-built knowledge bases. However, these studies do not cover sufficient knowledge to accurately match the Python code and also ignore the potential conflicts between their inferred dependencies, thus resulting in a low success rate of inference. In this paper, we propose PyCRE, a new approach to automatically inferring Python compatible runtime environments with domain knowledge graph (KG). Specifically, we design a domain-specific ontology for Python third-party packages and construct KGs for over 10,000 popular packages in Python 2 and Python 3. PyCRE discovers candidate libraries by measuring the matching degree between the known libraries and the third-party resources used in target code. For the NP-complete problem of dependency solving, we propose a heuristic graph traversal algorithm to efficiently guarantee the compatibility between packages. PyCRE achieves superior performance on a real-world dataset and efficiently resolves nearly half more import errors than previous methods.

preprint2022arXiv

Deep Federated Anomaly Detection for Multivariate Time Series Data

Despite the fact that many anomaly detection approaches have been developed for multivariate time series data, limited effort has been made on federated settings in which multivariate time series data are heterogeneously distributed among different edge devices while data sharing is prohibited. In this paper, we investigate the problem of federated unsupervised anomaly detection and present a Federated Exemplar-based Deep Neural Network (Fed-ExDNN) to conduct anomaly detection for multivariate time series data on different edge devices. Specifically, we first design an Exemplar-based Deep Neural network (ExDNN) to learn local time series representations based on their compatibility with an exemplar module which consists of hidden parameters learned to capture varieties of normal patterns on each edge device. Next, a constrained clustering mechanism (FedCC) is employed on the centralized server to align and aggregate the parameters of different local exemplar modules to obtain a unified global exemplar module. Finally, the global exemplar module is deployed together with a shared feature encoder to each edge device and anomaly detection is conducted by examining the compatibility of testing data to the exemplar module. Fed-ExDNN captures local normal time series patterns with ExDNN and aggregates these patterns by FedCC, and thus can handle the heterogeneous data distributed over different edge devices simultaneously. Thoroughly empirical studies on six public datasets show that ExDNN and Fed-ExDNN can outperform state-of-the-art anomaly detection algorithms and federated learning techniques.

preprint2022arXiv

Direct measurement of the distribution of dark matter with strongly lensed gravitational waves

In this Letter, we present a new idea of probing the distribution of dark matter exhibiting elastic and velocity-independent self-interactions. These interactions might be revealed in multiple measurements of strongly lensed gravitational waves, which can be observationally explored to determine the strength of self-scatterings. Specifically, each individual galactic-scale strong-lensing system whose source is a coalescing compact binary emitting gravitational waves will provide a model-independent measurement of the shear viscosity of dark matter along the line of sight. These individual measurements could be a probe of large-scale distribution of dark matter and its properties. Our results indicate that with 10-1000 strongly lensed gravitational waves from ET and DECIGO, robust constraints on the large-scale distribution of self-interacting dark matter might be produced. More stringent limits on the dark matter scattering cross-section per unit mass ($σ_χ/m_χ$) relevant to galaxy and cluster scales are also expected, compared with the conservative estimates obtained in the electromagnetic domain. Finally, we discuss the effectiveness of our method in the context of self-interacting dark matter particle physics.

preprint2022arXiv

Experimental violation of the Leggett-Garg inequality with a single-spin system

Investigation the boundary between quantum mechanical description and classical realistic view is of fundamental importance. The Leggett-Garg inequality provides a criterion to distinguish between quantum systems and classical systems, and can be used to prove the macroscopic superposition state. A larger upper bound of the LG function can be obtained in a multi-level system. Here, we present an experimental violation of the Leggett-Garg inequality in a three-level system using nitrogen-vacancy center in diamond by ideal negative result measurement. The experimental maximum value of Leggett-Garg function is $K_{3}^{exp}=1.625\pm0.022$ which exceeds the Lüders bound with a $5σ$ level of confidence.

preprint2022arXiv

Generalizable Neural Performer: Learning Robust Radiance Fields for Human Novel View Synthesis

This work targets at using a general deep learning framework to synthesize free-viewpoint images of arbitrary human performers, only requiring a sparse number of camera views as inputs and skirting per-case fine-tuning. The large variation of geometry and appearance, caused by articulated body poses, shapes and clothing types, are the key bottlenecks of this task. To overcome these challenges, we present a simple yet powerful framework, named Generalizable Neural Performer (GNR), that learns a generalizable and robust neural body representation over various geometry and appearance. Specifically, we compress the light fields for novel view human rendering as conditional implicit neural radiance fields from both geometry and appearance aspects. We first introduce an Implicit Geometric Body Embedding strategy to enhance the robustness based on both parametric 3D human body model and multi-view images hints. We further propose a Screen-Space Occlusion-Aware Appearance Blending technique to preserve the high-quality appearance, through interpolating source view appearance to the radiance fields with a relax but approximate geometric guidance. To evaluate our method, we present our ongoing effort of constructing a dataset with remarkable complexity and diversity. The dataset GeneBody-1.0, includes over 360M frames of 370 subjects under multi-view cameras capturing, performing a large variety of pose actions, along with diverse body shapes, clothing, accessories and hairdos. Experiments on GeneBody-1.0 and ZJU-Mocap show better robustness of our methods than recent state-of-the-art generalizable methods among all cross-dataset, unseen subjects and unseen poses settings. We also demonstrate the competitiveness of our model compared with cutting-edge case-specific ones. Dataset, code and model will be made publicly available.

preprint2022arXiv

High precision measurement of cosmic curvature: from gravitational waves and cosmic chronometer

Although the spatial curvature has been measured with very high precision, it still suffers from the well known cosmic curvature tension. In this paper, we propose an improved method to determine the cosmic curvature, by using the simulated data of binary neutron star mergers observed by the second generation space-based DECi-hertz Interferometer Gravitational-wave Observatory (DECIGO). By applying the Hubble parameter observations of cosmic chronometers to the DECIGO standard sirens, we explore different possibilities of making measurements of the cosmic curvature referring to a distant past: one is to reconstruct the Hubble parameters through the Gaussian process without the influence of hypothetical models, and the other is deriving constraints on $Ω_K$ in the framework of non-flat $Λ$ cold dark matter model. It is shown that in the improved method DECIGO could provide a reliable and stringent constraint on the cosmic curvature ($Ω_{K} = -0.007\pm0.016$), while we could only expect the zero cosmic curvature to be established at the precision of $ΔΩ_K=0.12$ in the second model-dependent method. Therefore, our results indicate that in the framework of methodology proposed in this paper, the increasing number of well-measured standard sirens in DECIGO could significantly reduce the bias of estimations for cosmic curvature. Such constraint is also comparable to the precision of Planck 2018 results with the newest cosmic microwave background (CMB) observations ($ΔΩ_{K} \approx 0.018$), based on the concordance $Λ$CDM model.

preprint2022arXiv

Inference in High-dimensional Multivariate Response Regression with Hidden Variables

This paper studies the inference of the regression coefficient matrix under multivariate response linear regressions in the presence of hidden variables. A novel procedure for constructing confidence intervals of entries of the coefficient matrix is proposed. Our method first utilizes the multivariate nature of the responses by estimating and adjusting the hidden effect to construct an initial estimator of the coefficient matrix. By further deploying a low-dimensional projection procedure to reduce the bias introduced by the regularization in the previous step, a refined estimator is proposed and shown to be asymptotically normal. The asymptotic variance of the resulting estimator is derived with closed-form expression and can be consistently estimated. In addition, we propose a testing procedure for the existence of hidden effects and provide its theoretical justification. Both our procedures and their analyses are valid even when the feature dimension and the number of responses exceed the sample size. Our results are further backed up via extensive simulations and a real data analysis.

preprint2022arXiv

Local strict singular characteristics II: existence for stationary equation on $\mathbb{R}^2$

The notion of strict singular characteristics is important in the wellposedness issue of singular dynamics on the cut locus of the viscosity solutions. We provide an intuitive and rigorous proof of the existence of the strict singular characteristics of Hamilton-Jacobi equation $H(x,Du(x),u(x))=0$ in two dimensional case. We also proved if $\mathbf{x}$ is a strict singular characteristic, then we really have the right-differentiability of $\mathbf{x}$ and the right-continuity of $\dot{\mathbf{x}}^+(t)$ for every $t$. Such a strict singular characteristic must give a selection $p(t)\in D^+u(\mathbf{x}(t))$ such that $p(t)=\arg\min_{p\in D^+u(\mathbf{x}(t))}H(\mathbf{x}(t),p,u(\mathbf{x}(t)))$.

preprint2022arXiv

Reheating constraints on modified single-field Natural Inflation models

In this paper, we discuss three modified single-field natural inflation models in detail, including Special generalized Natural Inflation model(SNI), Extended Natural Inflation model(ENI) and Natural Inflation inspired model(NII). We derive the analytical expression of the tensor-to-scalar ratio $r$ and the spectral index $n_s$ for those models. Then the reheating temperature $T_{re}$ and reheating duration $N_{re}$ are analytically derived. Moreover, considering the CMB constraints, the feasible space of the SNI model in $(n_s, r)$ plane is almost covered by that of the NII, which means the NII is more general than the SNI. In addition, there is no overlapping space between the ENI and the other two models in $(n_s, r)$ plane, which indicates that the ENI and the other two models exclude each other, and more accurate experiments can verify them. Furthermore, the reheating brings tighter constraints to the inflation models, but they still work for a different reheating universe. Considering the constraints of $n_s$, $r$, $N_k$ and choosing $T_{re}$ near the electroweak energy scale, one can find that the decay constants of the three models have no overlapping area and the effective equations of state $ω_{re}$ should be within $\frac{1}{4}\lesssim ω_{re} \lesssim \frac{4}{5}$ for the three models.

preprint2022arXiv

SEED: Sound Event Early Detection via Evidential Uncertainty

Sound Event Early Detection (SEED) is an essential task in recognizing the acoustic environments and soundscapes. However, most of the existing methods focus on the offline sound event detection, which suffers from the over-confidence issue of early-stage event detection and usually yield unreliable results. To solve the problem, we propose a novel Polyphonic Evidential Neural Network (PENet) to model the evidential uncertainty of the class probability with Beta distribution. Specifically, we use a Beta distribution to model the distribution of class probabilities, and the evidential uncertainty enriches uncertainty representation with evidence information, which plays a central role in reliable prediction. To further improve the event detection performance, we design the backtrack inference method that utilizes both the forward and backward audio features of an ongoing event. Experiments on the DESED database show that the proposed method can simultaneously improve 13.0\% and 3.8\% in time delay and detection F1 score compared to the state-of-the-art methods.

preprint2022arXiv

The ratio $\mathcal{R}(D_s)$ for $B_s \to D_s \ellν_\ell$ by using the QCD light-cone sum rules within the framework of heavy quark effective field theory

In the paper, we study the $B_s\to D_s$ transition form factors by using the light-cone sum rules within the framework of heavy quark effective field theory. We adopt a chiral current correlation function to do the calculation, the resultant transition form factors $f_+^{B_s\to D_s}(q^2)$ and $f_0^{B_s\to D_s}(q^2)$ are dominated by the contribution of $D_s$-meson leading-twist distribution amplitude, while the contributions from less certain $D_s$-meson twist-3 distribution amplitudes are greatly suppressed. At the largest recoil point, we obtain $f_{+,0}^{B_s \to D_s}(0)=0.533^{+0.112}_{-0.094}$. By further extrapolating the transition form factors into all the physically allowable $q^2$ region with the help of the $z$-series parametrization approach, we calculate the branching fractions $\mathcal{B}(B_s \to D_s \ell^\prime ν_{\ell^\prime})$ with $(\ell^\prime= e,μ)$ and $\mathcal{B}(B_s \to D_s τν_τ)$, which gives $\mathcal{R}(D_s)=0.334\pm 0.017$.

preprint2022arXiv

Zero-Shot Cross-Lingual Machine Reading Comprehension via Inter-sentence Dependency Graph

We target the task of cross-lingual Machine Reading Comprehension (MRC) in the direct zero-shot setting, by incorporating syntactic features from Universal Dependencies (UD), and the key features we use are the syntactic relations within each sentence. While previous work has demonstrated effective syntax-guided MRC models, we propose to adopt the inter-sentence syntactic relations, in addition to the rudimentary intra-sentence relations, to further utilize the syntactic dependencies in the multi-sentence input of the MRC task. In our approach, we build the Inter-Sentence Dependency Graph (ISDG) connecting dependency trees to form global syntactic relations across sentences. We then propose the ISDG encoder that encodes the global dependency graph, addressing the inter-sentence relations via both one-hop and multi-hop dependency paths explicitly. Experiments on three multilingual MRC datasets (XQuAD, MLQA, TyDiQA-GoldP) show that our encoder that is only trained on English is able to improve the zero-shot performance on all 14 test sets covering 8 languages, with up to 3.8 F1 / 5.2 EM improvement on-average, and 5.2 F1 / 11.2 EM on certain languages. Further analysis shows the improvement can be attributed to the attention on the cross-linguistically consistent syntactic path.

preprint2021arXiv

$η^{(\prime)}$-meson twist-2 distribution amplitude within QCD sum rule approach and its application to the semi-leptonic decay $ D_s^+ \toη^{(\prime)}\ell^+ ν_\ell$

In this paper, we make a detailed discussion on the $η$ and $η'$-meson leading-twist light-cone distribution amplitude $ϕ_{2;η^{(\prime)}}(u,μ)$ by using QCD sum rules approach under the background field theory. Taking both the non-perturbative condensates up to dimension-six and NLO QCD corrections to the perturbative part, its first three moments $\langleξ^n_{2;η^{(\prime)}}\rangle|_{μ_0} $ with $n = (2,4,6)$ at initial scale $μ_0 = 1$ GeV can be determined. e.g. $\langleξ_{2;η}^2\rangle|_{μ_0} =0.231_{-0.013}^{+0.010}$, $\langleξ_{2;η}^4 \rangle|_{μ_0} =0.109_{-0.007}^{+0.007}$, and $\langleξ_{2;η}^6 \rangle|_{μ_0} =0.066_{-0.006}^{+0.006}$ for $η$-meson, $\langleξ_{2;η'}^2\rangle|_{μ_0} =0.211_{-0.017}^{+0.015}$, $\langleξ_{2;η'}^4 \rangle|_{μ_0} =0.093_{-0.009}^{+0.009}$, and $\langleξ_{2;η'}^6 \rangle|_{μ_0} =0.054_{-0.008}^{+0.008}$ for $η'$-meson. Next, we calculate $D_s\toη^{(\prime)}$ TFFs $f^{η^{(\prime)}}_+(q^2)$ within QCD light-cone sum rules approach up to NLO level. The values at large recoil region are $f^η_+(0) = 0.476_{-0.036}^{+0.040}$ and $f^{η'}_+(0) = 0.544_{-0.042}^{+0.046}$. After extrapolating TFFs to the allowable physical regions within the series expansion, we obtain the branching fractions of the semi-leptonic decay, i.e. $D_s^+\toη^{(\prime)}\ell^+ ν_\ell$, i.e. ${\cal B}(D_s^+\toη^{(\prime)} e^+ν_e)=2.346_{-0.331}^{+0.418}(0.792_{-0.118}^{+0.141})\times10^{-2}$ and ${\cal B}(D_s^+\toη^{(\prime)} μ^+ν_μ)=2.320_{-0.327}^{+0.413}(0.773_{-0.115}^{+0.138})\times10^{-2}$ for $\ell = (e, μ)$ channels respectively. And in addition to that, the mixing angle for $η-η'$ with $φ$ and ratio for the different decay channels ${\cal R}_{η'/η}^\ell$ are given, which show good agreement with the recent BESIII measurements.

preprint2021arXiv

Cumulant Expansion of Mutual Information for Quantifying Leakage of a Protected Secret

The information leakage of a cryptographic implementation with a given degree of protection is evaluated in a typical situation when the signal-to-noise ratio is small. This is solved by expanding Kullback-Leibler divergence, entropy, and mutual information in terms of moments/cumulants.

preprint2021arXiv

Dynamic Gaussian Mixture based Deep Generative Model For Robust Forecasting on Sparse Multivariate Time Series

Forecasting on sparse multivariate time series (MTS) aims to model the predictors of future values of time series given their incomplete past, which is important for many emerging applications. However, most existing methods process MTS's individually, and do not leverage the dynamic distributions underlying the MTS's, leading to sub-optimal results when the sparsity is high. To address this challenge, we propose a novel generative model, which tracks the transition of latent clusters, instead of isolated feature representations, to achieve robust modeling. It is characterized by a newly designed dynamic Gaussian mixture distribution, which captures the dynamics of clustering structures, and is used for emitting timeseries. The generative model is parameterized by neural networks. A structured inference network is also designed for enabling inductive analysis. A gating mechanism is further introduced to dynamically tune the Gaussian mixture distributions. Extensive experimental results on a variety of real-life datasets demonstrate the effectiveness of our method.

preprint2021arXiv

Singularities of solutions of Hamilton-Jacobi equations

This is a survey paper on the quantitative analysis of the propagation of singularities for the viscosity solutions to Hamilton-Jacobi equations in the past decades. We also review further applications of the theory to various fields such as Riemannian geometry, Hamiltonian dynamical systems and partial differential equations.

preprint2020arXiv

$D \to P(π,K)$ helicity form factors within light-cone sum rule approach

In this paper, the $D\to P(π, K)$ helicity form factors (HFFs) are studied by applying the QCD light-cone sum rule (LCSR) approach. The calculation accuracy is up to next-to-leading order (NLO) gluon radiation correction of twist-(2,3) distribution amplitude. The resultant HFFs at large recoil point are ${\cal P}_{t,0}^π(0) = 0.688^{+0.020}_{-0.024}$, ${\cal P}_{t,0}^K(0)=0.780^{+0.024}_{-0.029}$. In which, the contributions from three particles of the leading order (LO) are so small that can be safely neglected, and the maximal contribution of the NLO gluon radiation correction for ${\cal P}_{t,0}^{π,K}(0)$ is less than $3\%$. After extrapolating the LCSR predictions for these HFFs to whole $q^2$-region, we obtain the decay widths for semileptonic decay processes $D\to P\ellν_\ell$, which are consistent with BES-III collaboration predictions within errors. After considering the $D^{+}/D^{0}$-meson lifetime, we give the branching fractions of $D\to P\ellν_\ell$ with $\ell = e, μ$, our predictions also agree with BES-III collaboration within errors, especially for $D\to π\ellν_\ell$ decay process. Finally, we present the forward-backward asymmetry ${\cal A}_{\rm FB}^\ell(q^2)$ and lepton convexity parameter ${\cal C}_F^\ell(q^2)$, and further calculate the mean value of these two observations $\langle{\cal A}_{\rm FB}^\ell\rangle$ and $\langle{\cal C}_F^\ell\rangle$, which may provide a way to test those HFFs in future experiments.

preprint2020arXiv

Generalizing Variational Autoencoders with Hierarchical Empirical Bayes

Variational Autoencoders (VAEs) have experienced recent success as data-generating models by using simple architectures that do not require significant fine-tuning of hyperparameters. However, VAEs are known to suffer from over-regularization which can lead to failure to escape local maxima. This phenomenon, known as posterior collapse, prevents learning a meaningful latent encoding of the data. Recent methods have mitigated this issue by deterministically moment-matching an aggregated posterior distribution to an aggregate prior. However, abandoning a probabilistic framework (and thus relying on point estimates) can both lead to a discontinuous latent space and generate unrealistic samples. Here we present Hierarchical Empirical Bayes Autoencoder (HEBAE), a computationally stable framework for probabilistic generative models. Our key contributions are two-fold. First, we make gains by placing a hierarchical prior over the encoding distribution, enabling us to adaptively balance the trade-off between minimizing the reconstruction loss function and avoiding over-regularization. Second, we show that assuming a general dependency structure between variables in the latent space produces better convergence onto the mean-field assumption for improved posterior inference. Overall, HEBAE is more robust to a wide-range of hyperparameter initializations than an analogous VAE. Using data from MNIST and CelebA, we illustrate the ability of HEBAE to generate higher quality samples based on FID score than existing autoencoder-based approaches.

preprint2020arXiv

Local singular characteristics on $\mathbb{R}^2$

The singular set of a viscosity solution to a Hamilton-Jacobi equation is known to propagate, from any noncritical singular point, along singular characteristics which are curves satisfying certain differential inclusions. In the literature, different notions of singular characteristics were introduced. However, a general uniqueness criterion for singular characteristics, not restricted to mechanical systems or problems in one space dimension, is missing at the moment. In this paper, we prove that, for a Tonelli Hamiltonian on $\mathbb{R}^2$, two different notions of singular characteristics coincide up to a bi-Lipschitz reparameterization. As a significant consequence, we obtain a uniqueness result for the class of singular characteristics that was introduced by Khanin and Sobolevski in the paper [On dynamics of Lagrangian trajectories for Hamilton-Jacobi equations. Arch. Ration. Mech. Anal., 219(2):861-885, 2016].

preprint2020arXiv

Monotone iterative schemes for positive solutions of a fractional differential system with integral boundary conditions on an infinite interval

In this paper, using the monotone iterative technique and the Banach contraction mapping principle, we study a class of fractional differential system with integral boundary on an infinite interval. Some explicit monotone iterative schemes for approximating the extreme positive solutions and the unique positive solution are constructed.

preprint2020arXiv

The $D\to ρ$ semileptonic and radiative decays within the light-cone sum rules

The measured branching ratio of the $D$ meson semileptonic decay $D \to ρe^+ ν_e$, which is based on the $0.82~{\rm fb^{-1}}$ CLEO data taken at the peak of $ψ(3770)$ resonance, disagrees with the traditional SVZ sum rules analysis by about three times. In the paper, we show that this discrepancy can be eliminated by applying the QCD light-cone sum rules (LCSR) approach to calculate the $D\to ρ$ transition form factors $A_{1,2}(q^2)$ and $V(q^2)$. After extrapolating the LCSR predictions of these TFFs to whole $q^2$-region, we obtain $1/|V_{\rm cd}|^2 \times Γ(D \to ρe ν_e) =(55.45^{+13.34}_{-9.41})\times 10^{-15}~{\rm GeV}$. Using the CKM matrix element and the $D^0(D^+)$ lifetime from the Particle Data Group, we obtain ${\cal B} (D^0\to ρ^- e^+ ν_e) = (1.749^{+0.421}_{-0.297}\pm 0.006)\times 10^{-3}$ and ${\cal B} (D^+ \to ρ^0 e^+ ν_e) = (2.217^{+0.534}_{-0.376}\pm 0.015)\times 10^{-3}$, which agree with the CLEO measurements within errors. We also calculate the branching ratios of the two $D$ meson radiative processes and obtain ${\cal B}(D^0\to ρ^0 γ)= (1.744^{+0.598}_{-0.704})\times 10^{-5}$ and ${\cal B}(D^+ \to ρ^+ γ) = (5.034^{+0.939}_{-0.958})\times 10^{-5}$, which also agree with the Belle measurements within errors. Thus we think the LCSR approach is applicable for dealing with the $D$ meson decays.

preprint2019arXiv

Asymmetrical Hierarchical Networks with Attentive Interactions for Interpretable Review-Based Recommendation

Recently, recommender systems have been able to emit substantially improved recommendations by leveraging user-provided reviews. Existing methods typically merge all reviews of a given user or item into a long document, and then process user and item documents in the same manner. In practice, however, these two sets of reviews are notably different: users' reviews reflect a variety of items that they have bought and are hence very heterogeneous in their topics, while an item's reviews pertain only to that single item and are thus topically homogeneous. In this work, we develop a novel neural network model that properly accounts for this important difference by means of asymmetric attentive modules. The user module learns to attend to only those signals that are relevant with respect to the target item, whereas the item module learns to extract the most salient contents with regard to properties of the item. Our multi-hierarchical paradigm accounts for the fact that neither are all reviews equally useful, nor are all sentences within each review equally pertinent. Extensive experimental results on a variety of real datasets demonstrate the effectiveness of our method.

preprint2018arXiv

Dynamic and asymptotic behavior of singularities of certain weak KAM solutions on the torus

For mechanical Hamiltonian systems on the torus, we study the dynamical properties of the generalized characteristics semiflows associated with certain Hamilton-Jacobi equations, and build the relation between the $ω$-limit set of this semiflow and the projected Aubry set.

preprint2018arXiv

Vanishing contact structure problem and convergence of the viscosity solutions

This paper is devoted to study the vanishing contact structure problem which is a generalization of the vanishing discount problem. Let $H^λ(x,p,u)$ be a family of Hamiltonians of contact type with parameter $λ>0$ and converges to $G(x,p)$. For the contact type Hamilton-Jacobi equation with respect to $H^λ$, we prove that, under mild assumptions, the associated viscosity solution $u^λ$ converges to a specific viscosity solution $u^0$ of the vanished contact equation. As applications, we give some convergence results for the nonlinear vanishing discount problem.

preprint2016arXiv

FlyCap: Markerless Motion Capture Using Multiple Autonomous Flying Cameras

Aiming at automatic, convenient and non-instrusive motion capture, this paper presents a new generation markerless motion capture technique, the FlyCap system, to capture surface motions of moving characters using multiple autonomous flying cameras (autonomous unmanned aerial vehicles(UAV) each integrated with an RGBD video camera). During data capture, three cooperative flying cameras automatically track and follow the moving target who performs large scale motions in a wide space. We propose a novel non-rigid surface registration method to track and fuse the depth of the three flying cameras for surface motion tracking of the moving target, and simultaneously calculate the pose of each flying camera. We leverage the using of visual-odometry information provided by the UAV platform, and formulate the surface tracking problem in a non-linear objective function that can be linearized and effectively minimized through a Gaussian-Newton method. Quantitative and qualitative experimental results demonstrate the competent and plausible surface and motion reconstruction results

preprint2016arXiv

Generalized characteristics and Lax-Oleinik operators: global theory

For autonomous Tonelli systems on $\R^n$, we develop an intrinsic proof of the existence of generalized characteristics using sup-convolutions. This approach, together with convexity estimates for the fundamental solution, leads to new results such as the global propagation of singularities along generalized characteristics.

preprint2016arXiv

Particle Swarm Optimized Power Consumption of Trilateration

Trilateration-based localization (TBL) has become a corner stone of modern technology. This study formulates the concern on how wireless sensor networks can take advantage of the computational intelligent techniques using both single- and multi-objective particle swarm optimization (PSO) with an overall aim of concurrently minimizing the required time for localization, minimizing energy consumed during localization, and maximizing the number of nodes fully localized through the adjustment of wireless sensor transmission ranges while using TBL process. A parameter-study of the applied PSO variants is performed, leading to results that show algorithmic improvements of up to 32% in the evaluated objectives.

preprint2015arXiv

$B\to K^$ Transition Form Factors and the Semi-leptonic Decay $B \to K^ μ^+ μ^-$

We present a detailed calculation on the $B\to K^*$ transition form factors (TFFs), $A_{0,1,2}$, $V$ and $T_{1,2,3}$, within the QCD light-cone sum rules (LCSR). To suppress the contributions from high-twist light-cone distribution amplitudes, we adopt a right-handed chiral correlator to do the LCSR calculation. In the resultant LCSRs for the TFFs, the transverse leading-twist distribution amplitude $ϕ_{2;K^*}^\bot$ provides over $90\%$ contribution, thus those TFFs provide good platforms for testing the property of $ϕ_{2;K^*}^\bot$. We suggest a model for $ϕ_{2;K^*}^\bot$, in which two parameters $B_{2;K^*}^\bot$ and $C_{2;K^*}^\bot$ dominantly control its longitudinal distribution. With a proper choice of $B_{2;K^*}^\bot$ and $C_{2;K^*}^\bot$, our predictions on $B\to K^*$ TFFs are consistent with those of lattice QCD predictions. As an application, we also calculate the branching fraction of the $B$-meson rare decay $B \to K^* μ^+ μ^-$. The predicted differential branching fraction $d{\cal B}/dq^2(B\to K^{*}μ^+μ^-)$ is consistent with the LHCb and Belle measurements within errors. After integrating over the allowable $q^2$-region, we get the branching fraction, ${\cal B}(B\to K^*μ^+μ^-) = \left(1.088^{+0.261}_{-0.205} \right)\times 10^{-6}$, where the errors are squared average of the mentioned error sources. When the precision of experimental measurements or the other source of theoretical uncertainties for this process have been further improved in the future, we may get a definite conclusion on the behavior of $ϕ_{2;K^*}^\bot$.

preprint2015arXiv

Lasry-Lions, Lax-Oleinik and Generalized characteristics

In the recent works \cite{Cannarsa-Chen-Cheng} and \cite{Cannarsa-Cheng3}, an intrinsic approach of the propagation of singularities along the generalized characteristics was obtained, even in global case, by a procedure of sup-convolution with the kernel the fundamental solutions of the associated Hamilton-Jacobi equations. In the present paper, we exploit the relations among Lasry-Lions regularization, Lax-Oleinik operators (or inf/sup-convolution) and generalized characteristics, which are discussed in the context of the variational setting of Tonelli Hamiltonian dynamics, such as Mather theory and weak KAM theory.

preprint2014arXiv

Homoclinic orbits and critical points of barrier functions

We interpret the close link between the critical points of Mather's barrier functions and minimal homoclinic orbits with respect to the Aubry sets on $\mathbb{T}^n$. We also prove a critical point theorem for barrier functions, and the existence of such homoclinic orbits on $\mathbb{T}^2$ as an application.

preprint2013arXiv

Propagation of singularities for weak KAM solutions and barrier functions

This paper studies the structure of the singular set (points of nondifferentiability) of viscosity solutions to Hamilton-Jacobi equations associated with general mechanical systems on the n-torus. First, using the level set method, we characterize the propagation of singularities along generalized characteristics. Then, we obtain a local propagation result for singularities of weak KAM solutions in the supercritical case. Finally, we apply such a result to study the propagation of singularities for barrier functions.

preprint2012arXiv

Identifying Brain Image Level Endophenotypes in Epilepsy

A brain wide association study (BWAS) based on the logistic regression was first developed and applied to a large population of epilepsy patients (168) and healthy controls (136). It was found that the most significant links associated with epilepsy are those bilateral links with regions mainly belonging to the default mode network and subcortex, such as amygdala, fusiform gyrus, inferior temporal gyrus, hippocampus, temporal pole, parahippocampal gyrus, insula, middle occipital gyrus, cuneus. These links were found to have much higher odd ratios than other links, and all of them showed reduced functional couplings in patients compared with controls. Interestingly, with the increasing of the seizure onset frequency or duration of illness, the functional connection between these bilateral regions became further reduced. On the other hand, as a functional compensation and brain plasticity, connections of these bilateral regions to other brain regions were abnormally enhanced and became even much stronger with the increase of the seizure onset frequency. Furthermore, patients had higher network efficiencies than healthy controls, and the seizure onset frequency was found to be positively correlated with the network efficiency. A negative correlation between the bilateral connection and the network efficiency was also observed. To further validate our findings, we then employed our BWAS results in discriminating patients from healthy controls and the leave-one-out accuracy was around 78%. Given the fact that a genome-wide association study with a large cohort has failed to identify any significant association between genes and epilepsy, our study could provide us with a set of endophenotypes for further study.

preprint2011arXiv

Increased Coupling in the Saliency Network is the main cause/effect of Attention Deficit Hyperactivity Disorder

To uncover the underlying mechanisms of mental disorders such as attention deficit hyperactivity disorder (ADHD) for improving both early diagnosis and therapy, it is increasingly recognized that we need a better understanding of how the brain's functional connections are altered. A new brain wide association study (BWAS) has been developed and used to investigate functional connectivity changes in the brains of patients suffering from ADHD using resting state fMRI data. To reliably find out the most significantly altered functional connectivity links and associate them with ADHD, a meta-analysis on a cohort of ever reported largest population comprising 249 patients and 253 healthy controls is carried out. The greatest change in ADHD patients was the increased coupling of the saliency network involving the anterior cingulate gyrus and anterior insula. A voxel-based morphometry analysis was also carried out but this revealed no evidence in the ADHD patients for altered grey matter volumes in the regions showing altered functional connectivity. This is the first evidence for the involvement of the saliency network in ADHD and it suggests that this may reflect increased sensitivity over the integration of the incoming sensory information and his/her own thoughts and the network as a switch is bias towards to the central executive network.

preprint2011arXiv

Photonic bandgap plasmonic waveguides

A novel type of a plasmonic waveguide has been proposed featuring an "open" design that is easy to manufacture, simple to excite and that offers a convenient access to a plasmonic mode. Optical properties of photonic bandgap (PBG) plasmonic waveguides are investigated experimentally by leakage radiation microscopy and numerically using the finite element method confirming photonic bandgap guidance in a broad spectral range. Propagation and localization characteristics of a PBG plasmonic waveguide have been discussed as a function of the wavelength of operation, waveguide core size and the number of ridges in the periodic reflector for fundamental and higher order plasmonic modes of the waveguide.

Wei Cheng

What is connected

Connect this record

See the researcher in context

Building this map preview

40 published item(s)

A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5

Improving Code Translation with Syntax-Guided and Semantic-aware Preference Optimization

To Diff or Not to Diff? Structure-Aware and Adaptive Output Formats for Efficient LLM-based Code Editing

Observation of hierarchy of Hilbert space ergodicities in the quantum dynamics of a single spin

$B_c \to J/ψ$ helicity form factors and the $B_c^+ \to J/ψ+(P, V, \ell^+ ν_\ell)$ decays

Attacking Masked Cryptographic Implementations: Information-Theoretic Bounds

Conflict-aware Inference of Python Compatible Runtime Environments with Domain Knowledge Graph

Deep Federated Anomaly Detection for Multivariate Time Series Data

Direct measurement of the distribution of dark matter with strongly lensed gravitational waves

Experimental violation of the Leggett-Garg inequality with a single-spin system

Generalizable Neural Performer: Learning Robust Radiance Fields for Human Novel View Synthesis

High precision measurement of cosmic curvature: from gravitational waves and cosmic chronometer

Inference in High-dimensional Multivariate Response Regression with Hidden Variables

Local strict singular characteristics II: existence for stationary equation on $\mathbb{R}^2$

Reheating constraints on modified single-field Natural Inflation models

SEED: Sound Event Early Detection via Evidential Uncertainty

The ratio $\mathcal{R}(D_s)$ for $B_s \to D_s \ellν_\ell$ by using the QCD light-cone sum rules within the framework of heavy quark effective field theory

Zero-Shot Cross-Lingual Machine Reading Comprehension via Inter-sentence Dependency Graph

$η^{(\prime)}$-meson twist-2 distribution amplitude within QCD sum rule approach and its application to the semi-leptonic decay $ D_s^+ \toη^{(\prime)}\ell^+ ν_\ell$

Cumulant Expansion of Mutual Information for Quantifying Leakage of a Protected Secret

Dynamic Gaussian Mixture based Deep Generative Model For Robust Forecasting on Sparse Multivariate Time Series

Singularities of solutions of Hamilton-Jacobi equations

$D \to P(π,K)$ helicity form factors within light-cone sum rule approach

Generalizing Variational Autoencoders with Hierarchical Empirical Bayes

Local singular characteristics on $\mathbb{R}^2$

Monotone iterative schemes for positive solutions of a fractional differential system with integral boundary conditions on an infinite interval

The $D\to ρ$ semileptonic and radiative decays within the light-cone sum rules

Asymmetrical Hierarchical Networks with Attentive Interactions for Interpretable Review-Based Recommendation

Dynamic and asymptotic behavior of singularities of certain weak KAM solutions on the torus

Vanishing contact structure problem and convergence of the viscosity solutions

FlyCap: Markerless Motion Capture Using Multiple Autonomous Flying Cameras

Generalized characteristics and Lax-Oleinik operators: global theory

Particle Swarm Optimized Power Consumption of Trilateration

$B\to K^*$ Transition Form Factors and the Semi-leptonic Decay $B \to K^* μ^+ μ^-$

Lasry-Lions, Lax-Oleinik and Generalized characteristics

Homoclinic orbits and critical points of barrier functions

Propagation of singularities for weak KAM solutions and barrier functions

Identifying Brain Image Level Endophenotypes in Epilepsy

Increased Coupling in the Saliency Network is the main cause/effect of Attention Deficit Hyperactivity Disorder

Photonic bandgap plasmonic waveguides

$B\to K^$ Transition Form Factors and the Semi-leptonic Decay $B \to K^ μ^+ μ^-$