Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
70works
0followers
38topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

70 published item(s)

preprint2026arXiv

1/2 order convergence rate of Euler-type methods for time-changed stochastic differential equations with super-linearly growing drift and diffusion coefficients

This paper investigates the strong convergence properties of two Euler-type methods for a class of time-changed stochastic differential equations (TCSDEs) with super-linearly growing drift and diffusion coefficients. Building upon existing research, we propose a backward Euler method (BEM) and introduce its explicit counterpart -- the projected Euler method (PEM). We prove that both methods converge strongly in the $L_2$-sense at the optimal rate of 1/2. This result extends the applicability of both the BEM and the PEM to a broader class of TCSDEs. Moreover, the two methods offer complementary strengths: while BEM possesses wide applicability, PEM is computationally more efficient. Numerical simulations confirm our theoretical findings and illustrate practical performance of both schemes.

preprint2026arXiv

Beyond "I cannot fulfill this request": Alleviating Rigid Rejection in LLMs via Label Enhancement

Large Language Models (LLMs) rely on safety alignment to obey safe requests while refusing harmful ones. However, traditional refusal mechanisms often lead to "rigid rejection," where a general template (e.g., "I cannot fulfill this request") indiscriminately triggers refusals and severely undermines the naturalness of interactions between humans and LLMs. To address this issue, LANCE is proposed in this paper to ensure safe yet flexible and natural responses via label enhancement. Specifically, LANCE employs variational inference to perform label enhancement, predicting a continuous distribution across multiple rejection categories. These fine-grained rejection distributions provide multi-way textual gradients for a refinement model to neutralize the hazardous elements in the prompt, so that the LLMs could generate safe responses that avoid rigid rejections while preserving the naturalness of interactions. Experiments demonstrate that LANCE significantly alleviates the rigid rejection problem while maintaining high security standards, significantly outperforming existing baseline models in terms of helpfulness and naturalness of responses.

preprint2026arXiv

Beyond Immediate Activation: Temporally Decoupled Backdoor Attacks on Time Series Forecasting

Existing backdoor attacks on multivariate time series (MTS) forecasting enforce strict temporal and dimensional coupling between triggers and target patterns, requiring synchronous activation at fixed positions across variables. However, realistic scenarios often demand delayed and variable-specific activation. We identify this critical unmet need and propose TDBA, a temporally decoupled backdoor attack framework for MTS forecasting. By injecting triggers that encode the expected location of the target pattern, TDBA enables the activation of the target pattern at any positions within the forecasted data, with the activation position flexibly varying across different variable dimensions. TDBA introduces two core modules: (1) a position-guided trigger generation mechanism that leverages smoothed Gaussian priors to generate triggers that are position-related to the predefined target pattern; and (2) a position-aware optimization module that assigns soft weights based on trigger completeness, pattern coverage, and temporal offset, facilitating targeted and stealthy attack optimization. Extensive experiments on real-world datasets show that TDBA consistently outperforms existing baselines in effectiveness while maintaining good stealthiness. Ablation studies confirm the controllability and robustness of its design.

preprint2026arXiv

EXG: Self-Evolving Agents with Experience Graphs

Large language model (LLM)-based agents have demonstrated strong capabilities in complex reasoning and problem solving through multi-step interactions, yet most deployed agents remain behaviorally static, with knowledge acquired during execution rarely translating into systematic improvement over time. In response, a growing line of work on self-evolving agents explores how agents can improve through experience during deployment, but most existing approaches either rely on ad hoc reflection limited to single-task correction or adopt unstructured memory that accumulates fragmented experience with delayed usability. To address this limitation, we introduce EXG, an experience graph framework for self-evolving agents that explicitly organizes accumulated successes and failures into a structured, relational representation. EXG is the first experience graph designed for self-evolving agents, supporting both online, real-time graph growth during execution for immediate cross-task experience reuse, and offline reuse of a consolidated experience graph as an external memory module. This design also enables EXG to serve as a plug-and-play component for existing self-evolving agents, organizing prior experience into a unified experience graph and improving both solution quality and resource efficiency as deployment progresses. Extensive experiments across code generation and reasoning benchmarks show that EXG attains more favorable performance-efficiency trade-offs than reflection- and memory-based baselines in both online and offline evaluations. Our results suggest that structuring experience as a graph provides a principled foundation for scalable and transferable self-evolving agent behavior.

preprint2026arXiv

HyperJoin: LLM-augmented Hypergraph Link Prediction for Joinable Table Discovery

As a pivotal task in data lake management, joinable table discovery has attracted widespread interest. While existing language model-based methods achieve remarkable performance by combining offline column representation learning with online ranking, their design insufficiently accounts for the underlying structural interactions: (1) offline, they directly model tables into isolated or pairwise columns, thereby struggling to capture the rich inter-table and intra-table structural information; and (2) online, they rank candidate columns based solely on query-candidate similarity, ignoring the mutual interactions among the candidates, leading to incoherent result sets. To address these limitations, we propose HyperJoin, a large language model (LLM)-augmented Hypergraph framework for Joinable table discovery. Specifically, we first construct a hypergraph to model tables using both the intra-table hyperedges and the LLM-augmented inter-table hyperedges. Consequently, the task of joinable table discovery is formulated as link prediction on this constructed hypergraph. We then design HIN, a Hierarchical Interaction Network that learns expressive column representations through bidirectional message passing over columns and hyperedges. To strengthen coherence and internal consistency in the result columns, we cast online ranking as a coherence-aware top-k column selection problem. We then introduce a reranking module that leverages a maximum spanning tree algorithm to prune noisy connections and maximize coherence. Experiments demonstrate the superiority of HyperJoin, achieving average improvements of 21.4% (Precision@15) and 17.2% (Recall@15) over the best baseline.

preprint2026arXiv

MedVL-SAM2: A unified 3D medical vision-language model for multimodal reasoning and prompt-driven segmentation

Recent progress in medical vision-language models (VLMs) has achieved strong performance on image-level text-centric tasks such as report generation and visual question answering (VQA). However, achieving fine-grained visual grounding and volumetric spatial reasoning in 3D medical VLMs remains challenging, particularly when aiming to unify these capabilities within a single, generalizable framework. To address this challenge, we proposed MedVL-SAM2, a unified 3D medical multimodal model that concurrently supports report generation, VQA, and multi-paradigm segmentation, including semantic, referring, and interactive segmentation. MedVL-SAM2 integrates image-level reasoning and pixel-level perception through a cohesive architecture tailored for 3D medical imaging, and incorporates a SAM2-based volumetric segmentation module to enable precise multi-granular spatial reasoning. The model is trained in a multi-stage pipeline: it is first pre-trained on a large-scale corpus of 3D CT image-text pairs to align volumetric visual features with radiology-language embeddings. It is then jointly optimized with both language-understanding and segmentation objectives using a comprehensive 3D CT segmentation dataset. This joint training enables flexible interaction via language, point, or box prompts, thereby unifying high-level visual reasoning with spatially precise localization. Our unified architecture delivers state-of-the-art performance across report generation, VQA, and multiple 3D segmentation tasks. Extensive analyses further show that the model provides reliable 3D visual grounding, controllable interactive segmentation, and robust cross-modal reasoning, demonstrating that high-level semantic reasoning and precise 3D localization can be jointly achieved within a unified 3D medical VLM.

preprint2026arXiv

Overeager Coding Agents: Measuring Out-of-Scope Actions on Benign Tasks

Coding agents now run autonomously with shell, file, and network privileges. When a user issues a benign request, the agent sometimes does more than asked: it deletes unrelated files, wipes a stale credentials backup, or rewrites configuration the user never mentioned. We call these scope expansions overeager actions, an authorization problem distinct from capability failures, prompt injection, or sandbox escapes. We present OverEager-Gen, a benchmark dedicated to overeager behavior on benign tasks. Building it surfaces a measurement-validity issue: if a benchmark spells out the authorized scope inside the prompt, the agent stops inferring boundaries and starts pattern-matching declaration text. On Claude Code, stripping the consent declaration alone raises the overeager rate from 0.0% to 17.1% on paired scenarios (McNemar exact p = 2.4 x 10^-4). OverEager-Gen therefore certifies each scenario's discriminative power before admission via a behavioral-gradient validator, audits internal tool calls through a dual-channel stack (PATH-injected shim plus per-agent event streams), and ships byte-identical consent_kept and consent_stripped variants. OverEager-Bench contains 500 validated scenarios and ~7,500 runs across four agent products (Claude Code, OpenHands, Codex CLI, Gemini CLI) and six base models; a 50-sample re-annotation gives Cohen's kappa = 0.73 and rule-judge recall = 1.00. Stripping consent multiplies the overeager rate on every shared base model (Delta in [11.9, 17.2] pp). The framework axis dominates effect size: a permissive cluster (Claude Code, Codex CLI, Gemini CLI) runs at 5.4-27.7% while the ask-to-continue framework (OpenHands) sits at 0.2-4.5% (Fisher p <= 10^-5). Within-framework base-model variance reaches 15.9 pp, indicating that model-layer alignment does not fully propagate through permissive permission gating.

preprint2026arXiv

Quantum Latin squares of order $6m$ with all possible cardinalities

A quantum Latin square of order $n$ (denoted as QLS$(n)$) is an $n\times n$ array whose entries are unit column vectors from the $n$-dimensional Hilbert space $\mathcal{H}_n$, such that each row and column forms an orthonormal basis. Two unit vectors $|u\rangle, |v\rangle\in \mathcal{H}_n$ are regarded as identical if there exists a real number $θ$ such that $|u\rangle=e^{iθ}|v\rangle$; otherwise, they are considered distinct. The cardinality $c$ of a QLS$(n)$ is the number of distinct vectors in the array. In this note,we use sub-QLS$(6)$ to prove that for any integer $m\geq 2$ and any $c\in [6m,36m^2]\setminus \{6m+1\}$, there is a QLS$(6m)$ with cardinality $c$.

preprint2026arXiv

Why is the $Z_c(3900)$ absent in the $h_cπ$ final state?

In this work, we perform a comprehensive phenomenological analysis of the exotic hadronic states $Z_c(3900)$, $Z_c(4020)$, $Z_b(10610)$ and $Z_b(10650)$ within the framework of Heavy Quark Spin Symmetry (HQSS) and its violation. By constructing S-wave contact interactions between elastic ($D\bar{D}^*/D^*\bar{D}^*$ or $B\bar{B}^*/B^*\bar{B}^*$) and inelastic ($J/ψπ, h_cπ$ or $Υπ$, $h_bπ$) channels, we solve the Lippmann-Schwinger equation to obtain physical production amplitudes and perform a global fit to experimental invariant-mass spectra. Our results demonstrate a striking difference between the charm and bottom sectors: HQSS violation is negligible in the bottom system, leading to comparable peak structures for both $Z_b$ states in all hidden-bottom decay channels. In contrast, significant HQSS breaking is required to describe the $Z_c$ system, where the violation is predominantly concentrated in the elastic interactions. This explains the observed selectivity: $Z_c(3900)$ appears prominently only in $J/ψπ$, while $Z_c(4020)$ appears only in $h_cπ$. Pole analysis confirms the molecular nature of the states, with the $Z_c(4020)$ likely arising from a threshold cusp effect. The model&#39;s robustness is verified against variations of the form factor and cutoff, showing stable results.

preprint2025arXiv

Analyzing Communication Predictability in LLM Training

Effective communication is essential in distributed training, with predictability being one of its most significant characteristics. However, existing studies primarily focus on exploiting predictability through online profiling for runtime optimization, without a systematic understanding of it. In this work, we aim to systematically formulate communication predictability in distributed training, particularly in Large Language Models (LLMs) that utilize hybrid parallelism. Our analysis focuses on both traffic patterns and communication overhead. Specifically, we investigate predictable traffic patterns in typical LLMs and evaluate how various factors influence GPU utilization and effective bandwidth (two critical variables affecting communication overhead). Furthermore, we develop an analytical formulation to estimate communication overhead in LLM training, which is validated with high accuracy against empirical data. Leveraging this formulation, we propose a configuration tuning tool, ConfigTuner, to optimize training performance. Compared to Megatron-LM, the training configurations optimized by ConfigTuner demonstrate up to a 1.36$\times$ increase in throughput. Compared to Alpa, ConfigTuner generates the same configuration suggestion while significantly reducing the search complexity.

preprint2023arXiv

Audio2Gestures: Generating Diverse Gestures from Audio

People may perform diverse gestures affected by various mental and physical factors when speaking the same sentences. This inherent one-to-many relationship makes co-speech gesture generation from audio particularly challenging. Conventional CNNs/RNNs assume one-to-one mapping, and thus tend to predict the average of all possible target motions, easily resulting in plain/boring motions during inference. So we propose to explicitly model the one-to-many audio-to-motion mapping by splitting the cross-modal latent code into shared code and motion-specific code. The shared code is expected to be responsible for the motion component that is more correlated to the audio while the motion-specific code is expected to capture diverse motion information that is more independent of the audio. However, splitting the latent code into two parts poses extra training difficulties. Several crucial training losses/strategies, including relaxed motion loss, bicycle constraint, and diversity loss, are designed to better train the VAE. Experiments on both 3D and 2D motion datasets verify that our method generates more realistic and diverse motions than previous state-of-the-art methods, quantitatively and qualitatively. Besides, our formulation is compatible with discrete cosine transformation (DCT) modeling and other popular backbones (\textit{i.e.} RNN, Transformer). As for motion losses and quantitative motion evaluation, we find structured losses/metrics (\textit{e.g.} STFT) that consider temporal and/or spatial context complement the most commonly used point-wise losses (\textit{e.g.} PCK), resulting in better motion dynamics and more nuanced motion details. Finally, we demonstrate that our method can be readily used to generate motion sequences with user-specified motion clips on the timeline.

preprint2023arXiv

Illegal Intelligent Reflecting Surface Based Active Channel Aging: When Jammer Can Attack Without Power and CSI

Illegal intelligent reflecting surfaces (I-IRSs), i.e., the illegal deployment and utilization of IRSs, impose serious harmful impacts on wireless networks. The existing I-IRS-based illegal jammer (IJ) requires channel state information (CSI) or extra power or both, and therefore, the I-IRS-based IJ seems to be difficult to implement in practical wireless networks. To raise concerns about significant potential threats posed by I-IRSs, we propose an alternative method to jam legitimate users (LUs) without relying on the CSI. By using an I-IRS to actively change wireless channels, the orthogonality of multi-user beamforming vectors and the co-user channels is destroyed, and significant inter-user interference is then caused, which is referred to as active channel aging. Such a fully-passive jammer (FPJ) can launch jamming attacks on multi-user multiple-input single-output (MU-MISO) systems via inter-user interference caused by active channel aging, where the IJ requires no additional transmit power and instantaneous CSI. The simulation results show the effectiveness of the proposed FPJ scheme. Moreover, we also investigate how the transmit power and the number of quantization phase shift bits influence the jamming performance.

preprint2022arXiv

%CRTFASTGEEPWR: a SAS macro for power of the generalized estimating equations of multi-period cluster randomized trials with application to stepped wedge designs

Multi-period cluster randomized trials (CRTs) are increasingly used for the evaluation of interventions delivered at the group level. While generalized estimating equations (GEE) are commonly used to provide population-averaged inference in CRTs, there is a gap of general methods and statistical software tools for power calculation based on multi-parameter, within-cluster correlation structures suitable for multi-period CRTs that can accommodate both complete and incomplete designs. A computationally fast, non-simulation procedure for determining statistical power is described for the GEE analysis of complete and incomplete multi-period cluster randomized trials. The procedure is implemented via a SAS macro, \%CRTFASTGEEPWR, which is applicable to binary, count and continuous responses and several correlation structures in multi-period CRTs. The SAS macro is illustrated in the power calculation of two complete and two incomplete stepped wedge cluster randomized trial scenarios under different specifications of marginal mean model and within-cluster correlation structure. The proposed GEE power method is quite general as demonstrated in the SAS macro with numerous input options. The power procedure and macro can also be used in the planning of parallel and crossover CRTs in addition to cross-sectional and closed cohort stepped wedge trials.

preprint2022arXiv

AFD Types Sparse Representations vs. the Karhunen-Loeve Expansion for Decomposing Stochastic Processes

This article introduces adaptive Fourier decomposition (AFD) type methods, emphasizing on those that can be applied to stochastic processes and random fields, mainly including stochastic adaptive Fourier decomposition and stochastic pre-orthogonal adaptive Fourier decomposition. We establish their algorithms based on the covariant function and prove that they enjoy the same convergence rate as the Karhunen-Loève (KL) decomposition. The AFD type methods are compared with the KL decomposition. In contrast with the latter, the AFD type methods do not need to compute eigenvalues and eigenfunctions of the kernel-integral operator induced by the covariance function, and thus considerably reduce the computation complexity and computer consumes. Various kinds of dictionaries offer AFD flexibility to solve problems of a great variety, including different types of deterministic and stochastic equations. The conducted experiments show, besides the numerical convenience and fast convergence, that the AFD type decompositions outperform the KL type in describing local details, in spite of the proven global optimality of the latter.

preprint2022arXiv

Carbide: Highly Reliable Networks Through Real-Time Multiple Control Plane Composition

Achieving highly reliable networks is essential for network operators to ensure proper packet delivery in the event of software errors or hardware failures. Networks must ensure reachability and routing correctness, such as subnet isolation and waypoint traversal. Existing work in network verification relies on centralized computation at the cost of fault tolerance, while other approaches either build an over-engineered, complex control plane, or compose multiple control planes without providing any guarantee on correctness. This paper presents Carbide, a novel system to achieve high reliability in networks through distributed verification and multiple control plane composition. The core of Carbide is a simple, generic, efficient distributed verification framework that transforms a generic network verification problem to a reachability verification problem on a directed acyclic graph (DAG), and solves the latter via an efficient distributed verification protocol (DV-protocol). Equipped with verification results, Carbide allows the systematic composition of multiple control planes and realization of operator-specified consistency. Carbide is fully implemented. Extensive experiments show that (1) Carbide reduces downtime by 43% over the most reliable individual underlying control plane, while enforcing correctness requirements on all traffic; and (2) by systematically decomposing computation to devices and pruning unnecessary messaging between devices during verification, Carbide scales to a production data center network.

preprint2022arXiv

Continuum Skyrme Hartree-Fock-Bogoliubov theory with Green&#39;s function method for neutron-rich Ca, Ni, Zr, Sn isotopes

The possible exotic nuclear properties in the neutron-rich Ca, Ni, Zr, and Sn isotopes are explored with the continuum Skyrme Hartree-Fock-Bogoliubov theory formulated with the Green&#39;s function method. The available experimental two-neutron separation energies $S_{\rm 2n}$ and one-neutron separation energies $S_{\rm n}$ are well reproduced. Much shorter drip lines predicted by $S_{\rm n}$ are obtained compared with those by $S_{\rm 2n}$. The systematic studies of the neutron pairing energies $-E_{\rm pair}$ shown that values of the odd-$A$ nuclei are much smaller in comparison with those of the neighboring even-even nuclei due to the absent contribution of pairing energy by the unpaired odd neutron. By investigating the single-particle structures, the rms radii and the density distributions, the possible halo structures in the neutron-rich Ca, Ni, and Sn isotopes are predicted, in which the sharp increases of rms radii with significant deviations from the traditional $r\varpropto A^{1/3}$ rule and very diffuse spatial distributions in densities are observed. By analyzing the contributions of different partial waves to the total neutron density $ρ_{lj}(r)/ρ(r)$, the orbitals locating around the Fermi surface especially those with low angular momenta are found the main reason causing the extended nuclear density and large rms radii. Finally, the numbers of the neutrons $N_λ$~($N_0$) occupied above the Fermi surface $λ_n$~(in the continuum) are discussed, the behaviors of which are basically consistent with those of pairing energy, supporting the key role of the pairing correlations in the halo phenomena.

preprint2022arXiv

Detailed Balance for Particle Models of Reversible Reactions in Bounded Domains

In particle-based stochastic reaction-diffusion models, reaction rate and placement kernels are used to decide the probability per time a reaction can occur between reactant particles, and to decide where product particles should be placed. When choosing kernels to use in reversible reactions, a key constraint is to ensure that detailed balance of spatial reaction-fluxes holds at all points at equilibrium. In this work we formulate a general partial-integral differential equation model that encompasses several of the commonly used contact reactivity (e.g. Smoluchowski-Collins-Kimball) and volume reactivity (e.g. Doi) particle models. From these equations we derive a detailed balance condition for the reversible $\textrm{A} + \textrm{B} \leftrightarrows \textrm{C}$ reaction. In bounded domains with no-flux boundary conditions, when choosing unbinding kernels consistent with several commonly used binding kernels, we show that preserving detailed balance of spatial reaction-fluxes at all points requires spatially varying unbinding rate functions near the domain boundary. Brownian Dynamics simulation algorithms can realize such varying rates through ignoring domain boundaries during unbinding and rejecting unbinding events that result in product particles being placed outside the domain.

preprint2022arXiv

Emotional Conversation Generation with Heterogeneous Graph Neural Network

The successful emotional conversation system depends on sufficient perception and appropriate expression of emotions. In a real-life conversation, humans firstly instinctively perceive emotions from multi-source information, including the emotion flow hidden in dialogue history, facial expressions, audio, and personalities of speakers. Then, they convey suitable emotions according to their personalities, but these multiple types of information are insufficiently exploited in emotional conversation fields. To address this issue, in this paper, we propose a heterogeneous graph-based model for emotional conversation generation. Firstly, we design a Heterogeneous Graph-Based Encoder to represent the conversation content (i.e., the dialogue history, its emotion flow, facial expressions, audio, and speakers&#39; personalities) with a heterogeneous graph neural network, and then predict suitable emotions for feedback. Secondly, we employ an Emotion-Personality-Aware Decoder to generate a response relevant to the conversation context as well as with appropriate emotions, through taking the encoded graph representations, the predicted emotions by the encoder and the personality of the current speaker as inputs. Experiments on both automatic and human evaluation show that our method can effectively perceive emotions from multi-source knowledge and generate a satisfactory response. Furthermore, based on the up-to-date text generator BART, our model still can achieve consistent improvement, which significantly outperforms some existing state-of-the-art models.

preprint2022arXiv

Enabling Harmonious Human-Machine Interaction with Visual-Context Augmented Dialogue System: A Review

The intelligent dialogue system, aiming at communicating with humans harmoniously with natural language, is brilliant for promoting the advancement of human-machine interaction in the era of artificial intelligence. With the gradually complex human-computer interaction requirements (e.g., multimodal inputs, time sensitivity), it is difficult for traditional text-based dialogue system to meet the demands for more vivid and convenient interaction. Consequently, Visual Context Augmented Dialogue System (VAD), which has the potential to communicate with humans by perceiving and understanding multimodal information (i.e., visual context in images or videos, textual dialogue history), has become a predominant research paradigm. Benefiting from the consistency and complementarity between visual and textual context, VAD possesses the potential to generate engaging and context-aware responses. For depicting the development of VAD, we first characterize the concepts and unique features of VAD, and then present its generic system architecture to illustrate the system workflow. Subsequently, several research challenges and representative works are detailed investigated, followed by the summary of authoritative benchmarks. We conclude this paper by putting forward some open issues and promising research trends for VAD, e.g., the cognitive mechanisms of human-machine dialogue under cross-modal dialogue context, and knowledge-enhanced cross-modal semantic interaction.

preprint2022arXiv

Example-Based Vulnerability Detection and Repair in Java Code

The Java libraries JCA and JSSE offer cryptographic APIs to facilitate secure coding. When developers misuse some of the APIs, their code becomes vulnerable to cyber-attacks. To eliminate such vulnerabilities, people built tools to detect security-API misuses via pattern matching. However, most tools do not (1) fix misuses or (2) allow users to extend tools&#39; pattern sets. To overcome both limitations, we created Seader-an example-based approach to detect and repair security-API misuses. Given an exemplar <insecure, secure>code pair, Seader compares the snippets to infer any API-misuse template and corresponding fixing edit. Based on the inferred info, given a program, Seader performs inter-procedural static analysis to search for security-API misuses and to propose customized fixes. For evaluation, we applied Seader to 28 <insecure, secure> codepairs; Seader successfully inferred 21 unique API-misuse templates and related fixes. With these <vulnerability, fix> patterns, we applied SEADER to a program benchmark that has 86 known vulnerabilities. Seader detected vulnerabilities with 95% precision, 72% recall, and82% F-score. We also applied Seader to 100 open-source projects and manually checked 77 suggested repairs; 76 of the repairs were correct. Seader can help developers correctly use security APIs.

preprint2022arXiv

Exploiting full Resolution Feature Context for Liver Tumor and Vessel Segmentation via Integrate Framework: Application to Liver Tumor and Vessel 3D Reconstruction under embedded microprocessor

Liver cancer is one of the most common malignant diseases in the world. Segmentation and labeling of liver tumors and blood vessels in CT images can provide convenience for doctors in liver tumor diagnosis and surgical intervention. In the past decades, many state-of-the-art medical image segmentation algorithms appeared during this period. With the development of embedded devices, embedded deployment for medical segmentation and automatic reconstruction brings prospects for future automated surgical tasks. Yet, most of the existing segmentation methods mostly care about the spatial feature context and have a perception defect in the semantic relevance of medical images, which significantly affects the segmentation accuracy of liver tumors and blood vessels. Deploying large and complex models into embedded devices requires a reasonable trade-off between model accuracy, reasoning speed and model capacity. Given these problems, we introduce a multi-scale feature fusion network called TransFusionNet based on Transformer. This network achieved very competitive performance for liver vessel and liver tumor segmentation tasks, meanwhile it can improve the recognition of morphologic margins of liver tumors by exploiting the global information of CT images. Experiments show that in vessel segmentation task TransFusionNet achieved mean Dice coefficients of 0.899 and in liver tumor segmentation task TransFusionNet achieved mean Dice coefficients of 0.961. Compared with the state-of-the-art framework, our model achieves the best segmentation result. In addition, we deployed the model into an embedded micro-structure and constructed an integrated model for liver tumor vascular segmentation and reconstruction. This proprietary structure will be the exclusive component of the future medical field.

preprint2022arXiv

Hot 2DHG states in tellurium

Element semiconductor Te is very popular in both fundamental electronic structure study, and device fabrication research area due to its unique band structure. Specifically, in low temperatures, Te possesses strong quantum oscillations with magnetic field applied in basal plane, either following Shubnikov-de Haas (SdH) oscillation rule or following log-periodic oscillation rule. With magnetic field applied along the [001] direction, the SdH oscillations are attributed to the two-dimensional hole gas (2DHG) surface states. Here we reported an interesting SdH oscillation in Te-based single crystals, with the magnetic field applied along the [001] direction of the crystals, showing the maximum oscillation intensity at ~ 75 K, and still traceable at 200 K, which indicates a rather hot 2DHG state. The nontrivial Berry phase can be also obtained from the oscillations, implying the contribution from topological states. More importantly, the high temperature SdH oscillation phenomena are observed in different Te single crystals samples, and Te single crystals with nonmagnetic/magnetic dopants, showing robustness to bulk defects. Therefore, the oscillation may be contributed by the bulk symmetry protected hot 2DHG states, which will offer a new platform for high-temperature quantum transport studies.

preprint2022arXiv

Marginal Regression on Transient State Occupation Probabilities with Clustered Multistate Process Data

Clustered multistate process data are commonly encountered in multicenter observational studies and clinical trials. A clinically important estimand with such data is the marginal probability of being in a particular transient state as a function of time. However, there is currently no method for nonparametric marginal regression analysis of these probabilities with clustered multistate process data. To address this problem, we propose a weighted functional generalized estimating equations approach which does not impose Markov assumptions or assumptions regarding the structure of the within-cluster dependence, and allows for informative cluster size (ICS). The asymptotic properties of the proposed estimators for the functional regression coefficients are rigorously established and a nonparametric hypothesis testing procedure for covariate effects is proposed. Simulation studies show that the proposed method performs well even with a small number of clusters, and that ignoring the within-cluster dependence and the ICS leads to invalid inferences. The proposed method is used to analyze data from a multicenter clinical trial on recurrent or metastatic squamous-cell carcinoma of the head and neck with a stratified randomization design.

preprint2022arXiv

Mix-up Self-Supervised Learning for Contrast-agnostic Applications

Contrastive self-supervised learning has attracted significant research attention recently. It learns effective visual representations from unlabeled data by embedding augmented views of the same image close to each other while pushing away embeddings of different images. Despite its great success on ImageNet classification, COCO object detection, etc., its performance degrades on contrast-agnostic applications, e.g., medical image classification, where all images are visually similar to each other. This creates difficulties in optimizing the embedding space as the distance between images is rather small. To solve this issue, we present the first mix-up self-supervised learning framework for contrast-agnostic applications. We address the low variance across images based on cross-domain mix-up and build the pretext task based on two synergistic objectives: image reconstruction and transparency prediction. Experimental results on two benchmark datasets validate the effectiveness of our method, where an improvement of 2.5% ~ 7.4% in top-1 accuracy was obtained compared to existing self-supervised learning methods.

preprint2022arXiv

Multi-grained Label Refinement Network with Dependency Structures for Joint Intent Detection and Slot Filling

Slot filling and intent detection are two fundamental tasks in the field of natural language understanding. Due to the strong correlation between these two tasks, previous studies make efforts on modeling them with multi-task learning or designing feature interaction modules to improve the performance of each task. However, none of the existing approaches consider the relevance between the structural information of sentences and the label semantics of two tasks. The intent and semantic components of a utterance are dependent on the syntactic elements of a sentence. In this paper, we investigate a multi-grained label refinement network, which utilizes dependency structures and label semantic embeddings. Considering to enhance syntactic representations, we introduce the dependency structures of sentences into our model by graph attention layer. To capture the semantic dependency between the syntactic information and task labels, we combine the task specific features with corresponding label embeddings by attention mechanism. The experimental results demonstrate that our model achieves the competitive performance on two public datasets.

preprint2022arXiv

Probabilistic Models for Manufacturing Lead Times

In this study, we utilize Gaussian processes, probabilistic neural network, natural gradient boosting, and quantile regression augmented gradient boosting to model lead times of laser manufacturing processes. We introduce probabilistic modelling in the domain and compare the models in terms of different abilities. While providing a comparison between the models in real-life data, our work has many use cases and substantial business value. Our results indicate that all of the models beat the company estimation benchmark that uses domain experience and have good calibration with the empirical frequencies.

preprint2022arXiv

Reinforcement Learning Based Query Vertex Ordering Model for Subgraph Matching

Subgraph matching is a fundamental problem in various fields that use graph structured data. Subgraph matching algorithms enumerate all isomorphic embeddings of a query graph q in a data graph G. An important branch of matching algorithms exploit the backtracking search approach which recursively extends intermediate results following a matching order of query vertices. It has been shown that the matching order plays a critical role in time efficiency of these backtracking based subgraph matching algorithms. In recent years, many advanced techniques for query vertex ordering (i.e., matching order generation) have been proposed to reduce the unpromising intermediate results according to the preset heuristic rules. In this paper, for the first time we apply the Reinforcement Learning (RL) and Graph Neural Networks (GNNs) techniques to generate the high-quality matching order for subgraph matching algorithms. Instead of using the fixed heuristics to generate the matching order, our model could capture and make full use of the graph information, and thus determine the query vertex order with the adaptive learning-based rule that could significantly reduces the number of redundant enumerations. With the help of the reinforcement learning framework, our model is able to consider the long-term benefits rather than only consider the local information at current ordering step.Extensive experiments on six real-life data graphs demonstrate that our proposed matching order generation technique could reduce up to two orders of magnitude of query processing time compared to the state-of-the-art algorithms.

preprint2022arXiv

Semi-supervised segmentation of tooth from 3D Scanned Dental Arches

Teeth segmentation is an important topic in dental restorations that is essential for crown generation, diagnosis, and treatment planning. In the dental field, the variability of input data is high and there are no publicly available 3D dental arch datasets. Although there has been improvement in the field provided by recent deep learning architectures on 3D data, there still exists some problems such as properly identifying missing teeth in an arch. We propose to use spectral clustering as a self-supervisory signal to joint-train neural networks for segmentation of 3D arches. Our approach is motivated by the observation that K-means clustering provides cues to capture margin lines related to human perception. The main idea is to automatically generate training data by decomposing unlabeled 3D arches into segments relying solely on geometric information. The network is then trained using a joint loss that combines a supervised loss of annotated input and a self-supervised loss of non-labeled input. Our collected data has a variety of arches including arches with missing teeth. Our experimental results show improvement over the fully supervised state-of-the-art MeshSegNet when using semi-supervised learning. Finally, we contribute code and a dataset.

preprint2022arXiv

Sparse Representations of Solutions to a class of Random Boundary Value Problems

We introduce certain sparse representation methods, named as stochastic pre-orthogonal adaptive Fourier decomposition 1 and 2 (SPOAFD1 and SPOAFD2) to solve the Dirichlet boundary value problem and the Cauchy initial value problem of random data. To solve the stochastic boundary value problems the sparse representation is, as the initial step, applied to the random boundary data. Due to the semigroup property of the Poisson and the heat kernel, each entry of the expanding series can be lifted up to compose a solution of the Dirichlet and the Cauchy initial value problem, respectively. The sparse representation gives rise to analytic as well as numerical solutions to the problems with high efficiency.

preprint2022arXiv

Testing for Treatment Effect Twice Using Internal and External Controls in Clinical Trials

Leveraging external controls -- relevant individual patient data under control from external trials or real-world data -- has the potential to reduce the cost of randomized controlled trials (RCTs) while increasing the proportion of trial patients given access to novel treatments. However, due to lack of randomization, RCT patients and external controls may differ with respect to covariates that may or may not have been measured. Hence, after controlling for measured covariates, for instance by matching, testing for treatment effect using external controls may still be subject to unmeasured biases. In this paper, we propose a sensitivity analysis approach to quantify the magnitude of unmeasured bias that would be needed to alter the study conclusion that presumed no unmeasured biases are introduced by employing external controls. Whether leveraging external controls increases power or not depends on the interplay between sample sizes and the magnitude of treatment effect and unmeasured biases, which may be difficult to anticipate. This motivates a combined testing procedure that performs two highly correlated analyses, one with and one without external controls, with a small correction for multiple testing using the joint distribution of the two test statistics. The combined test provides a new method of sensitivity analysis designed for data fusion problems, which anchors at the unbiased analysis based on RCT only and spends a small proportion of the type I error to also test using the external controls. In this way, if leveraging external controls increases power, the power gain compared to the analysis based on RCT only can be substantial; if not, the power loss is small. The proposed method is evaluated in theory and power calculations, and applied to a real trial.

preprint2022arXiv

The Hadron-Quark Crossover in Neutron Star within Gaussian Process Regression Method

The equations of state of the neutron star at the hadron-quark crossover region are interpolated with the Gaussian process regression (GPR) method, which can reduce the randomness of present interpolation schemes. The relativistic mean-field (RMF) model and Nambu-Jona-Lasinio (NJL) model are employed to describe the hadronic phase and quark phase, respectively. In the RMF model, the coupling term between $ω$ and $ρ$ mesons is considered to control the density-dependent behaviors of symmetry energy, i.e. the slope of symmetry energy, $L$. Furthermore, the vector interaction between quarks is included in the NJL model to obtain the additional repulsive contributions. Their coupling strengths and the crossover windows are discussed in the present framework under the constraints on the neutron star from gravitational wave detections, massive neutron star measurements, mass-radius simultaneous observation of NICER collaboration, and the neutron skin thickness of $^{208}$Pb from PREX-II. It is found that the slope of symmetry energy, $L$ should be around $50-90$ MeV and the crossover window is $(0.3,~0.6)~\rm fm^{-3}$ with these observables. Furthermore, the uncertainties of neutron star masses and radii in the hadron-quark crossover regions are also predicted by the GPR method.

preprint2022arXiv

The hyperonic star in relativistic mean-field model

The neutron star as a supernova remnant is attracting high attention recently due to the gravitation wave detection and precise measurements about its mass and radius. In the inner core region of the neutron star, the strangeness degrees of freedom, such as the hyperons, can be present, which is also named as a hyperonic star. In this work, the neutron star consisting of nucleons and leptons, and the hyperonic star including the hyperons will be reviewed in the framework of the relativistic mean-field (RMF) model. The popular non-linear and density-dependent RMF parametrizations in the market will be adopted to investigate the role of strangeness baryons in a hyperonic star on its mass, radius, tidal deformability, and other properties. Finally, the magnitudes of the coupling strengths between mesons and hyperons also will be discussed, which can generate the massive hyperonic star with present RMF parameter sets, when the vector coupling constants are strong.

preprint2022arXiv

The lineshape of the compact fully heavy tetraquark

Hadrons and their distributions are the most direct observables in experiment, which would shed light on the non-perturbative mystery of quantum chromodynamics (QCD). As the result, any new hadron will challenge our current knowlege on the one hand, and provide additional inputs on the other hand. The fully heavy $cc\bar{c}\bar{c}$ system observed by LHCb recently opens a new era for hadron physics. We first extract the internal structure of the fully heavy tetraquarks directly from the experimental data, within the compact tetraquark picture. By fitting to the di-$J/ψ$ lineshape, we find that the $X(6900)$ is only cusp effect from the $J/ψψ(3770)$ channel. In addition, there is also a cusp slightly below $6.8~\mathrm{GeV}$ stemming from the $J/ψψ^\prime$ channel. The two $0^{++}$ tetraquarks behave as two resonances above the di-$η_c$ and di-$J/ψ$ threshold, respectively. The $2^{++}$ state is a bound state below the di-$J/ψ$ threshold. Furthermore, we find that the $X_{0^{++}}(6035)$ shows a significant structure in the di-$η_c$ lineshape even after the coupled channel effect. This is an unique feature which can distinguish compact $cc\bar{c}\bar{c}$ tetraquark from the loosely hadronic molecules.

preprint2022arXiv

The Minimal Flavor Structure from Decomposition of the Fermion Mass Matrix

The minimal flavor structures for both quarks and leptons are proposed to address fermion mass hierarchy and flavor mixings by bi-unitary decomposition of the fermion mass matrix. The real matrix ${\bf M}_0^f$ is completely responsive to family mass hierarchy, which is expressed by a close-to-flat matrix structure. The left-handed unitary phase ${\bf F}_L^f$ provides the origin of CP violation in quark and lepton mixings, which can be explained as a quantum effect between Yukawa interaction states and weak gauge states. The minimal flavor structure is realized by just 10 parameters without any redundancy, corresponding to 6 fermion masses, 3 mixing angles and 1 CP violation in the quark/lepton sector. This approach provides a general flavor structure independent of the specific quark or lepton flavor data. We verify the validation of the flavor structure by reproducing quark/lepton masses and mixings. Some possible scenarios that yield the flavor structure are also discussed.

preprint2022arXiv

The nuclear symmetry energy from relativistic Brueckner-Hartree-Fock model

The microscopic mechanisms of the symmetry energy in nuclear matter are investigated in the framework of the relativistic Brueckner-Hartree-Fock (RBHF) model with a high-precision realistic nuclear potential, pvCDBonn A. The kinetic energy and potential contributions to symmetry energy are decomposed. They are explicitly expressed by the nucleon self-energies, which are obtained through projecting the $G$-matrices from the RBHF model into the terms of Lorentz covariants. The nuclear medium effects on the nucleon self-energy and nucleon-nucleon interaction in symmetry energy are discussed by comparing the results from the RBHF model and those from Hartree-Fock and relativistic Hartree-Fock models. It is found that the nucleon self-energy including the nuclear medium effect on the single-nucleon wave function provides a largely positive contribution to the symmetry energy, while {the nuclear medium effect on the nucleon-nucleon interaction, i.e., the effective $G$-matrices generates the negative contribution}. The tensor force plays an essential role in the symmetry energy around the density. The scalar and vector covariant amplitudes of nucleon-nucleon interaction dominate the potential component of the symmetry energy. Furthermore, the isoscalar and isovector terms in the optical potential are extracted from the RBHF model. The isoscalar part is consistent with the results from the analysis of global optical potential, while the isovector one has obvious differences at higher incident energy due to the relativistic effect.

preprint2022arXiv

Towards Higher-order Topological Consistency for Unsupervised Network Alignment

Network alignment task, which aims to identify corresponding nodes in different networks, is of great significance for many subsequent applications. Without the need for labeled anchor links, unsupervised alignment methods have been attracting more and more attention. However, the topological consistency assumptions defined by existing methods are generally low-order and less accurate because only the edge-indiscriminative topological pattern is considered, which is especially risky in an unsupervised setting. To reposition the focus of the alignment process from low-order to higher-order topological consistency, in this paper, we propose a fully unsupervised network alignment framework named HTC. The proposed higher-order topological consistency is formulated based on edge orbits, which is merged into the information aggregation process of a graph convolutional network so that the alignment consistencies are transformed into the similarity of node embeddings. Furthermore, the encoder is trained to be multi-orbit-aware and then be refined to identify more trusted anchor links. Node correspondence is comprehensively evaluated by integrating all different orders of consistency. {In addition to sound theoretical analysis, the superiority of the proposed method is also empirically demonstrated through extensive experimental evaluation. On three pairs of real-world datasets and two pairs of synthetic datasets, our HTC consistently outperforms a wide variety of unsupervised and supervised methods with the least or comparable time consumption. It also exhibits robustness to structural noise as a result of our multi-orbit-aware training mechanism.

preprint2022arXiv

Towards Real-Time Counting Shortest Cycles on Dynamic Graphs: A Hub Labeling Approach

With the ever-increasing prevalence of graph data in a wide spectrum of applications, it becomes essential to analyze structural trends in dynamic graphs on a continual basis. The shortest cycle is a fundamental pattern in graph analytics. In this paper, we investigate the problem of shortest cycle counting for a given vertex in dynamic graphs in light of its applicability to problems such as fraud detection. To address such queries efficiently, we propose a 2-hop labeling based algorithm called Counting Shortest Cycle (CSC for short). Additionally, techniques for dynamically updating the CSC index are explored. Comprehensive experiments are conducted to demonstrate the efficiency and effectiveness of our method. In particular, CSC enables query evaluation in a few hundreds of microseconds for graphs with millions of edges, and improves query efficiency by two orders of magnitude when compared to the baseline solutions. Also, the update algorithm could efficiently cope with edge insertions (deletions).

preprint2022arXiv

Traffic Context Aware Data Augmentation for Rare Object Detection in Autonomous Driving

Detection of rare objects (e.g., traffic cones, traffic barrels and traffic warning triangles) is an important perception task to improve the safety of autonomous driving. Training of such models typically requires a large number of annotated data which is expensive and time consuming to obtain. To address the above problem, an emerging approach is to apply data augmentation to automatically generate cost-free training samples. In this work, we propose a systematic study on simple Copy-Paste data augmentation for rare object detection in autonomous driving. Specifically, local adaptive instance-level image transformation is introduced to generate realistic rare object masks from source domain to the target domain. Moreover, traffic scene context is utilized to guide the placement of masks of rare objects. To this end, our data augmentation generates training data with high quality and realistic characteristics by leveraging both local and global consistency. In addition, we build a new dataset, Rare Object Dataset (ROD), consisting 10k training images, 4k validation images and the corresponding labels with a diverse range of scenarios in autonomous driving. Experiments on ROD show that our method achieves promising results on rare object detection. We also present a thorough study to illustrate the effectiveness of our local-adaptive and global constraints based Copy-Paste data augmentation for rare object detection. The data, development kit and more information of ROD are available online at: \url{https://nullmax-vision.github.io}.

preprint2021arXiv

A Unified Yukawa Interaction for the Standard Model of Quarks and Leptons

To address fermion mass hierarchy and flavor mixings in the quark and lepton sectors, a minimal flavor structure without any redundant parameters beyond phenomenological observables is proposed via decomposition of the Standard Model Yukawa mass matrix into a bi-unitary form. After reviewing the roles and parameterization of the factorized matrix ${\bf M}_0^f$ and ${\bf F}_L^f$ in fermion masses and mixings, we generalize the mechanism to up- and down-type fermions to unify them into a universal quark/lepton Yukawa interaction. In the same way, a unified form of the description of the quark and lepton Yukawa interactions is also proposed, which shows a similar picture as the unification of gauge interactions.

preprint2021arXiv

Adaptive Load Shedding for Grid Emergency Control via Deep Reinforcement Learning

Emergency control, typically such as under-voltage load shedding (UVLS), is broadly used to grapple with low voltage and voltage instability issues in practical power systems under contingencies. However, existing emergency control schemes are rule-based and cannot be adaptively applied to uncertain and floating operating conditions. This paper proposes an adaptive UVLS algorithm for emergency control via deep reinforcement learning (DRL) and expert systems. We first construct dynamic components for picturing the power system operation as the environment. The transient voltage recovery criteria, which poses time-varying requirements to UVLS, is integrated into the states and reward function to advise the learning of deep neural networks. The proposed approach has no tuning issue of coefficients in reward functions, and this issue was regarded as a deficiency in the existing DRL-based algorithms. Extensive case studies illustrate that the proposed method outperforms the traditional UVLS relay in both the timeliness and efficacy for emergency control.

preprint2021arXiv

Consensus-Aware Visual-Semantic Embedding for Image-Text Matching

Image-text matching plays a central role in bridging vision and language. Most existing approaches only rely on the image-text instance pair to learn their representations, thereby exploiting their matching relationships and making the corresponding alignments. Such approaches only exploit the superficial associations contained in the instance pairwise data, with no consideration of any external commonsense knowledge, which may hinder their capabilities to reason the higher-level relationships between image and text. In this paper, we propose a Consensus-aware Visual-Semantic Embedding (CVSE) model to incorporate the consensus information, namely the commonsense knowledge shared between both modalities, into image-text matching. Specifically, the consensus information is exploited by computing the statistical co-occurrence correlations between the semantic concepts from the image captioning corpus and deploying the constructed concept correlation graph to yield the consensus-aware concept (CAC) representations. Afterwards, CVSE learns the associations and alignments between image and text based on the exploited consensus as well as the instance-level representations for both modalities. Extensive experiments conducted on two public datasets verify that the exploited consensus makes significant contributions to constructing more meaningful visual-semantic embeddings, with the superior performances over the state-of-the-art approaches on the bidirectional image and text retrieval task. Our code of this paper is available at: https://github.com/BruceW91/CVSE.

preprint2021arXiv

Data-Driven Vulnerability Detection and Repair in Java Code

Java platform provides various APIs to facilitate secure coding. However, correctly using security APIs is usually challenging for developers who lack cybersecurity training. Prior work shows that many developers misuse security APIs; such misuses can introduce vulnerabilities into software, void security protections, and present security exploits to hackers. To eliminate such API-related vulnerabilities, this paper presents SEADER -- our new approach that detects and repairs security API misuses. Given an exemplar, insecure code snippet, and its secure counterpart, SEADER compares the snippets and conducts data dependence analysis to infer the security API misuse templates and corresponding fixing operations. Based on the inferred information, given a program, SEADER performs inter-procedural static analysis to search for any security API misuse and to propose customized fixing suggestions for those vulnerabilities. To evaluate SEADER, we applied it to 25 <insecure, secure> code pairs, and SEADER successfully inferred 18 unique API misuse templates and related fixes. With these vulnerability repair patterns, we further applied SEADER to 10 open-source projects that contain in total 32 known vulnerabilities. Our experiment shows that SEADER detected vulnerabilities with 100% precision, 84% recall, and 91% accuracy. Additionally, we applied SEADER to 100 Apache open-source projects and detected 988 vulnerabilities; SEADER always customized repair suggestions correctly. Based on SEADER&#39;s outputs, we filed 60 pull requests. Up till now, developers of 18 projects have offered positive feedbacks on SEADER&#39;s suggestions. Our results indicate that SEADER can effectively help developers detect and fix security API misuses. Whereas prior work either detects API misuses or suggests simple fixes, SEADER is the first tool to do both for nontrivial vulnerability repairs.

preprint2021arXiv

Detected the steerability bounds of the generalized Werner states via BackPropagation neural network

We use error BackPropagation (BP) neural network to determine whether an arbitrary two-qubit quantum state is steerable and optimize the steerability bounds of the generalized Werner state. The results show that no matter how we choose the features for the quantum states, we can use the BP neural network to construct several models to realize high-performance quantum steering classifiers compared with the support vector machine (SVM). In addition, we predict the steerability bounds of the generalized Werner states by using the classifiers which are newly constructed by the BP neural network, that is, the predicted steerability bounds are closer to the theoretical bounds. In particular, high-performance classifiers with partial information of the quantum states which we only need to measure in three fixed measurement directions are obtained.

preprint2021arXiv

On stochastic gradient Langevin dynamics with dependent data streams: the fully non-convex case

We consider the problem of sampling from a target distribution, which is \emph {not necessarily logconcave}, in the context of empirical risk minimization and stochastic optimization as presented in Raginsky et al. (2017). Non-asymptotic analysis results are established in the $L^1$-Wasserstein distance for the behaviour of Stochastic Gradient Langevin Dynamics (SGLD) algorithms. We allow the estimation of gradients to be performed even in the presence of \emph{dependent} data streams. Our convergence estimates are sharper and \emph{uniform} in the number of iterations, in contrast to those in previous studies.

preprint2021arXiv

Quantifying measurement-induced nonbilocal correlation

In the paper, we devote to defining an available measure to quantify the nonbilocal correlation in the entanglement-swapping experiment. Then we obtain analytical formulas to calculate the quantifier when the inputs are pure states. For the case of mixed inputs, we discuss the computational properties of the quantifier. Finally, we derive a tight upper bound to the nonbilocality quantifier.

preprint2021arXiv

Similarity Reasoning and Filtration for Image-Text Matching

Image-text matching plays a critical role in bridging the vision and language, and great progress has been made by exploiting the global alignment between image and sentence, or local alignments between regions and words. However, how to make the most of these alignments to infer more accurate matching scores is still underexplored. In this paper, we propose a novel Similarity Graph Reasoning and Attention Filtration (SGRAF) network for image-text matching. Specifically, the vector-based similarity representations are firstly learned to characterize the local and global alignments in a more comprehensive manner, and then the Similarity Graph Reasoning (SGR) module relying on one graph convolutional neural network is introduced to infer relation-aware similarities with both the local and global alignments. The Similarity Attention Filtration (SAF) module is further developed to integrate these alignments effectively by selectively attending on the significant and representative alignments and meanwhile casting aside the interferences of non-meaningful alignments. We demonstrate the superiority of the proposed method with achieving state-of-the-art performances on the Flickr30K and MSCOCO datasets, and the good interpretability of SGR and SAF modules with extensive qualitative experiments and analyses.

preprint2021arXiv

The $ΞN$ interaction constrained by recent $Ξ^-$ hypernuclei experiments

The $ΞN$ interaction is investigated in the quark mean-field (QMF) model based on recent observables of the $Ξ^-+^{14}\rm{N}$ ($_{Ξ^-}^{15}\rm{C}$) system. The experimental data about the binding energy of $1p$-state $Ξ^-$ hyperon in $_{Ξ^-}^{15}\rm{C}$ hypernuclei at KISO, IBUKI, E07-T011, E176-14-03-35 events are conflated as $B_{Ξ^-}(1p)=1.14\pm0.11$ MeV. With this constraint, the coupling strengths between the vector meson and $Ξ$ hyperon are fixed in three QMF parameter sets. Meanwhile, the $Ξ^-$ binding energy of $1s$ state in $_{Ξ^-}^{15}\rm{C}$ is predicted as $B_{Ξ^-}(1s)=5.66\pm0.38$ MeV with the same interactions, which are completely consistent with the data from the KINKA and IRRAWADDY events. Finally, the single $ΞN$ potential is calculated in the symmetric nuclear matter in the framework of QMF models. It is $U_{ΞN}=-11.96\pm 0.85$ MeV at nuclear saturation density, which will contribute to the study on the strangeness degree of freedom in compact star.

preprint2020arXiv

A fully data-driven approach to minimizing CVaR for portfolio of assets via SGLD with discontinuous updating

A new approach in stochastic optimization via the use of stochastic gradient Langevin dynamics (SGLD) algorithms, which is a variant of stochastic gradient decent (SGD) methods, allows us to efficiently approximate global minimizers of possibly complicated, high-dimensional landscapes. With this in mind, we extend here the non-asymptotic analysis of SGLD to the case of discontinuous stochastic gradients. We are thus able to provide theoretical guarantees for the algorithm&#39;s convergence in (standard) Wasserstein distances for both convex and non-convex objective functions. We also provide explicit upper estimates of the expected excess risk associated with the approximation of global minimizers of these objective functions. All these findings allow us to devise and present a fully data-driven approach for the optimal allocation of weights for the minimization of CVaR of portfolio of assets with complete theoretical guarantees for its performance. Numerical results illustrate our main findings.

preprint2020arXiv

A Unified Approach to Scalable Spectral Sparsification of Directed Graphs

Recent spectral graph sparsification research allows constructing nearly-linear-sized subgraphs that can well preserve the spectral (structural) properties of the original graph, such as the first few eigenvalues and eigenvectors of the graph Laplacian, leading to the development of a variety of nearly-linear time numerical and graph algorithms. However, there is not a unified approach that allows for truly-scalable spectral sparsification of both directed and undirected graphs. In this work, we prove the existence of linear-sized spectral sparsifiers for general directed graphs and introduce a practically-efficient and unified spectral graph sparsification approach that allows sparsifying real-world, large-scale directed and undirected graphs with guaranteed preservation of the original graph spectra. By exploiting a highly-scalable (nearly-linear complexity) spectral matrix perturbation analysis framework for constructing nearly-linear sized (directed) subgraphs, it enables us to well preserve the key eigenvalues and eigenvectors of the original (directed) graph Laplacians. The proposed method has been validated using various kinds of directed graphs obtained from public domain sparse matrix collections, showing promising results for solving directed graph Laplacians, spectral embedding, and partitioning of general directed graphs, as well as approximately computing (personalized) PageRank vectors.

preprint2020arXiv

AOT: Pushing the Efficiency Boundary of Main-memory Triangle Listing

Triangle listing is an important topic significant in many practical applications. Efficient algorithms exist for the task of triangle listing. Recent algorithms leverage an orientation framework, which can be thought of as mapping an undirected graph to a directed acylic graph, namely oriented graph, with respect to any global vertex order. In this paper, we propose an adaptive orientation technique that satisfies the orientation technique but refines it by traversing carefully based on the out-degree of the vertices in the oriented graph during the computation of triangles. Based on this adaptive orientation technique, we design a new algorithm, namely aot, to enhance the edge-iterator listing paradigm. We also make improvements to the performance of aot by exploiting the local order within the adjacent list of the vertices. We show that aot is the first work which can achieve best performance in terms of both practical performance and theoretical time complexity. Our comprehensive experiments over $16$ real-life large graphs show a superior performance of our \aot algorithm when compared against the state-of-the-art, especially for massive graphs with billions of edges. Theoretically, we show that our proposed algorithm has a time complexity of $Θ(\sum_{ \langle u,v \rangle \in \vec{E} } \min\{ deg^{+}(u),deg^{+}(v)\}))$, where $\vec{E}$ and $deg^{+}(x)$ denote the set of directed edges in an oriented graph and the out-degree of vertex $x$ respectively. As to our best knowledge, this is the best time complexity among in-memory triangle listing algorithms.

preprint2020arXiv

Attack Identification and Correction for PMU GPS Spoofing in Unbalanced Distribution Systems

Due to the vulnerability of civilian global positioning system (GPS) signals, the accuracy of phasor measurement units (PMUs) can be greatly compromised by GPS spoofing attacks (GSAs), which introduce phase shifts into true phase angle measurements. Focusing on simultaneous GSAs for multiple PMU locations, this paper proposes a novel identification and correction algorithm in distribution systems. A sensitivity analysis of state estimation residuals on a single GSA phase angle is firstly implemented. An identification algorithm using a probing technique is proposed to determine the locations of spoofed PMUs and the ranges of GSA phase shifts. Based on the identification results, these GSA phase shifts are determined via an estimation algorithm that minimizes the mismatch between measurements and system states. Further, with the attacked PMU data corrected, the system states are recovered. Simulations in unbalanced IEEE 34-bus and 123-bus distribution systems demonstrates the efficiency and accuracy of the proposed method.

preprint2020arXiv

Deep Reinforcement Learning Based Volt-VAR Optimization in Smart Distribution Systems

This paper develops a model-free volt-VAR optimization (VVO) algorithm via multi-agent deep reinforcement learning (MADRL) in unbalanced distribution systems. This method is novel since we cast the VVO problem in unbalanced distribution networks to an intelligent deep Q-network (DQN) framework, which avoids solving a specific optimization model directly when facing time-varying operating conditions of the systems. We consider statuses/ratios of switchable capacitors, voltage regulators, and smart inverters installed at distributed generators as the action variables of the DQN agents. A delicately designed reward function guides these agents to interact with the distribution system, in the direction of reinforcing voltage regulation and power loss reduction simultaneously. The forward-backward sweep method for radial three-phase distribution systems provides accurate power flow results within a few iterations to the DQN environment. Finally, the proposed multi-objective MADRL method realizes the dual goals for VVO. We test this algorithm on the unbalanced IEEE 13-bus and 123-bus systems. Numerical simulations validate the excellent performance of this method in voltage regulation and power loss reduction.

preprint2020arXiv

Effects of symmetry energy on the radius and tidal deformability of neutron stars in relativistic mean-field model

The radii and tidal deformabilities of neutron stars are investigated in the framework of relativistic mean-field (RMF) model with different density-dependent behaviors of symmetry energy. To study the effects of symmetry energy on the properties of neutron stars, an $ω$ meson and $ρ$ meson coupling term is included in a popular RMF Lagrangian, i.e. the TM1 parameter set, which is used for the widely used supernova equation of state (EoS) table. The coupling constants relevant to the vector-isovector meson, $ρ$, are refitted by a fixed symmetry energy at subsaturation density and its slope at saturation density, while other coupling constants remain the same as the original ones in TM1 so as to update the supernova EoS table. The radius and mass of maximum neutron stars are not so sensitive to the symmetry energy in these family TM1 parameterizations. However, the radii at intermediate mass region are strongly correlated with the slope of symmetry energy. Furthermore, the dimensionless tidal deformabilities of neutron stars are also calculated within the associated Love number. We find that its value at $1.4 M_\odot$ has a linear correlation to the slope of symmetry energy being different from the previous studied. With the latest constraints of tidal deformabilities from GW170817 event, the slope of symmetry energy at nuclear saturation density should be smaller than $60$ MeV in the family TM1 parameterizations. This fact supports the usage of lower symmetry energy slope for the update supernova EoS, which is applicable to simulations of neutron star merger. Furthermore, the analogous analysis are also done within the family IUFSU parameter sets. It is found that the correlations between the symmetry energy slope with the radius and tidal deformability at $1.4 M_\odot$ have very similar linear relations in these RMF models.

preprint2020arXiv

Efficient Bitruss Decomposition for Large-scale Bipartite Graphs

Cohesive subgraph mining in bipartite graphs becomes a popular research topic recently. An important structure k-bitruss is the maximal cohesive subgraph where each edge is contained in at least k butterflies (i.e., (2, 2)-bicliques). In this paper, we study the bitruss decomposition problem which aims to find all the k-bitrusses for k >= 0. The existing bottom-up techniques need to iteratively peel the edges with the lowest butterfly support. In this peeling process, these techniques are time-consuming to enumerate all the supporting butterflies for each edge. To relax this issue, we first propose a novel online index -- the BE-Index which compresses butterflies into k-blooms (i.e., (2, k)-bicliques). Based on the BE-Index, the new bitruss decomposition algorithm BiT-BU is proposed, along with two batch-based optimizations, to accomplish the butterfly enumeration of the peeling process in an efficient way. Furthermore, the BiT-PC algorithm is devised which is more efficient against handling the edges with high butterfly supports. We theoretically show that our new algorithms significantly reduce the time complexities of the existing algorithms. Also, we conduct extensive experiments on real datasets and the result demonstrates that our new techniques can speed up the state-of-the-art techniques by up to two orders of magnitude.

preprint2020arXiv

Efficient Matrix Factorization on Heterogeneous CPU-GPU Systems

Matrix Factorization (MF) has been widely applied in machine learning and data mining. A large number of algorithms have been studied to factorize matrices. Among them, stochastic gradient descent (SGD) is a commonly used method. Heterogeneous systems with multi-core CPUs and GPUs have become more and more promising recently due to the prevalence of GPUs in general-purpose data-parallel applications. Due to the large computational cost of MF, we aim to improve the efficiency of SGD-based MF computation by utilizing the massive parallel processing power of heterogeneous multiprocessors. The main challenge in parallel SGD algorithms on heterogeneous CPU-GPU systems lies in the granularity of the matrix division and the strategy to assign tasks. We design a novel strategy to divide the matrix into a set of blocks by considering two aspects. First, we observe that the matrix should be divided nonuniformly, and relatively large blocks should be assigned to GPUs to saturate the computing power of GPUs. In addition to exploiting the characteristics of hardware, the workloads assigned to two types of hardware should be balanced. Aiming at the final division strategy, we design a cost model tailored for our problem to accurately estimate the performance of hardware on different data sizes. A dynamic scheduling policy is also used to further balance workloads in practice. Extensive experiments show that our proposed algorithm achieves high efficiency with a high quality of training quality.

preprint2020arXiv

Exploring Cohesive Subgraphs with Vertex Engagement and Tie Strength in Bipartite Graphs

We propose a novel cohesive subgraph model called $τ$-strengthened $(α,β)$-core (denoted as $(α,β)_τ$-core), which is the first to consider both tie strength and vertex engagement on bipartite graphs. An edge is a strong tie if contained in at least $τ$ butterflies ($2\times2$-bicliques). $(α,β)_τ$-core requires each vertex on the upper or lower level to have at least $α$ or $β$ strong ties, given strength level $τ$. To retrieve the vertices of $(α,β)_τ$-core optimally, we construct index $I_{α,β,τ}$ to store all $(α,β)_τ$-cores. Effective optimization techniques are proposed to improve index construction. To make our idea practical on large graphs, we propose 2D-indexes $I_{α,β}, I_{β,τ}$, and $I_{α,τ}$ that selectively store the vertices of $(α,β)_τ$-core for some $α,β$, and $τ$. The 2D-indexes are more space-efficient and require less construction time, each of which can support $(α,β)_τ$-core queries. As query efficiency depends on input parameters and the choice of 2D-index, we propose a learning-based hybrid computation paradigm by training a feed-forward neural network to predict the optimal choice of 2D-index that minimizes the query time. Extensive experiments show that ($1$) $(α,β)_τ$-core is an effective model capturing unique and important cohesive subgraphs; ($2$) the proposed techniques significantly improve the efficiency of index construction and query processing.

preprint2020arXiv

Graph-based Faulted Line Identification Using Micro-PMU Data in Distribution Systems

Motivated by increasing penetration of distributed generators (DGs) and fast development of micro-phasor measurement units (μPMUs), this paper proposes a novel graph-based faulted line identification algorithm using a limited number of μPMUs in distribution networks. The core of the proposed method is to apply advanced distribution system state estimation (DSSE) techniques integrating μPMU data to the fault location. We propose a distributed DSSE algorithm to efficiently restrict the searching region for the fault source in the feeder between two adjacent μPMUs. Based on the graph model of the feeder in the reduced searching region, we further perform the DSSE in a hierarchical structure and identify the location of the fault source. Also, the proposed approach captures the impact of DGs on distribution system operation and remains robust against high-level noises in measurements. Numerical simulations verify the accuracy and efficiency of the proposed method under various fault scenarios covering multiple fault types and fault impedances.

preprint2020arXiv

Graph3S: A Simple, Speedy and Scalable Distributed Graph Processing System

Graph is a ubiquitous structure in many domains. The rapidly increasing data volume calls for efficient and scalable graph data processing. In recent years, designing distributed graph processing systems has been an increasingly important area to fulfil the demands of processing big graphs in a distributed environment. Though a variety of distributed graph processing systems have been developed, very little attention has been paid to achieving a good combinational system performance in terms of usage simplicity, efficiency and scalability. To contribute to the study of distributed graph processing system, this work tries to fill this gap by designing a simple, speedy and scalable system. Our observation is that enforcing the communication flexibility of a system leads to the gains of both system efficiency and scalability as well as simple usage. We realize our idea in a system Graph3S and conduct extensive experiments with diverse algorithms over big graphs from different domains to test its performance. The results show that, besides simple usage, our system has outstanding performance over various graph algorithms and can even reach up to two orders of magnitude speedup over existing in-memory systems when applying to some algorithms. Also, its scalability is competitive to disk-based systems and even better when less machines are used.

preprint2020arXiv

Interval State Estimation with Uncertainty of Distributed Generation and Line Parameters in Unbalanced Distribution Systems

Distribution system state estimation (DSSE), which provides critical information for system monitoring and control, is being challenged by multiple sources of uncertainties such as random meter errors, stochastic power output of distributed generation (DG), and imprecise network parameters. This paper originally proposes a general interval state estimation (ISE) model to simultaneously formulate these uncertainties in unbalanced distribution systems by interval arithmetic. Moreover, this model can accommodate partially available measurements of DG outputs and inaccurate line parameters. Further, a modified Krawczyk-operator (MKO) algorithm is proposed to solve the general ISE model efficiently, and effectively provides the upper and lower bounds of state variables under coordinated impacts of these uncertainties. The proposed algorithm is tested on unbalanced IEEE 13-bus and 123-bus systems. Comparison with various methods including Monte Carlo simulation indicates that the proposed algorithm is many orders of magnitude faster and encloses tighter boundaries of state variables.

preprint2020arXiv

Machine learning on DNA-encoded libraries: A new paradigm for hit-finding

DNA-encoded small molecule libraries (DELs) have enabled discovery of novel inhibitors for many distinct protein targets of therapeutic value through screening of libraries with up to billions of unique small molecules. We demonstrate a new approach applying machine learning to DEL selection data by identifying active molecules from a large commercial collection and a virtual library of easily synthesizable compounds. We train models using only DEL selection data and apply automated or automatable filters with chemist review restricted to the removal of molecules with potential for instability or reactivity. We validate this approach with a large prospective study (nearly 2000 compounds tested) across three diverse protein targets: sEH (a hydrolase), ERα (a nuclear receptor), and c-KIT (a kinase). The approach is effective, with an overall hit rate of {\sim}30% at 30 {\textmu}M and discovery of potent compounds (IC50 <10 nM) for every target. The model makes useful predictions even for molecules dissimilar to the original DEL and the compounds identified are diverse, predominantly drug-like, and different from known ligands. Collectively, the quality and quantity of DEL selection data; the power of modern machine learning methods; and access to large, inexpensive, commercially-available libraries creates a powerful new approach for hit finding.

preprint2020arXiv

Properties of neutron star described by a relativistic $ab~ initio$ model

Properties of neutron star are investigated by an available relativistic $ab~ initio$ method, i.e., the relativistic Brueckner-Hartree-Fock (RBHF) model, with the latest high-precision relativistic charge-dependent potentials, pvCD-Bonn A, B, C. The neutron star matter is solved within the beta equilibrium and charge neutrality conditions in the framework of RBHF model. Comparing to the conventional treatment, where the chemical potential of lepton was approximately represented by the symmetry energy of nuclear matter, the equation of state (EOS) of neutron star matter in the present self-consistent calculation with pvCD-Bonn B has striking difference above the baryon number density $n_b=0.55$ fm$^{-3}$. However, these differences influence the global properties of neutron star only about $1\%\sim2\%$. Then, three two-body potentials pvCD-Bonn A, B, C, with different tensor components, are systematically applied in RBHF model to calculate the properties of neutron star. It is found that the maximum masses of neutron star are around $2.21\sim2.30M_\odot$ and the corresponding radii are $R =11.18\sim11.72$ km. The radii of $1.4M_\odot$ neutron star are predicated as $R_{1.4} = 12.34\sim12.91$ km and their dimensionless tidal deformabilities are $Λ_{1.4} = 485\sim 626$. Furthermore, the direct URCA process in neutron star cooling will happen from $n_b=0.414\sim0.530$ fm$^{-3}$ with the proton fractions, $Y_p=0.136\sim0.138$. All of the results obtained from RBHF model only with two-body pvCD-Bonn potentials completely satisfy various constraints from recent astronomical observations of massive neutron stars, gravitational wave detection (GW 170817), and mass-radius simultaneous measurement (NICER).

preprint2020arXiv

Semiparametric regression and risk prediction with competing risks data under missing cause of failure

The cause of failure in cohort studies that involve competing risks is frequently incompletely observed. To address this, several methods have been proposed for the semiparametric proportional cause-specific hazards model under a missing at random assumption. However, these proposals provide inference for the regression coefficients only, and do not consider the infinite dimensional parameters, such as the covariate-specific cumulative incidence function. Nevertheless, the latter quantity is essential for risk prediction in modern medicine. In this paper we propose a unified framework for inference about both the regression coefficients of the proportional cause-specific hazards model and the covariate-specific cumulative incidence functions under missing at random cause of failure. Our approach is based on a novel computationally efficient maximum pseudo-partial-likelihood estimation method for the semiparametric proportional cause-specific hazards model. Using modern empirical process theory we derive the asymptotic properties of the proposed estimators for the regression coefficients and the covariate-specific cumulative incidence functions, and provide methodology for constructing simultaneous confidence bands for the latter. Simulation studies show that our estimators perform well even in the presence of a large fraction of missing cause of failures, and that the regression coefficient estimator can be substantially more efficient compared to the previously proposed augmented inverse probability weighting estimator. The method is applied using data from an HIV cohort study and a bladder cancer clinical trial.

preprint2020arXiv

SF-GRASS: Solver-Free Graph Spectral Sparsification

Recent spectral graph sparsification techniques have shown promising performance in accelerating many numerical and graph algorithms, such as iterative methods for solving large sparse matrices, spectral partitioning of undirected graphs, vectorless verification of power/thermal grids, representation learning of large graphs, etc. However, prior spectral graph sparsification methods rely on fast Laplacian matrix solvers that are usually challenging to implement in practice. This work, for the first time, introduces a solver-free approach (SF-GRASS) for spectral graph sparsification by leveraging emerging spectral graph coarsening and graph signal processing (GSP) techniques. We introduce a local spectral embedding scheme for efficiently identifying spectrally-critical edges that are key to preserving graph spectral properties, such as the first few Laplacian eigenvalues and eigenvectors. Since the key kernel functions in SF-GRASS can be efficiently implemented using sparse-matrix-vector-multiplications (SpMVs), the proposed spectral approach is simple to implement and inherently parallel friendly. Our extensive experimental results show that the proposed method can produce a hierarchy of high-quality spectral sparsifiers in nearly-linear time for a variety of real-world, large-scale graphs and circuit networks when compared with the prior state-of-the-art spectral method.

preprint2020arXiv

The common origin of the family mass hierarchy and CP violation from flavour-dependent vacuum for quarks and leptons

We rewrite the Yukawa interactions of the standard model in terms of flavour-dependent vacuum structure to address the family mass hierarchy and CP violation in both the quark sector and lepton sector. It is realized for the first time that the Lagrangian only includes the same number of degrees of freedom as phenomenological observables with no requirement of extra particles or any new symmetry. The quark and lepton mass hierarchy arises as a natural result of the close-to-flat vacuum in flavour space. The CP violation in CKM and PMNS is explained as a general quantum phase between weak gauge eigenstates and Yukawa interaction states. The mechanism is proven by reproduction of all current quark/lepton mass data and the CKM/PMNS flavour mixings.

preprint2020arXiv

The one-pion-exchange potential with contact terms from lattice QCD simulations

The pion-mass-dependent nucleon-nucleon ($NN$) potentials in term of one-pion exchange and contact terms are obtained from the latest lattice QCD simulations of two-nucleon system, which employ the forms of leading order (LO) $NN$ potential from the chiral effective field theory and thus are named as the LO chiral potential in this work. We extract the coefficients of contact terms and cut-off momenta in these potentials, for the first time, by fitting the phase shifts of $^1S_0$ and $^3S_1$ channels generated by the results of HALQCD collaboration with various pion masses from $468.6$ MeV to $1170.9$ MeV. The low-energy constants in the $^1S_0$ and $^3S_1$ channels become weaker and approach each other for larger pion masses. These LO chiral potentials are applied to symmetric nuclear and pure neutron matter within the Brueckner-Hartree-Fock method. At this moment, however, we do not have yet the information of the $P$-wave $NN$ interaction to be provided by the lattice QCD simulations for full description of nuclear matter. Our results will enhance the development of nuclear structure and nuclear matter by controlling the contribution of the pionic effect and illuminate the role of chiral symmetry of the strong interaction in complex system.

preprint2020arXiv

Towards Highly Efficient State Estimation with Nonlinear Measurements in Distribution Systems

This letter proposes a novel and highly efficient distribution system state estimation (DSSE) algorithm with nonlinear measurements from supervisory control and data acquisition (SCADA) systems. Conventional DSSE, i.e., a weighted least square (WLS)-based method, requires multiple Monte Carlo simulations for overall accuracy evaluation and has high calculation cost due to the nonlinear iterative process. The proposed method uses the Taylor series of voltages for constructing a linear DSSE model in the interval form and then solve this model by interval arithmetic. Compared with the nonlinear WLS-based method, the proposed method obtains accurate and robust estimates via a single random sampling of measurements and is computationally efficient. The comparative analysis of the IEEE 34-bus distribution system points to improved estimation results relative to the WLS-based method.

preprint2020arXiv

Understanding and adjusting the selection bias from a proof-of-concept study to a more confirmatory study

It has long been noticed that the efficacy observed in small early phase studies is generally better than that observed in later larger studies. Historically, the inflation of the efficacy results from early proof-of-concept studies is either ignored, or adjusted empirically using a frequentist or Bayesian approach. In this article, we systematically explained the underlying reason for the inflation of efficacy results in small early phase studies from the perspectives of measurement error models and selection bias. A systematic method was built to adjust the early phase study results from both frequentist and Bayesian perspectives. A hierarchical model was proposed to estimate the distribution of the efficacy for a portfolio of compounds, which can serve as the prior distribution for the Bayesian approach. We showed through theory that the systematic adjustment provides an unbiased estimator for the true mean efficacy for a portfolio of compounds. The adjustment was applied to paired data for the efficacy in early small and later larger studies for a set of compounds in diabetes and immunology. After the adjustment, the bias in the early phase small studies seems to be diminished.

preprint2020arXiv

What Can We Learn from the Travelers Data in Detecting Disease Outbreaks -- A Case Study of the COVID-19 Epidemic

Background: Travel is a potent force in the emergence of disease. We discussed how the traveler case reports could aid in a timely detection of a disease outbreak. Methods: Using the traveler data, we estimated a few indicators of the epidemic that affected decision making and policy, including the exponential growth rate, the doubling time, and the probability of severe cases exceeding the hospital capacity, in the initial phase of the COVID-19 epidemic in multiple countries. We imputed the arrival dates when they were missing. We compared the estimates from the traveler data to the ones from domestic data. We quantitatively evaluated the influence of each case report and knowing the arrival date on the estimation. Findings: We estimated the travel origin&#39;s daily exponential growth rate and examined the date from which the growth rate was consistently above 0.1 (equivalent to doubling time < 7 days). We found those dates were very close to the dates that critical decisions were made such as city lock-downs and national emergency announcement. Using only the traveler data, if the assumed epidemic start date was relatively accurate and the traveler sample was representative of the general population, the growth rate estimated from the traveler data was consistent with the domestic data. We also discussed situations that the traveler data could lead to biased estimates. From the data influence study, we found more recent travel cases had a larger influence on each day&#39;s estimate, and the influence of each case report got smaller as more cases became available. We provided the minimum number of exported cases needed to determine whether the local epidemic growth rate was above a certain level, and developed a user-friendly Shiny App to accommodate various scenarios.

preprint2019arXiv

Bayesian truncation errors in equations of state of nuclear matter with chiral nucleon-nucleon potentials

The truncation errors in equations of state (EOSs) of nuclear matter derived from the chiral nucleon-nucleon ($NN$) potentials at different expansion orders are analyzed by a Bayesian model. These EOSs are expanded as functions of a dimensionless parameter, $Q$, which is determined by Fermi momentum, $k_F$ and breakdown scale, $Λ_b$. The degree-of-belief (DoB) intervals predicted by the chiral effective field theory are calculated within the corresponding expansion coefficients and the specific prior probability distribution functions in terms of Bayes theorem. The truncation errors of EOSs, generated by the DoB intervals, exhibit good order-by-order convergences with different chiral expansion order potentials. When DoB is considered as $1σ$ credibility, i.e., $68.27\%$ confidence interval, the truncation errors of binding energy per nucleon in symmetric nuclear matter and pure neutron matter are consistent with the results given by a simple error analysis method proposed by Epelbaum, {\it et al.} Finally, the reasonable values of the breakdown scale $Λ_b$ in nuclear matter are discussed through consistency checks of Bayesian method based on the calculations of success rate.

preprint2019arXiv

The charge-dependent Bonn potentials with pseudovector pion-nucleon coupling

To apply the high-precision realistic nucleon-nucleon ($NN$) potentials on the investigations of relativistic many-body methods, the new versions of charge-dependent Bonn (CD-Bonn) $NN$ potential are constructed within the pseudovector pion-nucleon coupling instead of the pseudoscalar type in the original CD-Bonn potential worked out by Machleidt [Phys. Rev. C 63, 024001 (2001)]. Two effective scalar mesons are introduced, whose coupling constants with nucleon are independently determined at each partial wave for total angular momentum $J\leq 4$, to describe the charge dependence of $NN$ scattering data precisely, while the coupling constants between vector, pseudovector mesons and nucleon are identical in all channels. Three revised CD-Bonn potentials adopting the pseudovector pion-nucleon couplings (pvCD-Bonn) are generated by fitting the Nijmegen PWA phase shift data and deuteron binding energy with different pion-nucleon coupling strengths, which can reproduce the phase shifts at spin-single channels and low-energy $NN$ scattering parameters very well, and provide the significantly different mixing parameters at spin-triplet channels. Furthermore, the $D$-state probabilities of deuteron from these potentials range from $4.22\%$ to $6.05\%$. It demonstrates that these potentials contain different components of tensor force, which will be useful to discuss the roles of tensor force in nuclear few-body and many-body systems.