Source author record

Ying Zhang

Ying Zhang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

130works

62topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

1/2 order convergence rate of Euler-type methods for time-changed stochastic differential equations with super-linearly growing drift and diffusion coefficients

This paper investigates the strong convergence properties of two Euler-type methods for a class of time-changed stochastic differential equations (TCSDEs) with super-linearly growing drift and diffusion coefficients. Building upon existing research, we propose a backward Euler method (BEM) and introduce its explicit counterpart -- the projected Euler method (PEM). We prove that both methods converge strongly in the $L_2$-sense at the optimal rate of 1/2. This result extends the applicability of both the BEM and the PEM to a broader class of TCSDEs. Moreover, the two methods offer complementary strengths: while BEM possesses wide applicability, PEM is computationally more efficient. Numerical simulations confirm our theoretical findings and illustrate practical performance of both schemes.

preprint2026arXiv

Beyond "I cannot fulfill this request": Alleviating Rigid Rejection in LLMs via Label Enhancement

Large Language Models (LLMs) rely on safety alignment to obey safe requests while refusing harmful ones. However, traditional refusal mechanisms often lead to "rigid rejection," where a general template (e.g., "I cannot fulfill this request") indiscriminately triggers refusals and severely undermines the naturalness of interactions between humans and LLMs. To address this issue, LANCE is proposed in this paper to ensure safe yet flexible and natural responses via label enhancement. Specifically, LANCE employs variational inference to perform label enhancement, predicting a continuous distribution across multiple rejection categories. These fine-grained rejection distributions provide multi-way textual gradients for a refinement model to neutralize the hazardous elements in the prompt, so that the LLMs could generate safe responses that avoid rigid rejections while preserving the naturalness of interactions. Experiments demonstrate that LANCE significantly alleviates the rigid rejection problem while maintaining high security standards, significantly outperforming existing baseline models in terms of helpfulness and naturalness of responses.

preprint2026arXiv

Beyond Immediate Activation: Temporally Decoupled Backdoor Attacks on Time Series Forecasting

Existing backdoor attacks on multivariate time series (MTS) forecasting enforce strict temporal and dimensional coupling between triggers and target patterns, requiring synchronous activation at fixed positions across variables. However, realistic scenarios often demand delayed and variable-specific activation. We identify this critical unmet need and propose TDBA, a temporally decoupled backdoor attack framework for MTS forecasting. By injecting triggers that encode the expected location of the target pattern, TDBA enables the activation of the target pattern at any positions within the forecasted data, with the activation position flexibly varying across different variable dimensions. TDBA introduces two core modules: (1) a position-guided trigger generation mechanism that leverages smoothed Gaussian priors to generate triggers that are position-related to the predefined target pattern; and (2) a position-aware optimization module that assigns soft weights based on trigger completeness, pattern coverage, and temporal offset, facilitating targeted and stealthy attack optimization. Extensive experiments on real-world datasets show that TDBA consistently outperforms existing baselines in effectiveness while maintaining good stealthiness. Ablation studies confirm the controllability and robustness of its design.

preprint2026arXiv

EXG: Self-Evolving Agents with Experience Graphs

Large language model (LLM)-based agents have demonstrated strong capabilities in complex reasoning and problem solving through multi-step interactions, yet most deployed agents remain behaviorally static, with knowledge acquired during execution rarely translating into systematic improvement over time. In response, a growing line of work on self-evolving agents explores how agents can improve through experience during deployment, but most existing approaches either rely on ad hoc reflection limited to single-task correction or adopt unstructured memory that accumulates fragmented experience with delayed usability. To address this limitation, we introduce EXG, an experience graph framework for self-evolving agents that explicitly organizes accumulated successes and failures into a structured, relational representation. EXG is the first experience graph designed for self-evolving agents, supporting both online, real-time graph growth during execution for immediate cross-task experience reuse, and offline reuse of a consolidated experience graph as an external memory module. This design also enables EXG to serve as a plug-and-play component for existing self-evolving agents, organizing prior experience into a unified experience graph and improving both solution quality and resource efficiency as deployment progresses. Extensive experiments across code generation and reasoning benchmarks show that EXG attains more favorable performance-efficiency trade-offs than reflection- and memory-based baselines in both online and offline evaluations. Our results suggest that structuring experience as a graph provides a principled foundation for scalable and transferable self-evolving agent behavior.

preprint2026arXiv

HyperJoin: LLM-augmented Hypergraph Link Prediction for Joinable Table Discovery

As a pivotal task in data lake management, joinable table discovery has attracted widespread interest. While existing language model-based methods achieve remarkable performance by combining offline column representation learning with online ranking, their design insufficiently accounts for the underlying structural interactions: (1) offline, they directly model tables into isolated or pairwise columns, thereby struggling to capture the rich inter-table and intra-table structural information; and (2) online, they rank candidate columns based solely on query-candidate similarity, ignoring the mutual interactions among the candidates, leading to incoherent result sets. To address these limitations, we propose HyperJoin, a large language model (LLM)-augmented Hypergraph framework for Joinable table discovery. Specifically, we first construct a hypergraph to model tables using both the intra-table hyperedges and the LLM-augmented inter-table hyperedges. Consequently, the task of joinable table discovery is formulated as link prediction on this constructed hypergraph. We then design HIN, a Hierarchical Interaction Network that learns expressive column representations through bidirectional message passing over columns and hyperedges. To strengthen coherence and internal consistency in the result columns, we cast online ranking as a coherence-aware top-k column selection problem. We then introduce a reranking module that leverages a maximum spanning tree algorithm to prune noisy connections and maximize coherence. Experiments demonstrate the superiority of HyperJoin, achieving average improvements of 21.4% (Precision@15) and 17.2% (Recall@15) over the best baseline.

preprint2026arXiv

MedVL-SAM2: A unified 3D medical vision-language model for multimodal reasoning and prompt-driven segmentation

Recent progress in medical vision-language models (VLMs) has achieved strong performance on image-level text-centric tasks such as report generation and visual question answering (VQA). However, achieving fine-grained visual grounding and volumetric spatial reasoning in 3D medical VLMs remains challenging, particularly when aiming to unify these capabilities within a single, generalizable framework. To address this challenge, we proposed MedVL-SAM2, a unified 3D medical multimodal model that concurrently supports report generation, VQA, and multi-paradigm segmentation, including semantic, referring, and interactive segmentation. MedVL-SAM2 integrates image-level reasoning and pixel-level perception through a cohesive architecture tailored for 3D medical imaging, and incorporates a SAM2-based volumetric segmentation module to enable precise multi-granular spatial reasoning. The model is trained in a multi-stage pipeline: it is first pre-trained on a large-scale corpus of 3D CT image-text pairs to align volumetric visual features with radiology-language embeddings. It is then jointly optimized with both language-understanding and segmentation objectives using a comprehensive 3D CT segmentation dataset. This joint training enables flexible interaction via language, point, or box prompts, thereby unifying high-level visual reasoning with spatially precise localization. Our unified architecture delivers state-of-the-art performance across report generation, VQA, and multiple 3D segmentation tasks. Extensive analyses further show that the model provides reliable 3D visual grounding, controllable interactive segmentation, and robust cross-modal reasoning, demonstrating that high-level semantic reasoning and precise 3D localization can be jointly achieved within a unified 3D medical VLM.

preprint2026arXiv

Overeager Coding Agents: Measuring Out-of-Scope Actions on Benign Tasks

Coding agents now run autonomously with shell, file, and network privileges. When a user issues a benign request, the agent sometimes does more than asked: it deletes unrelated files, wipes a stale credentials backup, or rewrites configuration the user never mentioned. We call these scope expansions overeager actions, an authorization problem distinct from capability failures, prompt injection, or sandbox escapes. We present OverEager-Gen, a benchmark dedicated to overeager behavior on benign tasks. Building it surfaces a measurement-validity issue: if a benchmark spells out the authorized scope inside the prompt, the agent stops inferring boundaries and starts pattern-matching declaration text. On Claude Code, stripping the consent declaration alone raises the overeager rate from 0.0% to 17.1% on paired scenarios (McNemar exact p = 2.4 x 10^-4). OverEager-Gen therefore certifies each scenario's discriminative power before admission via a behavioral-gradient validator, audits internal tool calls through a dual-channel stack (PATH-injected shim plus per-agent event streams), and ships byte-identical consent_kept and consent_stripped variants. OverEager-Bench contains 500 validated scenarios and ~7,500 runs across four agent products (Claude Code, OpenHands, Codex CLI, Gemini CLI) and six base models; a 50-sample re-annotation gives Cohen's kappa = 0.73 and rule-judge recall = 1.00. Stripping consent multiplies the overeager rate on every shared base model (Delta in [11.9, 17.2] pp). The framework axis dominates effect size: a permissive cluster (Claude Code, Codex CLI, Gemini CLI) runs at 5.4-27.7% while the ask-to-continue framework (OpenHands) sits at 0.2-4.5% (Fisher p <= 10^-5). Within-framework base-model variance reaches 15.9 pp, indicating that model-layer alignment does not fully propagate through permissive permission gating.

preprint2026arXiv

Quantum Latin squares of order $6m$ with all possible cardinalities

A quantum Latin square of order $n$ (denoted as QLS$(n)$) is an $n\times n$ array whose entries are unit column vectors from the $n$-dimensional Hilbert space $\mathcal{H}_n$, such that each row and column forms an orthonormal basis. Two unit vectors $|u\rangle, |v\rangle\in \mathcal{H}_n$ are regarded as identical if there exists a real number $θ$ such that $|u\rangle=e^{iθ}|v\rangle$; otherwise, they are considered distinct. The cardinality $c$ of a QLS$(n)$ is the number of distinct vectors in the array. In this note,we use sub-QLS$(6)$ to prove that for any integer $m\geq 2$ and any $c\in [6m,36m^2]\setminus \{6m+1\}$, there is a QLS$(6m)$ with cardinality $c$.

preprint2026arXiv

Why is the $Z_c(3900)$ absent in the $h_cπ$ final state?

In this work, we perform a comprehensive phenomenological analysis of the exotic hadronic states $Z_c(3900)$, $Z_c(4020)$, $Z_b(10610)$ and $Z_b(10650)$ within the framework of Heavy Quark Spin Symmetry (HQSS) and its violation. By constructing S-wave contact interactions between elastic ($D\bar{D}^*/D^*\bar{D}^*$ or $B\bar{B}^*/B^*\bar{B}^*$) and inelastic ($J/ψπ, h_cπ$ or $Υπ$, $h_bπ$) channels, we solve the Lippmann-Schwinger equation to obtain physical production amplitudes and perform a global fit to experimental invariant-mass spectra. Our results demonstrate a striking difference between the charm and bottom sectors: HQSS violation is negligible in the bottom system, leading to comparable peak structures for both $Z_b$ states in all hidden-bottom decay channels. In contrast, significant HQSS breaking is required to describe the $Z_c$ system, where the violation is predominantly concentrated in the elastic interactions. This explains the observed selectivity: $Z_c(3900)$ appears prominently only in $J/ψπ$, while $Z_c(4020)$ appears only in $h_cπ$. Pole analysis confirms the molecular nature of the states, with the $Z_c(4020)$ likely arising from a threshold cusp effect. The model's robustness is verified against variations of the form factor and cutoff, showing stable results.

preprint2025arXiv

Analyzing Communication Predictability in LLM Training

Effective communication is essential in distributed training, with predictability being one of its most significant characteristics. However, existing studies primarily focus on exploiting predictability through online profiling for runtime optimization, without a systematic understanding of it. In this work, we aim to systematically formulate communication predictability in distributed training, particularly in Large Language Models (LLMs) that utilize hybrid parallelism. Our analysis focuses on both traffic patterns and communication overhead. Specifically, we investigate predictable traffic patterns in typical LLMs and evaluate how various factors influence GPU utilization and effective bandwidth (two critical variables affecting communication overhead). Furthermore, we develop an analytical formulation to estimate communication overhead in LLM training, which is validated with high accuracy against empirical data. Leveraging this formulation, we propose a configuration tuning tool, ConfigTuner, to optimize training performance. Compared to Megatron-LM, the training configurations optimized by ConfigTuner demonstrate up to a 1.36$\times$ increase in throughput. Compared to Alpa, ConfigTuner generates the same configuration suggestion while significantly reducing the search complexity.

preprint2023arXiv

Audio2Gestures: Generating Diverse Gestures from Audio

People may perform diverse gestures affected by various mental and physical factors when speaking the same sentences. This inherent one-to-many relationship makes co-speech gesture generation from audio particularly challenging. Conventional CNNs/RNNs assume one-to-one mapping, and thus tend to predict the average of all possible target motions, easily resulting in plain/boring motions during inference. So we propose to explicitly model the one-to-many audio-to-motion mapping by splitting the cross-modal latent code into shared code and motion-specific code. The shared code is expected to be responsible for the motion component that is more correlated to the audio while the motion-specific code is expected to capture diverse motion information that is more independent of the audio. However, splitting the latent code into two parts poses extra training difficulties. Several crucial training losses/strategies, including relaxed motion loss, bicycle constraint, and diversity loss, are designed to better train the VAE. Experiments on both 3D and 2D motion datasets verify that our method generates more realistic and diverse motions than previous state-of-the-art methods, quantitatively and qualitatively. Besides, our formulation is compatible with discrete cosine transformation (DCT) modeling and other popular backbones (\textit{i.e.} RNN, Transformer). As for motion losses and quantitative motion evaluation, we find structured losses/metrics (\textit{e.g.} STFT) that consider temporal and/or spatial context complement the most commonly used point-wise losses (\textit{e.g.} PCK), resulting in better motion dynamics and more nuanced motion details. Finally, we demonstrate that our method can be readily used to generate motion sequences with user-specified motion clips on the timeline.

preprint2023arXiv

Illegal Intelligent Reflecting Surface Based Active Channel Aging: When Jammer Can Attack Without Power and CSI

Illegal intelligent reflecting surfaces (I-IRSs), i.e., the illegal deployment and utilization of IRSs, impose serious harmful impacts on wireless networks. The existing I-IRS-based illegal jammer (IJ) requires channel state information (CSI) or extra power or both, and therefore, the I-IRS-based IJ seems to be difficult to implement in practical wireless networks. To raise concerns about significant potential threats posed by I-IRSs, we propose an alternative method to jam legitimate users (LUs) without relying on the CSI. By using an I-IRS to actively change wireless channels, the orthogonality of multi-user beamforming vectors and the co-user channels is destroyed, and significant inter-user interference is then caused, which is referred to as active channel aging. Such a fully-passive jammer (FPJ) can launch jamming attacks on multi-user multiple-input single-output (MU-MISO) systems via inter-user interference caused by active channel aging, where the IJ requires no additional transmit power and instantaneous CSI. The simulation results show the effectiveness of the proposed FPJ scheme. Moreover, we also investigate how the transmit power and the number of quantization phase shift bits influence the jamming performance.

preprint2022arXiv

%CRTFASTGEEPWR: a SAS macro for power of the generalized estimating equations of multi-period cluster randomized trials with application to stepped wedge designs

Multi-period cluster randomized trials (CRTs) are increasingly used for the evaluation of interventions delivered at the group level. While generalized estimating equations (GEE) are commonly used to provide population-averaged inference in CRTs, there is a gap of general methods and statistical software tools for power calculation based on multi-parameter, within-cluster correlation structures suitable for multi-period CRTs that can accommodate both complete and incomplete designs. A computationally fast, non-simulation procedure for determining statistical power is described for the GEE analysis of complete and incomplete multi-period cluster randomized trials. The procedure is implemented via a SAS macro, \%CRTFASTGEEPWR, which is applicable to binary, count and continuous responses and several correlation structures in multi-period CRTs. The SAS macro is illustrated in the power calculation of two complete and two incomplete stepped wedge cluster randomized trial scenarios under different specifications of marginal mean model and within-cluster correlation structure. The proposed GEE power method is quite general as demonstrated in the SAS macro with numerous input options. The power procedure and macro can also be used in the planning of parallel and crossover CRTs in addition to cross-sectional and closed cohort stepped wedge trials.

preprint2022arXiv

AFD Types Sparse Representations vs. the Karhunen-Loeve Expansion for Decomposing Stochastic Processes

This article introduces adaptive Fourier decomposition (AFD) type methods, emphasizing on those that can be applied to stochastic processes and random fields, mainly including stochastic adaptive Fourier decomposition and stochastic pre-orthogonal adaptive Fourier decomposition. We establish their algorithms based on the covariant function and prove that they enjoy the same convergence rate as the Karhunen-Loève (KL) decomposition. The AFD type methods are compared with the KL decomposition. In contrast with the latter, the AFD type methods do not need to compute eigenvalues and eigenfunctions of the kernel-integral operator induced by the covariance function, and thus considerably reduce the computation complexity and computer consumes. Various kinds of dictionaries offer AFD flexibility to solve problems of a great variety, including different types of deterministic and stochastic equations. The conducted experiments show, besides the numerical convenience and fast convergence, that the AFD type decompositions outperform the KL type in describing local details, in spite of the proven global optimality of the latter.

preprint2022arXiv

Carbide: Highly Reliable Networks Through Real-Time Multiple Control Plane Composition

Achieving highly reliable networks is essential for network operators to ensure proper packet delivery in the event of software errors or hardware failures. Networks must ensure reachability and routing correctness, such as subnet isolation and waypoint traversal. Existing work in network verification relies on centralized computation at the cost of fault tolerance, while other approaches either build an over-engineered, complex control plane, or compose multiple control planes without providing any guarantee on correctness. This paper presents Carbide, a novel system to achieve high reliability in networks through distributed verification and multiple control plane composition. The core of Carbide is a simple, generic, efficient distributed verification framework that transforms a generic network verification problem to a reachability verification problem on a directed acyclic graph (DAG), and solves the latter via an efficient distributed verification protocol (DV-protocol). Equipped with verification results, Carbide allows the systematic composition of multiple control planes and realization of operator-specified consistency. Carbide is fully implemented. Extensive experiments show that (1) Carbide reduces downtime by 43% over the most reliable individual underlying control plane, while enforcing correctness requirements on all traffic; and (2) by systematically decomposing computation to devices and pruning unnecessary messaging between devices during verification, Carbide scales to a production data center network.

preprint2022arXiv

Continuum Skyrme Hartree-Fock-Bogoliubov theory with Green's function method for neutron-rich Ca, Ni, Zr, Sn isotopes

The possible exotic nuclear properties in the neutron-rich Ca, Ni, Zr, and Sn isotopes are explored with the continuum Skyrme Hartree-Fock-Bogoliubov theory formulated with the Green's function method. The available experimental two-neutron separation energies $S_{\rm 2n}$ and one-neutron separation energies $S_{\rm n}$ are well reproduced. Much shorter drip lines predicted by $S_{\rm n}$ are obtained compared with those by $S_{\rm 2n}$. The systematic studies of the neutron pairing energies $-E_{\rm pair}$ shown that values of the odd-$A$ nuclei are much smaller in comparison with those of the neighboring even-even nuclei due to the absent contribution of pairing energy by the unpaired odd neutron. By investigating the single-particle structures, the rms radii and the density distributions, the possible halo structures in the neutron-rich Ca, Ni, and Sn isotopes are predicted, in which the sharp increases of rms radii with significant deviations from the traditional $r\varpropto A^{1/3}$ rule and very diffuse spatial distributions in densities are observed. By analyzing the contributions of different partial waves to the total neutron density $ρ_{lj}(r)/ρ(r)$, the orbitals locating around the Fermi surface especially those with low angular momenta are found the main reason causing the extended nuclear density and large rms radii. Finally, the numbers of the neutrons $N_λ$~($N_0$) occupied above the Fermi surface $λ_n$~(in the continuum) are discussed, the behaviors of which are basically consistent with those of pairing energy, supporting the key role of the pairing correlations in the halo phenomena.

preprint2022arXiv

Detailed Balance for Particle Models of Reversible Reactions in Bounded Domains

In particle-based stochastic reaction-diffusion models, reaction rate and placement kernels are used to decide the probability per time a reaction can occur between reactant particles, and to decide where product particles should be placed. When choosing kernels to use in reversible reactions, a key constraint is to ensure that detailed balance of spatial reaction-fluxes holds at all points at equilibrium. In this work we formulate a general partial-integral differential equation model that encompasses several of the commonly used contact reactivity (e.g. Smoluchowski-Collins-Kimball) and volume reactivity (e.g. Doi) particle models. From these equations we derive a detailed balance condition for the reversible $\textrm{A} + \textrm{B} \leftrightarrows \textrm{C}$ reaction. In bounded domains with no-flux boundary conditions, when choosing unbinding kernels consistent with several commonly used binding kernels, we show that preserving detailed balance of spatial reaction-fluxes at all points requires spatially varying unbinding rate functions near the domain boundary. Brownian Dynamics simulation algorithms can realize such varying rates through ignoring domain boundaries during unbinding and rejecting unbinding events that result in product particles being placed outside the domain.

preprint2022arXiv

Emotional Conversation Generation with Heterogeneous Graph Neural Network

The successful emotional conversation system depends on sufficient perception and appropriate expression of emotions. In a real-life conversation, humans firstly instinctively perceive emotions from multi-source information, including the emotion flow hidden in dialogue history, facial expressions, audio, and personalities of speakers. Then, they convey suitable emotions according to their personalities, but these multiple types of information are insufficiently exploited in emotional conversation fields. To address this issue, in this paper, we propose a heterogeneous graph-based model for emotional conversation generation. Firstly, we design a Heterogeneous Graph-Based Encoder to represent the conversation content (i.e., the dialogue history, its emotion flow, facial expressions, audio, and speakers' personalities) with a heterogeneous graph neural network, and then predict suitable emotions for feedback. Secondly, we employ an Emotion-Personality-Aware Decoder to generate a response relevant to the conversation context as well as with appropriate emotions, through taking the encoded graph representations, the predicted emotions by the encoder and the personality of the current speaker as inputs. Experiments on both automatic and human evaluation show that our method can effectively perceive emotions from multi-source knowledge and generate a satisfactory response. Furthermore, based on the up-to-date text generator BART, our model still can achieve consistent improvement, which significantly outperforms some existing state-of-the-art models.

preprint2022arXiv

Enabling Harmonious Human-Machine Interaction with Visual-Context Augmented Dialogue System: A Review

The intelligent dialogue system, aiming at communicating with humans harmoniously with natural language, is brilliant for promoting the advancement of human-machine interaction in the era of artificial intelligence. With the gradually complex human-computer interaction requirements (e.g., multimodal inputs, time sensitivity), it is difficult for traditional text-based dialogue system to meet the demands for more vivid and convenient interaction. Consequently, Visual Context Augmented Dialogue System (VAD), which has the potential to communicate with humans by perceiving and understanding multimodal information (i.e., visual context in images or videos, textual dialogue history), has become a predominant research paradigm. Benefiting from the consistency and complementarity between visual and textual context, VAD possesses the potential to generate engaging and context-aware responses. For depicting the development of VAD, we first characterize the concepts and unique features of VAD, and then present its generic system architecture to illustrate the system workflow. Subsequently, several research challenges and representative works are detailed investigated, followed by the summary of authoritative benchmarks. We conclude this paper by putting forward some open issues and promising research trends for VAD, e.g., the cognitive mechanisms of human-machine dialogue under cross-modal dialogue context, and knowledge-enhanced cross-modal semantic interaction.

preprint2022arXiv

Example-Based Vulnerability Detection and Repair in Java Code

The Java libraries JCA and JSSE offer cryptographic APIs to facilitate secure coding. When developers misuse some of the APIs, their code becomes vulnerable to cyber-attacks. To eliminate such vulnerabilities, people built tools to detect security-API misuses via pattern matching. However, most tools do not (1) fix misuses or (2) allow users to extend tools' pattern sets. To overcome both limitations, we created Seader-an example-based approach to detect and repair security-API misuses. Given an exemplar <insecure, secure>code pair, Seader compares the snippets to infer any API-misuse template and corresponding fixing edit. Based on the inferred info, given a program, Seader performs inter-procedural static analysis to search for security-API misuses and to propose customized fixes. For evaluation, we applied Seader to 28 <insecure, secure> codepairs; Seader successfully inferred 21 unique API-misuse templates and related fixes. With these <vulnerability, fix> patterns, we applied SEADER to a program benchmark that has 86 known vulnerabilities. Seader detected vulnerabilities with 95% precision, 72% recall, and82% F-score. We also applied Seader to 100 open-source projects and manually checked 77 suggested repairs; 76 of the repairs were correct. Seader can help developers correctly use security APIs.

preprint2022arXiv

Exploiting full Resolution Feature Context for Liver Tumor and Vessel Segmentation via Integrate Framework: Application to Liver Tumor and Vessel 3D Reconstruction under embedded microprocessor

Liver cancer is one of the most common malignant diseases in the world. Segmentation and labeling of liver tumors and blood vessels in CT images can provide convenience for doctors in liver tumor diagnosis and surgical intervention. In the past decades, many state-of-the-art medical image segmentation algorithms appeared during this period. With the development of embedded devices, embedded deployment for medical segmentation and automatic reconstruction brings prospects for future automated surgical tasks. Yet, most of the existing segmentation methods mostly care about the spatial feature context and have a perception defect in the semantic relevance of medical images, which significantly affects the segmentation accuracy of liver tumors and blood vessels. Deploying large and complex models into embedded devices requires a reasonable trade-off between model accuracy, reasoning speed and model capacity. Given these problems, we introduce a multi-scale feature fusion network called TransFusionNet based on Transformer. This network achieved very competitive performance for liver vessel and liver tumor segmentation tasks, meanwhile it can improve the recognition of morphologic margins of liver tumors by exploiting the global information of CT images. Experiments show that in vessel segmentation task TransFusionNet achieved mean Dice coefficients of 0.899 and in liver tumor segmentation task TransFusionNet achieved mean Dice coefficients of 0.961. Compared with the state-of-the-art framework, our model achieves the best segmentation result. In addition, we deployed the model into an embedded micro-structure and constructed an integrated model for liver tumor vascular segmentation and reconstruction. This proprietary structure will be the exclusive component of the future medical field.

preprint2022arXiv

Hot 2DHG states in tellurium

Element semiconductor Te is very popular in both fundamental electronic structure study, and device fabrication research area due to its unique band structure. Specifically, in low temperatures, Te possesses strong quantum oscillations with magnetic field applied in basal plane, either following Shubnikov-de Haas (SdH) oscillation rule or following log-periodic oscillation rule. With magnetic field applied along the [001] direction, the SdH oscillations are attributed to the two-dimensional hole gas (2DHG) surface states. Here we reported an interesting SdH oscillation in Te-based single crystals, with the magnetic field applied along the [001] direction of the crystals, showing the maximum oscillation intensity at ~ 75 K, and still traceable at 200 K, which indicates a rather hot 2DHG state. The nontrivial Berry phase can be also obtained from the oscillations, implying the contribution from topological states. More importantly, the high temperature SdH oscillation phenomena are observed in different Te single crystals samples, and Te single crystals with nonmagnetic/magnetic dopants, showing robustness to bulk defects. Therefore, the oscillation may be contributed by the bulk symmetry protected hot 2DHG states, which will offer a new platform for high-temperature quantum transport studies.

preprint2022arXiv

Marginal Regression on Transient State Occupation Probabilities with Clustered Multistate Process Data

Clustered multistate process data are commonly encountered in multicenter observational studies and clinical trials. A clinically important estimand with such data is the marginal probability of being in a particular transient state as a function of time. However, there is currently no method for nonparametric marginal regression analysis of these probabilities with clustered multistate process data. To address this problem, we propose a weighted functional generalized estimating equations approach which does not impose Markov assumptions or assumptions regarding the structure of the within-cluster dependence, and allows for informative cluster size (ICS). The asymptotic properties of the proposed estimators for the functional regression coefficients are rigorously established and a nonparametric hypothesis testing procedure for covariate effects is proposed. Simulation studies show that the proposed method performs well even with a small number of clusters, and that ignoring the within-cluster dependence and the ICS leads to invalid inferences. The proposed method is used to analyze data from a multicenter clinical trial on recurrent or metastatic squamous-cell carcinoma of the head and neck with a stratified randomization design.

preprint2022arXiv

Mix-up Self-Supervised Learning for Contrast-agnostic Applications

Contrastive self-supervised learning has attracted significant research attention recently. It learns effective visual representations from unlabeled data by embedding augmented views of the same image close to each other while pushing away embeddings of different images. Despite its great success on ImageNet classification, COCO object detection, etc., its performance degrades on contrast-agnostic applications, e.g., medical image classification, where all images are visually similar to each other. This creates difficulties in optimizing the embedding space as the distance between images is rather small. To solve this issue, we present the first mix-up self-supervised learning framework for contrast-agnostic applications. We address the low variance across images based on cross-domain mix-up and build the pretext task based on two synergistic objectives: image reconstruction and transparency prediction. Experimental results on two benchmark datasets validate the effectiveness of our method, where an improvement of 2.5% ~ 7.4% in top-1 accuracy was obtained compared to existing self-supervised learning methods.

preprint2022arXiv

Multi-grained Label Refinement Network with Dependency Structures for Joint Intent Detection and Slot Filling

Slot filling and intent detection are two fundamental tasks in the field of natural language understanding. Due to the strong correlation between these two tasks, previous studies make efforts on modeling them with multi-task learning or designing feature interaction modules to improve the performance of each task. However, none of the existing approaches consider the relevance between the structural information of sentences and the label semantics of two tasks. The intent and semantic components of a utterance are dependent on the syntactic elements of a sentence. In this paper, we investigate a multi-grained label refinement network, which utilizes dependency structures and label semantic embeddings. Considering to enhance syntactic representations, we introduce the dependency structures of sentences into our model by graph attention layer. To capture the semantic dependency between the syntactic information and task labels, we combine the task specific features with corresponding label embeddings by attention mechanism. The experimental results demonstrate that our model achieves the competitive performance on two public datasets.

preprint2022arXiv

Probabilistic Models for Manufacturing Lead Times

In this study, we utilize Gaussian processes, probabilistic neural network, natural gradient boosting, and quantile regression augmented gradient boosting to model lead times of laser manufacturing processes. We introduce probabilistic modelling in the domain and compare the models in terms of different abilities. While providing a comparison between the models in real-life data, our work has many use cases and substantial business value. Our results indicate that all of the models beat the company estimation benchmark that uses domain experience and have good calibration with the empirical frequencies.

preprint2022arXiv

Reinforcement Learning Based Query Vertex Ordering Model for Subgraph Matching

Subgraph matching is a fundamental problem in various fields that use graph structured data. Subgraph matching algorithms enumerate all isomorphic embeddings of a query graph q in a data graph G. An important branch of matching algorithms exploit the backtracking search approach which recursively extends intermediate results following a matching order of query vertices. It has been shown that the matching order plays a critical role in time efficiency of these backtracking based subgraph matching algorithms. In recent years, many advanced techniques for query vertex ordering (i.e., matching order generation) have been proposed to reduce the unpromising intermediate results according to the preset heuristic rules. In this paper, for the first time we apply the Reinforcement Learning (RL) and Graph Neural Networks (GNNs) techniques to generate the high-quality matching order for subgraph matching algorithms. Instead of using the fixed heuristics to generate the matching order, our model could capture and make full use of the graph information, and thus determine the query vertex order with the adaptive learning-based rule that could significantly reduces the number of redundant enumerations. With the help of the reinforcement learning framework, our model is able to consider the long-term benefits rather than only consider the local information at current ordering step.Extensive experiments on six real-life data graphs demonstrate that our proposed matching order generation technique could reduce up to two orders of magnitude of query processing time compared to the state-of-the-art algorithms.

preprint2022arXiv

Semi-supervised segmentation of tooth from 3D Scanned Dental Arches

Teeth segmentation is an important topic in dental restorations that is essential for crown generation, diagnosis, and treatment planning. In the dental field, the variability of input data is high and there are no publicly available 3D dental arch datasets. Although there has been improvement in the field provided by recent deep learning architectures on 3D data, there still exists some problems such as properly identifying missing teeth in an arch. We propose to use spectral clustering as a self-supervisory signal to joint-train neural networks for segmentation of 3D arches. Our approach is motivated by the observation that K-means clustering provides cues to capture margin lines related to human perception. The main idea is to automatically generate training data by decomposing unlabeled 3D arches into segments relying solely on geometric information. The network is then trained using a joint loss that combines a supervised loss of annotated input and a self-supervised loss of non-labeled input. Our collected data has a variety of arches including arches with missing teeth. Our experimental results show improvement over the fully supervised state-of-the-art MeshSegNet when using semi-supervised learning. Finally, we contribute code and a dataset.

preprint2022arXiv

Sparse Representations of Solutions to a class of Random Boundary Value Problems

We introduce certain sparse representation methods, named as stochastic pre-orthogonal adaptive Fourier decomposition 1 and 2 (SPOAFD1 and SPOAFD2) to solve the Dirichlet boundary value problem and the Cauchy initial value problem of random data. To solve the stochastic boundary value problems the sparse representation is, as the initial step, applied to the random boundary data. Due to the semigroup property of the Poisson and the heat kernel, each entry of the expanding series can be lifted up to compose a solution of the Dirichlet and the Cauchy initial value problem, respectively. The sparse representation gives rise to analytic as well as numerical solutions to the problems with high efficiency.

preprint2022arXiv

Testing for Treatment Effect Twice Using Internal and External Controls in Clinical Trials

Leveraging external controls -- relevant individual patient data under control from external trials or real-world data -- has the potential to reduce the cost of randomized controlled trials (RCTs) while increasing the proportion of trial patients given access to novel treatments. However, due to lack of randomization, RCT patients and external controls may differ with respect to covariates that may or may not have been measured. Hence, after controlling for measured covariates, for instance by matching, testing for treatment effect using external controls may still be subject to unmeasured biases. In this paper, we propose a sensitivity analysis approach to quantify the magnitude of unmeasured bias that would be needed to alter the study conclusion that presumed no unmeasured biases are introduced by employing external controls. Whether leveraging external controls increases power or not depends on the interplay between sample sizes and the magnitude of treatment effect and unmeasured biases, which may be difficult to anticipate. This motivates a combined testing procedure that performs two highly correlated analyses, one with and one without external controls, with a small correction for multiple testing using the joint distribution of the two test statistics. The combined test provides a new method of sensitivity analysis designed for data fusion problems, which anchors at the unbiased analysis based on RCT only and spends a small proportion of the type I error to also test using the external controls. In this way, if leveraging external controls increases power, the power gain compared to the analysis based on RCT only can be substantial; if not, the power loss is small. The proposed method is evaluated in theory and power calculations, and applied to a real trial.

preprint2022arXiv

The Hadron-Quark Crossover in Neutron Star within Gaussian Process Regression Method

The equations of state of the neutron star at the hadron-quark crossover region are interpolated with the Gaussian process regression (GPR) method, which can reduce the randomness of present interpolation schemes. The relativistic mean-field (RMF) model and Nambu-Jona-Lasinio (NJL) model are employed to describe the hadronic phase and quark phase, respectively. In the RMF model, the coupling term between $ω$ and $ρ$ mesons is considered to control the density-dependent behaviors of symmetry energy, i.e. the slope of symmetry energy, $L$. Furthermore, the vector interaction between quarks is included in the NJL model to obtain the additional repulsive contributions. Their coupling strengths and the crossover windows are discussed in the present framework under the constraints on the neutron star from gravitational wave detections, massive neutron star measurements, mass-radius simultaneous observation of NICER collaboration, and the neutron skin thickness of $^{208}$Pb from PREX-II. It is found that the slope of symmetry energy, $L$ should be around $50-90$ MeV and the crossover window is $(0.3,~0.6)~\rm fm^{-3}$ with these observables. Furthermore, the uncertainties of neutron star masses and radii in the hadron-quark crossover regions are also predicted by the GPR method.

preprint2022arXiv

The hyperonic star in relativistic mean-field model

The neutron star as a supernova remnant is attracting high attention recently due to the gravitation wave detection and precise measurements about its mass and radius. In the inner core region of the neutron star, the strangeness degrees of freedom, such as the hyperons, can be present, which is also named as a hyperonic star. In this work, the neutron star consisting of nucleons and leptons, and the hyperonic star including the hyperons will be reviewed in the framework of the relativistic mean-field (RMF) model. The popular non-linear and density-dependent RMF parametrizations in the market will be adopted to investigate the role of strangeness baryons in a hyperonic star on its mass, radius, tidal deformability, and other properties. Finally, the magnitudes of the coupling strengths between mesons and hyperons also will be discussed, which can generate the massive hyperonic star with present RMF parameter sets, when the vector coupling constants are strong.

preprint2022arXiv

The lineshape of the compact fully heavy tetraquark

Hadrons and their distributions are the most direct observables in experiment, which would shed light on the non-perturbative mystery of quantum chromodynamics (QCD). As the result, any new hadron will challenge our current knowlege on the one hand, and provide additional inputs on the other hand. The fully heavy $cc\bar{c}\bar{c}$ system observed by LHCb recently opens a new era for hadron physics. We first extract the internal structure of the fully heavy tetraquarks directly from the experimental data, within the compact tetraquark picture. By fitting to the di-$J/ψ$ lineshape, we find that the $X(6900)$ is only cusp effect from the $J/ψψ(3770)$ channel. In addition, there is also a cusp slightly below $6.8~\mathrm{GeV}$ stemming from the $J/ψψ^\prime$ channel. The two $0^{++}$ tetraquarks behave as two resonances above the di-$η_c$ and di-$J/ψ$ threshold, respectively. The $2^{++}$ state is a bound state below the di-$J/ψ$ threshold. Furthermore, we find that the $X_{0^{++}}(6035)$ shows a significant structure in the di-$η_c$ lineshape even after the coupled channel effect. This is an unique feature which can distinguish compact $cc\bar{c}\bar{c}$ tetraquark from the loosely hadronic molecules.

preprint2022arXiv

The Minimal Flavor Structure from Decomposition of the Fermion Mass Matrix

The minimal flavor structures for both quarks and leptons are proposed to address fermion mass hierarchy and flavor mixings by bi-unitary decomposition of the fermion mass matrix. The real matrix ${\bf M}_0^f$ is completely responsive to family mass hierarchy, which is expressed by a close-to-flat matrix structure. The left-handed unitary phase ${\bf F}_L^f$ provides the origin of CP violation in quark and lepton mixings, which can be explained as a quantum effect between Yukawa interaction states and weak gauge states. The minimal flavor structure is realized by just 10 parameters without any redundancy, corresponding to 6 fermion masses, 3 mixing angles and 1 CP violation in the quark/lepton sector. This approach provides a general flavor structure independent of the specific quark or lepton flavor data. We verify the validation of the flavor structure by reproducing quark/lepton masses and mixings. Some possible scenarios that yield the flavor structure are also discussed.

preprint2022arXiv

The nuclear symmetry energy from relativistic Brueckner-Hartree-Fock model

The microscopic mechanisms of the symmetry energy in nuclear matter are investigated in the framework of the relativistic Brueckner-Hartree-Fock (RBHF) model with a high-precision realistic nuclear potential, pvCDBonn A. The kinetic energy and potential contributions to symmetry energy are decomposed. They are explicitly expressed by the nucleon self-energies, which are obtained through projecting the $G$-matrices from the RBHF model into the terms of Lorentz covariants. The nuclear medium effects on the nucleon self-energy and nucleon-nucleon interaction in symmetry energy are discussed by comparing the results from the RBHF model and those from Hartree-Fock and relativistic Hartree-Fock models. It is found that the nucleon self-energy including the nuclear medium effect on the single-nucleon wave function provides a largely positive contribution to the symmetry energy, while {the nuclear medium effect on the nucleon-nucleon interaction, i.e., the effective $G$-matrices generates the negative contribution}. The tensor force plays an essential role in the symmetry energy around the density. The scalar and vector covariant amplitudes of nucleon-nucleon interaction dominate the potential component of the symmetry energy. Furthermore, the isoscalar and isovector terms in the optical potential are extracted from the RBHF model. The isoscalar part is consistent with the results from the analysis of global optical potential, while the isovector one has obvious differences at higher incident energy due to the relativistic effect.

preprint2022arXiv

Towards Higher-order Topological Consistency for Unsupervised Network Alignment

Network alignment task, which aims to identify corresponding nodes in different networks, is of great significance for many subsequent applications. Without the need for labeled anchor links, unsupervised alignment methods have been attracting more and more attention. However, the topological consistency assumptions defined by existing methods are generally low-order and less accurate because only the edge-indiscriminative topological pattern is considered, which is especially risky in an unsupervised setting. To reposition the focus of the alignment process from low-order to higher-order topological consistency, in this paper, we propose a fully unsupervised network alignment framework named HTC. The proposed higher-order topological consistency is formulated based on edge orbits, which is merged into the information aggregation process of a graph convolutional network so that the alignment consistencies are transformed into the similarity of node embeddings. Furthermore, the encoder is trained to be multi-orbit-aware and then be refined to identify more trusted anchor links. Node correspondence is comprehensively evaluated by integrating all different orders of consistency. {In addition to sound theoretical analysis, the superiority of the proposed method is also empirically demonstrated through extensive experimental evaluation. On three pairs of real-world datasets and two pairs of synthetic datasets, our HTC consistently outperforms a wide variety of unsupervised and supervised methods with the least or comparable time consumption. It also exhibits robustness to structural noise as a result of our multi-orbit-aware training mechanism.

preprint2022arXiv

Towards Real-Time Counting Shortest Cycles on Dynamic Graphs: A Hub Labeling Approach

With the ever-increasing prevalence of graph data in a wide spectrum of applications, it becomes essential to analyze structural trends in dynamic graphs on a continual basis. The shortest cycle is a fundamental pattern in graph analytics. In this paper, we investigate the problem of shortest cycle counting for a given vertex in dynamic graphs in light of its applicability to problems such as fraud detection. To address such queries efficiently, we propose a 2-hop labeling based algorithm called Counting Shortest Cycle (CSC for short). Additionally, techniques for dynamically updating the CSC index are explored. Comprehensive experiments are conducted to demonstrate the efficiency and effectiveness of our method. In particular, CSC enables query evaluation in a few hundreds of microseconds for graphs with millions of edges, and improves query efficiency by two orders of magnitude when compared to the baseline solutions. Also, the update algorithm could efficiently cope with edge insertions (deletions).

preprint2022arXiv

Traffic Context Aware Data Augmentation for Rare Object Detection in Autonomous Driving

Detection of rare objects (e.g., traffic cones, traffic barrels and traffic warning triangles) is an important perception task to improve the safety of autonomous driving. Training of such models typically requires a large number of annotated data which is expensive and time consuming to obtain. To address the above problem, an emerging approach is to apply data augmentation to automatically generate cost-free training samples. In this work, we propose a systematic study on simple Copy-Paste data augmentation for rare object detection in autonomous driving. Specifically, local adaptive instance-level image transformation is introduced to generate realistic rare object masks from source domain to the target domain. Moreover, traffic scene context is utilized to guide the placement of masks of rare objects. To this end, our data augmentation generates training data with high quality and realistic characteristics by leveraging both local and global consistency. In addition, we build a new dataset, Rare Object Dataset (ROD), consisting 10k training images, 4k validation images and the corresponding labels with a diverse range of scenarios in autonomous driving. Experiments on ROD show that our method achieves promising results on rare object detection. We also present a thorough study to illustrate the effectiveness of our local-adaptive and global constraints based Copy-Paste data augmentation for rare object detection. The data, development kit and more information of ROD are available online at: \url{https://nullmax-vision.github.io}.

preprint2021arXiv

A Unified Yukawa Interaction for the Standard Model of Quarks and Leptons

To address fermion mass hierarchy and flavor mixings in the quark and lepton sectors, a minimal flavor structure without any redundant parameters beyond phenomenological observables is proposed via decomposition of the Standard Model Yukawa mass matrix into a bi-unitary form. After reviewing the roles and parameterization of the factorized matrix ${\bf M}_0^f$ and ${\bf F}_L^f$ in fermion masses and mixings, we generalize the mechanism to up- and down-type fermions to unify them into a universal quark/lepton Yukawa interaction. In the same way, a unified form of the description of the quark and lepton Yukawa interactions is also proposed, which shows a similar picture as the unification of gauge interactions.

preprint2021arXiv

Adaptive Load Shedding for Grid Emergency Control via Deep Reinforcement Learning

Emergency control, typically such as under-voltage load shedding (UVLS), is broadly used to grapple with low voltage and voltage instability issues in practical power systems under contingencies. However, existing emergency control schemes are rule-based and cannot be adaptively applied to uncertain and floating operating conditions. This paper proposes an adaptive UVLS algorithm for emergency control via deep reinforcement learning (DRL) and expert systems. We first construct dynamic components for picturing the power system operation as the environment. The transient voltage recovery criteria, which poses time-varying requirements to UVLS, is integrated into the states and reward function to advise the learning of deep neural networks. The proposed approach has no tuning issue of coefficients in reward functions, and this issue was regarded as a deficiency in the existing DRL-based algorithms. Extensive case studies illustrate that the proposed method outperforms the traditional UVLS relay in both the timeliness and efficacy for emergency control.

preprint2021arXiv

Consensus-Aware Visual-Semantic Embedding for Image-Text Matching

Image-text matching plays a central role in bridging vision and language. Most existing approaches only rely on the image-text instance pair to learn their representations, thereby exploiting their matching relationships and making the corresponding alignments. Such approaches only exploit the superficial associations contained in the instance pairwise data, with no consideration of any external commonsense knowledge, which may hinder their capabilities to reason the higher-level relationships between image and text. In this paper, we propose a Consensus-aware Visual-Semantic Embedding (CVSE) model to incorporate the consensus information, namely the commonsense knowledge shared between both modalities, into image-text matching. Specifically, the consensus information is exploited by computing the statistical co-occurrence correlations between the semantic concepts from the image captioning corpus and deploying the constructed concept correlation graph to yield the consensus-aware concept (CAC) representations. Afterwards, CVSE learns the associations and alignments between image and text based on the exploited consensus as well as the instance-level representations for both modalities. Extensive experiments conducted on two public datasets verify that the exploited consensus makes significant contributions to constructing more meaningful visual-semantic embeddings, with the superior performances over the state-of-the-art approaches on the bidirectional image and text retrieval task. Our code of this paper is available at: https://github.com/BruceW91/CVSE.

preprint2021arXiv

Data-Driven Vulnerability Detection and Repair in Java Code

Java platform provides various APIs to facilitate secure coding. However, correctly using security APIs is usually challenging for developers who lack cybersecurity training. Prior work shows that many developers misuse security APIs; such misuses can introduce vulnerabilities into software, void security protections, and present security exploits to hackers. To eliminate such API-related vulnerabilities, this paper presents SEADER -- our new approach that detects and repairs security API misuses. Given an exemplar, insecure code snippet, and its secure counterpart, SEADER compares the snippets and conducts data dependence analysis to infer the security API misuse templates and corresponding fixing operations. Based on the inferred information, given a program, SEADER performs inter-procedural static analysis to search for any security API misuse and to propose customized fixing suggestions for those vulnerabilities. To evaluate SEADER, we applied it to 25 <insecure, secure> code pairs, and SEADER successfully inferred 18 unique API misuse templates and related fixes. With these vulnerability repair patterns, we further applied SEADER to 10 open-source projects that contain in total 32 known vulnerabilities. Our experiment shows that SEADER detected vulnerabilities with 100% precision, 84% recall, and 91% accuracy. Additionally, we applied SEADER to 100 Apache open-source projects and detected 988 vulnerabilities; SEADER always customized repair suggestions correctly. Based on SEADER's outputs, we filed 60 pull requests. Up till now, developers of 18 projects have offered positive feedbacks on SEADER's suggestions. Our results indicate that SEADER can effectively help developers detect and fix security API misuses. Whereas prior work either detects API misuses or suggests simple fixes, SEADER is the first tool to do both for nontrivial vulnerability repairs.

preprint2021arXiv

Detected the steerability bounds of the generalized Werner states via BackPropagation neural network

We use error BackPropagation (BP) neural network to determine whether an arbitrary two-qubit quantum state is steerable and optimize the steerability bounds of the generalized Werner state. The results show that no matter how we choose the features for the quantum states, we can use the BP neural network to construct several models to realize high-performance quantum steering classifiers compared with the support vector machine (SVM). In addition, we predict the steerability bounds of the generalized Werner states by using the classifiers which are newly constructed by the BP neural network, that is, the predicted steerability bounds are closer to the theoretical bounds. In particular, high-performance classifiers with partial information of the quantum states which we only need to measure in three fixed measurement directions are obtained.

preprint2021arXiv

On stochastic gradient Langevin dynamics with dependent data streams: the fully non-convex case

We consider the problem of sampling from a target distribution, which is \emph {not necessarily logconcave}, in the context of empirical risk minimization and stochastic optimization as presented in Raginsky et al. (2017). Non-asymptotic analysis results are established in the $L^1$-Wasserstein distance for the behaviour of Stochastic Gradient Langevin Dynamics (SGLD) algorithms. We allow the estimation of gradients to be performed even in the presence of \emph{dependent} data streams. Our convergence estimates are sharper and \emph{uniform} in the number of iterations, in contrast to those in previous studies.

preprint2021arXiv

Quantifying measurement-induced nonbilocal correlation

In the paper, we devote to defining an available measure to quantify the nonbilocal correlation in the entanglement-swapping experiment. Then we obtain analytical formulas to calculate the quantifier when the inputs are pure states. For the case of mixed inputs, we discuss the computational properties of the quantifier. Finally, we derive a tight upper bound to the nonbilocality quantifier.

preprint2021arXiv

Similarity Reasoning and Filtration for Image-Text Matching

Image-text matching plays a critical role in bridging the vision and language, and great progress has been made by exploiting the global alignment between image and sentence, or local alignments between regions and words. However, how to make the most of these alignments to infer more accurate matching scores is still underexplored. In this paper, we propose a novel Similarity Graph Reasoning and Attention Filtration (SGRAF) network for image-text matching. Specifically, the vector-based similarity representations are firstly learned to characterize the local and global alignments in a more comprehensive manner, and then the Similarity Graph Reasoning (SGR) module relying on one graph convolutional neural network is introduced to infer relation-aware similarities with both the local and global alignments. The Similarity Attention Filtration (SAF) module is further developed to integrate these alignments effectively by selectively attending on the significant and representative alignments and meanwhile casting aside the interferences of non-meaningful alignments. We demonstrate the superiority of the proposed method with achieving state-of-the-art performances on the Flickr30K and MSCOCO datasets, and the good interpretability of SGR and SAF modules with extensive qualitative experiments and analyses.

preprint2021arXiv

The $ΞN$ interaction constrained by recent $Ξ^-$ hypernuclei experiments

The $ΞN$ interaction is investigated in the quark mean-field (QMF) model based on recent observables of the $Ξ^-+^{14}\rm{N}$ ($_{Ξ^-}^{15}\rm{C}$) system. The experimental data about the binding energy of $1p$-state $Ξ^-$ hyperon in $_{Ξ^-}^{15}\rm{C}$ hypernuclei at KISO, IBUKI, E07-T011, E176-14-03-35 events are conflated as $B_{Ξ^-}(1p)=1.14\pm0.11$ MeV. With this constraint, the coupling strengths between the vector meson and $Ξ$ hyperon are fixed in three QMF parameter sets. Meanwhile, the $Ξ^-$ binding energy of $1s$ state in $_{Ξ^-}^{15}\rm{C}$ is predicted as $B_{Ξ^-}(1s)=5.66\pm0.38$ MeV with the same interactions, which are completely consistent with the data from the KINKA and IRRAWADDY events. Finally, the single $ΞN$ potential is calculated in the symmetric nuclear matter in the framework of QMF models. It is $U_{ΞN}=-11.96\pm 0.85$ MeV at nuclear saturation density, which will contribute to the study on the strangeness degree of freedom in compact star.

preprint2020arXiv

A fully data-driven approach to minimizing CVaR for portfolio of assets via SGLD with discontinuous updating

A new approach in stochastic optimization via the use of stochastic gradient Langevin dynamics (SGLD) algorithms, which is a variant of stochastic gradient decent (SGD) methods, allows us to efficiently approximate global minimizers of possibly complicated, high-dimensional landscapes. With this in mind, we extend here the non-asymptotic analysis of SGLD to the case of discontinuous stochastic gradients. We are thus able to provide theoretical guarantees for the algorithm's convergence in (standard) Wasserstein distances for both convex and non-convex objective functions. We also provide explicit upper estimates of the expected excess risk associated with the approximation of global minimizers of these objective functions. All these findings allow us to devise and present a fully data-driven approach for the optimal allocation of weights for the minimization of CVaR of portfolio of assets with complete theoretical guarantees for its performance. Numerical results illustrate our main findings.

preprint2020arXiv

A Unified Approach to Scalable Spectral Sparsification of Directed Graphs

Recent spectral graph sparsification research allows constructing nearly-linear-sized subgraphs that can well preserve the spectral (structural) properties of the original graph, such as the first few eigenvalues and eigenvectors of the graph Laplacian, leading to the development of a variety of nearly-linear time numerical and graph algorithms. However, there is not a unified approach that allows for truly-scalable spectral sparsification of both directed and undirected graphs. In this work, we prove the existence of linear-sized spectral sparsifiers for general directed graphs and introduce a practically-efficient and unified spectral graph sparsification approach that allows sparsifying real-world, large-scale directed and undirected graphs with guaranteed preservation of the original graph spectra. By exploiting a highly-scalable (nearly-linear complexity) spectral matrix perturbation analysis framework for constructing nearly-linear sized (directed) subgraphs, it enables us to well preserve the key eigenvalues and eigenvectors of the original (directed) graph Laplacians. The proposed method has been validated using various kinds of directed graphs obtained from public domain sparse matrix collections, showing promising results for solving directed graph Laplacians, spectral embedding, and partitioning of general directed graphs, as well as approximately computing (personalized) PageRank vectors.

preprint2020arXiv

AOT: Pushing the Efficiency Boundary of Main-memory Triangle Listing

Triangle listing is an important topic significant in many practical applications. Efficient algorithms exist for the task of triangle listing. Recent algorithms leverage an orientation framework, which can be thought of as mapping an undirected graph to a directed acylic graph, namely oriented graph, with respect to any global vertex order. In this paper, we propose an adaptive orientation technique that satisfies the orientation technique but refines it by traversing carefully based on the out-degree of the vertices in the oriented graph during the computation of triangles. Based on this adaptive orientation technique, we design a new algorithm, namely aot, to enhance the edge-iterator listing paradigm. We also make improvements to the performance of aot by exploiting the local order within the adjacent list of the vertices. We show that aot is the first work which can achieve best performance in terms of both practical performance and theoretical time complexity. Our comprehensive experiments over $16$ real-life large graphs show a superior performance of our \aot algorithm when compared against the state-of-the-art, especially for massive graphs with billions of edges. Theoretically, we show that our proposed algorithm has a time complexity of $Θ(\sum_{ \langle u,v \rangle \in \vec{E} } \min\{ deg^{+}(u),deg^{+}(v)\}))$, where $\vec{E}$ and $deg^{+}(x)$ denote the set of directed edges in an oriented graph and the out-degree of vertex $x$ respectively. As to our best knowledge, this is the best time complexity among in-memory triangle listing algorithms.

preprint2020arXiv

Attack Identification and Correction for PMU GPS Spoofing in Unbalanced Distribution Systems

Due to the vulnerability of civilian global positioning system (GPS) signals, the accuracy of phasor measurement units (PMUs) can be greatly compromised by GPS spoofing attacks (GSAs), which introduce phase shifts into true phase angle measurements. Focusing on simultaneous GSAs for multiple PMU locations, this paper proposes a novel identification and correction algorithm in distribution systems. A sensitivity analysis of state estimation residuals on a single GSA phase angle is firstly implemented. An identification algorithm using a probing technique is proposed to determine the locations of spoofed PMUs and the ranges of GSA phase shifts. Based on the identification results, these GSA phase shifts are determined via an estimation algorithm that minimizes the mismatch between measurements and system states. Further, with the attacked PMU data corrected, the system states are recovered. Simulations in unbalanced IEEE 34-bus and 123-bus distribution systems demonstrates the efficiency and accuracy of the proposed method.

preprint2020arXiv

Deep Reinforcement Learning Based Volt-VAR Optimization in Smart Distribution Systems

This paper develops a model-free volt-VAR optimization (VVO) algorithm via multi-agent deep reinforcement learning (MADRL) in unbalanced distribution systems. This method is novel since we cast the VVO problem in unbalanced distribution networks to an intelligent deep Q-network (DQN) framework, which avoids solving a specific optimization model directly when facing time-varying operating conditions of the systems. We consider statuses/ratios of switchable capacitors, voltage regulators, and smart inverters installed at distributed generators as the action variables of the DQN agents. A delicately designed reward function guides these agents to interact with the distribution system, in the direction of reinforcing voltage regulation and power loss reduction simultaneously. The forward-backward sweep method for radial three-phase distribution systems provides accurate power flow results within a few iterations to the DQN environment. Finally, the proposed multi-objective MADRL method realizes the dual goals for VVO. We test this algorithm on the unbalanced IEEE 13-bus and 123-bus systems. Numerical simulations validate the excellent performance of this method in voltage regulation and power loss reduction.

preprint2020arXiv

Effects of symmetry energy on the radius and tidal deformability of neutron stars in relativistic mean-field model

The radii and tidal deformabilities of neutron stars are investigated in the framework of relativistic mean-field (RMF) model with different density-dependent behaviors of symmetry energy. To study the effects of symmetry energy on the properties of neutron stars, an $ω$ meson and $ρ$ meson coupling term is included in a popular RMF Lagrangian, i.e. the TM1 parameter set, which is used for the widely used supernova equation of state (EoS) table. The coupling constants relevant to the vector-isovector meson, $ρ$, are refitted by a fixed symmetry energy at subsaturation density and its slope at saturation density, while other coupling constants remain the same as the original ones in TM1 so as to update the supernova EoS table. The radius and mass of maximum neutron stars are not so sensitive to the symmetry energy in these family TM1 parameterizations. However, the radii at intermediate mass region are strongly correlated with the slope of symmetry energy. Furthermore, the dimensionless tidal deformabilities of neutron stars are also calculated within the associated Love number. We find that its value at $1.4 M_\odot$ has a linear correlation to the slope of symmetry energy being different from the previous studied. With the latest constraints of tidal deformabilities from GW170817 event, the slope of symmetry energy at nuclear saturation density should be smaller than $60$ MeV in the family TM1 parameterizations. This fact supports the usage of lower symmetry energy slope for the update supernova EoS, which is applicable to simulations of neutron star merger. Furthermore, the analogous analysis are also done within the family IUFSU parameter sets. It is found that the correlations between the symmetry energy slope with the radius and tidal deformability at $1.4 M_\odot$ have very similar linear relations in these RMF models.

preprint2020arXiv

Efficient Bitruss Decomposition for Large-scale Bipartite Graphs

Cohesive subgraph mining in bipartite graphs becomes a popular research topic recently. An important structure k-bitruss is the maximal cohesive subgraph where each edge is contained in at least k butterflies (i.e., (2, 2)-bicliques). In this paper, we study the bitruss decomposition problem which aims to find all the k-bitrusses for k >= 0. The existing bottom-up techniques need to iteratively peel the edges with the lowest butterfly support. In this peeling process, these techniques are time-consuming to enumerate all the supporting butterflies for each edge. To relax this issue, we first propose a novel online index -- the BE-Index which compresses butterflies into k-blooms (i.e., (2, k)-bicliques). Based on the BE-Index, the new bitruss decomposition algorithm BiT-BU is proposed, along with two batch-based optimizations, to accomplish the butterfly enumeration of the peeling process in an efficient way. Furthermore, the BiT-PC algorithm is devised which is more efficient against handling the edges with high butterfly supports. We theoretically show that our new algorithms significantly reduce the time complexities of the existing algorithms. Also, we conduct extensive experiments on real datasets and the result demonstrates that our new techniques can speed up the state-of-the-art techniques by up to two orders of magnitude.

preprint2020arXiv

Efficient Matrix Factorization on Heterogeneous CPU-GPU Systems

Matrix Factorization (MF) has been widely applied in machine learning and data mining. A large number of algorithms have been studied to factorize matrices. Among them, stochastic gradient descent (SGD) is a commonly used method. Heterogeneous systems with multi-core CPUs and GPUs have become more and more promising recently due to the prevalence of GPUs in general-purpose data-parallel applications. Due to the large computational cost of MF, we aim to improve the efficiency of SGD-based MF computation by utilizing the massive parallel processing power of heterogeneous multiprocessors. The main challenge in parallel SGD algorithms on heterogeneous CPU-GPU systems lies in the granularity of the matrix division and the strategy to assign tasks. We design a novel strategy to divide the matrix into a set of blocks by considering two aspects. First, we observe that the matrix should be divided nonuniformly, and relatively large blocks should be assigned to GPUs to saturate the computing power of GPUs. In addition to exploiting the characteristics of hardware, the workloads assigned to two types of hardware should be balanced. Aiming at the final division strategy, we design a cost model tailored for our problem to accurately estimate the performance of hardware on different data sizes. A dynamic scheduling policy is also used to further balance workloads in practice. Extensive experiments show that our proposed algorithm achieves high efficiency with a high quality of training quality.

preprint2020arXiv

Exploring Cohesive Subgraphs with Vertex Engagement and Tie Strength in Bipartite Graphs

We propose a novel cohesive subgraph model called $τ$-strengthened $(α,β)$-core (denoted as $(α,β)_τ$-core), which is the first to consider both tie strength and vertex engagement on bipartite graphs. An edge is a strong tie if contained in at least $τ$ butterflies ($2\times2$-bicliques). $(α,β)_τ$-core requires each vertex on the upper or lower level to have at least $α$ or $β$ strong ties, given strength level $τ$. To retrieve the vertices of $(α,β)_τ$-core optimally, we construct index $I_{α,β,τ}$ to store all $(α,β)_τ$-cores. Effective optimization techniques are proposed to improve index construction. To make our idea practical on large graphs, we propose 2D-indexes $I_{α,β}, I_{β,τ}$, and $I_{α,τ}$ that selectively store the vertices of $(α,β)_τ$-core for some $α,β$, and $τ$. The 2D-indexes are more space-efficient and require less construction time, each of which can support $(α,β)_τ$-core queries. As query efficiency depends on input parameters and the choice of 2D-index, we propose a learning-based hybrid computation paradigm by training a feed-forward neural network to predict the optimal choice of 2D-index that minimizes the query time. Extensive experiments show that ($1$) $(α,β)_τ$-core is an effective model capturing unique and important cohesive subgraphs; ($2$) the proposed techniques significantly improve the efficiency of index construction and query processing.

preprint2020arXiv

Graph-based Faulted Line Identification Using Micro-PMU Data in Distribution Systems

Motivated by increasing penetration of distributed generators (DGs) and fast development of micro-phasor measurement units (μPMUs), this paper proposes a novel graph-based faulted line identification algorithm using a limited number of μPMUs in distribution networks. The core of the proposed method is to apply advanced distribution system state estimation (DSSE) techniques integrating μPMU data to the fault location. We propose a distributed DSSE algorithm to efficiently restrict the searching region for the fault source in the feeder between two adjacent μPMUs. Based on the graph model of the feeder in the reduced searching region, we further perform the DSSE in a hierarchical structure and identify the location of the fault source. Also, the proposed approach captures the impact of DGs on distribution system operation and remains robust against high-level noises in measurements. Numerical simulations verify the accuracy and efficiency of the proposed method under various fault scenarios covering multiple fault types and fault impedances.

preprint2020arXiv

Graph3S: A Simple, Speedy and Scalable Distributed Graph Processing System

Graph is a ubiquitous structure in many domains. The rapidly increasing data volume calls for efficient and scalable graph data processing. In recent years, designing distributed graph processing systems has been an increasingly important area to fulfil the demands of processing big graphs in a distributed environment. Though a variety of distributed graph processing systems have been developed, very little attention has been paid to achieving a good combinational system performance in terms of usage simplicity, efficiency and scalability. To contribute to the study of distributed graph processing system, this work tries to fill this gap by designing a simple, speedy and scalable system. Our observation is that enforcing the communication flexibility of a system leads to the gains of both system efficiency and scalability as well as simple usage. We realize our idea in a system Graph3S and conduct extensive experiments with diverse algorithms over big graphs from different domains to test its performance. The results show that, besides simple usage, our system has outstanding performance over various graph algorithms and can even reach up to two orders of magnitude speedup over existing in-memory systems when applying to some algorithms. Also, its scalability is competitive to disk-based systems and even better when less machines are used.

preprint2020arXiv

Interval State Estimation with Uncertainty of Distributed Generation and Line Parameters in Unbalanced Distribution Systems

Distribution system state estimation (DSSE), which provides critical information for system monitoring and control, is being challenged by multiple sources of uncertainties such as random meter errors, stochastic power output of distributed generation (DG), and imprecise network parameters. This paper originally proposes a general interval state estimation (ISE) model to simultaneously formulate these uncertainties in unbalanced distribution systems by interval arithmetic. Moreover, this model can accommodate partially available measurements of DG outputs and inaccurate line parameters. Further, a modified Krawczyk-operator (MKO) algorithm is proposed to solve the general ISE model efficiently, and effectively provides the upper and lower bounds of state variables under coordinated impacts of these uncertainties. The proposed algorithm is tested on unbalanced IEEE 13-bus and 123-bus systems. Comparison with various methods including Monte Carlo simulation indicates that the proposed algorithm is many orders of magnitude faster and encloses tighter boundaries of state variables.

preprint2020arXiv

Machine learning on DNA-encoded libraries: A new paradigm for hit-finding

DNA-encoded small molecule libraries (DELs) have enabled discovery of novel inhibitors for many distinct protein targets of therapeutic value through screening of libraries with up to billions of unique small molecules. We demonstrate a new approach applying machine learning to DEL selection data by identifying active molecules from a large commercial collection and a virtual library of easily synthesizable compounds. We train models using only DEL selection data and apply automated or automatable filters with chemist review restricted to the removal of molecules with potential for instability or reactivity. We validate this approach with a large prospective study (nearly 2000 compounds tested) across three diverse protein targets: sEH (a hydrolase), ERα (a nuclear receptor), and c-KIT (a kinase). The approach is effective, with an overall hit rate of {\sim}30% at 30 {\textmu}M and discovery of potent compounds (IC50 <10 nM) for every target. The model makes useful predictions even for molecules dissimilar to the original DEL and the compounds identified are diverse, predominantly drug-like, and different from known ligands. Collectively, the quality and quantity of DEL selection data; the power of modern machine learning methods; and access to large, inexpensive, commercially-available libraries creates a powerful new approach for hit finding.

preprint2020arXiv

Properties of neutron star described by a relativistic $ab~ initio$ model

Properties of neutron star are investigated by an available relativistic $ab~ initio$ method, i.e., the relativistic Brueckner-Hartree-Fock (RBHF) model, with the latest high-precision relativistic charge-dependent potentials, pvCD-Bonn A, B, C. The neutron star matter is solved within the beta equilibrium and charge neutrality conditions in the framework of RBHF model. Comparing to the conventional treatment, where the chemical potential of lepton was approximately represented by the symmetry energy of nuclear matter, the equation of state (EOS) of neutron star matter in the present self-consistent calculation with pvCD-Bonn B has striking difference above the baryon number density $n_b=0.55$ fm$^{-3}$. However, these differences influence the global properties of neutron star only about $1\%\sim2\%$. Then, three two-body potentials pvCD-Bonn A, B, C, with different tensor components, are systematically applied in RBHF model to calculate the properties of neutron star. It is found that the maximum masses of neutron star are around $2.21\sim2.30M_\odot$ and the corresponding radii are $R =11.18\sim11.72$ km. The radii of $1.4M_\odot$ neutron star are predicated as $R_{1.4} = 12.34\sim12.91$ km and their dimensionless tidal deformabilities are $Λ_{1.4} = 485\sim 626$. Furthermore, the direct URCA process in neutron star cooling will happen from $n_b=0.414\sim0.530$ fm$^{-3}$ with the proton fractions, $Y_p=0.136\sim0.138$. All of the results obtained from RBHF model only with two-body pvCD-Bonn potentials completely satisfy various constraints from recent astronomical observations of massive neutron stars, gravitational wave detection (GW 170817), and mass-radius simultaneous measurement (NICER).

preprint2020arXiv

Semiparametric regression and risk prediction with competing risks data under missing cause of failure

The cause of failure in cohort studies that involve competing risks is frequently incompletely observed. To address this, several methods have been proposed for the semiparametric proportional cause-specific hazards model under a missing at random assumption. However, these proposals provide inference for the regression coefficients only, and do not consider the infinite dimensional parameters, such as the covariate-specific cumulative incidence function. Nevertheless, the latter quantity is essential for risk prediction in modern medicine. In this paper we propose a unified framework for inference about both the regression coefficients of the proportional cause-specific hazards model and the covariate-specific cumulative incidence functions under missing at random cause of failure. Our approach is based on a novel computationally efficient maximum pseudo-partial-likelihood estimation method for the semiparametric proportional cause-specific hazards model. Using modern empirical process theory we derive the asymptotic properties of the proposed estimators for the regression coefficients and the covariate-specific cumulative incidence functions, and provide methodology for constructing simultaneous confidence bands for the latter. Simulation studies show that our estimators perform well even in the presence of a large fraction of missing cause of failures, and that the regression coefficient estimator can be substantially more efficient compared to the previously proposed augmented inverse probability weighting estimator. The method is applied using data from an HIV cohort study and a bladder cancer clinical trial.

preprint2020arXiv

SF-GRASS: Solver-Free Graph Spectral Sparsification

Recent spectral graph sparsification techniques have shown promising performance in accelerating many numerical and graph algorithms, such as iterative methods for solving large sparse matrices, spectral partitioning of undirected graphs, vectorless verification of power/thermal grids, representation learning of large graphs, etc. However, prior spectral graph sparsification methods rely on fast Laplacian matrix solvers that are usually challenging to implement in practice. This work, for the first time, introduces a solver-free approach (SF-GRASS) for spectral graph sparsification by leveraging emerging spectral graph coarsening and graph signal processing (GSP) techniques. We introduce a local spectral embedding scheme for efficiently identifying spectrally-critical edges that are key to preserving graph spectral properties, such as the first few Laplacian eigenvalues and eigenvectors. Since the key kernel functions in SF-GRASS can be efficiently implemented using sparse-matrix-vector-multiplications (SpMVs), the proposed spectral approach is simple to implement and inherently parallel friendly. Our extensive experimental results show that the proposed method can produce a hierarchy of high-quality spectral sparsifiers in nearly-linear time for a variety of real-world, large-scale graphs and circuit networks when compared with the prior state-of-the-art spectral method.

preprint2020arXiv

The common origin of the family mass hierarchy and CP violation from flavour-dependent vacuum for quarks and leptons

We rewrite the Yukawa interactions of the standard model in terms of flavour-dependent vacuum structure to address the family mass hierarchy and CP violation in both the quark sector and lepton sector. It is realized for the first time that the Lagrangian only includes the same number of degrees of freedom as phenomenological observables with no requirement of extra particles or any new symmetry. The quark and lepton mass hierarchy arises as a natural result of the close-to-flat vacuum in flavour space. The CP violation in CKM and PMNS is explained as a general quantum phase between weak gauge eigenstates and Yukawa interaction states. The mechanism is proven by reproduction of all current quark/lepton mass data and the CKM/PMNS flavour mixings.

preprint2020arXiv

The one-pion-exchange potential with contact terms from lattice QCD simulations

The pion-mass-dependent nucleon-nucleon ($NN$) potentials in term of one-pion exchange and contact terms are obtained from the latest lattice QCD simulations of two-nucleon system, which employ the forms of leading order (LO) $NN$ potential from the chiral effective field theory and thus are named as the LO chiral potential in this work. We extract the coefficients of contact terms and cut-off momenta in these potentials, for the first time, by fitting the phase shifts of $^1S_0$ and $^3S_1$ channels generated by the results of HALQCD collaboration with various pion masses from $468.6$ MeV to $1170.9$ MeV. The low-energy constants in the $^1S_0$ and $^3S_1$ channels become weaker and approach each other for larger pion masses. These LO chiral potentials are applied to symmetric nuclear and pure neutron matter within the Brueckner-Hartree-Fock method. At this moment, however, we do not have yet the information of the $P$-wave $NN$ interaction to be provided by the lattice QCD simulations for full description of nuclear matter. Our results will enhance the development of nuclear structure and nuclear matter by controlling the contribution of the pionic effect and illuminate the role of chiral symmetry of the strong interaction in complex system.

preprint2020arXiv

Towards Highly Efficient State Estimation with Nonlinear Measurements in Distribution Systems

This letter proposes a novel and highly efficient distribution system state estimation (DSSE) algorithm with nonlinear measurements from supervisory control and data acquisition (SCADA) systems. Conventional DSSE, i.e., a weighted least square (WLS)-based method, requires multiple Monte Carlo simulations for overall accuracy evaluation and has high calculation cost due to the nonlinear iterative process. The proposed method uses the Taylor series of voltages for constructing a linear DSSE model in the interval form and then solve this model by interval arithmetic. Compared with the nonlinear WLS-based method, the proposed method obtains accurate and robust estimates via a single random sampling of measurements and is computationally efficient. The comparative analysis of the IEEE 34-bus distribution system points to improved estimation results relative to the WLS-based method.

preprint2020arXiv

Understanding and adjusting the selection bias from a proof-of-concept study to a more confirmatory study

It has long been noticed that the efficacy observed in small early phase studies is generally better than that observed in later larger studies. Historically, the inflation of the efficacy results from early proof-of-concept studies is either ignored, or adjusted empirically using a frequentist or Bayesian approach. In this article, we systematically explained the underlying reason for the inflation of efficacy results in small early phase studies from the perspectives of measurement error models and selection bias. A systematic method was built to adjust the early phase study results from both frequentist and Bayesian perspectives. A hierarchical model was proposed to estimate the distribution of the efficacy for a portfolio of compounds, which can serve as the prior distribution for the Bayesian approach. We showed through theory that the systematic adjustment provides an unbiased estimator for the true mean efficacy for a portfolio of compounds. The adjustment was applied to paired data for the efficacy in early small and later larger studies for a set of compounds in diabetes and immunology. After the adjustment, the bias in the early phase small studies seems to be diminished.

preprint2020arXiv

What Can We Learn from the Travelers Data in Detecting Disease Outbreaks -- A Case Study of the COVID-19 Epidemic

Background: Travel is a potent force in the emergence of disease. We discussed how the traveler case reports could aid in a timely detection of a disease outbreak. Methods: Using the traveler data, we estimated a few indicators of the epidemic that affected decision making and policy, including the exponential growth rate, the doubling time, and the probability of severe cases exceeding the hospital capacity, in the initial phase of the COVID-19 epidemic in multiple countries. We imputed the arrival dates when they were missing. We compared the estimates from the traveler data to the ones from domestic data. We quantitatively evaluated the influence of each case report and knowing the arrival date on the estimation. Findings: We estimated the travel origin's daily exponential growth rate and examined the date from which the growth rate was consistently above 0.1 (equivalent to doubling time < 7 days). We found those dates were very close to the dates that critical decisions were made such as city lock-downs and national emergency announcement. Using only the traveler data, if the assumed epidemic start date was relatively accurate and the traveler sample was representative of the general population, the growth rate estimated from the traveler data was consistent with the domestic data. We also discussed situations that the traveler data could lead to biased estimates. From the data influence study, we found more recent travel cases had a larger influence on each day's estimate, and the influence of each case report got smaller as more cases became available. We provided the minimum number of exported cases needed to determine whether the local epidemic growth rate was above a certain level, and developed a user-friendly Shiny App to accommodate various scenarios.

preprint2019arXiv

Bayesian truncation errors in equations of state of nuclear matter with chiral nucleon-nucleon potentials

The truncation errors in equations of state (EOSs) of nuclear matter derived from the chiral nucleon-nucleon ($NN$) potentials at different expansion orders are analyzed by a Bayesian model. These EOSs are expanded as functions of a dimensionless parameter, $Q$, which is determined by Fermi momentum, $k_F$ and breakdown scale, $Λ_b$. The degree-of-belief (DoB) intervals predicted by the chiral effective field theory are calculated within the corresponding expansion coefficients and the specific prior probability distribution functions in terms of Bayes theorem. The truncation errors of EOSs, generated by the DoB intervals, exhibit good order-by-order convergences with different chiral expansion order potentials. When DoB is considered as $1σ$ credibility, i.e., $68.27\%$ confidence interval, the truncation errors of binding energy per nucleon in symmetric nuclear matter and pure neutron matter are consistent with the results given by a simple error analysis method proposed by Epelbaum, {\it et al.} Finally, the reasonable values of the breakdown scale $Λ_b$ in nuclear matter are discussed through consistency checks of Bayesian method based on the calculations of success rate.

preprint2019arXiv

The charge-dependent Bonn potentials with pseudovector pion-nucleon coupling

To apply the high-precision realistic nucleon-nucleon ($NN$) potentials on the investigations of relativistic many-body methods, the new versions of charge-dependent Bonn (CD-Bonn) $NN$ potential are constructed within the pseudovector pion-nucleon coupling instead of the pseudoscalar type in the original CD-Bonn potential worked out by Machleidt [Phys. Rev. C 63, 024001 (2001)]. Two effective scalar mesons are introduced, whose coupling constants with nucleon are independently determined at each partial wave for total angular momentum $J\leq 4$, to describe the charge dependence of $NN$ scattering data precisely, while the coupling constants between vector, pseudovector mesons and nucleon are identical in all channels. Three revised CD-Bonn potentials adopting the pseudovector pion-nucleon couplings (pvCD-Bonn) are generated by fitting the Nijmegen PWA phase shift data and deuteron binding energy with different pion-nucleon coupling strengths, which can reproduce the phase shifts at spin-single channels and low-energy $NN$ scattering parameters very well, and provide the significantly different mixing parameters at spin-triplet channels. Furthermore, the $D$-state probabilities of deuteron from these potentials range from $4.22\%$ to $6.05\%$. It demonstrates that these potentials contain different components of tensor force, which will be useful to discuss the roles of tensor force in nuclear few-body and many-body systems.

preprint2016arXiv

A centrosymmetric hexagonal magnet with superstable biskyrmion magnetic nanodomains in a wide temperature range of 100K to 340K

Here, we report on the experimental discovery of biskyrmion magnetic nanodomains at RT and the observation of a biskyrmion-derived topological Hall effect (THE) in the centrosymmetric hexagonal MnNiGa magnet. Using a phase reconstruction technique based on a transport-of-intensity equation (TIE), we established the texture of the biskyrmion spin. Results from Lorentz transmission electron microscopy (TEM) and the topological Hall effect revealed that the biskyrmion phase is stable over a much wider temperature range (100 K to ~340K) and a larger magnetic field range in our material than in skyrmion-hosting bulk crystals reported previously[1-3,6]. The super-wide operating temperature and the broad range of material options indicate important progress toward the realization of skyrmion-based spintronic devices.

preprint2016arXiv

A Fast Order-Based Approach for Core Maintenance

Graphs have been widely used in many applications such as social networks, collaboration networks, and biological networks. One important graph analytics is to explore cohesive subgraphs in a large graph. Among several cohesive subgraphs studied, k-core is one that can be computed in linear time for a static graph. Since graphs are evolving in real applications, in this paper, we study core maintenance which is to reduce the computational cost to compute k-cores for a graph when graphs are updated from time to time dynamically. We identify drawbacks of the existing efficient algorithm, which needs a large search space to find the vertices that need to be updated, and has high overhead to maintain the index built, when a graph is updated. We propose a new order-based approach to maintain an order, called k-order, among vertices, while a graph is updated. Our new algorithm can significantly outperform the state-of-the-art algorithm up to 3 orders of magnitude for the 11 large real graphs tested. We report our findings in this paper.

preprint2016arXiv

A geometric iterated function system on triangles

We construct a converging geometric iterated function system on the moduli space of ordered triangles, for which the involved functions have geometric meanings and contain a non-contraction map under the natural metric.

preprint2016arXiv

Approximate Nearest Neighbor Search on High Dimensional Data --- Experiments, Analyses, and Improvement (v1.0)

Approximate Nearest neighbor search (ANNS) is fundamental and essential operation in applications from many domains, such as databases, machine learning, multimedia, and computer vision. Although many algorithms have been continuously proposed in the literature in the above domains each year, there is no comprehensive evaluation and analysis of their performances. In this paper, we conduct a comprehensive experimental evaluation of many state-of-the-art methods for approximate nearest neighbor search. Our study (1) is cross-disciplinary (i.e., including 16 algorithms in different domains, and from practitioners) and (2) has evaluated a diverse range of settings, including 20 datasets, several evaluation metrics, and different query workloads. The experimental results are carefully reported and analyzed to understand the performance results. Furthermore, we propose a new method that achieves both high query efficiency and high recall empirically on majority of the datasets under a wide range of settings.

preprint2016arXiv

Computer Algebra Derivation of the Bias of Burg Estimators

A symbolic method is discussed which can be used to obtain the asymptotic bias and variance to order $O(1/n)$ for estimators in stationary time series. Using this method the bias to $O(1/n)$ of the Burg estimator in AR(1) and AR(2) models is shown to be equal to that of the least squares estimators in both the known and unknown mean cases. Previous researchers have only been able to obtain simulation results for this bias because this problem is too intractable without using computer algebra.

preprint2016arXiv

Developments in Maximum Likelihood Unit Root Tests

The exact maximum likelihood estimate (MLE) provides a test statistic for the unit root test that is more powerful \citep[p. 577]{Fuller96} than the usual least squares approach. In this paper a new derivation is given for the asymptotic distribution of this test statistic that is simpler and more direct than the previous method. The response surface regression method is used to obtain a fast algorithm that computes accurate finite-sample critical values. This algorithm is available in the R package {\tt mleur} that is available on CRAN. The empirical power of the new test is shown to be much better than the usual test not only in the normal case but also for innovations generated from an infinite variance stable distribution as well as for innovations generated from a GARCH$(1,1)$ process.

preprint2016arXiv

Dramatic increase of the onset critical temperature and critical field of elemental Sn in the form of thin nanowires

Sn is a well-known classical superconductor on the border between type I and type II with critical temperature of 3.722K and critical field of 0.031T. We show by means of specific heat and electric magneto-transport data that its critical parameters can be dramatically increased if it is brought in the form of loosely bound bundles of thin nanowires. The specific heat displays a pronounced double phase transition at 3.7K and 5.5K, which we attribute to the inner 'bulk' contribution of the nanowires and to the surface contribution, respectively. The latter is visible only because of the large volume fraction of the surface layer in relation to their bulk inner volume. The upper transition coincides with the onset of the resistive transition, while zero resistance is gradually approached below the lower transition. The large coherence length of 230nm at 0K likely actuates a Josephson coupling between adjacent neighboring nanowires and thus suppresses the effect of 1D phase fluctuations along the nanowires, and stabilizes 3D phase coherence throughout the entire network with zero resistance. A magnetic field of more than 3T is required to restore the normal state, which means that the critical field is enhanced by about two orders of magnitude with respect to Sn in its bulk form.

preprint2016arXiv

End-to-End Data Visualization by Metric Learning and Coordinate Transformation

This paper presents a deep nonlinear metric learning framework for data visualization on an image dataset. We propose the Triangular Similarity and prove its equivalence to the Cosine Similarity in measuring a data pair. Based on this novel similarity, a geometrically motivated loss function - the triangular loss - is then developed for optimizing a metric learning system comprising two identical CNNs. It is shown that this deep nonlinear system can be efficiently trained by a hybrid algorithm based on the conventional backpropagation algorithm. More interestingly, benefiting from classical manifold learning theories, the proposed system offers two different views to visualize the outputs, the second of which provides better classification results than the state-of-the-art methods in the visualizable spaces.

preprint2016arXiv

Faster ARMA maximum likelihood estimation

A new likelihood based AR approximation is given for ARMA models. The usual algorithms for the computation of the likelihood of an ARMA model require $O(n)$ flops per function evaluation. Using our new approximation, an algorithm is developed which requires only $O(1)$ flops in repeated likelihood evaluations. In most cases, the new algorithm gives results identical to or very close to the exact maximum likelihood estimate (MLE). This algorithm is easily implemented in high level Quantitative Programming Environments (QPEs) such as {\it Mathematica\/}, MatLab and R. In order to obtain reasonable speed, previous ARMA maximum likelihood algorithms are usually implemented in C or some other machine efficient language. With our algorithm it is easy to do maximum likelihood estimation for long time series directly in the QPE of your choice. The new algorithm is extended to obtain the MLE for the mean parameter. Simulation experiments which illustrate the effectiveness of the new algorithm are discussed. {\it Mathematica\/} and R packages which implement the algorithm discussed in this paper are available (McLeod and Zhang, 2007). Based on these package implementations, it is expected that the interested researcher would be able to implement this algorithm in other QPE's.

preprint2016arXiv

Fitting MA(q) Models in the Closed Invertible Region

The use of reparameterization in the maximization of the likelihood function of the MA(q) model is discussed. A general method for testing for the presence of a parameter estimate on the boundary of an MA(q) model is presented. This test is illustrated with a brief simulation experiment for the MA(q) for q=1,2,3,4 in which it is shown that the probability of an estimate being on the boundary increases with q.

preprint2016arXiv

On Multiplicative Integration with Recurrent Neural Networks

We introduce a general and simple structural design called Multiplicative Integration (MI) to improve recurrent neural networks (RNNs). MI changes the way in which information from difference sources flows and is integrated in the computational building block of an RNN, while introducing almost no extra parameters. The new structure can be easily embedded into many popular RNN models, including LSTMs and GRUs. We empirically analyze its learning behaviour and conduct evaluations on several tasks using different RNN models. Our experimental results demonstrate that Multiplicative Integration can provide a substantial performance boost over many of the existing RNN models.

preprint2016arXiv

Partial autocorrelation parameterization for subset autoregression

A new version of the partial autocorrelation plot and a new family of subset autoregressive models are introduced. A comprehensive approach to model identification, estimation and diagnostic checking is developed for these models. These models are better suited to efficient model building of high-order autoregressions with long time series. Several illustrative examples are given.

preprint2016arXiv

Performance of new 8-inch photomultiplier tube used for the Tibet muon-detector array

A new hybrid experiment has been constructed to measure the chemical composition of cosmic rays around the "knee" in the wide energy range by the Tibet AS$γ$ collaboration at Tibet, China, since 2014. They consist of a high-energy air-shower-core array (YAC-II), a high-density air-shower array (Tibet-III) and a large underground water-Cherenkov muon-detector array (MD). In order to obtain the primary proton, helium and iron spectra and their "knee" positions in the energy range lower than $10^{16}$ eV, each of PMTs equipped to the MD cell is required to measure the number of photons capable of covering a wide dynamic range of 100 - $10^{6}$ photoelectrons (PEs) according to Monte Carlo simulations. In this paper, we firstly compare the characteristic features between R5912-PMT made by Japan Hamamatsu and CR365-PMT made by Beijing Hamamatsu. This is the first comparison between R5912-PMT and CR365-PMT. If there exists no serious difference, we will then add two 8-inch-in-diameter PMTs to meet our requirements in each MD cell, which are responsible for the range of 100 - 10000 PEs and 2000 - 1000000 PEs, respectively. That is, MD cell is expected to be able to measure the number of muons over 6 orders of magnitude.

preprint2016arXiv

Professor Forcing: A New Algorithm for Training Recurrent Networks

The Teacher Forcing algorithm trains recurrent networks by supplying observed sequence values as inputs during training and using the network's own one-step-ahead predictions to do multi-step sampling. We introduce the Professor Forcing algorithm, which uses adversarial domain adaptation to encourage the dynamics of the recurrent network to be the same when training the network and when sampling from the network over multiple time steps. We apply Professor Forcing to language modeling, vocal synthesis on raw waveforms, handwriting generation, and image generation. Empirically we find that Professor Forcing acts as a regularizer, improving test likelihood on character level Penn Treebank and sequential MNIST. We also find that the model qualitatively improves samples, especially when sampling for a large number of time steps. This is supported by human evaluation of sample quality. Trade-offs between Professor Forcing and Scheduled Sampling are discussed. We produce T-SNEs showing that Professor Forcing successfully makes the dynamics of the network during training and sampling more similar.

preprint2016arXiv

Sensitivity of YAC to measure the light-component spectrum of primary cosmic rays at the "knee" energies

A new air-shower core-detector array (YAC : Yangbajing Air-shower Core-detector array) has been developed to measure the primary cosmic-ray composition at the "knee" energies in Tibet, China, focusing mainly on the light components. The prototype experiment (YAC-I) consisting of 16 detectors has been constructed and operated at Yangbajing (4300 m a.s.l.) in Tibet since May 2009. YAC-I is installed in the Tibet-III AS array and operates together. In this paper, we performed a Monte Carlo simulation to check the sensitivity of YAC-I+Tibet-III array to the cosmic-ray light component of cosmic rays around the knee energies, taking account of the observation conditions of actual YAC-I+Tibet-III array. The selection of light component from others was made by use of an artificial neural network (ANN). The simulation shows that the light-component spectrum estimated by our methods can well reproduce the input ones within 10\% error, and there will be about 30\% systematic errors mostly induced by the primary and interaction models used. It is found that the full-scale YAC and the Tibet-III array is powerful to study the cosmic-ray composition, in particular, to obtain the energy spectra of protons and helium nuclei around the knee energies.

preprint2016arXiv

Theano: A Python framework for fast computation of mathematical expressions

Theano is a Python library that allows to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. Since its introduction, it has been one of the most used CPU and GPU mathematical compilers - especially in the machine learning community - and has shown steady performance improvements. Theano is being actively and continuously developed since 2008, multiple frameworks have been built on top of it and it has been used to produce many state-of-the-art machine learning models. The present article is structured as follows. Section I provides an overview of the Theano software and its community. Section II presents the principal features of Theano and how to use them, and compares them with other similar projects. Section III focuses on recently-introduced functionalities and improvements. Section IV compares the performance of Theano against Torch7 and TensorFlow on several machine learning models. Section V discusses current limitations of Theano and potential ways of improving it.

preprint2016arXiv

Top-k Spatial-keyword Publish/Subscribe Over Sliding Window

With the prevalence of social media and GPS-enabled devices, a massive amount of geo-textual data has been generated in a stream fashion, leading to a variety of applications such as location-based recommendation and information dissemination. In this paper, we investigate a novel real-time top-k monitoring problem over sliding window of streaming data; that is, we continuously maintain the top-k most relevant geo-textual messages (e.g., geo-tagged tweets) for a large number of spatial-keyword subscriptions (e.g., registered users interested in local events) simultaneously. To provide the most recent information under controllable memory cost, sliding window model is employed on the streaming geo-textual data. To the best of our knowledge, this is the first work to study top-k spatial-keyword publish/subscribe over sliding window. A novel centralized system, called Skype (Topk Spatial-keyword Publish/Subscribe), is proposed in this paper. In Skype, to continuously maintain top-k results for massive subscriptions, we devise a novel indexing structure upon subscriptions such that each incoming message can be immediately delivered on its arrival. To reduce the expensive top-k re-evaluation cost triggered by message expiration, we develop a novel cost-based k-skyband technique to reduce the number of re-evaluations in a cost-effective way. Extensive experiments verify the great efficiency and effectiveness of our proposed techniques. Furthermore, to support better scalability and higher throughput, we propose a distributed version of Skype, namely, DSkype, on top of Storm, which is a popular distributed stream processing system. With the help of fine-tuned subscription/message distribution mechanisms, DSkype can achieve orders of magnitude speed-up than its centralized version.

preprint2016arXiv

When Engagement Meets Similarity: Efficient (k,r)-Core Computation on Social Networks

In this paper, we investigate the problem of (k,r)-core which intends to find cohesive subgraphs on social networks considering both user engagement and similarity perspectives. In particular, we adopt the popular concept of k-core to guarantee the engagement of the users (vertices) in a group (subgraph) where each vertex in a (k,r)-core connects to at least k other vertices. Meanwhile, we also consider the pairwise similarity between users based on their profiles. For a given similarity metric and a similarity threshold r, the similarity between any two vertices in a (k,r)-core is ensured not less than r. Efficient algorithms are proposed to enumerate all maximal (k,r)-cores and find the maximum (k,r)-core, where both problems are shown to be NP-hard. Effective pruning techniques significantly reduce the search space of two algorithms and a novel (k,k')-core based (k,r)-core size upper bound enhances performance of the maximum (k,r)-core computation. We also devise effective search orders to accommodate the different nature of two mining algorithms. Comprehensive experiments on real-life data demonstrate that the maximal/maximum (k,r)-cores enable us to find interesting cohesive subgraphs, and performance of two mining algorithms is significantly improved by proposed techniques.

preprint2015arXiv

A Simple and General Problem and its Optimal Randomized Online Algorithm Design with Competitive Analysis

The online algorithm design was proposed to handle the caching problem when the future information is unknown. And currently, it draws more and more attentions from the researchers from the areas of microgrid, where the production of renewables are unpredictable. In this note, we present a framework of randomized online algorithm design for the \textit{simple and tractable} problem. This framework hopes to provide a tractable design to design a randomized online algorithm, which can be proved to achieve the best competitive ratio by \textit{Yao's Principle}.

preprint2015arXiv

Batch Normalized Recurrent Neural Networks

Recurrent Neural Networks (RNNs) are powerful models for sequential data that have the potential to learn long-term dependencies. However, they are computationally expensive to train and difficult to parallelize. Recent work has shown that normalizing intermediate representations of neural networks can significantly improve convergence rates in feedforward neural networks . In particular, batch normalization, which uses mini-batch statistics to standardize features, was shown to significantly reduce training time. In this paper, we show that applying batch normalization to the hidden-to-hidden transitions of our RNNs doesn't help the training procedure. We also show that when applied to the input-to-hidden transitions, batch normalization can lead to a faster convergence of the training criterion but doesn't seem to improve the generalization performance on both our language modelling and speech recognition tasks. All in all, applying batch normalization to RNNs turns out to be more challenging than applying it to feedforward networks, but certain variants of it can still be beneficial.

preprint2015arXiv

Combined Analysis of the CP Properties of The Higgs Boson in Effective Higgs Lagrangian

CP violation effects of the Higgs stem from not only CP-violation interactions but also an ambiguous defined CP state. The two CPV sources are coherently studied based on an effective Higgs Lagrangian. The constraints from unitarity limits for $WW$ and $ZZ$ scatterings are proposed to restrict Higgs couplings to weak gauge bosons. Five interesting cases are classified in terms of CPV sources to investigate the Higgs CP properties. The allowed ranges are shown from fitting results to the signal strengths of the Higgs measured by ATLAS and CMS.

preprint2015arXiv

Development of Yangbajing Air shower Core detector array for a new EAS hybrid Experiment

Aiming at the observation of cosmic-ray chemical composition at the "knee" energy region, we have been developinga new type air-shower core detector (YAC, Yangbajing Air shower Core detector array) to be set up at Yangbajing (90.522$^\circ$ E, 30.102$^\circ$ N, 4300 m above sea level, atmospheric depth: 606 g/m$^2$) in Tibet, China. YAC works together with the Tibet air-shower array (Tibet-III) and an underground water cherenkov muon detector array (MD) as a hybrid experiment. Each YAC detector unit consists of lead plates of 3.5 cm thick and a scintillation counter which detects the burst size induced by high energy particles in the air-shower cores. The burst size can be measured from 1 MIP (Minimum Ionization Particle) to $10^{6}$ MIPs. The first phase of this experiment, named "YAC-I", consists of 16 YAC detectors each having the size 40 cm $\times$ 50 cm and distributing in a grid with an effective area of 10 m$^{2}$. YAC-I is used to check hadronic interaction models. The second phase of the experiment, called "YAC-II", consists of 124 YAC detectors with coverage about 500 m$^2$. The inner 100 detectors of 80 cm $\times $ 50 cm each are deployed in a 10 $\times$ 10 matrix from with a 1.9 m separation and the outer 24 detectors of 100 cm $\times$ 50 cm each are distributed around them to reject non-core events whose shower cores are far from the YAC-II array. YAC-II is used to study the primary cosmic-ray composition, in particular, to obtain the energy spectra of proton, helium and iron nuclei between 5$\times$$10^{13}$ eV and $10^{16}$ eV covering the "knee" and also being connected with direct observations at energies around 100 TeV. We present the design and performance of YAC-II in this paper.

preprint2015arXiv

I/O Efficient Core Graph Decomposition at Web Scale

Core decomposition is a fundamental graph problem with a large number of applications. Most existing approaches for core decomposition assume that the graph is kept in memory of a machine. Nevertheless, many real-world graphs are big and may not reside in memory. In the literature, there is only one work for I/O efficient core decomposition that avoids loading the whole graph in memory. However, this approach is not scalable to handle big graphs because it cannot bound the memory size and may load most parts of the graph in memory. In addition, this approach can hardly handle graph updates. In this paper, we study I/O efficient core decomposition following a semi-external model, which only allows node information to be loaded in memory. This model works well in many web-scale graphs. We propose a semi-external algorithm and two optimized algorithms for I/O efficient core decomposition using very simple structures and data access model. To handle dynamic graph updates, we show that our algorithm can be naturally extended to handle edge deletion. We also propose an I/O efficient core maintenance algorithm to handle edge insertion, and an improved algorithm to further reduce I/O and CPU cost by investigating some new graph properties. We conduct extensive experiments on 12 real large graphs. Our optimal algorithm significantly outperform the existing I/O efficient algorithm in terms of both processing time and memory consumption. In many memory-resident graphs, our algorithms for both core decomposition and maintenance can even outperform the in-memory algorithm due to the simple structures and data access model used. Our algorithms are very scalable to handle web-scale graphs. As an example, we are the first to handle a web graph with 978.5 million nodes and 42.6 billion edges using less than 4.2 GB memory.

preprint2015arXiv

Peak-Aware Online Economic Dispatching for Microgrids

By employing local renewable energy sources and power generation units while connected to the central grid, microgrid can usher in great benefits in terms of cost efficiency, power reliability, and environmental awareness. Economic dispatching is a central problem in microgrid operation, which aims at effectively scheduling various energy sources to minimize the operating cost while satisfying the electricity demand. Designing intelligent economic dispatching strategies for microgrids, however, is drastically different from that for conventional central grids, due to two unique challenges. First, the erratic renewable energy emphasizes the need for online algorithms. Second, the widely-adopted peak-based pricing scheme brings out the need for new peak-aware strategy design. In this paper, we tackle these critical challenges and devise peak-aware online economic dispatching algorithms. For microgrids with fast-responding generators, we prove that our deterministic and randomized algorithms achieve the best possible competitive ratios $2-β$ and $e/(e-1+β)$, respectively, where $β\in[0,1]$ is the ratio between the minimum grid spot price and the local-generation price. Our results characterize the fundamental \emph{price of uncertainty} of the problem. For microgrids with slow-responding generators, we first show that a large competitive ratio is inevitable. Then we leverage limited prediction of electricity demand and renewable generation to improve the competitiveness of the algorithms. By extensive empirical evaluations using real-world traces, we show that our online algorithms achieve near offline-optimal performance. In a representative scenario, our algorithm achieves $17.5\%$ and $9.24\%$ cost reduction as compared to the case without local generation units and the case using peak-oblivious algorithms, respectively.

preprint2015arXiv

Polynomial automorphisms of C^n preserving the Markoff-Hurwitz polynomial

We study the action of the group of polynomial automorphisms of C^n (n>2) which preserve the Markoff-Hurwitz polynomial H(x):= x_1^2 + x_2^2 + ... + x_n^2 - x_1 x_2 ... x_n. Our main results include the determination of the group, the description of a non-empty open subset of C^n on which the group acts properly discontinuously (domain of discontinuity), and identities for the orbit of points in the domain of discontinuity.

preprint2015arXiv

Restricted Low-Rank Approximation via ADMM

The matrix low-rank approximation problem with additional convex constraints can find many applications and has been extensively studied before. However, this problem is shown to be nonconvex and NP-hard; most of the existing solutions are heuristic and application-dependent. In this paper, we show that, other than tons of application in current literature, this problem can be used to recover a feasible solution for SDP relaxation. By some sophisticated tricks, it can be equivalently posed in an appropriate form for the Alternating Direction Method of Multipliers (ADMM) to solve. The two updates of ADMM include the basic matrix low-rank approximation and projection onto a convex set. Different from the general non-convex problems, the sub-problems in each step of ADMM can be solved exactly and efficiently in spite of their non-convexity. Moreover, the algorithm will converge exponentially under proper conditions. The simulation results confirm its superiority over existing solutions. We believe that the results in this paper provide a useful tool for this important problem and will help to extend the application of ADMM to the non-convex regime.

preprint2015arXiv

Statistical Constraints on the Error of the Leptonic CP Violation of Neutrinos

A constraint on the error of leptonic CP violation, which require the phase $δ_{CP}$ to be less than $π/4$ for it to be distinguishable on a $2π$ cycle, is presented. Under this constraint, the effects of neutrino detector 's distance, beam energy, and energy resolution are discussed with reference to the present values of these parameters in experiments. Although an optimized detector performances can minimize the deviation to yield a larger distinguishable range of the leptonic CP phase on a $2π$ cycle, it is not possible to determine an arbitrary leptonic CP phase in the range of $2π$ with the statistics from a single detector because of the existence of two singular points. An efficiency factor $η$ is defined to characterize the distinguishable range of $δ_{CP}$. To cover the entire possible $δ_{CP}$ range, a combined efficiency factor $η^*$ corresponding to multiple sets of detection parameters with different neutrino beam energies and distances is proposed. The combined efficiency factors $η^*$ of various major experiments are also presented.

preprint2014arXiv

A new identity for SL(2,C)-characters of the once punctured torus group

We obtain new variations of the original McShane identity for those SL(2,C)-representations of the once punctured torus group which satisfy the Bowditch conditions, and also for those fixed up to conjugacy by an Anosov mapping class of the torus and satisfying the relative Bowditch conditions.

preprint2014arXiv

A new multivariate dependence measure based on comonotonicity

In this paper we introduce a new multivariate dependence measure based on comonotonicity by means of product moment which motivated by the recent papers of Koch and Schepper (ASTIN Bulletin 41 (2011) 191-213) and Dhaene et al. (Journal of Computational and Applied Mathematics 263 (2014) 78-87). Some differences and relations between the new dependence measure and other multivariate measures are an- alyzed. We also give several characteristics of this measure and estimations based on the definitions and its property are presented.

preprint2014arXiv

Application of signal processing techniques in the assessment of clinical risks in preterm infants

Preterm infants with very low birth weight suffer from a high risk of intra-ventricular hemorrhage(IVH) and other serious diseases. To improve the clinical risk assessment of preterm infants and develop potential clinically makers for the adverse outcome, the first part of the paper develops the frequency spectral analysis on the non-invasively measured heart rate variability, blood pressure variability and cerebral near-infrared spectroscopy measures. Moderate and high correlations with the clinical risk index for babies were identified from various spectral measures of arterial baroreflex and cerebral autoregulation functions. It was also observed that the cross-spectral transfer function analysis of cerebral NIRS and arterial blood pressure was able to provide a number of parameters that were potentially useful for distinguishing between preterm infants with or without IVH. Furthermore, the detrended fluctuation analysis that quantifies the fractal correlation properties of physiological signals has been examined, to determine whether it could derive markers for the identification of preterm infants with IVH. Cardiac output(CO) and total peripheral resistance(TPR) are two important parameters of the cardiovascular system. Measurement of these two parameters can provide valuable information for the assessment and management of patients needing intensive care, including preterm infants in the neonatal intensive care unit. To further assess the changes in CO and TPR in the preterm infants, the multivariate regression model based on the useful features from ABP method was used to improve the accuracy and robustness of the estimation. The combination of signal analysis and multivariate regression model in estimation of CO has produced some outcomes, and in the future, more effort should be involved in this kind of research to improve the prediction of serious diseases in preterm infants.

preprint2014arXiv

Effects of carbon nanotubes and graphene oxide absorbers on the noise of mode-locked fiber lasers

Phase noise is very important for the ultrafast pulse application in telecommunication, ultrafast diagnose, material science, and biology. In this paper, two types of carbon nano-materials, single-wall carbon nanotube and graphene oxide, are investigated for noise suppression in ultrafast photonics. Various properties of the wall-paper SAs, such as saturable intensity, optical absorption and degree of purity, are found to be key factors determining the phase noise of the ultrafast pulses. A reduced-noise femtosecond fiber laser is experimentally demonstrated by optimizing the above parameters of carbon material based SAs. The phase noise reduction more than 10 dB at 10 kHz can be obtained in the experiments. To our knowledge, this is the first time that the relationship between different carbon material based SAs and the phase noise of mode-locked lasers has been investigated. This work will pave the way to get a high-quality ultrashort pulse in passively mode-locked fiber lasers.

preprint2014arXiv

Higgsphobic and fermiophobic Z' as a single dark matter candidate

A spin-1 Z' particle as a single dark matter candidate is investigated by assuming that it does not directly couple to the Higgs boson and standard model fermions and does not mix with the photon and Z boson. The remaining dominant vertices are quartic Z'Z'ZZ and Z'Z'W+W-, which can induce effective Z'Z'q\bar{q} couplings through standard-model gauge-boson loops. We discuss constraints from the cosmological thermal relic density, and direct and indirect-detection experiments, and find that a dark Z' can only exist above the W boson mass threshold, and the effective quartic coupling of Z'Z'VV is bounded in the region of 10^{-3}~10^{-2}.

preprint2014arXiv

Presentations for quaternionic $S$-unit groups

The purpose of this paper is to give presentations for projective $S$-unit groups of the Hurwitz order in Hamilton's quaternions over the rational field $\mathbb{Q}$. To our knowledge, this provides the first explicit presentations of an $S$-arithmetic lattice in a semisimple Lie group with $S$ large. In particular, we give presentations for groups acting irreducibly and cocompactly on a product of Bruhat--Tits trees. We also include some discussion and experimentation related to the congruence subgroup problem, which is open when $S$ contains at least two odd primes. In the appendix, we provide code that allows the reader to compute presentations for an arbitrary finite set $S$.

preprint2014arXiv

Symplectic Cohomology and the Stability of J-Anti-Invariant Cohomology

In this paper, we investigate the relationship between J-anti-invariant cohomology of a closed symplectic 4-manifold introduced by T.-J. Li and W. Zhang and new symplectic cohomologies introduced by L.-S. Tseng and S.-T. Yau. We also prove that the dimension of J-anti-invariant cohomology is constant for almost structures J which are compatible with a fixed symplectic form.

preprint2014arXiv

Virtual Machine Migration Planning in Software-Defined Networks

In this paper, we examine the problem of how to schedule the migrations and how to allocate network resources for migration when multiple VMs need to be migrated at the same time. We consider the problem in the Software-defined Network (SDN) context since it provides flexible control on routing. More specifically, we propose a method that computes the optimal migration sequence and network bandwidth used for each migration. We formulate this problem as a mixed integer programming, which is NP-hard. To make it computationally feasible for large scale data centers, we propose an approximation scheme via linear approximation plus fully polynomial time approximation, and obtain its theoretical performance bound. Through extensive simulations, we demonstrate that our fully polynomial time approximation (FPTA) algorithm has a good performance compared with the optimal solution and two state of-the-art algorithms. That is, our proposed FPTA algorithm approaches to the optimal solution with less than 10% variation and much less computation time. Meanwhile, it reduces the total migration time and the service downtime by up to 40% and 20% compared with the state-of-the-art algorithms, respectively.

preprint2013arXiv

A Monte Carlo study to check the hadronic interaction models by a new EAS hybrid experiment in Tibet

A new EAS hybrid experiment has been designed by constructing a YAC (Yangbajing Air shower Core) detector array inside the existing Tibet-III air shower array. The first step of YAC, called "YAC-I", consists of 16 plastic scintillator units (4 rows times 4 columns) each with an area of 40 cm * 50 cm which is used to check hadronic interaction models used in AS simulations. A Monte Carlo study shows that YAC-I can record high energy electromagnetic component in the core region of air showers induced by primary particles of several tens TeV energies where the primary composition is directly measured by space experiments. It may provide a direct check of the hadronic interaction models currently used in the air shower simulations in the corresponding energy region. In present paper, the method of the observation and the sensitivity of the characteristics of the observed events to the different interaction models are discussed.

preprint2013arXiv

A Monte Carlo study to measure the energy spectra of the primary cosmic-ray components at the knee using a new Tibet AS core detector array

A new hybrid experiment has been started by ASγ experiment at Tibet, China, since August 2011, which consists of a low threshold burst-detector-grid (YAC-II, Yangbajing Air shower Core array), the Tibet air-shower array (Tibet-III) and a large underground water Cherenkov muon detector (MD). In this paper, the capability of the measurement of the chemical components (proton, helium and iron) with use of the (Tibet-III+YAC-II) is investigated by means of an extensive Monte Carlo simulation in which the secondary particles are propagated through the (Tibet-III+YAC-II) array and an artificial neural network (ANN) method is applied for the primary mass separation. Our simulation shows that the new installation is powerful to study the chemical compositions, in particular, to obtain the primary energy spectrum of the major component at the knee.

preprint2013arXiv

Engineering Topological Surface States and Giant Rashba Spin Splitting in BiTeI/Bi$_2$Te$_3$ Heterostructures

The search for strongly inversion asymmetric topological insulators is an active research field because these materials possess distinct properties compared with the inversion symmetric ones. In particular, it is desirable to realize a large Rashba spin-splitting (RSS) in such materials, which combined with the topological surface states (TSS) could lead to useful spintronics applications. In this report, based on first principles calculations, we predict that the heterostructure of BiTeI/Bi$_{2}$Te$_{3}$ is a strong topological insulator with a giant RSS. The coexistence of TSS and RSS in the current system is native and stable. More importantly, we find that both the $\mathbb{Z}_{2}$ invariants and the Rashba energy can be controlled by engineering the layer geometries of the heterostructure, and the Rashba energy can be made even larger than that of bulk BiTeI. Our work opens a new route for designing topological spintronics devices based on inversion asymmetric heterostructures.

preprint2013arXiv

On cohomology of almost complex 4-manifolds

Based on recent work of T. Draghici, T.-J. Li and W. Zhang, we further investigate properties of the dimension h_J of the J-anti-invariant cohomology subgroup H_J of a closed almost Hermitian 4-manifold (M, g, J, F) using metric compatible and symplectic 2-form compatible almost complex structures. We prove that h_J = 0 for generic almost complex structures J on M.

preprint2013arXiv

Probe of the Solar Magnetic Field Using the "Cosmic-Ray Shadow" of the Sun

We report on a clear solar-cycle variation of the Sun's shadow in the 10 TeV cosmic-ray flux observed by the Tibet air shower array during a full solar cycle from 1996 to 2009. In order to clarify the physical implications of the observed solar cycle variation, we develop numerical simulations of the Sun's shadow, using the Potential Field Source Surface (PFSS) model and the Current Sheet Source Surface (CSSS) model for the coronal magnetic field. We find that the intensity deficit in the simulated Sun's shadow is very sensitive to the coronal magnetic field structure, and the observed variation of the Sun's shadow is better reproduced by the CSSS model. This is the first successful attempt to evaluate the coronal magnetic field models by using the Sun's shadow observed in the TeV cosmic-ray flux.

preprint2013arXiv

Pseudospin symmetry: Recent progress with supersymmetric quantum mechanics

It is an interesting and open problem to trace the origin of the pseudospin symmetry in nuclear single-particle spectra and its symmetry breaking mechanism in actual nuclei. In this report, we mainly focus on our recent progress on this topic by combining the similarity renormalization group technique, supersymmetric quantum mechanics, and perturbation theory. We found that it is a promising direction to understand the pseudospin symmetry in a quantitative way.

preprint2013arXiv

Survival of New Physics: An Anomaly-free Neutral Gauge Boson at the LHC

An anomaly-free $U(1)'$ effective Lagrangian as the most common new physics beyond the standard model is proposed to survey the maximal parameter space constrained by electroweak precise measurements at the LEP and direct detection in dilepton decay channel at the LHC at $\sqrt{s}=7$ TeV. By the global fit of effective couplings of $Z$ boson to the SM fermions, $Δ_{11},Δ_{21},g_2Δ_{31}$ related to mixings and $r$ related to $U(1)'$ charge assignment are bounded. The allowed areas are plotted in the not only $r$-$g_2$ but also $m_{Z'}$-$g_2$ planes, which show that a sub-TeV $Z'$ is still permissible as long as the coupling $g_2\sim 0.01$. The results provides a prime requirement to an extra $U(1)'$ gauge boson and hinds the direction of possible new physics beyond the standard model. The possible signal in dilepton decay channel at LHC at $\sqrt{s}=14$ TeV is also provided.

preprint2013arXiv

Tailoring Magnetic Doping in the Topological Insulator Bi2Se3

We theoretically investigate the possibility of establishing ferromagnetism in the topological insulator Bi2Se3 via magnetic doping of 3d transition metal elements. The formation energies, charge states, band structures, and magnetic properties of doped Bi2Se3 are studied using first-principles calculations within density functional theory. Our results show that Bi substitutional sites are energetically more favorable than interstitial sites for single impurities. Detailed electronic structure analysis reveals that Cr and Fe doped materials are still insulating in the bulk but the intrinsic band gap of Bi2Se3 is substantially reduced due to the strong hybridization between the d states of the dopants and the p states of the neighboring Se atoms. The calculated magnetic coupling suggests that Cr doped Bi2Se3 is possible to be both ferromagnetic and insulating, while Fe doped Bi2Se3 tends to be weakly antiferromagnetic.

preprint2012arXiv

Every Binary Self-Dual Code Arises From Hilbert Symbols

In this paper we construct binary self-dual codes using the étale cohomology of $\mathbb{Z}/2$ on the spectra of rings of $S$-integers of global fields. We will show that up to equivalence, all self-dual codes of length at least 4 arise from Hilbert pairings on rings of $S$-integers of $\Q$. This is an arithmetic counterpart of a result of Kreck and Puppe, who used cobordism theory to show that all self-dual codes arise from Poincaré duality on real three manifolds.

preprint2012arXiv

Partially monotone tensor spline estimation of the joint distribution function with bivariate current status data

The analysis of the joint cumulative distribution function (CDF) with bivariate event time data is a challenging problem both theoretically and numerically. This paper develops a tensor spline-based sieve maximum likelihood estimation method to estimate the joint CDF with bivariate current status data. The I-splines are used to approximate the joint CDF in order to simplify the numerical computation of a constrained maximum likelihood estimation problem. The generalized gradient projection algorithm is used to compute the constrained optimization problem. Based on the properties of B-spline basis functions it is shown that the proposed tensor spline-based nonparametric sieve maximum likelihood estimator is consistent with a rate of convergence potentially better than $n^{1/3}$ under some mild regularity conditions. The simulation studies with moderate sample sizes are carried out to demonstrate that the finite sample performance of the proposed estimator is generally satisfactory.

preprint2012arXiv

The Global Electroweak Fit and its Implication to Z-prime

Among the $Z$-pole observables, $A_{FB}^{(0,b)}$ and $A_e$ suffer moderately-large standard deviations from the Standard Model predictions. Fine-tuning of the unknown Higgs mass only reduces the deviation of one of them at the expense of increasing the deviation of the other observable. If we take this fact seriously, the result can be interpreted as independent experimental evidence of existing new physics beyond SM, even if a 125GeV Higgs on the LHC is finally confirmed to be SM Higgs. We show in this paper that the existence of a $Z'$ boson mixing with $Z$ and $γ$ and with generation-dependent anomaly-free charge assignments, helps to suppress $A_{FB}^{(0,b)}$ and $A_e$ at $Z$-pole simultaneously and dose reduce the largest deviation from $2.7σ$ in SM predictions to $1.2σ$ in our scenario. The global electroweak fit does not prefer $Z'$ coupling to the second-generation quarks and the first-generation left-handed lepton and right-handed electron. The fitting result also supports a negative contribution to the $S$ parameter from the $Z'$ boson.

preprint2012arXiv

Thermodynamic Principles in Social Collaborations

A thermodynamic framework is presented to characterize the evolution of efficiency, order, and quality in social content production systems, and this framework is applied to the analysis of Wikipedia. Contributing editors are characterized by their (creative) energy levels in terms of number of edits. We develop a definition of entropy that can be used to analyze the efficiency of the system as a whole, and relate it to the evolution of power-law distributions and a metric of quality. The concept is applied to the analysis of eight years of Wikipedia editing data and results show that (1) Wikipedia has become more efficient during its evolution and (2) the entropy-based efficiency metric has high correlation with observed readership of Wikipedia pages.

preprint2011arXiv

$Z'$ Phenomenology: A Model-independent Analysis and Fit in Combination of Chiral Effective Theory and Anomaly Cancellation

To investigate $Z'$ phenomenology model-independent, we combine chiral effective theory with anomaly cancellation conditions without any other model input. We focus on $Z'$ mixings with $γ-Z$ in both mass and kinetic parts and calculate contributions to oblique $S,T,U$. The three sets of anomaly-free fermion $U(1)'$ charges parameterize the $Z'$ interactions with fermions. The cancellation of the $[U(1)']^3$ anomaly and mixing gravitational-gauge anomaly determines the number of right-handed neutrinos. We also find a novel relation between the charge assignments and Stueckelberg coupling in terms of the renormalized electromagnetic current. A global fit to the electroweak precise observables shows that typical values for the mixing parameters are of order $10^{-3}$. In spite of this strict limit, we obtain a negative $S$ parameter contribution.

preprint2011arXiv

Delambre-Gauss Formulas for Augmented, Right-Angled Hexagons in Hyperbolic 4-Space

We study the geometry of oriented right-angled hexagons in H^4, the hyperbolic 4-space, via Clifford numbers or quaternions. We show how to augment alternate sides of such a hexagon so that for the non-augmented sides, we can define quaternion half side-lengths whose angular parts are obtained from half the Euler angles associated to a certain orientation-preserving isometry of the Euclidean 3-space. This generalizes the complex half side-lengths of oriented right-angled hexagons in H^3. We also define appropriate complex half side-lengths for the augmented sides of the hexagon. We further explain how to geometrically read off the quaternion half side-lengths for a given oriented,augmented, right-angled hexagon in H^4. Our main result is a set of generalized Delambre-Gauss formulas for oriented, augmented, right-angled hexagons in H^4, involving the quaternion half side-lengths and the complex half side-lengths. We also show in the appendix how the same method gives Delambre-Gauss formulas for oriented right-angled hexagons in H^3, from which the well-known sine and cosine laws can be deduced. These formulas generalize the classical Delambre-Gauss formulas for spherical/hyperbolic triangles.

preprint2011arXiv

Parameterization of General Z-gamma-Z' Mixings in an Electroweak Chiral Theory

A new general parameterization with eight mixing parameters among Z, gamma and an extra neutral gauge boson Z' is proposed and subjected to phenomenological analysis. We show that in addition to the conventional Weinberg angle theta_W, there are seven other phenomenological parameters G', xi, eta, theta_l, theta_r, r, l for the most general Z-gamma-Z' mixings, in which parameter G' arises due to the presence of an extra Stueckelberg-type mass coupling. Combined with the conventional Z-Z' mass mixing angle theta', the remaining six parameters xi, eta, theta_l-theta', theta_r-theta', r, l are caused by general kinetic mixings. In all the eight phenomenological parameters theta_W, G', xi, eta, theta_l, theta_r, r, l, we can determine the Z-Z' mass mixing angle theta' and the mass ratio M_Z/M_Z'. The Z-gamma-Z' mixings we discuss are based on the model-independent description of the extended electroweak chiral Lagrangian (EWCL) previous proposed by us. In addition, we show that there are eight corresponding independent theoretical coefficients in our EWCL which are fully fixed by our eight phenomenological mixing parameters. We further find that the experimental measurability of these eight parameters does not rely on the extended neutral current for Z', but depends on the Z-Z' mass ratio.

preprint2011arXiv

Perturbative interpretation of relativistic symmetries in nuclei

Perturbation theory is used systematically to investigate the symmetries of the Dirac Hamiltonian and their breaking in atomic nuclei. Using the perturbation corrections to the single-particle energies and wave functions, the link between the single-particle states in realistic nuclei and their counterparts in the symmetry limits is discussed. It is shown that the limit of S-V=const and relativistic harmonic oscillator (RHO) potentials can be connected to the actual Dirac Hamiltonian by the perturbation method, while the limit of S+V=const cannot, where S and V are the scalar and vector potentials, respectively. This indicates that the realistic system can be treated as a perturbation of spin-symmetric Hamiltonians, and the energy splitting of the pseudospin doublets can be regarded as a result of small perturbation around the Hamiltonian with RHO potentials, where the pseudospin doublets are quasidegenerate.

preprint2011arXiv

Representations of Finite Dimensional Pointed Hopf Algebras over $\mathbb{z}_n$

In this paper, we study the representations of the new finite-dimensional pointed Hopf algebras in positive characteristic given in \cite{Cib09}. We find that these Hopf algebras are symmetric algebras. We determine the simple modules and their projective covers over these Hopf algebras. We show that these Hopf algebras are of wild representation type.

preprint2010arXiv

Half-Heusler Topological Insulators: A First-Principle Study with the Tran-Blaha Modified Becke-Johnson Density Functional

We systematically investigate the topological band structures of half-Heusler compounds using first-principles calculations. The modified Becke-Johnson exchange potential together with local density approximation for the correlation potential (MBJLDA) has been used here to obtain accurate band inversion strength and band order. Our results show that a large number of half-Heusler compounds are candidates for three-dimensional topological insulators. The difference between band structures obtained using the local density approximation (LDA) and MBJLDA potential is also discussed.

preprint2010arXiv

On Temporal Variations of the Multi-TeV Cosmic Ray Anisotropy using the Tibet III Air Shower Array

We analyze the large-scale two-dimensional sidereal anisotropy of multi-TeV cosmic rays by Tibet Air Shower Array, with the data taken from 1999 November to 2008 December. To explore temporal variations of the anisotropy, the data set is divided into nine intervals, each in a time span of about one year. The sidereal anisotropy of magnitude about 0.1% appears fairly stable from year to year over the entire observation period of nine years. This indicates that the anisotropy of TeV Galactic cosmic rays remains insensitive to solar activities since the observation period covers more than a half of the 23rd solar cycle.

preprint2010arXiv

Persistent contribution of unbound quasiparticles to the pair correlation in continuum Skyrme-Hartree-Fock-Bogoliubov approach

The neutron pair correlation in nuclei near the neutron drip-line is investigated using the selfconsistent continuum Skyrme-Hartree-Fock-Bogoliubov theory formulated with the coordinate-space Green's function technique. Numerical analysis is performed for even-even N = 86 isotones in the Mo-Sn region, where the 3p3/2 and 3p1/2 orbits lying near the Fermi energy are either weakly bound or unbound. The quasiparticle states originating from the l = 1 orbits form resonances with large widths, which are due to the low barrier height and the strong continuum coupling caused by the pair potential. Analyzing in detail the pairing properties and roles of the quasiparticle resonances, we found that the l = 1 broad quasiparticle resonances persist to feel the pair potential and contribute to the pair correlation even when their widths are comparable with the resonance energy.

preprint2009arXiv

Avoid the Tsunami of the Dirac sea in the Imaginary Time Step method

The discrete single-particle spectra in both the Fermi and Dirac sea have been calculated by the imaginary time step (ITS) method for the Schrödinger-like equation after avoiding the "tsunami" of the Dirac sea, i.e., the diving behavior of the single-particle level into the Dirac sea in the direct application of the ITS method for the Dirac equation. It is found that by the transform from the Dirac equation to the Schrödinger-like equation, the single-particle spectra, which extend from the positive to the negative infinity, can be separately obtained by the ITS evolution in either the Fermi sea or the Dirac sea. Identical results with those in the conventional shooting method have been obtained via the ITS evolution for the equivalent Schrödinger-like equation, which demonstrates the feasibility, practicality and reliability of the present algorithm and dispels the doubts on the ITS method in the relativistic system.

preprint2009arXiv

Computation of the p6 order chiral Lagrangian coefficients from the underlying theory of QCD

We present results of computing the p6 order low energy constants in the normal part of chiral Lagrangian both for two and three flavor pseudo-scalar mesons. This is a generalization of our previous work on calculating the p4 order coefficients of the chiral Lagrangian in terms of the quark self energy Sigma(p2) approximately from QCD. We show that most of our results are consistent with those we can find in the literature.

preprint2009arXiv

Observation of TeV Gamma Rays from the Fermi Bright Galactic Sources with the Tibet Air Shower Array

Using the Tibet-III air shower array, we search for TeV gamma-rays from 27 potential Galactic sources in the early list of bright sources obtained by the Fermi Large Area Telescope at energies above 100 MeV. Among them, we observe 7 sources instead of the expected 0.61 sources at a significance of 2 sigma or more excess. The chance probability from Poisson statistics would be estimated to be 3.8 x 10^-6. If the excess distribution observed by the Tibet-III array has a density gradient toward the Galactic plane, the expected number of sources may be enhanced in chance association. Then, the chance probability rises slightly, to 1.2 x 10^-5, based on a simple Monte Carlo simulation. These low chance probabilities clearly show that the Fermi bright Galactic sources have statistically significant correlations with TeV gamma-ray excesses. We also find that all 7 sources are associated with pulsars, and 6 of them are coincident with sources detected by the Milagro experiment at a significance of 3 sigma or more at the representative energy of 35 TeV. The significance maps observed by the Tibet-III air shower array around the Fermi sources, which are coincident with the Milagro >=3sigma sources, are consistent with the Milagro observations. This is the first result of the northern sky survey of the Fermi bright Galactic sources in the TeV region.

preprint2009arXiv

Solving the Dirac equation with nonlocal potential by Imaginary Time Step method

The Imaginary Time Step (ITS) method is applied to solve the Dirac equation with the nonlocal potential in coordinate space by the ITS evolution for the corresponding Schrödinger-like equation for the upper component. It is demonstrated that the ITS evolution can be equivalently performed for the Schrödinger-like equation with or without localization. The latter algorithm is recommended in the application for the reason of simplicity and efficiency. The feasibility and reliability of this algorithm are also illustrated by taking the nucleus $^{16}$O as an example, where the same results as the shooting method for the Dirac equation with localized effective potentials are obtained.

preprint2007arXiv

Gauge and Lorentz Covariant Schwinger-Dyson Equation for Fermion Propagator in Arbitrary External Gauge Field

A formalism of gauge and Lorentz covariant Schwinger-Dyson equation is built up for fermion propagator in presence of arbitrary external gauge field within ladder approximation. Different type external electromagnetic field dependent trial solutions are investigated which spontaneously breaking chiral symmetry.

Ying Zhang

What is connected

Connect this record

See the researcher in context

Building this map preview

130 published item(s)

1/2 order convergence rate of Euler-type methods for time-changed stochastic differential equations with super-linearly growing drift and diffusion coefficients

Beyond "I cannot fulfill this request": Alleviating Rigid Rejection in LLMs via Label Enhancement

Beyond Immediate Activation: Temporally Decoupled Backdoor Attacks on Time Series Forecasting

EXG: Self-Evolving Agents with Experience Graphs

HyperJoin: LLM-augmented Hypergraph Link Prediction for Joinable Table Discovery

MedVL-SAM2: A unified 3D medical vision-language model for multimodal reasoning and prompt-driven segmentation

Overeager Coding Agents: Measuring Out-of-Scope Actions on Benign Tasks

Quantum Latin squares of order $6m$ with all possible cardinalities

Why is the $Z_c(3900)$ absent in the $h_cπ$ final state?

Analyzing Communication Predictability in LLM Training

Audio2Gestures: Generating Diverse Gestures from Audio

Illegal Intelligent Reflecting Surface Based Active Channel Aging: When Jammer Can Attack Without Power and CSI

%CRTFASTGEEPWR: a SAS macro for power of the generalized estimating equations of multi-period cluster randomized trials with application to stepped wedge designs

AFD Types Sparse Representations vs. the Karhunen-Loeve Expansion for Decomposing Stochastic Processes

Carbide: Highly Reliable Networks Through Real-Time Multiple Control Plane Composition

Continuum Skyrme Hartree-Fock-Bogoliubov theory with Green's function method for neutron-rich Ca, Ni, Zr, Sn isotopes

Detailed Balance for Particle Models of Reversible Reactions in Bounded Domains

Emotional Conversation Generation with Heterogeneous Graph Neural Network

Enabling Harmonious Human-Machine Interaction with Visual-Context Augmented Dialogue System: A Review

Example-Based Vulnerability Detection and Repair in Java Code

Exploiting full Resolution Feature Context for Liver Tumor and Vessel Segmentation via Integrate Framework: Application to Liver Tumor and Vessel 3D Reconstruction under embedded microprocessor

Hot 2DHG states in tellurium

Marginal Regression on Transient State Occupation Probabilities with Clustered Multistate Process Data

Mix-up Self-Supervised Learning for Contrast-agnostic Applications

Multi-grained Label Refinement Network with Dependency Structures for Joint Intent Detection and Slot Filling

Probabilistic Models for Manufacturing Lead Times

Reinforcement Learning Based Query Vertex Ordering Model for Subgraph Matching

Semi-supervised segmentation of tooth from 3D Scanned Dental Arches

Sparse Representations of Solutions to a class of Random Boundary Value Problems

Testing for Treatment Effect Twice Using Internal and External Controls in Clinical Trials

The Hadron-Quark Crossover in Neutron Star within Gaussian Process Regression Method

The hyperonic star in relativistic mean-field model

The lineshape of the compact fully heavy tetraquark

The Minimal Flavor Structure from Decomposition of the Fermion Mass Matrix

The nuclear symmetry energy from relativistic Brueckner-Hartree-Fock model

Towards Higher-order Topological Consistency for Unsupervised Network Alignment

Towards Real-Time Counting Shortest Cycles on Dynamic Graphs: A Hub Labeling Approach

Traffic Context Aware Data Augmentation for Rare Object Detection in Autonomous Driving

A Unified Yukawa Interaction for the Standard Model of Quarks and Leptons

Adaptive Load Shedding for Grid Emergency Control via Deep Reinforcement Learning

Consensus-Aware Visual-Semantic Embedding for Image-Text Matching

Data-Driven Vulnerability Detection and Repair in Java Code

Detected the steerability bounds of the generalized Werner states via BackPropagation neural network

On stochastic gradient Langevin dynamics with dependent data streams: the fully non-convex case

Quantifying measurement-induced nonbilocal correlation

Similarity Reasoning and Filtration for Image-Text Matching

The $ΞN$ interaction constrained by recent $Ξ^-$ hypernuclei experiments

A fully data-driven approach to minimizing CVaR for portfolio of assets via SGLD with discontinuous updating

A Unified Approach to Scalable Spectral Sparsification of Directed Graphs

AOT: Pushing the Efficiency Boundary of Main-memory Triangle Listing

Attack Identification and Correction for PMU GPS Spoofing in Unbalanced Distribution Systems

Deep Reinforcement Learning Based Volt-VAR Optimization in Smart Distribution Systems

Effects of symmetry energy on the radius and tidal deformability of neutron stars in relativistic mean-field model

Efficient Bitruss Decomposition for Large-scale Bipartite Graphs

Efficient Matrix Factorization on Heterogeneous CPU-GPU Systems

Exploring Cohesive Subgraphs with Vertex Engagement and Tie Strength in Bipartite Graphs

Graph-based Faulted Line Identification Using Micro-PMU Data in Distribution Systems

Graph3S: A Simple, Speedy and Scalable Distributed Graph Processing System

Interval State Estimation with Uncertainty of Distributed Generation and Line Parameters in Unbalanced Distribution Systems

Machine learning on DNA-encoded libraries: A new paradigm for hit-finding

Properties of neutron star described by a relativistic $ab~ initio$ model

Semiparametric regression and risk prediction with competing risks data under missing cause of failure

SF-GRASS: Solver-Free Graph Spectral Sparsification

The common origin of the family mass hierarchy and CP violation from flavour-dependent vacuum for quarks and leptons

The one-pion-exchange potential with contact terms from lattice QCD simulations

Towards Highly Efficient State Estimation with Nonlinear Measurements in Distribution Systems

Understanding and adjusting the selection bias from a proof-of-concept study to a more confirmatory study

What Can We Learn from the Travelers Data in Detecting Disease Outbreaks -- A Case Study of the COVID-19 Epidemic

Bayesian truncation errors in equations of state of nuclear matter with chiral nucleon-nucleon potentials

The charge-dependent Bonn potentials with pseudovector pion-nucleon coupling

A centrosymmetric hexagonal magnet with superstable biskyrmion magnetic nanodomains in a wide temperature range of 100K to 340K

A Fast Order-Based Approach for Core Maintenance

A geometric iterated function system on triangles

Approximate Nearest Neighbor Search on High Dimensional Data --- Experiments, Analyses, and Improvement (v1.0)