Source author record

Lu Zhang

Lu Zhang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

88works

47topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Retrieving Any Relevant Moments: Benchmark and Models for Generalized Moment Retrieval

Video Moment Retrieval (VMR) aims to localize temporal segments in videos that correspond to a natural language query, but typically assumes only a single matching moment for each query. This assumption does not always hold in real-world scenarios, where queries may correspond to multiple or no moments. Thus, we formulate Generalized Moment Retrieval (GMR), a unified setting that requires retrieving the complete set of relevant moments or predicting an empty set. To enable systematic study of GMR, we introduce Soccer-GMR, a large-scale benchmark built on challenging soccer videos that reflect general GMR scenarios, with realistic negative and positive queries. The benchmark is constructed via a duration-flexible semi-automated pipeline with human verification, enabling scalable data generation while maintaining high annotation quality. We further design a unified evaluation protocol with complementary metrics tailored for null-set rejection, positive-query localization, and end-to-end GMR performance. Finally, we establish strong baselines across two modeling paradigms: a lightweight plug-and-play GMR adapter for discriminative VMR models, and a GMR-tailored GRPO reward for fine-tuning multimodal large language models (MLLMs). Extensive experiments show consistent gains across all metrics and expose key limitations of current methods, positioning GMR as a more realistic and challenging benchmark for video-language understanding.

preprint2025arXiv

Daily Land Surface Temperature Reconstruction in Landsat Cross-Track Areas Using Deep Ensemble Learning With Uncertainty Quantification

Many real-world applications rely on land surface temperature (LST) data at high spatiotemporal resolution. In complex urban areas, LST exhibits significant variations, fluctuating dramatically within and across city blocks. Landsat provides high spatial resolution data at 100 meters but is limited by long revisit time, with cloud cover further disrupting data collection. Here, we propose DELAG, a deep ensemble learning method that integrates annual temperature cycles and Gaussian processes, to reconstruct Landsat LST in complex urban areas. Leveraging the cross-track characteristics and dual-satellite operation of Landsat since 2021, we further enhance data availability to 4 scenes every 16 days. We select New York City, London and Hong Kong from three different continents as study areas. Experiments show that DELAG successfully reconstructed LST in the three cities under clear-sky (RMSE = 0.73-0.96 K) and heavily-cloudy (RMSE = 0.84-1.62 K) situations, superior to existing methods. Additionally, DELAG can quantify uncertainty that enhances LST reconstruction reliability. We further tested the reconstructed LST to estimate near-surface air temperature, achieving results (RMSE = 1.48-2.11 K) comparable to those derived from clear-sky LST (RMSE = 1.63-2.02 K). The results demonstrate the successful reconstruction through DELAG and highlight the broader applications of LST reconstruction for estimating accurate air temperature. Our study thus provides a novel and practical method for Landsat LST reconstruction, particularly suited for complex urban areas within Landsat cross-track areas, taking one step toward addressing complex climate events at high spatiotemporal resolution. Code and data are available at https://skrisliu.com/delag

preprint2023arXiv

Fixed-Domain Asymptotics Under Vecchia's Approximation of Spatial Process Likelihoods

Statistical modeling for massive spatial data sets has generated a substantial literature on scalable spatial processes based upon Vecchia's approximation. Vecchia's approximation for Gaussian process models enables fast evaluation of the likelihood by restricting dependencies at a location to its neighbors. We establish inferential properties of microergodic spatial covariance parameters within the paradigm of fixed-domain asymptotics when they are estimated using Vecchia's approximation. The conditions required to formally establish these properties are explored, theoretically and empirically, and the effectiveness of Vecchia's approximation is further corroborated from the standpoint of fixed-domain asymptotics.

preprint2022arXiv

A discontinuous Galerkin method for nonlinear biharmonic Schrödinger equations

This paper proposes and analyzes a fully discrete scheme that discretizes space with an ultra-weak local discontinuous Galerkin scheme and time with the Crank--Nicolson method for the nonlinear biharmonic Schrödinger equation. We first rewrite the problem into a system with a second-order spatial derivative and then apply the ultra-weak discontinuous Galerkin method to the system. The proposed scheme is more computationally efficient compared with the local discontinuous Galerkin method because of fewer auxiliary variables, and unconditionally stable without any penalty terms; it also preserves the mass and Hamiltonian conservation that are important properties of the nonlinear biharmonic Schrödinger equation. We also derive optimal L2-error estimates of the semi-discrete scheme that measure both the solution and the auxiliary variable with general nonlinear terms. Several numerical studies demonstrate and support our theoretical findings.

preprint2022arXiv

A high order finite difference method for the elastic wave equation in bounded domains with nonconforming interfaces

We develop a stable finite difference method for the elastic wave equation in bounded media, where the material properties can be discontinuous at curved interfaces. The governing equation is discretized in second order form by a fourth or sixth order accurate summation-by-parts operator. The mesh size is determined by the velocity structure of the material, resulting in nonconforming grid interfaces with hanging nodes. We use order-preserving interpolation and the ghost point technique to couple adjacent mesh blocks in an energy-conserving manner, which is supported by a fully discrete stability analysis. In our previous work for the wave equation, two pairs of order-preserving interpolation operators are needed when imposing the interface conditions weakly by a penalty technique. Here, we only use one pair in the ghost point method. In numerical experiments, we demonstrate that the convergence rate is optimal, and is the same as when a globally uniform mesh is used in a single domain. In addition, with a predictor-corrector time integration method, we obtain time stepping stability with stepsize almost the same as given by the usual Courant-Friedrichs-Lewy condition.

preprint2022arXiv

A local energy-based discontinuous Galerkin method for fourth order semilinear wave equations

This paper generalizes the earlier work on the energy-based discontinuous Galerkin method for second-order wave equations to fourth-order semilinear wave equations. We first rewrite the problem into a system with a second-order spatial derivative, then apply the energy-based discontinuous Galerkin method to the system. The proposed scheme, on the one hand, is more computationally efficient compared with the local discontinuous Galerkin method because of fewer auxiliary variables. On the other hand, it is unconditionally stable without adding any penalty terms, and admits optimal convergence in the $L^2$ norm for both solution and auxiliary variables. In addition, the energy-dissipating or energy-conserving property of the scheme follows from simple, mesh-independent choices of the interelement fluxes. We also present a stability and convergence analysis along with numerical experiments to demonstrate optimal convergence for certain choices of the interelement fluxes.

preprint2022arXiv

A Syntax-Guided Edit Decoder for Neural Program Repair

Automated Program Repair (APR) helps improve the efficiency of software development and maintenance. Recent APR techniques use deep learning, particularly the encoder-decoder architecture, to generate patches. Though existing DL-based APR approaches have proposed different encoder architectures, the decoder remains to be the standard one, which generates a sequence of tokens one by one to replace the faulty statement. This decoder has multiple limitations: 1) allowing to generate syntactically incorrect programs, 2) inefficiently representing small edits, and 3) not being able to generate project-specific identifiers. In this paper, we propose Recoder, a syntax-guided edit decoder with placeholder generation. Recoder is novel in multiple aspects: 1) Recoder generates edits rather than modified code, allowing efficient representation of small edits; 2) Recoder is syntax-guided, with the novel provider/decider architecture to ensure the syntactic correctness of the patched program and accurate generation; 3) Recoder generates placeholders that could be instantiated as project-specific identifiers later. We conduct experiments to evaluate Recoder on 395 bugs from Defects4J v1.2 and 420 additional bugs from Defects4J v2.0. Our results show that Recoder repairs 53 bugs on Defects4J v1.2, which achieves 21.4% improvement over the previous state-of-the-art approach for single-hunk bugs (TBar). Importantly, to our knowledge, Recoder is the first DL-based APR approach that has outperformed the traditional APR approaches on this dataset. Furthermore, Recoder also repairs 19 bugs on the additional bugs from Defects4J v2.0, which is 137.5% more than TBar (8 bugs) and 850% more than SimFix (2 bugs). This result suggests that Recoder has better generalizability than existing APR approaches.

preprint2022arXiv

A Unified and Biologically-Plausible Relational Graph Representation of Vision Transformers

Vision transformer (ViT) and its variants have achieved remarkable successes in various visual tasks. The key characteristic of these ViT models is to adopt different aggregation strategies of spatial patch information within the artificial neural networks (ANNs). However, there is still a key lack of unified representation of different ViT architectures for systematic understanding and assessment of model representation performance. Moreover, how those well-performing ViT ANNs are similar to real biological neural networks (BNNs) is largely unexplored. To answer these fundamental questions, we, for the first time, propose a unified and biologically-plausible relational graph representation of ViT models. Specifically, the proposed relational graph representation consists of two key sub-graphs: aggregation graph and affine graph. The former one considers ViT tokens as nodes and describes their spatial interaction, while the latter one regards network channels as nodes and reflects the information communication between channels. Using this unified relational graph representation, we found that: a) a sweet spot of the aggregation graph leads to ViTs with significantly improved predictive performance; b) the graph measures of clustering coefficient and average path length are two effective indicators of model prediction performance, especially when applying on the datasets with small samples; c) our findings are consistent across various ViT architectures and multiple datasets; d) the proposed relational graph representation of ViT has high similarity with real BNNs derived from brain science data. Overall, our work provides a novel unified and biologically-plausible paradigm for more interpretable and effective representation of ViT ANNs.

preprint2022arXiv

Achieving Long-Term Fairness in Sequential Decision Making

In this paper, we propose a framework for achieving long-term fair sequential decision making. By conducting both the hard and soft interventions, we propose to take path-specific effects on the time-lagged causal graph as a quantitative tool for measuring long-term fairness. The problem of fair sequential decision making is then formulated as a constrained optimization problem with the utility as the objective and the long-term and short-term fairness as constraints. We show that such an optimization problem can be converted to a performative risk optimization. Finally, repeated risk minimization (RRM) is used for model training, and the convergence of RRM is theoretically analyzed. The empirical evaluation shows the effectiveness of the proposed algorithm on synthetic and semi-synthetic temporal datasets.

preprint2022arXiv

AGA: An Accelerated Greedy Additional Algorithm for Test Case Prioritization

In recent years, many test case prioritization (TCP) techniques have been proposed to speed up the process of fault detection. However, little work has taken the efficiency problem of these techniques into account. In this paper, we target the Greedy Additional (GA) algorithm, which has been widely recognized to be effective but less efficient, and try to improve its efficiency while preserving effectiveness. In our Accelerated GA (AGA) algorithm, we use some extra data structures to reduce redundant data accesses in the GA algorithm and thus the time complexity is reduced from $\mathcal{O}(m^2n)$ to $\mathcal{O}(kmn)$ when $n > m$, where $m$ is the number of test cases, $n$ is the number of program elements, and $k$ is the iteration number. Moreover, we observe the impact of iteration numbers on prioritization efficiency on our dataset and propose to use a specific iteration number in the AGA algorithm to further improve the efficiency. We conducted experiments on 55 open-source subjects. In particular, we implemented each TCP algorithm with two kinds of widely-used input formats, adjacency matrix and adjacency list. Since a TCP algorithm with adjacency matrix is less efficient than the algorithm with adjacency list, the result analysis is mainly conducted based on TCP algorithms with adjacency list. The results show that AGA achieves 5.95X speedup ratio over GA on average, while it achieves the same average effectiveness as GA in terms of Average Percentage of Fault Detected (APFD). Moreover, we conducted an industrial case study on 22 subjects, collected from Baidu, and find that the average speedup ratio of AGA over GA is 44.27X, which indicates the practical usage of AGA in real-world scenarios.

preprint2022arXiv

An Energy-Based Discontinuous Galerkin Method with Tame CFL Numbers for the Wave Equation

We extend and analyze the energy-based discontinuous Galerkin method for second order wave equations on staggered and structured meshes. By combining spatial staggering with local time-stepping near boundaries, the method overcomes the typical numerical stiffness associated with high order piecewise polynomial approximations. In one space dimension with periodic boundary conditions and suitably chosen numerical fluxes, we prove bounds on the spatial operators that establish stability for CFL numbers $c \frac {Δt}{h} < C$ independent of order when stability-enhanced explicit time-stepping schemes of matching order are used. For problems on bounded domains and in higher dimensions we demonstrate numerically that one can march explicitly with large time steps at high order temporal and spatial accuracy.

preprint2022arXiv

Analysis on the composite nature of the light scalar mesons $f_{0}(980)$ and $a_0(980)$

We study the weight or compositeness of the $ππ$-$K\bar{K}$ and $πη$-$K\bar{K}$ in the composition of the $f_0(980)$ and $a_0(980)$ resonances, respectively. Either we use the saturation of the total width and compositeness, or we use a Flatté parameterization taking also into account the spectral function of a near-threshold resonance. We make connections and compare between these two methods. We take input values for the pole mass and width from several determinations in the literature. In addition, we take as third input either the total compositeness or the decay-width branching ratio to the lighter channel for each resonance. It turns out that for the poles considered the meson-meson components are dominant for the $f_0(980)$, while for the $a_0(980)$ resonance they are subdominant. We also provide partial decay widths and partial compositeness coefficients, so that the $K\bar{K}$ component is the most important one for the $f_0(980)$. Additionally, this study stresses the need to distinguish between the bare and dressed couplings and widths in a Flatté parameterization. We elaborate on the connection between the partial-decay widths calculated in terms of the dressed couplings and the actual measured ones. Due to the coupled-channel dynamics when the pole lies near the heavier threshold in the second Riemann sheet some changes are needed with respect to standard relations.

preprint2022arXiv

Automatically Discovering Novel Visual Categories with Self-supervised Prototype Learning

This paper tackles the problem of novel category discovery (NCD), which aims to discriminate unknown categories in large-scale image collections. The NCD task is challenging due to the closeness to the real-world scenarios, where we have only encountered some partial classes and images. Unlike other works on the NCD, we leverage the prototypes to emphasize the importance of category discrimination and alleviate the issue of missing annotations of novel classes. Concretely, we propose a novel adaptive prototype learning method consisting of two main stages: prototypical representation learning and prototypical self-training. In the first stage, we obtain a robust feature extractor, which could serve for all images with base and novel categories. This ability of instance and category discrimination of the feature extractor is boosted by self-supervised learning and adaptive prototypes. In the second stage, we utilize the prototypes again to rectify offline pseudo labels and train a final parametric classifier for category clustering. We conduct extensive experiments on four benchmark datasets and demonstrate the effectiveness and robustness of the proposed method with state-of-the-art performance.

preprint2022arXiv

Composite nature of $Z_b$ states from data analysis

We use a near-threshold parameterization with explicit inclusion of the Castillejo-Dalitz-Dyson poles, which is more general than the effective range expansion, to study the bottomonium-like states $Z_b(10610)$ and $Z_b(10650)$. In terms of the partial-wave amplitude, we fit the event number distribution of $B^{(*)}\bar B^*$ system to the experimental data for these resonances from Belle Collaboration. The data could be described very well in our method, which supports the molecular interpretation. Then the relevant physical quantities are obtained, including the $B^{(*)}\bar{B}^*$ scattering length ($a$), effective range ($r$), and residue squared ($γ_s^2$) of the pole in the complex plane. In particular, we find the compositeness can range from about 0.4 up to 1 for the $B\bar B^*$ ($B^*\bar B^*$) component in the resonance $Z_b(10610)$ ($Z_b(10650)$).

preprint2022arXiv

Coupling Visual Semantics of Artificial Neural Networks and Human Brain Function via Synchronized Activations

Artificial neural networks (ANNs), originally inspired by biological neural networks (BNNs), have achieved remarkable successes in many tasks such as visual representation learning. However, whether there exists semantic correlations/connections between the visual representations in ANNs and those in BNNs remains largely unexplored due to both the lack of an effective tool to link and couple two different domains, and the lack of a general and effective framework of representing the visual semantics in BNNs such as human functional brain networks (FBNs). To answer this question, we propose a novel computational framework, Synchronized Activations (Sync-ACT), to couple the visual representation spaces and semantics between ANNs and BNNs in human brain based on naturalistic functional magnetic resonance imaging (nfMRI) data. With this approach, we are able to semantically annotate the neurons in ANNs with biologically meaningful description derived from human brain imaging for the first time. We evaluated the Sync-ACT framework on two publicly available movie-watching nfMRI datasets. The experiments demonstrate a) the significant correlation and similarity of the semantics between the visual representations in FBNs and those in a variety of convolutional neural networks (CNNs) models; b) the close relationship between CNN's visual representation similarity to BNNs and its performance in image classification tasks. Overall, our study introduces a general and effective paradigm to couple the ANNs and BNNs and provides novel insights for future studies such as brain-inspired artificial intelligence.

preprint2022arXiv

Disentangling Spatial-Temporal Functional Brain Networks via Twin-Transformers

How to identify and characterize functional brain networks (BN) is fundamental to gain system-level insights into the mechanisms of brain organizational architecture. Current functional magnetic resonance (fMRI) analysis highly relies on prior knowledge of specific patterns in either spatial (e.g., resting-state network) or temporal (e.g., task stimulus) domain. In addition, most approaches aim to find group-wise common functional networks, individual-specific functional networks have been rarely studied. In this work, we propose a novel Twin-Transformers framework to simultaneously infer common and individual functional networks in both spatial and temporal space, in a self-supervised manner. The first transformer takes space-divided information as input and generates spatial features, while the second transformer takes time-related information as input and outputs temporal features. The spatial and temporal features are further separated into common and individual ones via interactions (weights sharing) and constraints between the two transformers. We applied our TwinTransformers to Human Connectome Project (HCP) motor task-fMRI dataset and identified multiple common brain networks, including both task-related and resting-state networks (e.g., default mode network). Interestingly, we also successfully recovered a set of individual-specific networks that are not related to task stimulus and only exist at the individual level.

preprint2022arXiv

DrugOOD: Out-of-Distribution (OOD) Dataset Curator and Benchmark for AI-aided Drug Discovery -- A Focus on Affinity Prediction Problems with Noise Annotations

AI-aided drug discovery (AIDD) is gaining increasing popularity due to its promise of making the search for new pharmaceuticals quicker, cheaper and more efficient. In spite of its extensive use in many fields, such as ADMET prediction, virtual screening, protein folding and generative chemistry, little has been explored in terms of the out-of-distribution (OOD) learning problem with \emph{noise}, which is inevitable in real world AIDD applications. In this work, we present DrugOOD, a systematic OOD dataset curator and benchmark for AI-aided drug discovery, which comes with an open-source Python package that fully automates the data curation and OOD benchmarking processes. We focus on one of the most crucial problems in AIDD: drug target binding affinity prediction, which involves both macromolecule (protein target) and small-molecule (drug compound). In contrast to only providing fixed datasets, DrugOOD offers automated dataset curator with user-friendly customization scripts, rich domain annotations aligned with biochemistry knowledge, realistic noise annotations and rigorous benchmarking of state-of-the-art OOD algorithms. Since the molecular data is often modeled as irregular graphs using graph neural network (GNN) backbones, DrugOOD also serves as a valuable testbed for \emph{graph OOD learning} problems. Extensive empirical studies have shown a significant performance gap between in-distribution and out-of-distribution experiments, which highlights the need to develop better schemes that can allow for OOD generalization under noise for AIDD.

preprint2022arXiv

EPiDA: An Easy Plug-in Data Augmentation Framework for High Performance Text Classification

Recent works have empirically shown the effectiveness of data augmentation (DA) in NLP tasks, especially for those suffering from data scarcity. Intuitively, given the size of generated data, their diversity and quality are crucial to the performance of targeted tasks. However, to the best of our knowledge, most existing methods consider only either the diversity or the quality of augmented data, thus cannot fully mine the potential of DA for NLP. In this paper, we present an easy and plug-in data augmentation framework EPiDA to support effective text classification. EPiDA employs two mechanisms: relative entropy maximization (REM) and conditional entropy minimization (CEM) to control data generation, where REM is designed to enhance the diversity of augmented data while CEM is exploited to ensure their semantic consistency. EPiDA can support efficient and continuous data generation for effective classifier training. Extensive experiments show that EPiDA outperforms existing SOTA methods in most cases, though not using any agent networks or pre-trained generation networks, and it works well with various DA algorithms and classification models. Code is available at https://github.com/zhaominyiz/EPiDA.

preprint2022arXiv

Eye-gaze-guided Vision Transformer for Rectifying Shortcut Learning

Learning harmful shortcuts such as spurious correlations and biases prevents deep neural networks from learning the meaningful and useful representations, thus jeopardizing the generalizability and interpretability of the learned representation. The situation becomes even more serious in medical imaging, where the clinical data (e.g., MR images with pathology) are limited and scarce while the reliability, generalizability and transparency of the learned model are highly required. To address this problem, we propose to infuse human experts' intelligence and domain knowledge into the training of deep neural networks. The core idea is that we infuse the visual attention information from expert radiologists to proactively guide the deep model to focus on regions with potential pathology and avoid being trapped in learning harmful shortcuts. To do so, we propose a novel eye-gaze-guided vision transformer (EG-ViT) for diagnosis with limited medical image data. We mask the input image patches that are out of the radiologists' interest and add an additional residual connection in the last encoder layer of EG-ViT to maintain the correlations of all patches. The experiments on two public datasets of INbreast and SIIM-ACR demonstrate our EG-ViT model can effectively learn/transfer experts' domain knowledge and achieve much better performance than baselines. Meanwhile, it successfully rectifies the harmful shortcut learning and significantly improves the EG-ViT model's interpretability. In general, EG-ViT takes the advantages of both human expert's prior knowledge and the power of deep neural networks. This work opens new avenues for advancing current artificial intelligence paradigms by infusing human intelligence.

preprint2022arXiv

FB-MSTCN: A Full-Band Single-Channel Speech Enhancement Method Based on Multi-Scale Temporal Convolutional Network

In recent years, deep learning-based approaches have significantly improved the performance of single-channel speech enhancement. However, due to the limitation of training data and computational complexity, real-time enhancement of full-band (48 kHz) speech signals is still very challenging. Because of the low energy of spectral information in the high-frequency part, it is more difficult to directly model and enhance the full-band spectrum using neural networks. To solve this problem, this paper proposes a two-stage real-time speech enhancement model with extraction-interpolation mechanism for a full-band signal. The 48 kHz full-band time-domain signal is divided into three sub-channels by extracting, and a two-stage processing scheme of `masking + compensation' is proposed to enhance the signal in the complex domain. After the two-stage enhancement, the enhanced full-band speech signal is restored by interval interpolation. In the subjective listening and word accuracy test, our proposed model achieves superior performance and outperforms the baseline model overall by 0.59 MOS and 4.0% WAcc for the non-personalized speech denoising task.

preprint2022arXiv

Floodgate: inference for model-free variable importance

Many modern applications seek to understand the relationship between an outcome variable $Y$ and a covariate $X$ in the presence of a (possibly high-dimensional) confounding variable $Z$. Although much attention has been paid to testing \emph{whether} $Y$ depends on $X$ given $Z$, in this paper we seek to go beyond testing by inferring the \emph{strength} of that dependence. We first define our estimand, the minimum mean squared error (mMSE) gap, which quantifies the conditional relationship between $Y$ and $X$ in a way that is deterministic, model-free, interpretable, and sensitive to nonlinearities and interactions. We then propose a new inferential approach called \emph{floodgate} that can leverage any working regression function chosen by the user (allowing, e.g., it to be fitted by a state-of-the-art machine learning algorithm or be derived from qualitative domain knowledge) to construct asymptotic confidence bounds, and we apply it to the mMSE gap. \acc{We additionally show that floodgate's accuracy (distance from confidence bound to estimand) is adaptive to the error of the working regression function.} We then show we can apply the same floodgate principle to a different measure of variable importance when $Y$ is binary. Finally, we demonstrate floodgate's performance in a series of simulations and apply it to data from the UK Biobank to infer the strengths of dependence of platelet count on various groups of genetic mutations.

preprint2022arXiv

GDsmith: Detecting Bugs in Graph Database Engines

Graph database engines stand out in the era of big data for their efficiency of modeling and processing linked data. There is a strong need of testing graph database engines. However, random testing, the most practical way of automated test generation, faces the challenges of semantic validity, non-empty result, and behavior diversity to detect bugs in graph database engines. To address these challenges, in this paper, we propose GDsmith, the first black-box approach for testing graph database engines. It ensures that each randomly generated Cypher query satisfies the semantic requirements via skeleton generation and completion. GDsmith includes our technique to increase the probability of producing Cypher queries that return non-empty results by leveraging three types of structural mutation strategies. GDsmith also includes our technique to improve the behavior diversity of the generated Cypher queries by selecting property keys according to their previous frequencies when generating new queries. Our evaluation results demonstrate that GDsmith is effective and efficient for automated query generation and substantially outperforms the baseline. GDsmith successfully detects 27 previously unknown bugs on the released versions of three popular open-source graph database engines and receive positive feedback from their developers.

preprint2022arXiv

Generalized Equivariance and Preferential Labeling for GNN Node Classification

Existing graph neural networks (GNNs) largely rely on node embeddings, which represent a node as a vector by its identity, type, or content. However, graphs with unattributed nodes widely exist in real-world applications (e.g., anonymized social networks). Previous GNNs either assign random labels to nodes (which introduces artefacts to the GNN) or assign one embedding to all nodes (which fails to explicitly distinguish one node from another). Further, when these GNNs are applied to unattributed node classification problems, they have an undesired equivariance property, which are fundamentally unable to address the data with multiple possible outputs. In this paper, we analyze the limitation of existing approaches to node classification problems. Inspired by our analysis, we propose a generalized equivariance property and a Preferential Labeling technique that satisfies the desired property asymptotically. Experimental results show that we achieve high performance in several unattributed node classification tasks.

preprint2022arXiv

Hyperspectral Imaging for cherry tomato

Cherry tomato (Solanum Lycopersicum) is popular with consumers over the world due to its special flavor. Soluble solids content (SSC) and firmness are two key metrics for evaluating the product qualities. In this work, we develop non-destructive testing techniques for SSC and fruit firmness based on hyperspectral images and a corresponding deep learning regression model. Hyperspectral reflectance images of over 200 tomato fruits are derived with spectrum ranging from 400 to 1000 nm. The acquired hyperspectral images are corrected and the spectral information is extracted. A novel one-dimensional(1D) convolutional ResNet (Con1dResNet) based regression model is prosed and compared with the state of art techniques. Experimental results show that, with a relatively large number of samples our technique is 26.4\% better than state of art technique for SSC and 33.7\% for firmness. The results of this study indicate the application potential of hyperspectral imaging technique in the SSC and firmness detection, which provides a new option for non-destructive testing of cherry tomato fruit quality in the future.

preprint2022arXiv

Intra-Modal Constraint Loss For Image-Text Retrieval

Cross-modal retrieval has drawn much attention in both computer vision and natural language processing domains. With the development of convolutional and recurrent neural networks, the bottleneck of retrieval across image-text modalities is no longer the extraction of image and text features but an efficient loss function learning in embedding space. Many loss functions try to closer pairwise features from heterogeneous modalities. This paper proposes a method for learning joint embedding of images and texts using an intra-modal constraint loss function to reduce the violation of negative pairs from the same homogeneous modality. Experimental results show that our approach outperforms state-of-the-art bi-directional image-text retrieval methods on Flickr30K and Microsoft COCO datasets. Our code is publicly available: https://github.com/CanonChen/IMC.

preprint2022arXiv

Mask-guided Vision Transformer (MG-ViT) for Few-Shot Learning

Learning with little data is challenging but often inevitable in various application scenarios where the labeled data is limited and costly. Recently, few-shot learning (FSL) gained increasing attention because of its generalizability of prior knowledge to new tasks that contain only a few samples. However, for data-intensive models such as vision transformer (ViT), current fine-tuning based FSL approaches are inefficient in knowledge generalization and thus degenerate the downstream task performances. In this paper, we propose a novel mask-guided vision transformer (MG-ViT) to achieve an effective and efficient FSL on ViT model. The key idea is to apply a mask on image patches to screen out the task-irrelevant ones and to guide the ViT to focus on task-relevant and discriminative patches during FSL. Particularly, MG-ViT only introduces an additional mask operation and a residual connection, enabling the inheritance of parameters from pre-trained ViT without any other cost. To optimally select representative few-shot samples, we also include an active learning based sample selection method to further improve the generalizability of MG-ViT based FSL. We evaluate the proposed MG-ViT on both Agri-ImageNet classification task and ACFR apple detection task with gradient-weighted class activation mapping (Grad-CAM) as the mask. The experimental results show that the MG-ViT model significantly improves the performance when compared with general fine-tuning based ViT models, providing novel insights and a concrete approach towards generalizing data-intensive and large-scale deep learning models for FSL.

preprint2022arXiv

MetaNOR: A Meta-Learnt Nonlocal Operator Regression Approach for Metamaterial Modeling

We propose MetaNOR, a meta-learnt approach for transfer-learning operators based on the nonlocal operator regression. The overall goal is to efficiently provide surrogate models for new and unknown material-learning tasks with different microstructures. The algorithm consists of two phases: (1) learning a common nonlocal kernel representation from existing tasks; (2) transferring the learned knowledge and rapidly learning surrogate operators for unseen tasks with a different material, where only a few test samples are required. We apply MetaNOR to model the wave propagation within 1D metamaterials, showing substantial improvements on the sampling efficiency for new materials.

preprint2022arXiv

On the hyper-singular boundary integral equation methods for dynamic poroelasticity: three dimensional case

In our previous work [SIAM J. Sci. Comput. 43(3) (2021) B784-B810], an accurate hyper-singular boundary integral equation method for dynamic poroelasticity in two dimensions has been developed. This work is devoted to studying the more complex and difficult three-dimensional problems with Neumann boundary condition and both the direct and indirect methods are adopted to construct combined boundary integral equations. The strongly-singular and hyper-singular integral operators are reformulated into compositions of weakly-singular integral operators and tangential-derivative operators, which allow us to prove the jump relations associated with the poroelastic layer potentials and boundary integral operators in a simple manner. Relying on both the investigated spectral properties of the strongly-singular operators, which indicate that the corresponding eigenvalues accumulate at three points whose values are only dependent on two Lamé constants, and the spectral properties of the Calderón relations of the poroelasticity, we propose low-GMRES-iteration regularized integral equations. Numerical examples are presented to demonstrate the accuracy and efficiency of the proposed methodology by means of a Chebyshev-based rectangular-polar solver.

preprint2022arXiv

Pathfinder: Parallel quasi-Newton variational inference

We propose Pathfinder, a variational method for approximately sampling from differentiable log densities. Starting from a random initialization, Pathfinder locates normal approximations to the target density along a quasi-Newton optimization path, with local covariance estimated using the inverse Hessian estimates produced by the optimizer. Pathfinder returns draws from the approximation with the lowest estimated Kullback-Leibler (KL) divergence to the true posterior. We evaluate Pathfinder on a wide range of posterior distributions, demonstrating that its approximate draws are better than those from automatic differentiation variational inference (ADVI) and comparable to those produced by short chains of dynamic Hamiltonian Monte Carlo (HMC), as measured by 1-Wasserstein distance. Compared to ADVI and short dynamic HMC runs, Pathfinder requires one to two orders of magnitude fewer log density and gradient evaluations, with greater reductions for more challenging posteriors. Importance resampling over multiple runs of Pathfinder improves the diversity of approximate draws, reducing 1-Wasserstein distance further and providing a measure of robustness to optimization failures on plateaus, saddle points, or in minor modes. The Monte Carlo KL divergence estimates are embarrassingly parallelizable in the core Pathfinder algorithm, as are multiple runs in the resampling version, further increasing Pathfinder's speed advantage with multiple cores.

preprint2022arXiv

Phase transition of eigenvalues in deformed Ginibre ensembles

Consider a random matrix of size $N$ as an additive deformation of the complex Ginibre ensemble under a deterministic matrix $X_0$ with a finite rank, independent of $N$. When some eigenvalues of $X_0$ separate from the unit disk, outlier eigenvalues may appear asymptotically in the same locations, and their fluctuations exhibit surprising phenomena that highly depend on the Jordan canonical form of $X_0$. These findings are largely due to Benaych-Georges and Rochet \cite{BR}, Bordenave and Capitaine \cite{BC16}, and Tao \cite{Ta13}. When all eigenvalues of $X_0$ lie inside the unit disk, we prove that local eigenvalue statistics at the spectral edge form a new class of determinantal point processes, for which correlation kernels are characterized in terms of the repeated erfc integrals. This thus completes a non-Hermitian analogue of the BBP phase transition in Random Matrix Theory. Similar results hold for the deformed real quaternion Ginibre ensemble.

preprint2022arXiv

Representing Brain Anatomical Regularity and Variability by Few-Shot Embedding

Effective representation of brain anatomical architecture is fundamental in understanding brain regularity and variability. Despite numerous efforts, it is still difficult to infer reliable anatomical correspondence at finer scale, given the tremendous individual variability in cortical folding patterns. It is even more challenging to disentangle common and individual patterns when comparing brains at different neuro-developmental stages. In this work, we developed a novel learning-based few-shot embedding framework to encode the cortical folding patterns into a latent space represented by a group of anatomically meaningful embedding vectors. Specifically, we adopted 3-hinge (3HG) network as the substrate and designed an autoencoder-based embedding framework to learn a common embedding vector for each 3HG's multi-hop feature: each 3HG can be represented as a combination of these feature embeddings via a set of individual specific coefficients to characterize individualized anatomical information. That is, the regularity of folding patterns is encoded into the embeddings, while the individual variations are preserved by the multi=hop combination coefficients. To effectively learn the embeddings for the population with very limited samples, few-shot learning was adopted. We applied our method on adult HCP and pediatric datasets with 1,000+ brains (from 34 gestational weeks to young adult). Our experimental results show that: 1) the learned embedding vectors can quantitatively encode the commonality and individuality of cortical folding patterns; 2) with the embeddings we can robustly infer the complicated many-to-many anatomical correspondences among different brains and 3) our model can be successfully transferred to new populations with very limited training samples.

preprint2022arXiv

Revisiting Linearized Bregman Iterations under Lipschitz-like Convexity Condition

The linearized Bregman iterations (LBreI) and its variants have received considerable attention in signal/image processing and compressed sensing. Recently, LBreI has been extended to a larger class of nonconvex functions, along with several theoretical issues left for further investigation. In particular, the gradient Lipschitz continuity assumption precludes its use in many practical applications. In this study, we propose a generalized algorithmic framework to unify LBreI-type methods. Our main discovery is that the gradient Lipschitz continuity assumption can be replaced by a Lipschitz-like convexity condition in both convex and nonconvex cases. The proposed framework and theory are then applied to linear/quadratic inverse problems.

preprint2022arXiv

Series Photo Selection via Multi-view Graph Learning

Series photo selection (SPS) is an important branch of the image aesthetics quality assessment, which focuses on finding the best one from a series of nearly identical photos. While a great progress has been observed, most of the existing SPS approaches concentrate solely on extracting features from the original image, neglecting that multiple views, e.g, saturation level, color histogram and depth of field of the image, will be of benefit to successfully reflecting the subtle aesthetic changes. Taken multi-view into consideration, we leverage a graph neural network to construct the relationships between multi-view features. Besides, multiple views are aggregated with an adaptive-weight self-attention module to verify the significance of each view. Finally, a siamese network is proposed to select the best one from a series of nearly identical photos. Experimental results demonstrate that our model accomplish the highest success rates compared with competitive methods.

preprint2022arXiv

Stability analysis of the Tsallis holographic dark energy model

Using the generalized Tsallis entropy, the Tsallis holographic dark energy(THDE) was proposed recently. In this paper we analyze the cosmological consequences of the THDE model with an interaction between dark energy and dark matter $Q=H(αρ_{m}+βρ_{D})$. We find that the THDE model can explain the current accelerated cosmic expansion, and it is stable under certain conditions. Furthermore, through investigating the dynamical analysis, we find that there exists an attractor which represents an accelerated expansion phase of the universe. When $β=0$, this attractor corresponds to a dark energy dominated de Sitter solution and the universe can evolve into an era which is depicted by the $Λ$CDM model. The age of universe in this model is also explored.

preprint2022arXiv

Taming Hybrid-Cloud Fast and Scalable Graph Analytics at Twitter

We have witnessed a boosted demand for graph analytics at Twitter in recent years, and graph analytics has become one of the key parts of Twitter's large-scale data analytics and machine learning for driving engagement, serving the most relevant content, and promoting healthier conversations. However, infrastructure for graph analytics has historically not been an area of investment at Twitter, resulting in a long timeline and huge engineering effort for each project to deal with graphs at the Twitter scale. How do we build a unified graph analytics user experience to fulfill modern data analytics on various graph scales spanning from thousands to hundreds of billions of vertices and edges? To bring fast and scalable graph analytics capability into production, we investigate the challenges we are facing in large-scale graph analytics at Twitter and propose a unified graph analytics platform for efficient, scalable, and reliable graph analytics across on-premises and cloud, to fulfill the requirements of diverse graph use cases and challenging scales. We also conduct quantitative benchmarking on Twitter's production-level graph use cases between popular graph analytics frameworks to certify our solution.

preprint2022arXiv

Trajectory Prediction with Graph-based Dual-scale Context Fusion

Motion prediction for traffic participants is essential for a safe and robust automated driving system, especially in cluttered urban environments. However, it is highly challenging due to the complex road topology as well as the uncertain intentions of the other agents. In this paper, we present a graph-based trajectory prediction network named the Dual Scale Predictor (DSP), which encodes both the static and dynamical driving context in a hierarchical manner. Different from methods based on a rasterized map or sparse lane graph, we consider the driving context as a graph with two layers, focusing on both geometrical and topological features. Graph neural networks (GNNs) are applied to extract features with different levels of granularity, and features are subsequently aggregated with attention-based inter-layer networks, realizing better local-global feature fusion. Following the recent goal-driven trajectory prediction pipeline, goal candidates with high likelihood for the target agent are extracted, and predicted trajectories are generated conditioned on these goals. Thanks to the proposed dual-scale context fusion network, our DSP is able to generate accurate and human-like multi-modal trajectories. We evaluate the proposed method on the large-scale Argoverse motion forecasting benchmark, and it achieves promising results, outperforming the recent state-of-the-art methods.

preprint2022arXiv

Weakly Aligned Feature Fusion for Multimodal Object Detection

To achieve accurate and robust object detection in the real-world scenario, various forms of images are incorporated, such as color, thermal, and depth. However, multimodal data often suffer from the position shift problem, i.e., the image pair is not strictly aligned, making one object has different positions in different modalities. For the deep learning method, this problem makes it difficult to fuse multimodal features and puzzles the convolutional neural network (CNN) training. In this article, we propose a general multimodal detector named aligned region CNN (AR-CNN) to tackle the position shift problem. First, a region feature (RF) alignment module with adjacent similarity constraint is designed to consistently predict the position shift between two modalities and adaptively align the cross-modal RFs. Second, we propose a novel region of interest (RoI) jitter strategy to improve the robustness to unexpected shift patterns. Third, we present a new multimodal feature fusion method that selects the more reliable feature and suppresses the less useful one via feature reweighting. In addition, by locating bounding boxes in both modalities and building their relationships, we provide novel multimodal labeling named KAIST-Paired. Extensive experiments on 2-D and 3-D object detection, RGB-T, and RGB-D datasets demonstrate the effectiveness and robustness of our method.

preprint2021arXiv

Adaptively Sketched Bregman Projection Methods for Linear Systems

The sketch-and-project, as a general archetypal algorithm for solving linear systems, unifies a variety of randomized iterative methods such as the randomized Kaczmarz and randomized coordinate descent. However, since it aims to find a least-norm solution from a linear system, the randomized sparse Kaczmarz can not be included. This motivates us to propose a more general framework, called sketched Bregman projection (SBP) method, in which we are able to find solutions with certain structures from linear systems. To generalize the concept of adaptive sampling to the SBP method, we show how the progress, measured by Bregman distance, of single step depends directly on a sketched loss function. Theoretically, we provide detailed global convergence results for the SBP method with different adaptive sampling rules. At last, for the (sparse) Kaczmarz methods, a group of numerical simulations are tested, with which we verify that the methods utilizing sampling Kaczmarz-Motzkin rule demands the fewest computational costs to achieve a given error bound comparing to the corresponding methods with other sampling rules.

preprint2021arXiv

Extracting Concise Bug-Fixing Patches from Human-Written Patches in Version Control Systems

High-quality and large-scale repositories of real bugs and their concise patches collected from real-world applications are critical for research in software engineering community. In such a repository, each real bug is explicitly associated with its fix. Therefore, on one side, the real bugs and their fixes} may inspire novel approaches for finding, locating, and repairing software bugs; on the other side, the real bugs and their fixes are indispensable for rigorous and meaningful evaluation of approaches for software testing, fault localization, and program repair. To this end, a number of such repositories, e.g., Defects4J, have been proposed. However, such repositories are rather small because their construction involves expensive human intervention. Although bug-fixing code commits as well as associated test cases could be retrieved from version control systems automatically, existing approaches could not yet automatically extract concise bug-fixing patches from bug-fixing commits because such commits often involve bug-irrelevant changes. In this paper, we propose an automatic approach, called BugBuilder, to extracting complete and concise bug-fixing patches from human-written patches in version control systems. It excludes refactorings by detecting refactorings involved in bug-fixing commits, and reapplying detected refactorings on the faulty version. It enumerates all subsets of the remaining part and validates them on test cases. If none of the subsets has the potential to be a complete bug-fixing patch, the remaining part as a whole is taken as a complete and concise bug-fixing patch. Evaluation results on 809 real bug-fixing commits in Defects4J suggest that BugBuilder successfully generated complete and concise bug-fixing patches for forty percent of the bug-fixing commits, and its precision (99%) was even higher than human experts.

preprint2020arXiv

A Fixation-based 360° Benchmark Dataset for Salient Object Detection

Fixation prediction (FP) in panoramic contents has been widely investigated along with the booming trend of virtual reality (VR) applications. However, another issue within the field of visual saliency, salient object detection (SOD), has been seldom explored in 360° (or omnidirectional) images due to the lack of datasets representative of real scenes with pixel-level annotations. Toward this end, we collect 107 equirectangular panoramas with challenging scenes and multiple object classes. Based on the consistency between FP and explicit saliency judgements, we further manually annotate 1,165 salient objects over the collected images with precise masks under the guidance of real human eye fixation maps. Six state-of-the-art SOD models are then benchmarked on the proposed fixation-based 360° image dataset (F-360iSOD), by applying a multiple cubic projection-based fine-tuning method. Experimental results show a limitation of the current methods when used for SOD in panoramic images, which indicates the proposed dataset is challenging. Key issues for 360° SOD is also discussed. The proposed dataset is available at https://github.com/PanoAsh/F-360iSOD.

preprint2020arXiv

A Study of Bug Resolution Characteristics in Popular Programming Languages

This paper presents a large-scale study that investigates the bug resolution characteristics among popular Github projects written in different programming languages. We explore correlations but, of course, we cannot infer causation. Specifically, we analyse bug resolution data from approximately 70 million Source Line of Code, drawn from 3 million commits to 600 GitHub projects, primarily written in 10 programming languages. We find notable variations in apparent bug resolution time and patch (fix) size. While interpretation of results from such large-scale empirical studies is inherently difficult, we believe that the differences in medians are sufficiently large to warrant further investigation, replication, re-analysis and follow up research. For example, in our corpus, the median apparent bug resolution time (elapsed time from raise to resolve) for Ruby was 4X that for Go and 2.5X for Java. We also found that patches tend to touch more files for the corpus of strongly typed and for statically typed programs. However, we also found evidence for a lower elapsed resolution time for bug resolution committed to projects constructed from statically typed languages. These findings, if replicated in subsequent follow on studies, may shed further empirical light on the debate about the importance of static typing.

preprint2020arXiv

An accurate hyper-singular boundary integral equation method for dynamic poroelasticity in two dimensions

This paper is concerned with the boundary integral equation method for solving the exterior Neumann boundary value problem of dynamic poroelasticity in two dimensions. The main contribution of this work consists of two aspescts: the proposal of a novel regularized boundary integral equation, and the presentation of new regularized formulations of the strongly-singular and hyper-singular boundary integral operators. Firstly, turning to the spectral properties of the double-layer operator and the corresponding Calderón relation of the poroelasticity, we propose the novel low-GMRES-iteration integral equation whose eigenvalues are bounded away from zero and infinity. Secondly, with the help of the Günter derivatives, we reformulate the strongly-singular and hyper-singular integral operators into combinations of the weakly-singular operators and the tangential derivatives. The accuracy and efficiency of the proposed methodology are demonstrated through several numerical examples.

preprint2020arXiv

An energy-based discontinuous Galerkin method for semilinear wave equations

We generalize the energy-based discontinuous Galerkin method proposed in [SIAM J. Num. Anal., 53(6):2705-2726, 2015.] to second-order semilinear wave equations. A stability and convergence analysis is presented along with numerical experiments demonstrating optimal convergence for certain choices of the interelement fluxes. Applications to the sine-Gordon equation include simulations of breathers, kink, and anti-kink solitons.

preprint2020arXiv

Bi-parameter trilinear Fourier multipliers and pseudo-differential operators with flag symbols

The main purpose of this paper is to study $L^r$ Hölder type estimates for a bi-parameter trilinear Fourier multiplier with flag singularity, and the analogous pseudo-differential operator, when the symbols are in a certain product form. More precisely, for $f,g,h\in \mathcal{S}(\mathbb{R}^{2})$, the bi-parameter trilinear flag Fourier multiplier operators we consider are defined by $$ T_{m_1,m_2}(f,g,h)(x):=\int_{\mathbb{R}^{6}}m_1(ξ,η,ζ)m_2(η,ζ)\hat f(ξ) \hat g(η)\hat h(ζ)e^{2πi(ξ+η+ζ)\cdot x}dξdηdζ, $$ when $m_1,m_2$ are two bi-parameter symbols. We will show that our problem can be reduced to establish the $L^r$ estimate for the special multiplier $m_1(ξ_1, η_1, ζ_1) m_2(η_2, ζ_2)$ (see Theorem 1.7). We also study these $L^r$ estimates for the corresponding bi-parameter trilinear pseudo-differential operators defined by $$ T_{ab}(f,g,h)(x):=\int_{\mathbb{R}^6}a(x,ξ,η,ζ)b(x,η,ζ)\hat f(ξ)\hat g(η)\hat h(ζ)e^{2πi x(ξ+η+ζ)}dξdηdζ, $$ where the smooth symbols $a,b$ satisfy certain bi-parameter Hörmander conditions. We will also show that the $L^r$ estimate holds for $T_{ab}$ as long as the $L^r$ estimate for the flag multiplier operator holds when the multiplier has the special form $m_1(ξ_1, η_1, ζ_1) m_2(η_2, ζ_2)$ (see Theorem 1.10). The bi-parameter and trilinear flag Fourier multipliers considered in this paper do not satisfy the conditions of the classical bi-parameter trilinear Fourier multipliers considered by Muscalu, Tao, Thiele and the second author [21, 22]. They may also be viewed as the bi-parameter trilinear variants of estimates obtained for the one-parameter flag paraproducts by Muscalu [18].

preprint2020arXiv

Binary Probability Model for Learning Based Image Compression

In this paper, we propose to enhance learned image compression systems with a richer probability model for the latent variables. Previous works model the latents with a Gaussian or a Laplace distribution. Inspired by binary arithmetic coding , we propose to signal the latents with three binary values and one integer, with different probability models. A relaxation method is designed to perform gradient-based training. The richer probability model results in a better entropy coding leading to lower rate. Experiments under the Challenge on Learned Image Compression (CLIC) test conditions demonstrate that this method achieves 18% rate saving compared to Gaussian or Laplace models.

preprint2020arXiv

Efficient Uncertainty-aware Decision-making for Automated Driving Using Guided Branching

Decision-making in dense traffic scenarios is challenging for automated vehicles (AVs) due to potentially stochastic behaviors of other traffic participants and perception uncertainties (e.g., tracking noise and prediction errors, etc.). Although the partially observable Markov decision process (POMDP) provides a systematic way to incorporate these uncertainties, it quickly becomes computationally intractable when scaled to the real-world large-size problem. In this paper, we present an efficient uncertainty-aware decision-making (EUDM) framework, which generates long-term lateral and longitudinal behaviors in complex driving environments in real-time. The computation complexity is controlled to an appropriate level by two novel techniques, namely, the domain-specific closed-loop policy tree (DCP-Tree) structure and conditional focused branching (CFB) mechanism. The key idea is utilizing domain-specific expert knowledge to guide the branching in both action and intention space. The proposed framework is validated using both onboard sensing data captured by a real vehicle and an interactive multi-agent simulation platform. We also release the code of our framework to accommodate benchmarking.

preprint2020arXiv

Medusa: Blockchain Powered Log Storage System

Blockchain is one of the most heavily invested technologies in recent years. Due to its tamper-proof and decentralization properties, blockchain has become an ideal utility for data storage that is applicable in many real world industrial scenarios. One important scenario is web log, which is treated as sources of technical significance and commercial revenues in major internet companies. In this paper, we illustrate our design of a web log storage system based on HyperLedger. HyperLedger yields higher throughput and lower latency compared with other blockchain systems. Alongside its efficiency advantages, HyperLeger is a permissioned blockchain, which is an ideal fit for enterprise software design scenario.

preprint2020arXiv

Modeling Programs Hierarchically with Stack-Augmented LSTM

Programming language modeling has attracted extensive attention in recent years, and it plays an essential role in program processing fields. Statistical language models, which are initially designed for natural languages, have been generally used for modeling programming languages. However, different from natural languages, programming languages contain explicit and hierarchical structure that is hard to learn by traditional statistical language models. To address this challenge, we propose a novel Stack-Augmented LSTM neural network for programming language modeling. Adding a stack memory component into the LSTM network enables our model to capture the hierarchical information of programs through the PUSH and POP operations, which further allows our model capturing the long-term dependency in the programs. We evaluate the proposed model on three program analysis tasks, i.e., code completion, program classification, and code summarization. Evaluation results show that our proposed model outperforms baseline models in all the three tasks, indicating that by capturing the structural information of programs with a stack, our proposed model can represent programs more precisely.

preprint2020arXiv

ModeNet: Mode Selection Network For Learned Video Coding

In this paper, a mode selection network (ModeNet) is proposed to enhance deep learning-based video compression. Inspired by traditional video coding, ModeNet purpose is to enable competition among several coding modes. The proposed ModeNet learns and conveys a pixel-wise partitioning of the frame, used to assign each pixel to the most suited coding mode. ModeNet is trained alongside the different coding modes to minimize a rate-distortion cost. It is a flexible component which can be generalized to other systems to allow competition between different coding tools. Mod-eNet interest is studied on a P-frame coding task, where it is used to design a method for coding a frame given its prediction. ModeNet-based systems achieve compelling performance when evaluated under the Challenge on Learned Image Compression 2020 (CLIC20) P-frame coding track conditions.

preprint2020arXiv

OCoR: An Overlapping-Aware Code Retriever

Code retrieval helps developers reuse the code snippet in the open-source projects. Given a natural language description, code retrieval aims to search for the most relevant code among a set of code. Existing state-of-the-art approaches apply neural networks to code retrieval. However, these approaches still fail to capture an important feature: overlaps. The overlaps between different names used by different people indicate that two different names may be potentially related (e.g., "message" and "msg"), and the overlaps between identifiers in code and words in natural language descriptions indicate that the code snippet and the description may potentially be related. To address these problems, we propose a novel neural architecture named OCoR, where we introduce two specifically-designed components to capture overlaps: the first embeds identifiers by character to capture the overlaps between identifiers, and the second introduces a novel overlap matrix to represent the degrees of overlaps between each natural language word and each identifier. The evaluation was conducted on two established datasets. The experimental results show that OCoR significantly outperforms the existing state-of-the-art approaches and achieves 13.1% to 22.3% improvements. Moreover, we also conducted several in-depth experiments to help understand the performance of different components in OCoR.

preprint2020arXiv

Optical Flow and Mode Selection for Learning-based Video Coding

This paper introduces a new method for inter-frame coding based on two complementary autoencoders: MOFNet and CodecNet. MOFNet aims at computing and conveying the Optical Flow and a pixel-wise coding Mode selection. The optical flow is used to perform a prediction of the frame to code. The coding mode selection enables competition between direct copy of the prediction or transmission through CodecNet. The proposed coding scheme is assessed under the Challenge on Learned Image Compression 2020 (CLIC20) P-frame coding conditions, where it is shown to perform on par with the state-of-the-art video codec ITU/MPEG HEVC. Moreover, the possibility of copying the prediction enables to learn the optical flow in an end-to-end fashion i.e. without relying on pre-training and/or a dedicated loss term.

preprint2020arXiv

Service Ecosystem: A Lens of Smart Society

Intelligence services are playing an increasingly important role in the operation of our society. Exploring the evolution mechanism, boundaries and challenges of service ecosystem is essential to our ability to realize smart society, reap its benefits and prevent potential risks. We argue that this necessitates a broad scientific research agenda to study service ecosystem that incorporates and expands upon the disciplines of computer science and includes insights from across the sciences. We firstly outline a set of research issues that are fundamental to this emerging field, and then explores the technical, social, legal and institutional challenges on the study of service ecosystem.

preprint2019arXiv

Epitaxial growth and antiferromagnetism of Sn-substituted perovskite iridate SrIr$_{0.8}$Sn$_{0.2}$O$_3$

5d iridates have shown vast emergent phenomena due to a strong interplay among its lattice, charge and spin degrees of freedom, because of which the potential in spintronic application of the thin-film form is highly leveraged. Here we have epitaxially stabilized perovskite SrIr$_{0.8}$Sn$_{0.2}$O$_3$ on [001] SrTiO$_3$ substrates through pulsed laser deposition and systematically characterized the structural, electronic and magnetic properties. Physical properties measurements unravel an insulating ground state with a weak ferromagnetism in the compressively strained epitaxial film. The octahedral rotation pattern is identified by synchrotron x-ray diffraction, resolving a mix of $a^+b^-c^-$ and $a^-b^+c^-$ domains. X-ray magnetic resonant scattering directly demonstrates a G-type antiferromagnetic structure of the magnetic order and the spin canting nature of the weak ferromagnetism.

preprint2016arXiv

A causal framework for discovering and removing direct and indirect discrimination

Anti-discrimination is an increasingly important task in data science. In this paper, we investigate the problem of discovering both direct and indirect discrimination from the historical data, and removing the discriminatory effects before the data is used for predictive analysis (e.g., building classifiers). We make use of the causal network to capture the causal structure of the data. Then we model direct and indirect discrimination as the path-specific effects, which explicitly distinguish the two types of discrimination as the causal effects transmitted along different paths in the network. Based on that, we propose an effective algorithm for discovering direct and indirect discrimination, as well as an algorithm for precisely removing both types of discrimination while retaining good data utility. Different from previous works, our approaches can ensure that the predictive models built from the modified data will not incur discrimination in decision making. Experiments using real datasets show the effectiveness of our approaches.

preprint2016arXiv

A New Method for Computing $φ$-functions and Their Condition Numbers of Large Sparse Matrices

We propose a new method for computing the $φ$-functions of large sparse matrices with low rank or fast decaying singular values. The key is to reduce the computation of $φ_{\ell}$-functions of a large matrix to $φ_{\ell+1}$-functions of some $r$-by-$r$ matrices, where $r$ is the numerical rank of the large matrix in question. Some error analysis on the new method is given. Furthermore, we propose two novel strategies for estimating 2-norm condition numbers of the $φ$-functions. Numerical experiments illustrate the numerical behavior of the new algorithms and show the effectiveness of our theoretical results.

preprint2016arXiv

A novel three-axis cylindrical hohlraum designed for inertial confinement fusion ignition

A novel ignition hohlraum for indirect-drive inertial confinement fusion is proposed, which is named as three-axis cylindrical hohlraum (TACH). TACH is a kind of 6 laser entrance holes (LEHs) hohlraum, which is made of three cylindrical hohlraums orthogonally jointed. Laser beams are injected through every entrance hole with the same incident angle of 55°. The view-factor simulation result shows that the time-varying drive asymmetry of TACH is no more than 1.0% in the whole drive pulse period without any supplementary technology such as beam phasing etc. Its coupling efficiency of TACH is close to that of 6 LEHs spherical hohlraum with corresponding size. Its plasma-filling time is close to typical cylindrical ignition hohlraum. Its laser plasma interaction has as low backscattering as the outer cone of the cylindrical ignition hohlraum. Therefore, the proposed hohlraum provides a competitive candidate for ignition hohlraum.

preprint2016arXiv

Achieving non-discrimination in data release

Discrimination discovery and prevention/removal are increasingly important tasks in data mining. Discrimination discovery aims to unveil discriminatory practices on the protected attribute (e.g., gender) by analyzing the dataset of historical decision records, and discrimination prevention aims to remove discrimination by modifying the biased data before conducting predictive analysis. In this paper, we show that the key to discrimination discovery and prevention is to find the meaningful partitions that can be used to provide quantitative evidences for the judgment of discrimination. With the support of the causal graph, we present a graphical condition for identifying a meaningful partition. Based on that, we develop a simple criterion for the claim of non-discrimination, and propose discrimination removal algorithms which accurately remove discrimination while retaining good data utility. Experiments using real datasets show the effectiveness of our approaches.

preprint2016arXiv

Application of Origen2.1 in the decay photon spectrum calculation of spallation products

Origen2.1 is a widely used computer code for calculating the burnup, decay, and processing of radioactive materials. However, the nuclide library of Origen2.1 is used for existing reactors like pressurized water reactor, to calculate the photon spectrum released by the decay of spallation products, we have made specific libraries for the ADS tungsten spallation target, based on the results given by a Monte Carlo code: FLUKA. All the data used to make the Origen2.1 libraries is obtained from Nuclear structure & decay Data (NuDat2.6). The accumulated activity of spallation products and the contribution of nuclides to photon emission are given in this paper.

preprint2016arXiv

Backward and Forward Language Modeling for Constrained Sentence Generation

Recent language models, especially those based on recurrent neural networks (RNNs), make it possible to generate natural language from a learned probability. Language generation has wide applications including machine translation, summarization, question answering, conversation systems, etc. Existing methods typically learn a joint probability of words conditioned on additional information, which is (either statically or dynamically) fed to RNN's hidden layer. In many applications, we are likely to impose hard constraints on the generated texts, i.e., a particular word must appear in the sentence. Unfortunately, existing approaches could not solve this problem. In this paper, we propose a novel backward and forward language model. Provided a specific word, we use RNNs to generate previous words and future words, either simultaneously or asynchronously, resulting in two model variants. In this way, the given word could appear at any position in the sentence. Experimental results show that the generated texts are comparable to sequential LMs in quality.

preprint2016arXiv

Distilling Word Embeddings: An Encoding Approach

Distilling knowledge from a well-trained cumbersome network to a small one has recently become a new research topic, as lightweight neural networks with high performance are particularly in need in various resource-restricted systems. This paper addresses the problem of distilling word embeddings for NLP tasks. We propose an encoding approach to distill task-specific knowledge from a set of high-dimensional embeddings, which can reduce model complexity by a large margin as well as retain high accuracy, showing a good compromise between efficiency and performance. Experiments in two tasks reveal the phenomenon that distilling knowledge from cumbersome embeddings is better than directly training neural networks with small embeddings.

preprint2016arXiv

How Transferable are Neural Networks in NLP Applications?

Transfer learning is aimed to make use of valuable knowledge in a source domain to help model performance in a target domain. It is particularly important to neural networks, which are very likely to be overfitting. In some fields like image processing, many studies have shown the effectiveness of neural network-based transfer learning. For neural NLP, however, existing studies have only casually applied transfer learning, and conclusions are inconsistent. In this paper, we conduct systematic case studies and provide an illuminating picture on the transferability of neural networks in NLP.

preprint2016arXiv

Natural Language Inference by Tree-Based Convolution and Heuristic Matching

In this paper, we propose the TBCNN-pair model to recognize entailment and contradiction between two sentences. In our model, a tree-based convolutional neural network (TBCNN) captures sentence-level semantics; then heuristic matching layers like concatenation, element-wise product/difference combine the information in individual sentences. Experimental results show that our model outperforms existing sentence encoding-based approaches by a large margin.

preprint2016arXiv

Power Side Channels in Security ICs: Hardware Countermeasures

Power side-channel attacks are a very effective cryptanalysis technique that can infer secret keys of security ICs by monitoring the power consumption. Since the emergence of practical attacks in the late 90s, they have been a major threat to many cryptographic-equipped devices including smart cards, encrypted FPGA designs, and mobile phones. Designers and manufacturers of cryptographic devices have in response developed various countermeasures for protection. Attacking methods have also evolved to counteract resistant implementations. This paper reviews foundational power analysis attack techniques and examines a variety of hardware design mitigations. The aim is to highlight exposed vulnerabilities in hardware-based countermeasures for future more secure implementations.

preprint2016arXiv

Resonance of Gaussian electromagnetic field to the high frequency gravitational waves

We consider a Gaussian Beam (GB) resonant system for high frequency gravitational waves (HFGWs) detection. At present, we find the optimal signal strength in theory through setting the magnetic component of GB in a standard gaussian form. Under the synchro-resonance condition, we study the signal strength (i.e., transverse perturbative photon fluxes) from the relic HFGWs (predicted by ordinary inflationary model) and the braneworld HFGWs (from braneworld scenarios). Both of them would generate potentially detectable transverse perturbative photon fluxes (PPFs). Furthermore we find optimal system parameters and the relationship between frequency and effective width of energy fluxes accumulation.

preprint2016arXiv

Sequence to Backward and Forward Sequences: A Content-Introducing Approach to Generative Short-Text Conversation

Using neural networks to generate replies in human-computer dialogue systems is attracting increasing attention over the past few years. However, the performance is not satisfactory: the neural network tends to generate safe, universally relevant replies which carry little meaning. In this paper, we propose a content-introducing approach to neural network-based generative dialogue systems. We first use pointwise mutual information (PMI) to predict a noun as a keyword, reflecting the main gist of the reply. We then propose seq2BF, a "sequence to backward and forward sequences" model, which generates a reply containing the given keyword. Experimental results show that our approach significantly outperforms traditional sequence-to-sequence models in terms of human evaluation and the entropy measure, and that the predicted keyword can appear at an appropriate position in the reply.

preprint2016arXiv

Superconducting nanowire single photon detector at 532 nm and demonstration in satellite laser ranging

Superconducting nanowire single-photon detectors (SNSPDs) at a wavelength of 532 nm were designed and fabricated aiming to satellite laser ranging (SLR) applications. The NbN SNSPDs were fabricated on one-dimensional photonic crystals with a sensitive-area diameter of 42 um. The devices were coupled with multimode fiber (phi=50um) and exhibited a maximum system detection efficiency of 75% at an extremely low dark count rate of <0.1 Hz. An SLR experiment using an SNSPD at a wavelength of 532 nm was successfully demonstrated. The results showed a depth ranging with a precision of ~8.0 mm for the target satellite LARES, which is ~3,000 km away from the ground ranging station at the Sheshan Observatory.

preprint2016arXiv

Universal Optimal Estimation of the Polarization of Light with Arbitrary Photon Statistics

A universal and optimal method for the polarimetry of light with arbitrary photon statistics is presented. The method is based on the continuous maximum-likelihood positive operator-valued measure (ML-POVM) for pure polarization states over the surface of the Bloch sphere. The success probability and the mean fidelity are used as the figures of merit to show its performance. The POVM is found to attain the collective bound of polarization estimation with respect to the mean fidelity. As demonstrations, explicit results for the N photon Fock state, the phase-randomized coherent state (Poisson distribution), and the thermal light are obtained. It is found that the estimation performances for the Fock state and the Poisson distribution are almost identical, while that for the thermal light is much worse. This suggests that thermal light leaks less information to an eavesdropper and hence could potentially provide more security in polarization-encoded quantum communication protocols than a single-mode laser beam as customarily considered. Finally, comparisons against an optimal adaptive measurement with classical communications are made to show the better and more stable performance of the continuous ML-POVM.

preprint2015arXiv

$L^p$ Boundedness of rough Bi-parameter Fourier Integral Operators

In this paper, we will investigate the boundedness of the bi-parameter Fourier integral operators (or FIOs for short) of the following form: $$T(f)(x)=\frac{1}{(2π)^{2n}}\int_{\mathbb{R}^{2n}}e^{iφ(x,ξ,η)}\cdot a(x,ξ,η)\cdot\widehat{f}(ξ,η)dξdη,$$ where for $x=(x_1,x_2)\in \mathbb{R}^{n}\times \mathbb{R}^{n}$ and $ξ,η\in \mathbb{R}^{n}\setminus\{0\}$, the amplitude $a(x,ξ,η)\in L^\infty BS^m_ρ$ and the phase function is of the form $ φ(x,ξ,η)=φ_1(x_1,ξ)+φ_2(x_2,η)$ with $\quad φ_1,φ_2 \in L^\infty Φ^2 (\mathbb{R}^{n}\times\mathbb{R}^{n}\setminus\{0\})$ and $φ(x, ξ, η)$ satisfies a certain rough non-degeneracy condition. The study of these operators are motivated by the $L^p$ estimates for one-parameter FIOs and bi-parameter Fourier multipliers and pseudo-differential operators. We will first define the bi-parameter FIOs and then study the $L^p$ boundedness of such operators when their phase functions have compact support in frequency variables with certain necessary non-degeneracy conditions. We will then establish the $L^p$ boundedness of the more general FIOs with amplitude $a(x,ξ,η)\in L^\infty BS^m_ρ$ and non-smooth phase function $φ(x,ξ,η)$ on $x$ satisfying a rough non-degeneracy condition.

preprint2015arXiv

Chirped Multi-photon adiabatic passage for a four-level ladder-type Rydberg excitation

We develop a multi-photon adiabatic passage to realize a highly efficient Rydberg excitation in a four-level ladder-type atomic system. The adiabatic passage is based on the existence of a novel quasi-dark state in the cascade excitation system where the frequencies of the lasers are appropriately chirped with time. We also investigate the influence of the interatomic Rydberg interaction on the passage and extend its application to the preparation of anti-blockade Rydberg atom pairs in the Rydberg blockade regime.

preprint2015arXiv

Convolutional Neural Networks over Tree Structures for Programming Language Processing

Programming language processing (similar to natural language processing) is a hot research topic in the field of software engineering; it has also aroused growing interest in the artificial intelligence community. However, different from a natural language sentence, a program contains rich, explicit, and complicated structural information. Hence, traditional NLP models may be inappropriate for programs. In this paper, we propose a novel tree-based convolutional neural network (TBCNN) for programming language processing, in which a convolution kernel is designed over programs' abstract syntax trees to capture structural information. TBCNN is a generic architecture for programming language processing; our experiments show its effectiveness in two different program analysis tasks: classifying programs according to functionality, and detecting code snippets of certain patterns. TBCNN outperforms baseline methods, including several neural models for NLP.

preprint2015arXiv

Dark counts of superconducting nanowire single-photon detector under illumination

An abnormal increase in the SDE was observed for superconducting nanowire single-photon detectors (SNSPDs) when the bias current (Ib) was close to the switching current (Isw). By introducing the time-correlated single-photon counting technique, we investigated the temporal histogram of the detection counts of an SNSPD under illumination. The temporal information helps us to distinguish photon counts from dark counts in the time domain. In this manner, the dark count rate (DCR) under illumination and the accurate SDE can be determined. The DCR under moderate illumination may be significantly larger than the conventional DCR measured without illumination under a high Ib, which causes the abnormal increase in the SDE. The increased DCR may be explained by the suppression of Isw under illumination.

preprint2015arXiv

Discriminative Neural Sentence Modeling by Tree-Based Convolution

This paper proposes a tree-based convolutional neural network (TBCNN) for discriminative sentence modeling. Our models leverage either constituency trees or dependency trees of sentences. The tree-based convolution process extracts sentences' structural features, and these features are aggregated by max pooling. Such architecture allows short propagation paths between the output layer and underlying feature detectors, which enables effective structural feature learning and extraction. We evaluate our models on two tasks: sentiment analysis and question classification. In both experiments, TBCNN outperforms previous state-of-the-art results, including existing neural networks and dedicated feature/rule engineering. We also make efforts to visualize the tree-based convolution process, shedding light on how our models work.

preprint2015arXiv

Dynamical phases in a one-dimensional chain of Heterospecies Rydberg atoms with next-nearest neighbor interactions

We theoretically investigate the dynamical phase diagram of a one-dimensional chain of laser-excited two-species Rydberg atoms. The existence of a variety of unique dynamical phases in the experimentally-achievable parameter region is predicted under the mean-field approximation, and the change of those phases when the effect of the next-nearest neighbor interaction is included is further discussed. In particular we find the competition of the strong Rydberg-Rydberg interactions and the optical excitation imbalance can lead to the presence of complex multiple chaotic phases, which are highly sensitive to the initial Rydberg-state population and the strength of the next-nearest neighbor interactions.

preprint2015arXiv

Efficiency limitation for realizing an atom-molecule adiabatic transfer based on a chainwise system

In a recent work we have developed a robust chainwise atom-molecule adiabatic passage scheme to produce ultracold ground-state molecules via photo-associating free atoms [J. Qian {\it et.al.} Phys. Rev. A 81 013632 (2010)]. With the help of intermediate auxiliary levels, the pump laser intensity requested in the atomic photo-association process can be greatly reduced. In the present work, we extend the scheme to a more generalized (2$n$+1)-level system and investigate the efficiency limitation for it. As the increase of intermediate levels and auxiliary lasers, the atom-molecule adiabatic passage would be gradually closed, leading to a poor transfer efficiency. For the purpose of enhancing the efficiency, we present various optimization approaches to the laser parameters, involving order number $n$, relative strength ratio and absolute strength. We show there can remain a limit on the population transfer efficiency given by a three-level $Λ$ system. In addition, we illustrate the importance of selecting an appropriate number of intermediate levels for maintaining a highly efficient transfer under mild experimental conditions.

preprint2015arXiv

Equivalence of critical and subcritical sharp Trudinger-Moser-Adams inequalities

Sharp Trudinger-Moser inequalities on the first order Sobolev spaces and their analogous Adams inequalities on high order Sobolev spaces play an important role in geometric analysis, partial differential equations and other branches of modern mathematics. Such geometric inequalities have been studied extensively by many authors in recent years and there is a vast literature. There are two types of such optimal inequalities: critical and subcritical sharp inequalities, both are with best constants. Critical sharp inequalities are under the restriction of the full Sobolev norms for the functions under consideration, while the subcritical inequalities are under the restriction of the partial Sobolev norms for the functions under consideration. There are subtle differences between these two type of inequalities. Surprisingly, we prove in this paper that these critical and subcritical Trudinger-Moser and Adams inequalities are actually equivalent. Moreover, we also establish the asymptotic behavior of the supremum for the subcritical Trudinger-Moser and Adams inequalities on the entire Euclidean spaces (Theorem 1.1 and Theorem 1.3) and provide a precise relationship between the supremums for the critical and subcritical Trudinger-Moser and Adams inequalities (Theorem 1.2 and Theorem 1.4).

preprint2015arXiv

Global existence and steady states of a two competing species Keller-Segel chemotaxis model

We study an one{dimensional quasilinear system proposed by J. Tello and M. Winkler [19] which models the population dynamics of two competing species attracted by the same chemical. The kinetics terms of the interacting species are chosen to be the Lotka{Volterra type. We prove the existence of global bounded and classical solutions for all chemoattraction rates. Under homogeneous Neumann boundary conditions, we establish the existence of nonconstant steady states by local bifurcation theory. The stability of the bifurcating solutions is also obtained when the diffusivity of both species is large. Finally, we perform extensive numerical studies to demonstrate the formation of stable positive steady states with various interesting spatial structures.

preprint2015arXiv

Holographic p-wave superconductor models with Weyl corrections

We study the effect of the Weyl corrections on the holographic p-wave dual models in the backgrounds of AdS soliton and AdS black hole via a Maxwell complex vector field model by using the numerical and analytical methods. We find that, in the soliton background, the Weyl corrections do not influence the properties of the holographic p-wave insulator/superconductor phase transition, which is different from that of the Yang-Mills theory. However, in the black hole background, we observe that similar to the Weyl correction effects in the Yang-Mills theory, the higher Weyl corrections make it easier for the p-wave metal/superconductor phase transition to be triggered, which shows that these two p-wave models with Weyl corrections share some similar features for the condensation of the vector operator.

preprint2015arXiv

Measurement-device-independent quantum key distribution over untrustful metropolitan network

Quantum cryptography holds the promise to establish an information-theoretically secure global network. All field tests of metropolitan-scale quantum networks to date are based on trusted relays. The security critically relies on the accountability of the trusted relays, which will break down if the relay is dishonest or compromised. Here, we construct a measurement-device-independent quantum key distribution (MDIQKD) network in a star topology over a 200 square kilometers metropolitan area, which is secure against untrustful relays and against all detection attacks. In the field test, our system continuously runs through one week with a secure key rate ten times larger than previous result. Our results demonstrate that the MDIQKD network, combining the best of both worlds --- security and practicality, constitutes an appealing solution to secure metropolitan communications.

preprint2015arXiv

On End-to-End Program Generation from User Intention by Deep Neural Networks

This paper envisions an end-to-end program generation scenario using recurrent neural networks (RNNs): Users can express their intention in natural language; an RNN then automatically generates corresponding code in a characterby-by-character fashion. We demonstrate its feasibility through a case study and empirical analysis. To fully make such technique useful in practice, we also point out several cross-disciplinary challenges, including modeling user intention, providing datasets, improving model architectures, etc. Although much long-term research shall be addressed in this new field, we believe end-to-end program generation would become a reality in future decades, and we are looking forward to its practice.

preprint2014arXiv

A framework of the harmonic Arnoldi method for evaluating $φ$-functions with applications to exponential integrators

In recent years, a great deal of attention has been focused on numerically solving exponential integrators. The important ingredient to the implementation of exponential integrators is the efficient and accurate evaluation of the so called $φ$-functions on a given vector. The Krylov subspace method is an important technique for this problem. For this type of method, however, restarts become essential for the sake of storage requirements or due to the growing computational complexity of evaluating the matrix function on a Hessenberg matrix of growing size. Another problem in computing $φ$-functions is the lack of a clear residual notion. The contribution of this work is threefold. First, we introduce a framework of the harmonic Arnoldi method for $φ$-functions, which is based on the residual and the oblique projection technique. Second, we establish the relationship between the harmonic Arnoldi approximation and the classical Arnoldi approximation, and compare the harmonic Arnoldi method with the Arnoldi method from a theoretical point of view. Third, we apply the thick-restarting strategy to the harmonic Arnoldi method, and propose a thick-restated harmonic Arnoldi algorithm for evaluating $φ$-functions. An advantage of the new algorithm is that we can compute several $φ$-functions simultaneously in the same search subspace. We show the merit of augmenting approximate eigenvectors in the search subspace, and give insight into the relationship between the error and the residual of $φ$-functions. Numerical experiments show the superiority of our new algorithm over many state-of-the-art algorithms for the computation of $φ$-functions.

preprint2014arXiv

Building Program Vector Representations for Deep Learning

Deep learning has made significant breakthroughs in various fields of artificial intelligence. Advantages of deep learning include the ability to capture highly complicated features, weak involvement of human engineering, etc. However, it is still virtually impossible to use deep learning to analyze programs since deep architectures cannot be trained effectively with pure back propagation. In this pioneering paper, we propose the "coding criterion" to build program vector representations, which are the premise of deep learning for program analysis. Our representation learning approach directly makes deep learning a reality in this new field. We evaluate the learned vector representations both qualitatively and quantitatively. We conclude, based on the experiments, the coding criterion is successful in building program representations. To evaluate whether deep learning is beneficial for program analysis, we feed the representations to deep neural networks, and achieve higher accuracy in the program classification task than "shallow" methods, such as logistic regression and the support vector machine. This result confirms the feasibility of deep learning to analyze programs. It also gives primary evidence of its success in this new field. We believe deep learning will become an outstanding technique for program analysis in the near future.

preprint2014arXiv

Characterization of superconducting nanowire single-photon detector with artificial constrictions

Statistical studies on the performance of different superconducting nanowire single-photon detectors (SNSPDs) on one chip suggested that random constrictions existed in the nanowire that were barely registered by scanning electron microscopy. With the aid of advanced e-beam lithography, artificial geometric constrictions were fabricated on SNSPDs as well as single nanowires. In this way, we studied the influence of artificial constrictions on SNSPDs in a straight forward manner. By introducing artificial constrictions with different wire widths in single nanowires, we concluded that the dark counts of SNSPDs originate from a single constriction. Further introducing artificial constrictions in SNSPDs, we studied the relationship between detection efficiency and kinetic inductance and the bias current, confirming the hypothesis that constrictions exist in SNSPDs.

preprint2014arXiv

Field Test of Measurement-Device-Independent Quantum Key Distribution

A main type of obstacles of practical applications of quantum key distribution (QKD) network is various attacks on detection. Measurement-device-independent QKD (MDIQKD) protocol is immune to all these attacks and thus a strong candidate for network security. Recently, several proof-of-principle demonstrations of MDIQKD have been performed. Although novel, those experiments are implemented in the laboratory with secure key rates less than 0.1 bps. Besides, they need manual calibration frequently to maintain the system performance. These aspects render these demonstrations far from practicability. Thus, justification is extremely crucial for practical deployment into the field environment. Here, by developing an automatic feedback MDIQKD system operated at a high clock rate, we perform a field test via deployed fiber network of 30 km total length, achieving a 16.9 bps secure key rate. The result lays the foundation for a global quantum network which can shield from all the detection-side attacks.

preprint2014arXiv

Measurement-device-independent quantum key distribution over 200 km

Measurement-device-independent quantum key distribution (MDIQKD) protocol is immune to all attacks on detection and guarantees the information-theoretical security even with imperfect single photon detectors. Recently, several proof-of-principle demonstrations of MDIQKD have been achieved. Those experiments, although novel, are implemented through limited distance with a key rate less than 0.1 bps. Here, by developing a 75 MHz clock rate fully-automatic and highly-stable system, and superconducting nanowire single photon detectors with detection efficiencies more than 40%, we extend the secure transmission distance of MDIQKD to 200 km and achieve a secure key rate of three orders of magnitude higher. These results pave the way towards a quantum network with measurement-device-independent security.

preprint2014arXiv

Nonideal optical cavity structure of superconducting nanowire single photon detector

Optical cavity structure has been proven to be a crucial factor for obtaining high detection efficiency in superconducting nanowire single photon detector (SNSPD). Practically, complicated fabrication processes may result in a non-ideal optical cavity structure. The cross-sectional transmission electron microscope (TEM) image of SNSPD fabricated in this study shows unexpected arc-shaped optical cavities which could have originated due to the over-etching of SiO2 layer while defining NbN nanowire. The effects of the arc-shaped optical cavity structure, such as the wavelength dependence of the optical absorption efficiency for different polarization, were analyzed by performing optical simulations using finite-difference time-domain method. The central wavelength of the device is found to exhibit a blue shift owing to the arced cavity structure. This effect is equivalent to the flat cavity with a reduced height. The results may give interesting reference for SNSPD design and fabrication.

preprint2014arXiv

Superconducting nanowire single photon detector with on-chip bandpass filter

Dark count rate is one of the key parameters limiting the performance of the superconducting nanowire single photon detector (SNSPD). We have designed a multi-layer film bandpass filter that can be integrated onto the SNSPD to suppress the dark counts contributed by the stray light and blackbody radiation of the fiber. The bandpass filter is composed of 16 SiO2/Si bilayers deposited onto the backside of a thermally oxidized Si substrate. The substrate shows an excellent bandpass filter effect and provides a high transmittance of 88% at the central wavelength of the pass band, which is the same as that of the bare substrate. The SNSPDs fabricated on the substrate integrated with the bandpass filter show conspicuous wavelength-sensitive detection efficiency. The background dark count rate is reduced by two orders of magnitude to sub-Hz compared with the conventional SNSPD (a few tens of Hz). The detector exhibits a system detection efficiency of 56% at DCR of 1 Hz, with the measured minimal noise equivalent power reaching 2e-19 w/Hz1/2.

preprint2013arXiv

Jitter analysis of a superconducting nanowire single photon detector

Jitter is one of the key parameters for a superconducting nanowire single photon detector (SNSPD). Using an optimized time-correlated single photon counting system for jitter measurement, we extensively studied the dependence of system jitter on the bias current and working temperature. The signal-to-noise ratio of the single-photon-response pulse was proven to be an important factor in system jitter. The final system jitter was reduced to 18 ps by using a high-critical-current SNSPD, which showed an intrinsic SNSPD jitter of 15 ps. A laser ranging experiment using a 15-ps SNSPD achieved a record depth resolution of 3 mm at a wavelength of 1550 nm.

preprint2010arXiv

Low energy cluster beam deposited BN films as the cascade for Field Emission

The atomic deposited BN films with the thickness of nanometers (ABN) were prepared by radio frequency magnetron sputtering method and the nanostructured BN films (CBN) were prepared by Low Energy Cluster Beam Deposition. UV-Vis Absorption measurement proves the band gap of 4.27eV and field emission of the BN films were carried out. F-N plots of all the samples give a good fitting and demonstrate the F-N tunneling of the emission process. The emission of ABN begins at the electric field of 14.6 V/μm while that of CBN starts at 5.10V/μm. Emission current density of 1mA/cm2 for ABN needs the field of 20V/μm while that of CBN needs only 12.1V/μm. The cluster-deposited BN on n-type Silicon substrate proves a good performance in terms of the lower gauge voltage, more emission sites and higher electron intensity and seems a promising substitute for the cascade of Field Emission.

Lu Zhang

What is connected

Connect this record

See the researcher in context

Building this map preview

88 published item(s)

Retrieving Any Relevant Moments: Benchmark and Models for Generalized Moment Retrieval

Daily Land Surface Temperature Reconstruction in Landsat Cross-Track Areas Using Deep Ensemble Learning With Uncertainty Quantification

Fixed-Domain Asymptotics Under Vecchia's Approximation of Spatial Process Likelihoods

A discontinuous Galerkin method for nonlinear biharmonic Schrödinger equations

A high order finite difference method for the elastic wave equation in bounded domains with nonconforming interfaces

A local energy-based discontinuous Galerkin method for fourth order semilinear wave equations

A Syntax-Guided Edit Decoder for Neural Program Repair

A Unified and Biologically-Plausible Relational Graph Representation of Vision Transformers

Achieving Long-Term Fairness in Sequential Decision Making

AGA: An Accelerated Greedy Additional Algorithm for Test Case Prioritization

An Energy-Based Discontinuous Galerkin Method with Tame CFL Numbers for the Wave Equation

Analysis on the composite nature of the light scalar mesons $f_{0}(980)$ and $a_0(980)$

Automatically Discovering Novel Visual Categories with Self-supervised Prototype Learning

Composite nature of $Z_b$ states from data analysis

Coupling Visual Semantics of Artificial Neural Networks and Human Brain Function via Synchronized Activations

Disentangling Spatial-Temporal Functional Brain Networks via Twin-Transformers

DrugOOD: Out-of-Distribution (OOD) Dataset Curator and Benchmark for AI-aided Drug Discovery -- A Focus on Affinity Prediction Problems with Noise Annotations

EPiDA: An Easy Plug-in Data Augmentation Framework for High Performance Text Classification

Eye-gaze-guided Vision Transformer for Rectifying Shortcut Learning

FB-MSTCN: A Full-Band Single-Channel Speech Enhancement Method Based on Multi-Scale Temporal Convolutional Network

Floodgate: inference for model-free variable importance

GDsmith: Detecting Bugs in Graph Database Engines

Generalized Equivariance and Preferential Labeling for GNN Node Classification

Hyperspectral Imaging for cherry tomato

Intra-Modal Constraint Loss For Image-Text Retrieval

Mask-guided Vision Transformer (MG-ViT) for Few-Shot Learning

MetaNOR: A Meta-Learnt Nonlocal Operator Regression Approach for Metamaterial Modeling

On the hyper-singular boundary integral equation methods for dynamic poroelasticity: three dimensional case

Pathfinder: Parallel quasi-Newton variational inference

Phase transition of eigenvalues in deformed Ginibre ensembles

Representing Brain Anatomical Regularity and Variability by Few-Shot Embedding

Revisiting Linearized Bregman Iterations under Lipschitz-like Convexity Condition

Series Photo Selection via Multi-view Graph Learning

Stability analysis of the Tsallis holographic dark energy model

Taming Hybrid-Cloud Fast and Scalable Graph Analytics at Twitter

Trajectory Prediction with Graph-based Dual-scale Context Fusion

Weakly Aligned Feature Fusion for Multimodal Object Detection

Adaptively Sketched Bregman Projection Methods for Linear Systems

Extracting Concise Bug-Fixing Patches from Human-Written Patches in Version Control Systems

A Fixation-based 360° Benchmark Dataset for Salient Object Detection

A Study of Bug Resolution Characteristics in Popular Programming Languages

An accurate hyper-singular boundary integral equation method for dynamic poroelasticity in two dimensions

An energy-based discontinuous Galerkin method for semilinear wave equations

Bi-parameter trilinear Fourier multipliers and pseudo-differential operators with flag symbols

Binary Probability Model for Learning Based Image Compression

Efficient Uncertainty-aware Decision-making for Automated Driving Using Guided Branching

Medusa: Blockchain Powered Log Storage System

Modeling Programs Hierarchically with Stack-Augmented LSTM

ModeNet: Mode Selection Network For Learned Video Coding

OCoR: An Overlapping-Aware Code Retriever

Optical Flow and Mode Selection for Learning-based Video Coding

Service Ecosystem: A Lens of Smart Society

Epitaxial growth and antiferromagnetism of Sn-substituted perovskite iridate SrIr$_{0.8}$Sn$_{0.2}$O$_3$

A causal framework for discovering and removing direct and indirect discrimination

A New Method for Computing $φ$-functions and Their Condition Numbers of Large Sparse Matrices

A novel three-axis cylindrical hohlraum designed for inertial confinement fusion ignition

Achieving non-discrimination in data release

Application of Origen2.1 in the decay photon spectrum calculation of spallation products

Backward and Forward Language Modeling for Constrained Sentence Generation

Distilling Word Embeddings: An Encoding Approach

How Transferable are Neural Networks in NLP Applications?

Natural Language Inference by Tree-Based Convolution and Heuristic Matching

Power Side Channels in Security ICs: Hardware Countermeasures

Resonance of Gaussian electromagnetic field to the high frequency gravitational waves

Sequence to Backward and Forward Sequences: A Content-Introducing Approach to Generative Short-Text Conversation

Superconducting nanowire single photon detector at 532 nm and demonstration in satellite laser ranging

Universal Optimal Estimation of the Polarization of Light with Arbitrary Photon Statistics

$L^p$ Boundedness of rough Bi-parameter Fourier Integral Operators

Chirped Multi-photon adiabatic passage for a four-level ladder-type Rydberg excitation

Convolutional Neural Networks over Tree Structures for Programming Language Processing

Dark counts of superconducting nanowire single-photon detector under illumination

Discriminative Neural Sentence Modeling by Tree-Based Convolution

Dynamical phases in a one-dimensional chain of Heterospecies Rydberg atoms with next-nearest neighbor interactions

Efficiency limitation for realizing an atom-molecule adiabatic transfer based on a chainwise system