Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
114works
0followers
43topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

114 published item(s)

preprint2026arXiv

Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling

Recent visual generation models have made major progress in photorealism, typography, instruction following, and interactive editing, yet they still struggle with spatial reasoning, persistent state, long-horizon consistency, and causal understanding. We argue that the field should move beyond appearance synthesis toward intelligent visual generation: plausible visuals grounded in structure, dynamics, domain knowledge, and causal relations. To frame this shift, we introduce a five-level taxonomy: Atomic Generation, Conditional Generation, In-Context Generation, Agentic Generation, and World-Modeling Generation, progressing from passive renderers to interactive, agentic, world-aware generators. We analyze key technical drivers, including flow matching, unified understanding-and-generation models, improved visual representations, post-training, reward modeling, data curation, synthetic data distillation, and sampling acceleration. We further show that current evaluations often overestimate progress by emphasizing perceptual quality while missing structural, temporal, and causal failures. By combining benchmark review, in-the-wild stress tests, and expert-constrained case studies, this roadmap offers a capability-centered lens for understanding, evaluating, and advancing the next generation of intelligent visual generation systems.

preprint2024arXiv

Black holes with scalar hair: Extending from and beyond the Schwarzschild solution

We construct novel scalarized black hole (BH) solutions beyond the general relativity (GR) framework. These scalarized BH solutions are extended from the Schwarzschild one and the non-Schwarzschild one in the pure Einstein-Weyl gravity. By studying the BH entropy and free energy, we demonstrate that the scalarized BH extending from the Schwarzschild one exhibits thermodynamically preferred. We obtain these novel solutions by directly solving the full fourth-order equations of motion. This narrows the problematic solution space obtained by commonly adopted second-order reduction to physically valid spaces. Our findings also unveil the evasion of the no-hair theorem within the realm of higher-derivative gravity.

preprint2024arXiv

eCIL-MU: Embedding based Class Incremental Learning and Machine Unlearning

New categories may be introduced over time, or existing categories may need to be reclassified. Class incremental learning (CIL) is employed for the gradual acquisition of knowledge about new categories while preserving information about previously learned ones in such dynamic environments. It might also be necessary to also eliminate the influence of related categories on the model to adapt to reclassification. We thus introduce class-level machine unlearning (MU) within CIL. Typically, MU methods tend to be time-consuming and can potentially harm the model's performance. A continuous stream of unlearning requests could lead to catastrophic forgetting. To address these issues, we propose a non-destructive eCIL-MU framework based on embedding techniques to map data into vectors and then be stored in vector databases. Our approach exploits the overlap between CIL and MU tasks for acceleration. Experiments demonstrate the capability of achieving unlearning effectiveness and orders of magnitude (upto $\sim 278\times$) of acceleration.

preprint2023arXiv

BERT-ERC: Fine-tuning BERT is Enough for Emotion Recognition in Conversation

Previous works on emotion recognition in conversation (ERC) follow a two-step paradigm, which can be summarized as first producing context-independent features via fine-tuning pretrained language models (PLMs) and then analyzing contextual information and dialogue structure information among the extracted features. However, we discover that this paradigm has several limitations. Accordingly, we propose a novel paradigm, i.e., exploring contextual information and dialogue structure information in the fine-tuning step, and adapting the PLM to the ERC task in terms of input text, classification structure, and training strategy. Furthermore, we develop our model BERT-ERC according to the proposed paradigm, which improves ERC performance in three aspects, namely suggestive text, fine-grained classification module, and two-stage training. Compared to existing methods, BERT-ERC achieves substantial improvement on four datasets, indicating its effectiveness and generalization capability. Besides, we also set up the limited resources scenario and the online prediction scenario to approximate real-world scenarios. Extensive experiments demonstrate that the proposed paradigm significantly outperforms the previous one and can be adapted to various scenes.

preprint2023arXiv

High-fidelity ptychographic imaging of highly periodic structures enabled by vortex high harmonic beams

Ptychographic Coherent Diffractive Imaging enables diffraction-limited imaging of nanoscale structures at extreme ultraviolet and x-ray wavelengths, where high-quality image-forming optics are not available. However, its reliance on a set of diverse diffraction patterns makes it challenging to use ptychography to image highly periodic samples, limiting its application to defect inspection for electronic and photonic devices. Here, we use a vortex high harmonic light beam driven by a laser carrying orbital angular momentum to implement extreme ultraviolet ptychographic imaging of highly periodic samples with high fidelity and reliability. We also demonstrate, for the first time to our knowledge, ptychographic imaging of an isolated, near-diffraction-limited defect in an otherwise periodic sample using vortex high harmonic beams. This enhanced metrology technique can enable high-fidelity imaging and inspection of highly periodic structures for next-generation nano, energy, photonic and quantum devices.

preprint2022arXiv

$(2+1)$-Dimensional Black Holes in $f(R,ϕ)$ Gravity

We consider a $f(R)$ gravity theory in $(2+1)$-dimensions with a self-interacting scalar field non-minimally coupled to gravity. Without specifying the form of the $f(R)$ function, solving the field equations we find that the Ricci scalar receives a non-linear correction term which breaks the conformal invariance and leads to a massless black hole solution. When the non-linear term decouples, we get a well known hairy black hole solution with the scalar field conformally coupled to gravity. We also find that the entropy of our black hole may be higher than the corresponding conformal black hole which indicates that our solution may be thermodynamically preferred.

preprint2022arXiv

A Numerical Reasoning Question Answering System with Fine-grained Retriever and the Ensemble of Multiple Generators for FinQA

The numerical reasoning in the financial domain -- performing quantitative analysis and summarizing the information from financial reports -- can greatly increase business efficiency and reduce costs of billions of dollars. Here, we propose a numerical reasoning question answering system to answer numerical reasoning questions among financial text and table data sources, consisting of a retriever module, a generator module, and an ensemble module. Specifically, in the retriever module, in addition to retrieving the whole row data, we innovatively design a cell retriever that retrieves the gold cells to avoid bringing unrelated and similar cells in the same row to the inputs of the generator module. In the generator module, we utilize multiple generators to produce programs, which are operation steps to answer the question. Finally, in the ensemble module, we integrate multiple programs to choose the best program as the output of our system. In the final private test set in FinQA Competition, our system obtains 69.79 execution accuracy.

preprint2022arXiv

BCOT: A Markerless High-Precision 3D Object Tracking Benchmark

Template-based 3D object tracking still lacks a high-precision benchmark of real scenes due to the difficulty of annotating the accurate 3D poses of real moving video objects without using markers. In this paper, we present a multi-view approach to estimate the accurate 3D poses of real moving objects, and then use binocular data to construct a new benchmark for monocular textureless 3D object tracking. The proposed method requires no markers, and the cameras only need to be synchronous, relatively fixed as cross-view and calibrated. Based on our object-centered model, we jointly optimize the object pose by minimizing shape re-projection constraints in all views, which greatly improves the accuracy compared with the single-view approach, and is even more accurate than the depth-based method. Our new benchmark dataset contains 20 textureless objects, 22 scenes, 404 video sequences and 126K images captured in real scenes. The annotation error is guaranteed to be less than 2mm, according to both theoretical analysis and validation experiments. We re-evaluate the state-of-the-art 3D object tracking methods with our dataset, reporting their performance ranking in real scenes. Our BCOT benchmark and code can be found at https://ar3dv.github.io/BCOT-Benchmark/.

preprint2022arXiv

C3KG: A Chinese Commonsense Conversation Knowledge Graph

Existing commonsense knowledge bases often organize tuples in an isolated manner, which is deficient for commonsense conversational models to plan the next steps. To fill the gap, we curate a large-scale multi-turn human-written conversation corpus, and create the first Chinese commonsense conversation knowledge graph which incorporates both social commonsense knowledge and dialog flow information. To show the potential of our graph, we develop a graph-conversation matching approach, and benchmark two graph-grounded conversational tasks.

preprint2022arXiv

CompoundE: Knowledge Graph Embedding with Translation, Rotation and Scaling Compound Operations

Translation, rotation, and scaling are three commonly used geometric manipulation operations in image processing. Besides, some of them are successfully used in developing effective knowledge graph embedding (KGE) models such as TransE and RotatE. Inspired by the synergy, we propose a new KGE model by leveraging all three operations in this work. Since translation, rotation, and scaling operations are cascaded to form a compound one, the new model is named CompoundE. By casting CompoundE in the framework of group theory, we show that quite a few scoring-function-based KGE models are special cases of CompoundE. CompoundE extends the simple distance-based relation to relation-dependent compound operations on head and/or tail entities. To demonstrate the effectiveness of CompoundE, we conduct experiments on three popular KG completion datasets. Experimental results show that CompoundE consistently achieves the state of-the-art performance.

preprint2022arXiv

Continuous-stage symplectic adapted exponential methods for charged-particle dynamics with arbitrary electromagnetic fields

This paper is devoted to the numerical symplectic approximation of the charged-particle dynamics (CPD) with arbitrary electromagnetic fields. By utilizing continuous-stage methods and exponential integrators, a general class of symplectic methods is formulated for CPD under a homogeneous magnetic field. Based on the derived symplectic conditions, two practical symplectic methods up to order four are constructed where the error estimates show that the proposed second order scheme has a uniform accuracy in the position w.r.t. the strength of the magnetic field. Moreover, the symplectic methods are extended to CPD under non-homogeneous magnetic fields and three algorithms are formulated. Rigorous error estimates are investigated for the proposed methods and one method is proved to have a uniform accuracy in the position w.r.t. the strength of the magnetic field. Numerical experiments are provided for CPD under homogeneous and non-homogeneous magnetic fields, and the numerical results support the theoretical analysis and demonstrate the remarkable numerical behavior of our methods.

preprint2022arXiv

Critical phenomena in dynamical scalarization of charged black hole

We report a new black hole scalarization mechanism and disclose novel dynamical critical phenomena in the process of the nonlinear accretion of the scalar field into black holes. The accretion process can transform a seed black hole into a final scalarized or bald black hole, depending on the initial parameter of the scalar field $p$. There is a critical parameter $p_{\ast}$ and near it all intermediate solutions are attracted to a critical solution and stay there for a time scaling as $T\propto-γ\ln|p-p_{\ast}|$. At late times, the solutions evolve into scalarized black holes if $p>p_{\ast}$, or bald black holes if $p<p_{\ast}$. The final masses of the resulting scalarized/bald black holes satisfy power-laws $M_{p}-M_{\pm}\propto|p-p_{\ast}|^{γ_{\pm}}$ where $M_{\pm}$ are the masses of the scalarized/bald black holes when $p\to p_\ast$ from above/below, and $γ_{\pm}$ the corresponding exponents.

preprint2022arXiv

Cross-Domain Recommendation to Cold-Start Users via Variational Information Bottleneck

Recommender systems have been widely deployed in many real-world applications, but usually suffer from the long-standing user cold-start problem. As a promising way, Cross-Domain Recommendation (CDR) has attracted a surge of interest, which aims to transfer the user preferences observed in the source domain to make recommendations in the target domain. Previous CDR approaches mostly achieve the goal by following the Embedding and Mapping (EMCDR) idea which attempts to learn a mapping function to transfer the pre-trained user representations (embeddings) from the source domain into the target domain. However, they pre-train the user/item representations independently for each domain, ignoring to consider both domain interactions simultaneously. Therefore, the biased pre-trained representations inevitably involve the domain-specific information which may lead to negative impact to transfer information across domains. In this work, we consider a key point of the CDR task: what information needs to be shared across domains? To achieve the above idea, this paper utilizes the information bottleneck (IB) principle, and proposes a novel approach termed as CDRIB to enforce the representations encoding the domain-shared information. To derive the unbiased representations, we devise two IB regularizers to model the cross-domain/in-domain user-item interactions simultaneously and thereby CDRIB could consider both domain interactions jointly for de-biasing.

preprint2022arXiv

Cuspy and fractured black hole shadows in a toy model with axisymmetry

Cuspy shadow was first reported for hairy rotating black holes, whose metrics deviate significantly from the Kerr one. The non-smooth edge of the shadow is attributed to a transition between different branches of unstable but bounded orbits, known as the fundamental photon orbits, which end up at the light rings. In searching for a minimal theoretical setup to reproduce such a salient feature, in this work, we devise a toy model with axisymmetry, a slowly rotating Kerr black hole enveloped by a thin slowly rotating dark matter shell. Despite its simplicity, we show rich structures regarding fundamental photon orbits explicitly in such a system. We observe two disconnected branches of unstable spherical photon orbits, and the jump between them gives rise to a pair of cusps in the resultant black hole shadow. Besides the cuspy shadow, we explore other intriguing phenomena when the Maxwell construction cannot be established. We find that it is possible to have an incomplete arc of Einstein rings and a &#34;fractured&#34; shadow. The potential astrophysical significance of the corresponding findings is addressed.

preprint2022arXiv

Dynamical spontaneous scalarization in Einstein-Maxwell-scalar theory

We study the linear instability and the nonlinear dynamical evolution of the Reissner-Nordström (RN) black hole in the Einstein-Maxwell-scalar theory in asymptotic flat spacetime. We focus on the coupling function $f(ϕ)=e^{-bϕ^2}$ which allows both the scalar-free RN solution and scalarized black hole solution. We first present the evolution of system parameters during dynamic scalarization. For parameter regions where spontaneous scalarization occurs, we find that the evolution of the scalar field at the horizon is dominated by the fundamental unstable mode from linear analysis at early times. At late times, the nonlinear evolution can be viewed as the perturbation of scalarized black holes.

preprint2022arXiv

Electrified Autonomous Freight Benefit analysis on Fleet, Infrastructure and Grid Leveraging Grid-Electrified Mobility (GEM) Model

This paper analyzes the potential benefit of heavy-duty vehicle (HDV) electrification and automation on fleet cost, infrastructure cost, grid, and environmental impact. In this work, we extended the vehicle electrification benefit analysis tool: Grid-Electrified Mobility (GEM) model, which had primarily been used to study light-duty passenger vehicles (LDVs), to analyze the heavy-duty vehicle electrification. The extended model is derived for freight transportation and key results and findings on the impact of freight electrification and automation are presented and discussed.

preprint2022arXiv

Forecasts on Interacting Dark Energy from 21-cm Angular Power Spectrum with BINGO and SKA observations

Neutral hydrogen (HI) intensity mapping is a promising technique to probe the large-scale structure of the Universe, improving our understanding on the late-time accelerated expansion. In this work, we first scrutinize how an alternative cosmology, interacting dark energy (IDE), can affect the 21-cm angular power spectrum relative to the concordance $Λ$CDM model. We re-derive the 21-cm brightness temperature fluctuation in the context of such interaction and uncover an extra new contribution. Then we estimate the noise level of three upcoming HI intensity mapping surveys, BINGO, SKA1-MID Band$\,$1 and Band$\,$2, respectively, and employ a Fisher matrix approach to forecast their constraints on the IDE model. We find that while $\textit{Planck}\,$ 2018 maintains its dominion over early-Universe parameter constraints, BINGO and SKA1-MID Band$\,$2 put complementary bounding to the latest CMB measurements on dark energy equation of state $w$, the interacting strength $λ_i$ and the reduced Hubble constant $h$, and SKA1-MID Band$\,$1 even outperforms $\textit{Planck}\,$ 2018 in these late-Universe parameter constraints. The expected minimum uncertainties are given by SKA1-MID Band$\,$1+$\textit{Planck}\,$: $\sim 0.34\%$ on $w$, $\sim 0.22\%$ on $h$, $\sim 0.64\%$ on HI bias $b_{\rm HI}$, and an absolute uncertainty of about $3\times10^{-4}$ ($7\times10^{-4}$) on $λ_{1}$ ($λ_{2}$). Moreover, we quantify the effects from systematics of the redshift bin number, redshift-space distortions, foreground residuals and uncertainties on the measured HI fraction, $Ω_{\mathrm{HI}}(z)$. Our results indicate a bright prospect for HI intensity mapping surveys in constraining IDE, whether on their own or further by synergies with other measurements.

preprint2022arXiv

Geometry Contrastive Learning on Heterogeneous Graphs

Self-supervised learning (especially contrastive learning) methods on heterogeneous graphs can effectively get rid of the dependence on supervisory data. Meanwhile, most existing representation learning methods embed the heterogeneous graphs into a single geometric space, either Euclidean or hyperbolic. This kind of single geometric view is usually not enough to observe the complete picture of heterogeneous graphs due to their rich semantics and complex structures. Under these observations, this paper proposes a novel self-supervised learning method, termed as Geometry Contrastive Learning (GCL), to better represent the heterogeneous graphs when supervisory data is unavailable. GCL views a heterogeneous graph from Euclidean and hyperbolic perspective simultaneously, aiming to make a strong merger of the ability of modeling rich semantics and complex structures, which is expected to bring in more benefits for downstream tasks. GCL maximizes the mutual information between two geometric views by contrasting representations at both local-local and local-global semantic levels. Extensive experiments on four benchmarks data sets show that the proposed approach outperforms the strong baselines, including both unsupervised methods and supervised methods, on three tasks, including node classification, node clustering and similarity search.

preprint2022arXiv

Hessian-Free Second-Order Adversarial Examples for Adversarial Learning

Recent studies show deep neural networks (DNNs) are extremely vulnerable to the elaborately designed adversarial examples. Adversarial learning with those adversarial examples has been proved as one of the most effective methods to defend against such an attack. At present, most existing adversarial examples generation methods are based on first-order gradients, which can hardly further improve models&#39; robustness, especially when facing second-order adversarial attacks. Compared with first-order gradients, second-order gradients provide a more accurate approximation of the loss landscape with respect to natural examples. Inspired by this, our work crafts second-order adversarial examples and uses them to train DNNs. Nevertheless, second-order optimization involves time-consuming calculation for Hessian-inverse. We propose an approximation method through transforming the problem into an optimization in the Krylov subspace, which remarkably reduce the computational complexity to speed up the training procedure. Extensive experiments conducted on the MINIST and CIFAR-10 datasets show that our adversarial learning with second-order adversarial examples outperforms other fisrt-order methods, which can improve the model robustness against a wide range of attacks.

preprint2022arXiv

Joint Distribution Alignment via Adversarial Learning for Domain Adaptive Object Detection

Unsupervised domain adaptive object detection aims to adapt a well-trained detector from its original source domain with rich labeled data to a new target domain with unlabeled data. Recently, mainstream approaches perform this task through adversarial learning, yet still suffer from two limitations. First, they mainly align marginal distribution by unsupervised cross-domain feature matching, and ignore each feature&#39;s categorical and positional information that can be exploited for conditional alignment; Second, they treat all classes as equally important for transferring cross-domain knowledge and ignore that different classes usually have different transferability. In this paper, we propose a joint adaptive detection framework (JADF) to address the above challenges. First, an end-to-end joint adversarial adaptation framework for object detection is proposed, which aligns both marginal and conditional distributions between domains without introducing any extra hyperparameter. Next, to consider the transferability of each object class, a metric for class-wise transferability assessment is proposed, which is incorporated into the JADF objective for domain adaptation. Further, an extended study from unsupervised domain adaptation (UDA) to unsupervised few-shot domain adaptation (UFDA) is conducted, where only a few unlabeled training images are available in unlabeled target domain. Extensive experiments validate that JADF is effective in both the UDA and UFDA settings, achieving significant performance gains over existing state-of-the-art cross-domain detection methods.

preprint2022arXiv

Joint Planning of Distributed Generations and Energy Storage in Active Distribution Networks: A Bi-Level Programming Approach

In order to improve the penetration of renewable energy resources for distribution networks, a joint planning model of distributed generations (DGs) and energy storage is proposed for an active distribution network by using a bi-level programming approach in this paper. In this model, the upper-level aims to seek the optimal location and capacity of DGs and energy storage, while the lower-level optimizes the operation of energy storage devices. To solve this model, an improved binary particle swarm optimization (IBPSO) algorithm based on chaos optimization is developed, and the optimal joint planning is achieved through alternating iterations between the two levels. The simulation results on the PG & E 69-bus distribution system demonstrate that the presented approach manages to reduce the planning deviation caused by the uncertainties of DG outputs and remarkably improve the voltage profile and operational economy of distribution systems.

preprint2022arXiv

Just Rank: Rethinking Evaluation with Word and Sentence Similarities

Word and sentence embeddings are useful feature representations in natural language processing. However, intrinsic evaluation for embeddings lags far behind, and there has been no significant update since the past decade. Word and sentence similarity tasks have become the de facto evaluation method. It leads models to overfit to such evaluations, negatively impacting embedding models&#39; development. This paper first points out the problems using semantic similarity as the gold standard for word and sentence embedding evaluations. Further, we propose a new intrinsic evaluation method called EvalRank, which shows a much stronger correlation with downstream tasks. Extensive experiments are conducted based on 60+ models and popular datasets to certify our judgments. Finally, the practical evaluation toolkit is released for future benchmarking purposes.

preprint2022arXiv

Long term analysis of splitting methods for charged-particle dynamics

In this paper, we rigorously analyze the energy, momentum and magnetic moment behaviours of two splitting methods for solving charged-particle dynamics. The near-conservations of these invariants are given for the system under constant magnetic field or quadratic electric potential. By the approach named as backward error analysis, we derive the modified equations and modified invariants of the splitting methods and based on which, the near-conservations over long times are proved. Some numerical experiments are presented to demonstrate these long time behaviours.

preprint2022arXiv

MISC: A MIxed Strategy-Aware Model Integrating COMET for Emotional Support Conversation

Applying existing methods to emotional support conversation -- which provides valuable assistance to people who are in need -- has two major limitations: (a) they generally employ a conversation-level emotion label, which is too coarse-grained to capture user&#39;s instant mental state; (b) most of them focus on expressing empathy in the response(s) rather than gradually reducing user&#39;s distress. To address the problems, we propose a novel model \textbf{MISC}, which firstly infers the user&#39;s fine-grained emotional status, and then responds skillfully using a mixture of strategy. Experimental results on the benchmark dataset demonstrate the effectiveness of our method and reveal the benefits of fine-grained emotion understanding as well as mixed-up strategy modeling. Our code and data could be found in \url{https://github.com/morecry/MISC}.

preprint2022arXiv

MOORe: Model-based Offline-to-Online Reinforcement Learning

With the success of offline reinforcement learning (RL), offline trained RL policies have the potential to be further improved when deployed online. A smooth transfer of the policy matters in safe real-world deployment. Besides, fast adaptation of the policy plays a vital role in practical online performance improvement. To tackle these challenges, we propose a simple yet efficient algorithm, Model-based Offline-to-Online Reinforcement learning (MOORe), which employs a prioritized sampling scheme that can dynamically adjust the offline and online data for smooth and efficient online adaptation of the policy. We provide a theoretical foundation for our algorithms design. Experiment results on the D4RL benchmark show that our algorithm smoothly transfers from offline to online stages while enabling sample-efficient online adaption, and also significantly outperforms existing methods.

preprint2022arXiv

Nebula-I: A General Framework for Collaboratively Training Deep Learning Models on Low-Bandwidth Cloud Clusters

The ever-growing model size and scale of compute have attracted increasing interests in training deep learning models over multiple nodes. However, when it comes to training on cloud clusters, especially across remote clusters, huge challenges are faced. In this work, we introduce a general framework, Nebula-I, for collaboratively training deep learning models over remote heterogeneous clusters, the connections between which are low-bandwidth wide area networks (WANs). We took natural language processing (NLP) as an example to show how Nebula-I works in different training phases that include: a) pre-training a multilingual language model using two remote clusters; and b) fine-tuning a machine translation model using knowledge distilled from pre-trained models, which run through the most popular paradigm of recent deep learning. To balance the accuracy and communication efficiency, in Nebula-I, parameter-efficient training strategies, hybrid parallel computing methods and adaptive communication acceleration techniques are jointly applied. Meanwhile, security strategies are employed to guarantee the safety, reliability and privacy in intra-cluster computation and inter-cluster communication. Nebula-I is implemented with the PaddlePaddle deep learning framework, which can support collaborative training over heterogeneous hardware, e.g. GPU and NPU. Experiments demonstrate that the proposed framework could substantially maximize the training efficiency while preserving satisfactory NLP performance. By using Nebula-I, users can run large-scale training tasks over cloud clusters with minimum developments, and the utility of existed large pre-trained models could be further promoted. We also introduced new state-of-the-art results on cross-lingual natural language inference tasks, which are generated based upon a novel learning framework and Nebula-I.

preprint2022arXiv

Nonlinear Hall effect induced by internal Coulomb interaction and phase relaxation process in a four-terminal system with time-reversal symmetry

We numerically investigate the second-order nonlinear Hall transport properties of a four-terminal system with time-reversal symmetry and broken inversion symmetry. Within the nonequilibrium Green&#39;s function formalism, the second-order nonlinear conductances are derived, where the internal Coulomb potential in response to external voltages is explicitly included to guarantee the gauge invariance. For the system with single mirror symmetry Mx, nonlinear Hall properties are only observable in the y direction and contributed solely from the second-order nonlinear effect. From the symmetry point of view, the observed nonlinear Hall transport phenomena have one-to-one correspondence with the Berry curvature dipole induced nonlinear Hall effect semiclassically obtained for the same Hamiltonian. In addition to the nonlinear Hall effect originated from symmetries of the system, it is found that the internal Coulomb potential has the same symmetry of the four-terminal system, which gives rise to an extra nonlinear Hall response. Moreover, the phase relaxation mechanism modeled by virtual probes leads to the dephasing-induced nonlinear Hall effect.

preprint2022arXiv

Quasinormal modes in two-photon autocorrelation and the geometric-optics approximation

In this work, we study the black hole light echoes in terms of the two-photon autocorrelation and explore their connection with the quasinormal modes. It is shown that the above time-domain phenomenon can be analyzed by utilizing the well-known frequency-domain relations between the quasinormal modes and characteristic parameters of null geodesics. We found that the time-domain correlator, obtained by the inverse Fourier transform, naturally acquires the echo feature, which can be attributed to a collective effect of the asymptotic poles through a weighted summation of the squared modulus of the relevant Green&#39;s functions. Specifically, the contour integral leads to a summation taking over both the overtone index and angular momentum. Moreover, the dominant contributions to the light echoes are from those in the eikonal limit, consistent with the existing findings using the geometric-optics arguments. For the Schwarzschild black holes, we demonstrate the results numerically by considering a transient spherical light source. Also, for the Kerr spacetimes, we point out a potential difference between the resulting light echoes using the geometric-optics approach and those obtained by the black hole perturbation theory. Possible astrophysical implications of the present study are addressed.

preprint2022arXiv

Rethinking Reinforcement Learning based Logic Synthesis

Recently, reinforcement learning has been used to address logic synthesis by formulating the operator sequence optimization problem as a Markov decision process. However, through extensive experiments, we find out that the learned policy makes decisions independent from the circuit features (i.e., states) and yields an operator sequence that is permutation invariant to some extent in terms of operators. Based on these findings, we develop a new RL-based method that can automatically recognize critical operators and generate common operator sequences generalizable to unseen circuits. Our algorithm is verified on both the EPFL benchmark, a private dataset and a circuit at industrial scale. Experimental results demonstrate that it achieves a good balance among delay, area and runtime, and is practical for industrial usage.

preprint2022arXiv

Risk Concentration and the Mean-Expected Shortfall Criterion

Expected Shortfall (ES, also known as CVaR) is the most important coherent risk measure in finance, insurance, risk management, and engineering. Recently, Wang and Zitikis (2021) put forward four economic axioms for portfolio risk assessment and provide the first economic axiomatic foundation for the family of ES. In particular, the axiom of no reward for concentration (NRC) is arguably quite strong, which imposes an additive form of the risk measure on portfolios with a certain dependence structure. We move away from the axiom of NRC by introducing the notion of concentration aversion, which does not impose any specific form of the risk measure. It turns out that risk measures with concentration aversion are functions of ES and the expectation. Together with the other three standard axioms of monotonicity, translation invariance and lower semicontinuity, concentration aversion uniquely characterizes the family of ES. In addition, we establish an axiomatic foundation for the problem of mean-ES portfolio selection and new explicit formulas for convex and consistent risk measures. Finally, we provide an economic justification for concentration aversion via a few axioms on the attitude of a regulator towards dependence structures.

preprint2022arXiv

Semi-discretization and full-discretization with optimal accuracy for charged-particle dynamics in a strong nonuniform magnetic field

The aim of this paper is to formulate and analyze numerical discretizations of charged-particle dynamics (CPD) in a strong nonuniform magnetic field. A strategy is firstly performed for the two dimensional CPD to construct the semi-discretization and full-discretization which have optimal accuracy. This accuracy is improved in the position and in the velocity when the strength of the magnetic field becomes stronger. This is a better feature than the usual so called &#34;uniformly accurate methods&#34;. To obtain this refined accuracy, some reformulations of the problem and two-scale exponential integrators are incorporated, and the optimal accuracy is derived from this new procedure. Then based on the strategy given for the two dimensional case, a new class of uniformly accurate methods with simple scheme is formulated for the three dimensional CPD in maximal ordering case. All the theoretical results of the accuracy are numerically illustrated by some numerical tests.

preprint2022arXiv

Single-frame characterization of ultrafast pulses with spatiotemporal orbital angular momentum

Light carrying spatiotemporal orbital angular momentum (ST-OAM) makes possible new types of optical vortices arising from transverse OAM. ST-OAM pulses exhibit novel properties during propagation, transmission, refraction, diffraction, and nonlinear conversion, attracting growing experimental and theoretical interest and studies. However, one major challenge is the lack of a simple and straightforward method for characterizing ultrafast ST-OAM pulses. Using spatially resolved spectral interferometry, we demonstrate a simple, stationary, single-frame method to quantitatively characterize ultrashort light pulses carrying ST-OAM. Using our method, the presence of an ST-OAM pulse, including its main characteristics such as topological charge numbers and OAM helicity, can be identified easily from the unique and unambiguous features directly seen on the raw data--without any need for a full analysis of the data. After processing and reconstructions, other exquisite features, including pulse dispersion and beam divergence, can also be fully characterized. Our fast characterization method allows high-throughput and quick feedback during the generation and optical alignment processes of ST-OAM pulses. It is straightforward to extend our method to single-shot measurement by using a high-speed camera that matches the pulse repetition rate. This new method can help advance the field of spatially and temporally structured light and its applications in advanced metrologies.

preprint2022arXiv

Strong Effects of Interlayer Interaction on Valence-Band Splitting in Transition Metal Dichalcogenides

Understanding the origin of valence band maxima (VBM) splitting in transition metal dichalcogenides (TMDs) is important because it governs the unique spin and valley physics in monolayer and multilayer TMDs. In this work, we present our systematic study of VBM splitting ($Δ$) in atomically thin MoS$_2$ and WS$_2$ by employing photocurrent spectroscopy as we change the temperature and the layer numbers. We found that VBM splitting in monolayer MoS$_2$ and WS$_2$ depends strongly on temperature, which contradicts the theory that spin-orbit coupling solely determines the VBM splitting in monolayer TMDs. We also found that the rate of change of VBM splitting with respect to temperature ($m=\frac{\partialΔ}{\partial T}$) is the highest for monolayer (-0.14 meV/K for MoS$_2$) and the rate decreases as the layer number increases ($m ~ 0$ meV/K for 5 layers MoS$_2$). We performed density functional theory (DFT) and the GW with Bethe-Salpeter Equation (GW-BSE) calculations to determine the electronic band structure and optical absorption for a bilayer MoS$_2$ with different interlayer separations. Our simulations agree with the experimental observations and demonstrate that the temperature dependence of VBM splitting in atomically thin monolayer and multilayer TMDs originates from the changes in the interlayer coupling strength between the neighboring layers. By studying two different types of TMDs and many different layer thicknesses, we also demonstrate that VBM splitting also depends on the layer numbers and type of transition metals. Our study will help understand the role spin-orbit coupling and interlayer interaction play in determining the VBM splitting in quantum materials and develop next-generation devices based on spin-orbit interactions.

preprint2022arXiv

SynWMD: Syntax-aware Word Mover&#39;s Distance for Sentence Similarity Evaluation

Word Mover&#39;s Distance (WMD) computes the distance between words and models text similarity with the moving cost between words in two text sequences. Yet, it does not offer good performance in sentence similarity evaluation since it does not incorporate word importance and fails to take inherent contextual and structural information in a sentence into account. An improved WMD method using the syntactic parse tree, called Syntax-aware Word Mover&#39;s Distance (SynWMD), is proposed to address these two shortcomings in this work. First, a weighted graph is built upon the word co-occurrence statistics extracted from the syntactic parse trees of sentences. The importance of each word is inferred from graph connectivities. Second, the local syntactic parsing structure of words is considered in computing the distance between words. To demonstrate the effectiveness of the proposed SynWMD, we conduct experiments on 6 textual semantic similarity (STS) datasets and 4 sentence classification datasets. Experimental results show that SynWMD achieves state-of-the-art performance on STS tasks. It also outperforms other WMD-based methods on sentence classification tasks.

preprint2022arXiv

The BINGO Project III: Optical design and optimisation of the focal plane

The BINGO telescope was designed to measure the fluctuations of the 21-cm radiation arising from the hyperfine transition of neutral hydrogen and aims to measure the Baryon Acoustic Oscillations (BAO) from such fluctuations, therefore serving as a pathfinder to future deeper intensity mapping surveys. The requirements for the Phase 1 of the projects consider a large reflector system (two 40 m-class dishes in a crossed-Dragone configuration), illuminating a focal plane with 28 horns to measure the sky with two circular polarisations in a drift scan mode to produce measurements of the radiation in intensity as well as the circular polarisation. In this paper we present the optical design for the instrument. We describe the intensity and polarisation properties of the beams and the optical arrangement of the horns in the focal plane to produce a homogeneous and well-sampled map after the end of Phase 1. Our analysis provides an optimal model for the location of the horns in the focal plane, producing a homogeneous and Nyquist sampled map after the nominal survey time. We arrive at an optimal configuration for the optical system, including the focal plane positioning and the beam behavior of the instrument. We present an estimate of the expected side lobes both for intensity and polarisation, as well as the effect of band averaging on the final side lobes. The cross polarisation leakage values for the final configuration allow us to conclude that the optical arrangement meets the requirements of the project. We conclude that the chosen optical design meets the requirements for the project in terms of polarisation purity, area coverage as well as homogeneity of coverage so that BINGO can perform a successful BAO experiment. We further conclude that the requirements on the placement and r.m.s. error on the mirrors are also achievable so that a successful experiment can be conducted.(Abridged)

preprint2022arXiv

The BINGO Project V: Further steps in Component Separation and Bispectrum Analysis

Observing the neutral hydrogen distribution across the Universe via redshifted 21cm line intensity mapping constitutes a powerful probe for cosmology. However, the redshifted 21cm signal is obscured by the foreground emission from our Galaxy and other extragalactic foregrounds. This paper addresses the capabilities of the BINGO survey to separate such signals. Specifically, this paper looks in detail at the different residuals left over by foreground components, shows that a noise-corrected spectrum is unbiased, and shows that we understand the remaining systematic residuals by analyzing nonzero contributions to the three-point function. We use the generalized needlet internal linear combination, which we apply to sky simulations of the BINGO experiment for each redshift bin of the survey. We present our recovery of the redshifted 21cm signal from sky simulations of the BINGO experiment, including foreground components. We test the recovery of the 21cm signal through the angular power spectrum at different redshifts, as well as the recovery of its non-Gaussian distribution through a bispectrum analysis. We find that non-Gaussianities from the original foreground maps can be removed down to, at least, the noise limit of the BINGO survey with such techniques. Our component separation methodology allows us to subtract the foreground contamination in the BINGO channels down to levels below the cosmological signal and the noise, and to reconstruct the 21cm power spectrum for different redshift bins without significant loss at multipoles $20 \lesssim \ell \lesssim 500$. Our bispectrum analysis yields strong tests of the level of the residual foreground contamination in the recovered 21cm signal, thereby allowing us to both optimize and validate our component separation analysis. (Abridged)

preprint2022arXiv

The Volcspeech system for the ICASSP 2022 multi-channel multi-party meeting transcription challenge

This paper describes our submission to ICASSP 2022 Multi-channel Multi-party Meeting Transcription (M2MeT) Challenge. For Track 1, we propose several approaches to empower the clustering-based speaker diarization system to handle overlapped speech. Front-end dereverberation and the direction-of-arrival (DOA) estimation are used to improve the accuracy of speaker diarization. Multi-channel combination and overlap detection are applied to reduce the missed speaker error. A modified DOVER-Lap is also proposed to fuse the results of different systems. We achieve the final DER of 5.79% on the Eval set and 7.23% on the Test set. For Track 2, we develop our system using the Conformer model in a joint CTC-attention architecture. Serialized output training is adopted to multi-speaker overlapped speech recognition. We propose a neural front-end module to model multi-channel audio and train the model end-to-end. Various data augmentation methods are utilized to mitigate over-fitting in the multi-channel multi-speaker E2E system. Transformer language model fusion is developed to achieve better performance. The final CER is 19.2% on the Eval set and 20.8% on the Test set.

preprint2022arXiv

Topological Mirror Symmetry of Parabolic Hitchin Systems

In this paper, we first prove the parabolic Beauvile-Narasimhan-Ramanan correspondence over an arbitrary field which generalizes the corresponding results over algebraically closed fields in [SWW22]. We use the correspondence and the p-adic integration methods developed by Groechenig- Wyss-Ziegler [GWZ20b] to prove the topological mirror symmetry for parabolic Hitchin systems on curves with arbitrary parabolic structures.

preprint2022arXiv

Towards the Desirable Decision Boundary by Moderate-Margin Adversarial Training

Adversarial training, as one of the most effective defense methods against adversarial attacks, tends to learn an inclusive decision boundary to increase the robustness of deep learning models. However, due to the large and unnecessary increase in the margin along adversarial directions, adversarial training causes heavy cross-over between natural examples and adversarial examples, which is not conducive to balancing the trade-off between robustness and natural accuracy. In this paper, we propose a novel adversarial training scheme to achieve a better trade-off between robustness and natural accuracy. It aims to learn a moderate-inclusive decision boundary, which means that the margins of natural examples under the decision boundary are moderate. We call this scheme Moderate-Margin Adversarial Training (MMAT), which generates finer-grained adversarial examples to mitigate the cross-over problem. We also take advantage of logits from a teacher model that has been well-trained to guide the learning of our model. Finally, MMAT achieves high natural accuracy and robustness under both black-box and white-box attacks. On SVHN, for example, state-of-the-art robustness and natural accuracy are achieved.

preprint2022arXiv

Transformer based Generative Adversarial Network for Liver Segmentation

Automated liver segmentation from radiology scans (CT, MRI) can improve surgery and therapy planning and follow-up assessment in addition to conventional use for diagnosis and prognosis. Although convolutional neural networks (CNNs) have become the standard image segmentation tasks, more recently this has started to change towards Transformers based architectures because Transformers are taking advantage of capturing long range dependence modeling capability in signals, so called attention mechanism. In this study, we propose a new segmentation approach using a hybrid approach combining the Transformer(s) with the Generative Adversarial Network (GAN) approach. The premise behind this choice is that the self-attention mechanism of the Transformers allows the network to aggregate the high dimensional feature and provide global information modeling. This mechanism provides better segmentation performance compared with traditional methods. Furthermore, we encode this generator into the GAN based architecture so that the discriminator network in the GAN can classify the credibility of the generated segmentation masks compared with the real masks coming from human (expert) annotations. This allows us to extract the high dimensional topology information in the mask for biomedical image segmentation and provide more reliable segmentation results. Our model achieved a high dice coefficient of 0.9433, recall of 0.9515, and precision of 0.9376 and outperformed other Transformer based approaches.

preprint2022arXiv

Yes, DLGM! A novel hierarchical model for hazard classification

Hazards can be exposed by HAZOP as text information, and studying their classification is of great significance to the development of industrial informatics, which is conducive to safety early warning, decision support, policy evaluation, etc. However, there is no research on this important field at present. In this paper, we propose a novel model termed DLGM via deep learning for hazard classification. Specifically, first, we leverage BERT to vectorize the hazard and treat it as a type of time series (HTS). Secondly, we build a grey model FSGM(1, 1) to model it, and get the grey guidance in the sense of the structural parameters. Finally, we design a hierarchical-feature fusion neural network (HFFNN) to investigate the HTS with grey guidance (HTSGG) from three themes, where, HFFNN is a hierarchical structure with four types of modules: two feature encoders, a gating mechanism, and a deepening mechanism. We take 18 industrial processes as application cases and launch a series of experiments. The experimental results prove that DLGM has promising aptitudes for hazard classification and that FSGM(1, 1) and HFFNN are effective. We hope our research can contribute added value and support to the daily practice in industrial safety.

preprint2021arXiv

BLOCKEYE: Hunting For DeFi Attacks on Blockchain

Decentralized finance, i.e., DeFi, has become the most popular type of application on many public blockchains (e.g., Ethereum) in recent years. Compared to the traditional finance, DeFi allows customers to flexibly participate in diverse blockchain financial services (e.g., lending, borrowing, collateralizing, exchanging etc.) via smart contracts at a relatively low cost of trust. However, the open nature of DeFi inevitably introduces a large attack surface, which is a severe threat to the security of participants funds. In this paper, we proposed BLOCKEYE, a real-time attack detection system for DeFi projects on the Ethereum blockchain. Key capabilities provided by BLOCKEYE are twofold: (1) Potentially vulnerable DeFi projects are identified based on an automatic security analysis process, which performs symbolic reasoning on the data flow of important service states, e.g., asset price, and checks whether they can be externally manipulated. (2) Then, a transaction monitor is installed offchain for a vulnerable DeFi project. Transactions sent not only to that project but other associated projects as well are collected for further security analysis. A potential attack is flagged if a violation is detected on a critical invariant configured in BLOCKEYE, e.g., Benefit is achieved within a very short time and way much bigger than the cost. We applied BLOCKEYE in several popular DeFi projects and managed to discover potential security attacks that are unreported before. A video of BLOCKEYE is available at https://youtu.be/7DjsWBLdlQU.

preprint2021arXiv

Boosting performance for software defined networks from traffic engineering perspective

Paths selection algorithms and rate adaptation objective functions are usually studied separately. In contrast, this paper evaluates some traffic engineering (TE) systems for software defined networking obtained by combining path selection techniques with average delay and load balancing, the two most popular TE objective functions. Based on TE simulation results, the best TE system suitable for software defined networks is a system where the paths are calculated using an oblivious routing model and its adaptation rate calculated using an average delay objective function. Thus, we propose the RACKE+AD system combining path sets computed using Racke&#39;s oblivious routing and traffic splitting objective function using average delay. This model outperforms current state-of-the-art models, maximizes throughput, achieves better network resource utilization, and minimizes delay. The proposed system outperformed SMORE and SWAN by 4.2% and 9.6% respectively, achieving 27% better utilization and delivering 34% more traffic with 50% less latency compared with both systems on a GEANT network.

preprint2021arXiv

CORE: A Knowledge Graph Entity Type Prediction Method via Complex Space Regression and Embedding

Entity type prediction is an important problem in knowledge graph (KG) research. A new KG entity type prediction method, named CORE (COmplex space Regression and Embedding), is proposed in this work. The proposed CORE method leverages the expressive power of two complex space embedding models; namely, RotatE and ComplEx models. It embeds entities and types in two different complex spaces using either RotatE or ComplEx. Then, we derive a complex regression model to link these two spaces. Finally, a mechanism to optimize embedding and regression parameters jointly is introduced. Experiments show that CORE outperforms benchmarking methods on representative KG entity type inference datasets. Strengths and weaknesses of various entity type prediction methods are analyzed.

preprint2021arXiv

Dynamical scalarization in Einstein-Maxwell-dilaton theory

We study the process of fully nonlinear dynamical scalarization starting from a charged black hole or a naked singularity in asymptotically flat spacetime in the Einstein-Maxwell-dilaton theory. Initially the dilaton field is negligible compared to the gravitational and the Maxwell field. Then the dilaton field experiences an immediate growth, later it oscillates with damping amplitude and finally settles down to a finite value. For a hairy black hole develops from an original Reissner-Nordström black hole, since the dilaton oscillation and decay are almost independent of the coupling parameter, unlike the Anti-de Sitter spacetime it is not easy to distinguish the resulting hairy black hole from the original asymptotically flat charged hole. For a hairy black hole evolves from an original naked singularity, the resulting hairy black hole has rich structures. In the scalarization process, the naked singularity is soon enveloped by one outer horizon, then another horizon is developed and in the end a stable hairy black hole forms and two horizons degenerate into one to protect the singularity. The hairy black hole mass saturates exponentially in the scalarization.

preprint2021arXiv

Evolution of Anti-de Sitter black holes in Einstein-Maxwell-dilaton theory

We study the nonlinear evolution of the spherical symmetric black holes under a small neutral scalar field perturbation in Einstein-Maxwell-dilaton theory with coupling function $f(ϕ)=e^{-bϕ}$ in asymptotic anti-de Sitter spacetime. The non-minimal coupling between scalar and Maxwell fields allows the transmission of the energy from the Maxwell field to the scalar field, but also behaves as a repulsive force for the scalar. The scalar field oscillates with damping amplitude and converges to a final value by a power law. The irreducible mass of the black hole increases abruptly at initial times and then saturates to the final value exponentially. The saturating rate is twice the decaying rate of the dominant mode of the scalar. The effects of the black hole charge, the cosmological constant and the coupling parameter on the evolution are studied in detail. When the initial configuration is a naked singularity spacetime with a large charge to mass ratio, a horizon will form soon and hide the singularity.

preprint2021arXiv

GraphHop: An Enhanced Label Propagation Method for Node Classification

A scalable semi-supervised node classification method on graph-structured data, called GraphHop, is proposed in this work. The graph contains attributes of all nodes but labels of a few nodes. The classical label propagation (LP) method and the emerging graph convolutional network (GCN) are two popular semi-supervised solutions to this problem. The LP method is not effective in modeling node attributes and labels jointly or facing a slow convergence rate on large-scale graphs. GraphHop is proposed to its shortcoming. With proper initial label vector embeddings, each iteration of GraphHop contains two steps: 1) label aggregation and 2) label update. In Step 1, each node aggregates its neighbors&#39; label vectors obtained in the previous iteration. In Step 2, a new label vector is predicted for each node based on the label of the node itself and the aggregated label information obtained in Step 1. This iterative procedure exploits the neighborhood information and enables GraphHop to perform well in an extremely small label rate setting and scale well for very large graphs. Experimental results show that GraphHop outperforms state-of-the-art graph learning methods on a wide range of tasks (e.g., multi-label and multi-class classification on citation networks, social graphs, and commodity consumption graphs) in graphs of various sizes. Our codes are publicly available on GitHub (https://github.com/TianXieUSC/GraphHop).

preprint2021arXiv

Inductive Learning on Commonsense Knowledge Graph Completion

Commonsense knowledge graph (CKG) is a special type of knowledge graph (KG), where entities are composed of free-form text. However, most existing CKG completion methods focus on the setting where all the entities are presented at training time. Although this setting is standard for conventional KG completion, it has limitations for CKG completion. At test time, entities in CKGs can be unseen because they may have unseen text/names and entities may be disconnected from the training graph, since CKGs are generally very sparse. Here, we propose to study the inductive learning setting for CKG completion where unseen entities may present at test time. We develop a novel learning framework named InductivE. Different from previous approaches, InductiveE ensures the inductive learning capability by directly computing entity embeddings from raw entity attributes/text. InductiveE consists of a free-text encoder, a graph encoder, and a KG completion decoder. Specifically, the free-text encoder first extracts the textual representation of each entity based on the pre-trained language model and word embedding. The graph encoder is a gated relational graph convolutional neural network that learns from a densified graph for more informative entity representation learning. We develop a method that densifies CKGs by adding edges among semantic-related entities and provide more supportive information for unseen entities, leading to better generalization ability of entity embedding for unseen entities. Finally, inductiveE employs Conv-TransE as the CKG completion decoder. Experimental results show that InductiveE significantly outperforms state-of-the-art baselines in both standard and inductive settings on ATOMIC and ConceptNet benchmarks. InductivE performs especially well on inductive scenarios where it achieves above 48% improvement over present methods.

preprint2021arXiv

KGBoost: A Classification-based Knowledge Base Completion Method with Negative Sampling

Knowledge base completion is formulated as a binary classification problem in this work, where an XGBoost binary classifier is trained for each relation using relevant links in knowledge graphs (KGs). The new method, named KGBoost, adopts a modularized design and attempts to find hard negative samples so as to train a powerful classifier for missing link prediction. We conduct experiments on multiple benchmark datasets, and demonstrate that KGBoost outperforms state-of-the-art methods across most datasets. Furthermore, as compared with models trained by end-to-end optimization, KGBoost works well under the low-dimensional setting so as to allow a smaller model size.

preprint2021arXiv

Optimal convergence and long-time conservation of exponential integration for Schrödinger equations in a normal or highly oscillatory regime

In this paper, we formulate and analyse exponential integrations when applied to nonlinear Schrödinger equations in a normal or highly oscillatory regime. A kind of exponential integrators with energy preservation, optimal convergence and long time near conservations of actions, momentum and density will be formulated and analysed. To this end, we derive continuous-stage exponential integrators and show that the integrators can exactly preserve the energy of Hamiltonian systems. Three practical energy-preserving integrators are presented. It is shown that these integrators exhibit optimal convergence and have near conservations of actions, momentum and density over long times. A numerical experiment is carried out to support all the theoretical results presented in this paper. Some applications of the integrators to other kinds of ordinary/partial differential equations are also presented.

preprint2021arXiv

Optimal Scheduling of Integrated Demand Response-Enabled Community Integrated Energy Systems in Uncertain Environments

The community integrated energy system (CIES) is an essential energy internet carrier that has recently been the focus of much attention. A scheduling model based on chance-constrained programming is proposed for integrated demand response (IDR)-enabled CIES in uncertain environments to minimize the system operating costs, where an IDR program is used to explore the potential interaction ability of electricity-gas-heat flexible loads and electric vehicles. Moreover, power to gas (P2G) and micro-gas turbine (MT), as links of multi-energy carriers, are adopted to strengthen the coupling of different energy subsystems. Sequence operation theory (SOT) and linearization methods are employed to transform the original model into a solvable mixed-integer linear programming model. Simulation results on a practical CIES in North China demonstrate an improvement in the CIES operational economy via the coordination of IDR and renewable uncertainties, with P2G and MT enhancing the system operational flexibility and user comprehensive satisfaction. The CIES operation is able to achieve a trade-off between economy and system reliability by setting a suitable confidence level for the spinning reserve constraints. Besides, the proposed solution method outperforms the Hybrid Intelligent Algorithm in terms of both optimization results and calculation efficiency.

preprint2021arXiv

Passive underwater acoustic barcodes using Rayleigh wave resonance

A passive underwater acoustic marker is presented and its feasibility for underwater recognition, positioning, and navigation is proved by a numerical method and experimental results. These markers are composed of acrylic elastic objects designed by backscattering strong peaks associated with the subsonic Rayleigh wave resonance of a polymer target excited by a broadband pulse, and having a unique acoustic signature for a selected frequency band, akin to acoustic barcodes. Therefore, the backscattering response of markers can be regulated by changing the geometry of elastic objects. These acoustic barcodes naturally operate in a wider frequency band, and have a longer lifetime and lower cost, compared with active acoustic markers.

preprint2021arXiv

Progressive Depth Learning for Single Image Dehazing

The formulation of the hazy image is mainly dominated by the reflected lights and ambient airlight. Existing dehazing methods often ignore the depth cues and fail in distant areas where heavier haze disturbs the visibility. However, we note that the guidance of the depth information for transmission estimation could remedy the decreased visibility as distances increase. In turn, the good transmission estimation could facilitate the depth estimation for hazy images. In this paper, a deep end-to-end model that iteratively estimates image depths and transmission maps is proposed to perform an effective depth prediction for hazy images and improve the dehazing performance with the guidance of depth information. The image depth and transmission map are progressively refined to better restore the dehazed image. Our approach benefits from explicitly modeling the inner relationship of image depth and transmission map, which is especially effective for distant hazy areas. Extensive results on the benchmarks demonstrate that our proposed network performs favorably against the state-of-the-art dehazing methods in terms of depth estimation and haze removal.

preprint2021arXiv

Structure-preserving algorithms with uniform error bound and long-time energy conservation for highly oscillatory Hamiltonian systems

Structure-preserving algorithms and algorithms with uniform error bound have constituted two interesting classes of numerical methods. In this paper, we blend these two kinds of methods for solving nonlinear Hamiltonian systems with highly oscillatory solution, and the blended algorithms inherit and respect the advantage of each method. Two kinds of algorithms are presented to preserve the symplecticity and energy of the Hamiltonian systems, respectively. Moreover, the proposed algorithms are shown to have uniform error bound for the highly oscillatory structure. A numerical experiment is carried out to support the theoretical results established in this paper by showing the performance of the blended algorithms.

preprint2021arXiv

Study of form factors and branching ratios for $D\rightarrow S,Al\bar{ν_{l}}$ with light-cone sum rules

We systematically study the semileptonic decay process of $ D\rightarrow S,A l\bar{ν_{l}}(l=e,μ)$ by light-cone sum rules (LCSR) with chiral currents, calculate the form factors containing only the contribution of the leading twist light-cone distribution amplitudes (LCDAs). For scalar mesons $a_{0}(980)$ and $a_{0}(1450)$, we take them as $q\bar{q}$ states. For axial-vector meson, we study $a_{1}(1260)( 1^{3}p^{1})$ and $b_{1}(1235)( 1^{1}p^{1})$. Based on the results of these form factors, we further present the branching ratios of these semileptonic decay processes. The numerical results for $ D\rightarrow a_{0}(980), b_{1}(1235)l\bar{ν_{l}} $ are in good agreement with experiments and that for $ D\rightarrow a_{0}(1450)l\bar{ν_{l}}$ process are expected to be tested experimentally in the future.

preprint2021arXiv

The BINGO Project I: Baryon Acoustic Oscillations from Integrated Neutral Gas Observations

Observations of the redshifted 21-cm line of neutral hydrogen (HI) are a new and powerful window of observation that offers us the possibility to map the spatial distribution of cosmic HI and learn about cosmology. BINGO (Baryon Acoustic Oscillations [BAO] from Integrated Neutral Gas Observations) is a new unique radio telescope designed to be one of the first to probe BAO at radio frequencies. BINGO has two science goals: cosmology and astrophysics. Cosmology is the main science goal and the driver for BINGO&#39;s design and strategy. The key of BINGO is to detect the low redshift BAO to put strong constraints in the dark sector models. Given the versatility of the BINGO telescope, a secondary goal is astrophysics, where BINGO can help discover and study Fast Radio Bursts (FRB) and other transients, Galactic and extragalactic science. In this paper, we introduce the latest progress of the BINGO project, its science goals, describing the scientific potential of the project in each science and the new developments obtained by the collaboration. We introduce the BINGO project and its science goals and give a general summary of recent developments in construction, science potential and pipeline development obtained by the BINGO collaboration in the past few years. We show that BINGO will be able to obtain competitive constraints for the dark sector, and also that will allow for the discovery of several FRBs in the southern hemisphere. The capacity of BINGO in obtaining information from 21-cm is also tested in the pipeline introduced here. There is still no measurement of the BAO in radio, and studying cosmology in this new window of observations is one of the most promising advances in the field. The BINGO project is a radio telescope that has the goal to be one of the first to perform this measurement and it is currently being built in the northeast of Brazil. (Abridged)

preprint2021arXiv

The BINGO Project II: Instrument Description

The measurement of diffuse 21-cm radiation from the hyperfine transition of neutral hydrogen (HI signal) in different redshifts is an important tool for modern cosmology. However, detecting this faint signal with non-cryogenic receivers in single-dish telescopes is a challenging task. The BINGO (Baryon Acoustic Oscillations from Integrated Neutral Gas Observations) radio telescope is an instrument designed to detect baryonic acoustic oscillations (BAOs) in the cosmological HI signal, in the redshift interval $0.127 \le z \le 0.449$. This paper describes the BINGO radio telescope, including the current status of the optics, receiver, observational strategy, calibration, and the site. BINGO has been carefully designed to minimize systematics, being a transit instrument with no moving dishes and 28 horns operating in the frequency range $980 \le ν\le 1260$ MHz. Comprehensive laboratory tests were conducted for many of the BINGO subsystems and the prototypes of the receiver chain, horn, polarizer, magic tees, and transitions have been successfully tested between 2018 - 2020. The survey was designed to cover $\sim 13\%$ of the sky, with the primary mirror pointing at declination $δ=-15^{\circ}$. The telescope will see an instantaneous declination strip of $14.75^{\circ}$. The results of the prototype tests closely meet those obtained during the modeling process, suggesting BINGO will perform according to our expectations. After one year of observations with a $60\%$ duty cycle and 28 horns, BINGO should achieve an expected sensitivity of 102 $μK$ per 9.33 MHz frequency channel, one polarization, and be able to measure the HI power spectrum in a competitive time frame.

preprint2021arXiv

The BINGO Project IV: Simulations for mission performance assessment and preliminary component separation steps

The large-scale distribution of neutral hydrogen (HI) in the Universe is luminous through its 21 cm emission. The goal of the Baryon Acoustic Oscillations from Integrated Neutral Gas Observations -- BINGO -- radio telescope is to detect baryon acoustic oscillations (BAOs) at radio frequencies through 21 cm intensity mapping (IM). The telescope will span the redshift range 0.127 $< z <$ 0.449 with an instantaneous field-of-view of $14.75^{\circ} \times 6.0^{\circ}$. In this work we investigate different constructive and operational scenarios of the instrument by generating sky maps as they would be produced by the instrument. In doing this we use a set of end-to-end IM mission simulations. The maps will additionally be used to evaluate the efficiency of a component separation method (GNILC). We have simulated the kind of data that would be produced in a single-dish IM experiment such as BINGO. According to the results obtained, we have optimized the focal plane design of the telescope. In addition, the application of the GNILC method on simulated data shows that it is feasible to extract the cosmological signal across a wide range of multipoles and redshifts. The results are comparable with the standard principal component analysis method.

preprint2021arXiv

The BINGO Project VI: HI Halo Occupation Distribution and Mock Building

BINGO (Baryon Acoustic Oscillations from Integrated Neutral Gas Observations.) is a radio telescope designed to survey from 980 MHz to 1260 MHz, observe the neutral Hydrogen (HI) 21-cm line and detect BAO (Baryon Acoustic Oscillation) signal with Intensity Mapping technique. Here we present our method to generate mock maps of the 21-cm Intensity Mapping signal covering the BINGO frequency range and related test results. (Abridged)

preprint2021arXiv

The BINGO Project VII: Cosmological Forecasts from 21cm Intensity Mapping

The 21cm line of neutral hydrogen (HI) opens a new avenue in our exploration of the structure and evolution of the Universe. It provides complementary data to the current large-scale structure observations with different systematics, and thus it will be used to improve our understanding of the $Λ$CDM model. Among several radio cosmological surveys designed to measure this line, BINGO is a single-dish telescope mainly designed to detect baryon acoustic oscillations (BAOs) at low redshifts ($0.127< z<0.449$). Our goal is to assess the fiducial BINGO setup and its capabilities of constraining the cosmological parameters, and to analyze the effect of different instrument configurations. We used the Phase 1 fiducial configuration of the BINGO telescope to perform our cosmological forecasts. In addition, we investigated the impact of several instrumental setups, taking into account some instrumental systematics, and different cosmological models. Combining BINGO with Planck temperature and polarization data, the projected constraint improves from a $13\%$ and $25\%$ precision measurement at the $68\%$ confidence level with Planck only to $1\%$ and $3\%$ for the Hubble constant and the dark energy equation of state (EoS), respectively, within the wCDM model. Assuming a Chevallier-Polarski-Linder parameterization, the EoS parameters have standard deviations given by $σ_{w_0} = 0.30$ and $σ_{w_a} = 1.2$, which are improvements on the order of $30\%$ with respect to Planck alone. Also, we can access information about the HI density and bias, obtaining $\sim 8.5\%$ and $\sim 6\%$ precision, respectively, assuming they vary with redshift at three independent bins. The fiducial BINGO configuration will be able to extract significant cosmological information from the HI distribution and provide constraints competitive with current and future cosmological surveys. (Abridged)

preprint2021arXiv

The collider tests of a leptophilic scalar for the anomalous magnetic moments

We study the anomalous muon and electron magnetic moments by introducing a scalar with CP-violating Yukawa couplings to the lepton sector. By fitting these two magnetic moments with the recent experimental measurements, we find that such a leptophilic scalar in the mass range of $\mathcal{O}(10)- \mathcal{O}(1000 )\,\rm GeV$ can be a possible source for the current experimental deviations from the Standard Model (SM) predictions, with $\mathcal{O}(0.1) - \mathcal{O}(1)$ Yukawa couplings. The current electron and muon EDM constraints to the general CP-violating Yukawa couplings are discussed. We propose to search such a leptophilic scalar mediated at the future high-luminosity LHC (HL-LHC) runs, as well as the high-energy lepton colliders, including the CEPC and the muon collider. Our results show that the leptophilic scalar in the mass range of $\mathcal{O}(10)- \mathcal{O}(1000 )\,\rm GeV$ can be fully probed by the future experimental searches at the HL-LHC and the lepton colliders at their early stages.

preprint2021arXiv

Thermodynamics of 2+1 dimensional Coulomb-Like Black Holes from Non Linear Electrodynamics with a traceless energy momentum tensor

In this work we study thermodynamics of 2+1-dimensional static black holes with a nonlinear electric field. Besides employing the standard thermodynamic approach, we investigate the black hole thermodynamics by studying its thermodynamic geometry. We compute the Weinhold and Ruppeiner metrics and compare the thermodynamic geometry with the standard description on the black hole thermodynamics. We further consider the cosmological constant as an additional extensive thermodynamic variable. In the thermodynamic equilibrium three dimensional space, we compute the efficiency of the heat engine and show that it is possible to be built with this black hole.

preprint2021arXiv

Towards Speeding up Adversarial Training in Latent Spaces

Adversarial training is wildly considered as one of the most effective way to defend against adversarial examples. However, existing adversarial training methods consume unbearable time, due to the fact that they need to generate adversarial examples in the large input space. To speed up adversarial training, we propose a novel adversarial training method that does not need to generate real adversarial examples. By adding perturbations to logits to generate Endogenous Adversarial Examples (EAEs) -- the adversarial examples in the latent space, the time consuming gradient calculation can be avoided. Extensive experiments are conducted on CIFAR-10 and ImageNet, and the results show that comparing to state-of-the-art methods, our EAE adversarial training not only shortens the training time, but also enhances the robustness of the model and has less impact on the accuracy of clean examples than the existing methods.

preprint2021arXiv

Train a One-Million-Way Instance Classifier for Unsupervised Visual Representation Learning

This paper presents a simple unsupervised visual representation learning method with a pretext task of discriminating all images in a dataset using a parametric, instance-level classifier. The overall framework is a replica of a supervised classification model, where semantic classes (e.g., dog, bird, and ship) are replaced by instance IDs. However, scaling up the classification task from thousands of semantic labels to millions of instance labels brings specific challenges including 1) the large-scale softmax computation; 2) the slow convergence due to the infrequent visiting of instance samples; and 3) the massive number of negative classes that can be noisy. This work presents several novel techniques to handle these difficulties. First, we introduce a hybrid parallel training framework to make large-scale training feasible. Second, we present a raw-feature initialization mechanism for classification weights, which we assume offers a contrastive prior for instance discrimination and can clearly speed up converge in our experiments. Finally, we propose to smooth the labels of a few hardest classes to avoid optimizing over very similar negative pairs. While being conceptually simple, our framework achieves competitive or superior performance compared to state-of-the-art unsupervised approaches, i.e., SimCLR, MoCoV2, and PIC under ImageNet linear evaluation protocol and on several downstream visual tasks, verifying that full instance classification is a strong pretraining technique for many semantic visual tasks.

preprint2020arXiv

A Modal-Space Method for Online Power System Steady-State Stability Monitoring

This paper proposes a novel approach to estimate the steady-state angle stability limit (SSASL) by using the nonlinear power system dynamic model in the modal space. Through two linear changes of coordinates and a simplification introduced by the steady-state condition, the nonlinear power system dynamic model is transformed into a number of single-machine-like power systems whose power-angle curves can be derived and used for estimating the SSASL. The proposed approach estimates the SSASL of angles at all machines and all buses without the need for manually specifying the scenario, i.e. setting sink and source areas, and also without the need for solving multiple nonlinear power flows. Case studies on 9-bus and 39-bus power systems demonstrate that the proposed approach is always able to capture the aperiodic instability in an online environment, showing promising performance in the online monitoring of the steady-state angle stability over the traditional power flow-based analysis.

preprint2020arXiv

A New Software Framework for Traffic Engineering: Path Cardinality and the Effect of Multipath on Residual Capacity

In this paper, we present a new traffic engineering (TE) software framework to analyze, configure, and optimize (with the aid of a linear programming solver) a network for service provisioning. The developed software tool is based on our new data-driven traffic engineering approach that analyzes a large volume of network configuration data generated given the user input. By analyzing the data, one can then make efficient decisions later when designing a traffic engineering solution. We focus on three well-known traffic engineering objective functions: minimum cost routing (MCR), load balancing (LB), and average delay (AD). With this new tool, one can answer numerous traffic engineering questions. For example, what are the differences among the three objective functions? What is the impact of an objective function on link utilization? How many candidate paths are enough to achieve optimality or near-optimality with respect to a specific objective. This new software tool allows us to conveniently perform various experiments and visualize the results for performance analysis. As case studies, this paper presents examples that answer the questions for two traffic engineering problems: (1) how many paths are required to obtain a solution that is within a few percent from the optimal solution and whether that number is fixed for any network size? (2) how the choice of single-path/multi-path routing affects the load in the network? For the first problem, it turns out that the number of paths needed to achieve optimality increases as the number of links in the network increases.

preprint2020arXiv

A novel combination of theoretical analysis and data-driven method for reconstruction of structural defects

Ultrasonic guided wave technology has played a significant role in the field of non-destructive testing as it employs acoustic waves that have advantages of high propagation efficiency and low energy consumption during the inspect process. However, theoretical solutions to guided wave scattering problems using assumptions such as Born approximation, have led to the poor quality of the reconstructed results. To address this issue, a novel approach to quantitative reconstruction of defects using the integration of data-driven method with the guided wave scattering analysis has been proposed in this paper. Based on the geometrical information of defects and initial results by the theoretical analysis of defect reconstructions, a deep learning neural network model is built to reveal the physical relationship between defects and the received signals. This data-driven model is then applied to quantitatively assess and characterize defect profiles in structures, reduce the inaccuracy of the theoretical modelling and eliminate the impact of noise pollution in the process of inspection. To demonstrate advantages of the developed approach to reconstructions of defects with complex profiles, numerical examples including basic defect profiles and a defect with the noisy fringe have been examined. Results show that this approach has greater accuracy for reconstruction of defects in structures as compared with the analytical method and provides a valuable insight into the development of artificial intelligence-assisted inspection systems with high accuracy and efficiency in the field of non-destructive testing.

preprint2020arXiv

Accelerating linear solvers for Stokes problems with C++ metaprogramming

The efficient solution of large sparse saddle point systems is very important in computational fluid mechanics. The discontinuous Galerkin finite element methods have become increasingly popular for incompressible flow problems but their application is limited due to high computational cost. We describe the C++ programming techniques that may help to accelerate linear solvers for such problems. The approach is based on the policy-based design pattern and partial template specialization, and is implemented in the open source AMGCL library. The efficiency is demonstrated with the example of accelerating an iterative solver of a discontinuous Galerkin finite element method for the Stokes problem. The implementation allows selecting algorithmic components of the solver by adjusting template parameters without any changes to the codebase. It is possible to switch the system matrix to use small statically sized blocks to store the nonzero values, or use a mixed precision solution, which results in up to 4 times speedup, and reduces the memory footprint of the algorithm by about 40\%. We evaluate both monolithic and composite preconditioning strategies for the 3 benchmark problems. The performance of the proposed solution is compared with a multithreaded direct Pardiso solver and a parallel iterative PETSc solver.

preprint2020arXiv

Acoustic black hole in Schwarzschild spacetime: quasi-normal modes, analogous Hawking radiation and shadows

Various properties of acoustic black holes constructed in Minkowski spacetime have been widely studied in the past decades. Recently the acoustic black holes in general spacetime were proposed . In this paper, we first investigate the basic characteristics of `curved&#39; acoustic black hole in Schwarzschild spacetime, including the quasi-normal modes, grey-body factor and analogous Hawking radiation. We find that the signal of quasi-normal mode is weaker than that of Schwarzschild black hole. Moreover, as the tuning parameter increases, both the positive real part and negative imaginal part of the quasi-normal frequency approach to the horizonal axis, but they will not change sign. This means that all the perturbations could die off and the system is stable under those perturbations. Since the larger tuning parameter suppresses the effective potential barrier, so it enhances the grey-body factor. The energy emission rate of Hawking radiation does not monotonically increase of the tuning parameter because of the non-monotonicity of the Hawking temperature. Finally, as a first attempt, we study the acoustic black hole shadow. The radius of acoustic shadow becomes larger as the tuning parameter increases, because both the related acoustic horizon and the acoustic sphere become larger. Our studies could help us to further understand the near horizon geometrical features of the black hole. We also expect that our observations could be detected experimentally in the near future.

preprint2020arXiv

Anapole mediated giant photothermal nonlinearity in nanostructured silicon

Featured with a plethora of electric and magnetic Mie resonances, high index dielectric nanostructures offer a versatile platform to concentrate light-matter interactions at the nanoscale. By integrating unique features of far-field scattering control and near-field concentration from radiationless anapole states, here, we demonstrate a giant photothermal nonlinearity in single subwavelength-sized silicon nanodisks. The nanoscale energy concentration and consequent near-field enhancements mediated by the anapole mode yield a reversible nonlinear scattering with a large modulation depth and a broad dynamic range, unveiling a record-high nonlinear index change up to 0.5 at mild incident light intensities on the order of MW/cm2. The observed photothermal nonlinearity showcases three orders of magnitude enhancement compared with that of unstructured bulk silicon, as well as nearly one order of magnitude higher than that through the radiative electric dipolar mode. Such nonlinear scattering can empower distinctive point spread functions in confocal reflectance imaging, offering the potential for far-field localization of nanostructured Si with an accuracy approaching 40 nm. Our findings shed new light on active silicon photonics based on optical anapoles.

preprint2020arXiv

Attention-based Transducer for Online Speech Recognition

Recent studies reveal the potential of recurrent neural network transducer (RNN-T) for end-to-end (E2E) speech recognition. Among some most popular E2E systems including RNN-T, Attention Encoder-Decoder (AED), and Connectionist Temporal Classification (CTC), RNN-T has some clear advantages given that it supports streaming recognition and does not have frame-independency assumption. Although significant progresses have been made for RNN-T research, it is still facing performance challenges in terms of training speed and accuracy. We propose attention-based transducer with modification over RNN-T in two aspects. First, we introduce chunk-wise attention in the joint network. Second, self-attention is introduced in the encoder. Our proposed model outperforms RNN-T for both training speed and accuracy. For training, we achieves over 1.7x speedup. With 500 hours LAIX non-native English training data, attention-based transducer yields ~10.6% WER reduction over baseline RNN-T. Trained with full set of over 10K hours data, our final system achieves ~5.5% WER reduction over that trained with the best Kaldi TDNN-f recipe. After 8-bit weight quantization without WER degradation, RTF and latency drop to 0.34~0.36 and 268~409 milliseconds respectively on a single CPU core of a production server.

preprint2020arXiv

Centrality dependence of multiplicity fluctuations from a hydrodynamical approach

As one of the possible signals for the whereabouts of the critical point on the QCD phase diagram, recently, the multiplicity fluctuations in heavy-ion collisions have aroused much attention. It is a crucial observable of the Beam Energy Scan program of the Relativistic Heavy Ion Collider. In this work, we investigate the centrality dependence of the multiplicity fluctuations regarding the recent measurements from STAR Collaboration. By employing a hydrodynamical approach, the present study is dedicated to the noncritical aspects of the phenomenon. To be specific, in addition to the thermal fluctuations, finite volume corrections, and resonance decay at the freeze-out surface, the model is focused on the properties of the hydrodynamic expansion of the system and the event-by-event initial fluctuations. It is understood that the real signal of the critical point can only be obtained after appropriately subtracting the background, the latter is investigated in the present work. Besides the experimental data, our results are also compared to those of the hadronic resonance gas, as well as transport models.

preprint2020arXiv

Creation of Synthetic Networked PMU Data: A Generative Adversarial Network Approach

This paper introduces a machine learning-based approach to synthetically creating realistic phasor measurement unit (PMU) data streams of multiple transient types. In contrast to the existing literature of transient simulation-based data generation methods, we propose a generative adversarial network (GAN) based approach to learning directly from the historical data and simultaneously reproduce multiple PMU data streams. The synthetic PMU data streams reflect meaningful dynamic characteristics which observe first principles such as Kirchhoff&#39;s laws. The efficacy of this approach is demonstrated by numerical studies on the IEEE 39-bus system. We validate the fidelity and flexibility of the synthetic data via statistical resemblance and modal analysis approaches. Finally we illustrate a practical application scenario for the usage of the synthetic PMU data, i.e. leverage the synthetic data to improve the performance of the event classification algorithms.

preprint2020arXiv

Design-controlled Synthesis of IrO$_2$ sub-monolayers on Au Nanodendrites: Marrying Plasmonic and Electrocatalytic Properties

We develop herein plasmonic-catalytic Au-IrO$_2$ nanostructures with a morphology optimized for efficient light harvesting and catalytic surface area; the nanoparticles have a dendritic morphology, with closely spaced Au branches all partially covered by an ultrathin (1 nm) IrO$_2$ shell. This nanoparticle architecture optimizes optical features due to the interactions of closely spaced plasmonic branches forming electromagnetic hot spots, and the ultra-thin IrO$_2$ layer maximizes efficient use of this expensive catalyst. This concept was evaluated towards the enhancement of the electrocatalytic performances towards the oxygen evolution reaction (OER) as a model transformation. The OER can play a central role in meeting future energy demands but the performance of conventional electrocatalysts in this reaction is limited by the sluggish OER kinetics. We demonstrate an improvement of the OER performance for one of the most active OER catalysts, IrO$_2$, by harvesting plasmonic effects from visible light illumination in multimetallic nanoparticles. We find that the OER activity for the Au-IrO$_2$ nanodendrites can be improved under LSPR excitation, matching best properties reported in the literature. Our simulations and electrocatalytic data demonstrate that the enhancement in OER activities can be attributed to an electronic interaction between Au and IrO$_2$ and to the activation of Ir-O bonds by LSPR excited hot holes, leading to a change in the reaction mechanism (rate-determinant step) under visible light illumination.

preprint2020arXiv

Echoes from phantom wormholes

We study the time evolution of the test scalar and electromagnetic fields perturbations in configurations of phantom wormholes surrounded by dark energy with state parameter $ω< -1$. We observe obvious signals of echoes reflecting wormholes properties and disclose the physical reasons behind such phenomena. In particular, we find that the dark energy equation of state has a clear imprint in echoes in wave perturbations. When $ω$ approaches the phantom divide $ω=-1$ from below, the delay time of echoes becomes longer. The echo of gravitational wave is likely to be detected in the near future, the signature of the dark energy equation of state in the echo spectrum can serve as a local measurement of the dark energy.

preprint2020arXiv

Efficient Sentence Embedding via Semantic Subspace Analysis

A novel sentence embedding method built upon semantic subspace analysis, called semantic subspace sentence embedding (S3E), is proposed in this work. Given the fact that word embeddings can capture semantic relationship while semantically similar words tend to form semantic groups in a high-dimensional embedding space, we develop a sentence representation scheme by analyzing semantic subspaces of its constituent words. Specifically, we construct a sentence model from two aspects. First, we represent words that lie in the same semantic group using the intra-group descriptor. Second, we characterize the interaction between multiple semantic groups with the inter-group descriptor. The proposed S3E method is evaluated on both textual similarity tasks and supervised tasks. Experimental results show that it offers comparable or better performance than the state-of-the-art. The complexity of our S3E method is also much lower than other parameterized models.

preprint2020arXiv

Error estimates of some splitting schemes for charged-particle dynamics under strong magnetic field

In this work, we consider the error estimates of some splitting schemes for the charged-particle dynamics under a strong magnetic field. We first propose a novel energy-preserving splitting scheme with computational cost per step independent from the strength of the magnetic field. Then under the maximal ordering scaling case, we establish for the scheme and in fact for a class of Lie-Trotter type splitting schemes, a uniform (in the strength of the magnetic field) and optimal error bound in the position and in the velocity parallel to the magnetic field. For the general strong magnetic field case, the modulated Fourier expansions of the exact and the numerical solutions are constructed to obtain a convergence result. Numerical experiments are presented to illustrate the error and energy behaviour of the splitting schemes.

preprint2020arXiv

Explicit symplectic adapted exponential integrators for charged-particle dynamics in a strong and constant magnetic field

This paper studies explicit symplectic adapted exponential integrators for solving charged-particle dynamics in a strong and constant magnetic field. We first formulate the scheme of adapted exponential integrators and then derive its symplecticity conditions. Based on the symplecticity conditions, we propose five practical explicit symplectic adapted exponential integrators. Two numerical experiments are carried out and the numerical results demonstrate the remarkable numerical behavior of the new methods.

preprint2020arXiv

Feature-Driven Super-Resolution for Object Detection

Although some convolutional neural networks (CNNs) based super-resolution (SR) algorithms yield good visual performances on single images recently. Most of them focus on perfect perceptual quality but ignore specific needs of subsequent detection task. This paper proposes a simple but powerful feature-driven super-resolution (FDSR) to improve the detection performance of low-resolution (LR) images. First, the proposed method uses feature-domain prior which extracts from an existing detector backbone to guide the HR image reconstruction. Then, with the aligned features, FDSR update SR parameters for better detection performance. Comparing with some state-of-the-art SR algorithms with 4$\times$ scale factor, FDSR outperforms the detection performance mAP on MS COCO validation, VOC2007 databases with good generalization to other detection networks.

preprint2020arXiv

Forecasting Interacting Vacuum-Energy Models using Gravitational Waves

The physics of the dark sector has remained one of the controversial areas of modern cosmology at present and hence it naturally attracts massive attention to the scientific community. With the developments of the astronomical data, the physics of the dark sector is becoming much more transparent than it was some twenty years back. The detection of gravitational waves (GWs) has now opened a cluster of possibilities in the cosmological regime. Being motivated by the detection of GWs and its possible impact on the physics of dark matter and dark energy, in this work we focus on the interacting dark energy models. Assuming the simplest possibility in which the vacuum energy with equation-of-state $w_x =-1$ is allowed to interact with the pressureless dark matter, we have extracted the constraints of the cosmological parameters. We find that the addition of the GWs data to the CMB measurements significantly improves up to a factor 4 of the parameters space, and up to a factor 2 for the full combination of current cosmological datasets, namely CMB+BAO+Pantheon+RSD+R16+CC+WL. The most affected parameters by the inclusion of the GWs are $Ω_ch^2$, $θ_{MC}$, $ξ$, and the derived parameters $Ω_{m0}$, $σ_8$ and $H_0$.

preprint2020arXiv

From Trend Analysis to Virtual World System Design Requirement Satisfaction Study

Virtual worlds have become global platforms connecting millions of people and containing various technologies. The development of technology, shift of market value, and change of user preference shape the features of virtual worlds. In this paper, we first study the new features of virtual worlds and emergent requirements of system development through trend analysis. Based on the trend analysis, we constructed the new design requirement space. We then discuss the requirement satisfaction of existing virtual world system architectures and highlight their limitations through a literature survey. The comparison of existing system architectures sheds some light on future virtual world system development to match the changing trends of the user market. At the end of this study, we briefly introduce our ongoing study, a new architecture, called Virtual Net, and discuss its possibility in requirement satisfaction and new research challenges.

preprint2020arXiv

Full analytical formulas for frequency response of space-based gravitational wave detectors

The discovery of gravitational waves, which are ripples of space-time itself, opened a new window to test general relativity, because it predicts that there are only plus and cross polarizations for gravitational waves. For alternative theories of gravity, there may be up to six polarizations. The measurement of the polarization is one of the major scientific goals for future gravitational wave detectors. To evaluate the capability of the detector, we need to use the frequency dependent response functions averaged over the source direction and polarization angle. We derive the full analytical formulas of the averaged response functions for all six possible polarizations and present their asymptotic behaviors based on these analytical formulas. Compared with the numerical simulation, the full analytical formulas are more efficient and valid for any equal-arm interferometric gravitational wave detector without optical cavities in the arms and for a time-delay-interferometry Michelson combination.

preprint2020arXiv

Generic Fibers of Parahoric Hitchin Systems

In this paper, we talk about parahoric Hitchin systems over smooth projective curves with structure group a semisimple simply connected group. We describe the geometry of generic fibers of parahoric Hitchin fibrations using root stacks. We work over an algebraically closed field with a mild assumption of the characteristic. All of these can be treated as a generalization of GLn case in [SWW19]

preprint2020arXiv

Graph Structured Network for Image-Text Matching

Image-text matching has received growing interest since it bridges vision and language. The key challenge lies in how to learn correspondence between image and text. Existing works learn coarse correspondence based on object co-occurrence statistics, while failing to learn fine-grained phrase correspondence. In this paper, we present a novel Graph Structured Matching Network (GSMN) to learn fine-grained correspondence. The GSMN explicitly models object, relation and attribute as a structured phrase, which not only allows to learn correspondence of object, relation and attribute separately, but also benefits to learn fine-grained correspondence of structured phrase. This is achieved by node-level matching and structure-level matching. The node-level matching associates each node with its relevant nodes from another modality, where the node can be object, relation or attribute. The associated nodes then jointly infer fine-grained correspondence by fusing neighborhood associations at structure-level matching. Comprehensive experiments show that GSMN outperforms state-of-the-art methods on benchmarks, with relative Recall@1 improvements of nearly 7% and 2% on Flickr30K and MSCOCO, respectively. Code will be released at: https://github.com/CrossmodalGroup/GSMN.

preprint2020arXiv

Great Chiral Fluorescence from Optical Duality Silver Nanostructures Enabled by 3D Laser Printing

Featured by prominent flexibility and fidelity in producing sophisticated stereoscopic structures transdimensionally, three-dimensional (3D) laser printing technique has vastly extended the toolkit for delivering diverse functional devices. Yet chiral nanoemitters heavily resorting to artificial structures that manifest efficient emission and tightly confined light-mater interactions simultaneously remains alluring but dauntingly challenging for this technique at this moment. In this work, we assert the chiral photoluminescence is implemented from silver nanostructures of optical duality in one go via a twofold three-dimensional laser printing scheme. Such laser printing protocol allows the highly desired duality by simultaneously producing uniformly distributed fluorescent silver nanoclusters and aggregated plasmonic silver nanoparticles to tightly confine chiral interactions at the nanoscale. A helical emitter of 550 nm-helix-diameter as fabricated has seen a record-high luminescence anisotropic factor with the absolute value up to 0.58, which is two orders of magnitude greater than fluorescent chiral silver clusters. This method holds great promise for future versatile applications in chiroptical nanodevices.

preprint2020arXiv

Heterogeneous Graph Attention Networks for Early Detection of Rumors on Twitter

With the rapid development of mobile Internet technology and the widespread use of mobile devices, it becomes much easier for people to express their opinions on social media. The openness and convenience of social media platforms provide a free expression for people but also cause new social problems. The widespread of false rumors on social media can bring about the panic of the public and damage personal reputation, which makes rumor automatic detection technology become particularly necessary. The majority of existing methods for rumor detection focus on mining effective features from text contents, user profiles, and patterns of propagation. Nevertheless, these methods do not take full advantage of global semantic relations of the text contents, which characterize the semantic commonality of rumors as a key factor for detecting rumors. In this paper, we construct a tweet-word-user heterogeneous graph based on the text contents and the source tweet propagations of rumors. A meta-path based heterogeneous graph attention network framework is proposed to capture the global semantic relations of text contents, together with the global structure information of source tweet propagations for rumor detection. Experiments on real-world Twitter data demonstrate the superiority of the proposed approach, which also has a comparable ability to detect rumors at a very early stage.

preprint2020arXiv

Holographic Principle and the Second Law in Stephani Cosmology Revisited

We show in a model-independent way that the inhomogeneous cosmological class II Stephani model fulfills both the the cosmological holographic principle, and that the entropy is increasing with time. By this we mean the result does not depend on any assumption on the time evolution of the scale factor, or on the matter content of the Universe, we also do not need to use the numerical values of the cosmological parameters, which are inferred in the framework of the usual homogeneous Friedmann model. Therefore our analysis is not affected by the tension of the current estimates of the Hubble parameter, and does not rely on any model-dependent assumption of the entropy amount at the present epoch. Our analysis allows us to set an upper limit for the inhomogeneity parameter of the model, an upper bound for the size that this type of universe can reach during the time evolution, a lower bound for the entropy abundance, and an estimate of the present day value of the deceleration parameter.

preprint2020arXiv

Hydrodynamic results on multiplicity fluctuations in heavy-ion collisions

Multiplicity fluctuations are one of the most crucial observables in the Beam Energy Scan program of the Relativistic Heavy Ion Collider. It is understood that they can be utilized to probe the whereabouts of the critical point on the phase diagram of the QCD matter. However, a significant portion of these fluctuations is, apart from that related to the QCD phase transition, attributed to the other origins, which we refer to as &#34;noncritical&#34; ones. The present study is dedicated to the noncritical aspects of the multiplicity fluctuations in heavy-ion collisions. In particular, we focus on those of dynamical origin, such as the hydrodynamic expansion of the system and the event-by-event initial fluctuations, in addition to the usual thermal fluctuations, finite volume corrections, and resonance decay at the freeze-out surface. The obtained results are compared to those of the hadronic resonance gas model as well as to the experimental data.

preprint2020arXiv

Inductive Unsupervised Domain Adaptation for Few-Shot Classification via Clustering

Few-shot classification tends to struggle when it needs to adapt to diverse domains. Due to the non-overlapping label space between domains, the performance of conventional domain adaptation is limited. Previous work tackles the problem in a transductive manner, by assuming access to the full set of test data, which is too restrictive for many real-world applications. In this paper, we set out to tackle this issue by introducing a inductive framework, DaFeC, to improve Domain adaptation performance for Few-shot classification via Clustering. We first build a representation extractor to derive features for unlabeled data from the target domain (no test data is necessary) and then group them with a cluster miner. The generated pseudo-labeled data and the labeled source-domain data are used as supervision to update the parameters of the few-shot classifier. In order to derive high-quality pseudo labels, we propose a Clustering Promotion Mechanism, to learn better features for the target domain via Similarity Entropy Minimization and Adversarial Distribution Alignment, which are combined with a Cosine Annealing Strategy. Experiments are performed on the FewRel 2.0 dataset. Our approach outperforms previous work with absolute gains (in classification accuracy) of 4.95%, 9.55%, 3.99% and 11.62%, respectively, under four few-shot settings.

preprint2020arXiv

InsteaDMatic: Towards cross-platform automated continuous rotation electron diffraction

A DigitalMicrograph script InsteaDMatic has been developed to facilitate rapid automated continuous rotation electron diffraction (cRED) data acquisition. The script coordinates microscope functions, such as stage rotation, camera functions relevant for data collection, and stores the experiment metadata. The script is compatible with any microscope that can be controlled by DigitalMicrograph and has been tested on both JEOL and Thermo Fisher Scientific microscopes. A proof-of-concept has been performed through employing InsteaDMatic for data collection and structure determination of a ZSM-5 zeolite. The influence of illumination settings and electron dose rate on the quality of diffraction data, unit cell determination and structure solution has been investigated in order to optimize the data acquisition procedure.

preprint2020arXiv

Joint Extraction of Entities and Relations Based on a Novel Decomposition Strategy

Joint extraction of entities and relations aims to detect entity pairs along with their relations using a single model. Prior work typically solves this task in the extract-then-classify or unified labeling manner. However, these methods either suffer from the redundant entity pairs, or ignore the important inner structure in the process of extracting entities and relations. To address these limitations, in this paper, we first decompose the joint extraction task into two interrelated subtasks, namely HE extraction and TER extraction. The former subtask is to distinguish all head-entities that may be involved with target relations, and the latter is to identify corresponding tail-entities and relations for each extracted head-entity. Next, these two subtasks are further deconstructed into several sequence labeling problems based on our proposed span-based tagging scheme, which are conveniently solved by a hierarchical boundary tagger and a multi-span decoding algorithm. Owing to the reasonable decomposition strategy, our model can fully capture the semantic interdependency between different steps, as well as reduce noise from irrelevant entity pairs. Experimental results show that our method outperforms previous work by 5.2%, 5.9% and 21.5% (F1 score), achieving a new state-of-the-art on three public datasets

preprint2020arXiv

MeshODE: A Robust and Scalable Framework for Mesh Deformation

We present MeshODE, a scalable and robust framework for pairwise CAD model deformation without prespecified correspondences. Given a pair of shapes, our framework provides a novel shape feature-preserving mapping function that continuously deforms one model to the other by minimizing fitting and rigidity losses based on the non-rigid iterative-closest-point (ICP) algorithm. We address two challenges in this problem, namely the design of a powerful deformation function and obtaining a feature-preserving CAD deformation. While traditional deformation directly optimizes for the coordinates of the mesh vertices or the vertices of a control cage, we introduce a deep bijective mapping that utilizes a flow model parameterized as a neural network. Our function has the capacity to handle complex deformations, produces deformations that are guaranteed free of self-intersections, and requires low rigidity constraining for geometry preservation, which leads to a better fitting quality compared with existing methods. It additionally enables continuous deformation between two arbitrary shapes without supervision for intermediate shapes. Furthermore, we propose a robust preprocessing pipeline for raw CAD meshes using feature-aware subdivision and a uniform graph template representation to address artifacts in raw CAD models including self-intersections, irregular triangles, topologically disconnected components, non-manifold edges, and nonuniformly distributed vertices. This facilitates a fast deformation optimization process that preserves global and local details. Our code is publicly available.

preprint2020arXiv

Modeling Discourse Structure for Document-level Neural Machine Translation

Recently, document-level neural machine translation (NMT) has become a hot topic in the community of machine translation. Despite its success, most of existing studies ignored the discourse structure information of the input document to be translated, which has shown effective in other tasks. In this paper, we propose to improve document-level NMT with the aid of discourse structure information. Our encoder is based on a hierarchical attention network (HAN). Specifically, we first parse the input document to obtain its discourse structure. Then, we introduce a Transformer-based path encoder to embed the discourse structure information of each word. Finally, we combine the discourse structure information with the word embedding before it is fed into the encoder. Experimental results on the English-to-German dataset show that our model can significantly outperform both Transformer and Transformer+HAN.

preprint2020arXiv

New probe of gravity: strongly lensed gravitational wave multi-messenger approach

Strong gravitational lensing by galaxies provides us with a unique opportunity to understand the nature of gravity on galactic and extra-galactic scales. In this paper, we propose a new multimessenger approach using data from both gravitational wave (GW) and the corresponding electromagnetic (EM) counterpart to infer the constraint of the modified gravity (MG) theory denoted by the scale dependent phenomenological parameter. To demonstrate the robustness of this approach, we calculate the time-delay predictions by choosing various values of the phenomenological parameters and then compare them with that from the general relativity (GR). For the third generation ground-based GW observatory, with one typical strongly lensed GW+EM event, and assuming that the dominated error from the stellar velocity dispersions is 5\%, the GW time-delay data can distinguish an 18\% MG effect on a scale of tens of kiloparsecs with a $68\%$ confidence level. Assuming GR and a Singular Isothermal Sphere mass model, there exists a simplified consistency relationship between time-delay and imaging data. This relationship does not require for the velocity dispersion measurement, and hence can avoid major uncertainties. By using this relationship, the multimessenger approach is able to distinguish an $8\%$ MG effect. Our results show that the GW multimessenger approach can play an important role in revealing the nature of gravity on the galactic and extra-galactic scales.

preprint2020arXiv

No static regular black holes in Einstein-complex-scalar-Gauss-Bonnet gravity

In this brief report, we investigate the existence of 4-dimensional static spherically symmetric black holes (BHs) in the Einstein-complex-scalar-Gauss-Bonnet (EcsGB) gravity with an arbitrary potential $V(ϕ)$ and a coupling $f(ϕ)$ between the scalar field $ϕ$ and the Gauss-Bonnet (GB) term. We find that static regular BH solutions with complex scalar hairs do not exist. This conclusion does not depend on the coupling between the GB term and the scalar field, nor on the scalar potential $V(ϕ)$ and the presence of a cosmological constant $Λ$ (which can be either positive or negative), as longer as the scalar field remains complex and is regular across the horizon.

preprint2020arXiv

Object picture of scalar field perturbation on Kerr black hole in scalar-Einstein-Gauss-Bonnet theory

Scalar perturbations around the Kerr black hole in scalar-Einstein-Gauss-Bonnet (sEGB) theory are studied in the time domain. To overcome the &#34;outer boundary problem&#34; that usually encountered in traditional numerical calculations, we apply the hyperboloidal compactification technique to perform a $(2+1)$-dimensional simulation aiming to obtain a precise object picture of the wave propagation under the scalar field perturbation. We find that the big enough coupling constant between the scalar field and the Gauss-Bonnet curvature is responsible to destroy the original Kerr black hole. The breakdown of the Kerr spacetime happens earlier and the instability becomes more violent when the coupling becomes stronger. We further present object confirmations on the special case for the negative coupling where there exists a minimum rotation and below which the instability can never happen no matter how strong the coupling is. We also illustrate the fine structure property in the quasinormal ringing frequency once there is the coupling, and present the characteristic imprint of the sEGB theory. We expect that such a fine structure can be detected in the future gravitational wave observation to test the sEGB theory.

preprint2020arXiv

On nonlinearity in hydrodynamic response to the initial geometry in relativistic heavy-ion collisions

In the context of event-by-event hydrodynamic description, we analyze the implications of two models characterized by distinct initial conditions. The initial energy density of the first model adopts a Gaussian-type distribution, while those of the second one are features by high energy peripheral tubes. We calibrate the initial conditions of both models so that their initial probability distribution of eccentricity are mostly identical. Subsequently, the resultant scaled probability distributions of collective flow and the correlations between flow harmonic and eccentricity coefficients are investigated. Besides, the calculations are carried out for particle correlations regarding the symmetric cumulant, mixed harmonics, and nonlinear response coefficients. Although the resultant two-particle correlations possess similar shapes, numerical calculations indicate a subtle difference between the two models. To be specific, the difference resides in more detailed observables such as the probability distributions of elliptic flow as well as Pearson correlation coefficient regarding higher-order harmonics. We discuss several essential aspects concerning the linearity and nonlinearity between initial eccentricities and final state anisotropies. Further implications are addressed.

preprint2020arXiv

Particle Swarm Optimisation for Evolving Deep Neural Networks for Image Classification by Evolving and Stacking Transferable Blocks

Deep Convolutional Neural Networks (CNNs) have been widely used in image classification tasks, but the process of designing CNN architectures is very complex, so Neural Architecture Search (NAS), automatically searching for optimal CNN architectures, has attracted more and more research interests. However, the computational cost of NAS is often too high to apply NAS on real-life applications. In this paper, an efficient particle swarm optimisation method named EPSOCNN is proposed to evolve CNN architectures inspired by the idea of transfer learning. EPSOCNN successfully reduces the computation cost by minimising the search space to a single block and utilising a small subset of the training set to evaluate CNNs during evolutionary process. Meanwhile, EPSOCNN also keeps very competitive classification accuracy by stacking the evolved block multiple times to fit the whole dataset. The proposed EPSOCNN algorithm is evaluated on CIFAR-10 dataset and compared with 13 peer competitors comprised of deep CNNs crafted by hand, learned by reinforcement learning methods and evolved by evolutionary computation approaches, which shows very promising results by outperforming all of the peer competitors with regard to the classification accuracy, number of parameters and the computational cost.

preprint2020arXiv

SBERT-WK: A Sentence Embedding Method by Dissecting BERT-based Word Models

Sentence embedding is an important research topic in natural language processing (NLP) since it can transfer knowledge to downstream tasks. Meanwhile, a contextualized word representation, called BERT, achieves the state-of-the-art performance in quite a few NLP tasks. Yet, it is an open problem to generate a high quality sentence representation from BERT-based word models. It was shown in previous study that different layers of BERT capture different linguistic properties. This allows us to fusion information across layers to find better sentence representation. In this work, we study the layer-wise pattern of the word representation of deep contextualized models. Then, we propose a new sentence embedding method by dissecting BERT-based word models through geometric analysis of the space spanned by the word representation. It is called the SBERT-WK method. No further training is required in SBERT-WK. We evaluate SBERT-WK on semantic textual similarity and downstream supervised tasks. Furthermore, ten sentence-level probing tasks are presented for detailed linguistic analysis. Experiments show that SBERT-WK achieves the state-of-the-art performance. Our codes are publicly available.

preprint2020arXiv

Searching for the Shortest Path to the Point of Voltage Collapse on the Algebraic Manifold

Voltage instability is one of the main causes of power system blackouts. Emerging technologies such as renewable energy integration, distributed energy resources and demand responses may introduce significant uncertainties in analyzing of system-wide voltage stability. This paper starts with summarizing different known voltage instability mechanisms, and then focuses on a class of voltage instability which is induced by the singular surface of the algebraic manifold. We argue and demonstrate that this class can include both dynamic and static voltage instabilities. To determine the minimum distance to the point of voltage collapse, a new formulation is proposed on the algebraic manifold. This formulation is further converted into an optimal control framework for identifying the path with minimum distance on the manifold. Comprehensive numerical studies are conducted on some manifolds of different power system test cases and demonstrate that the proposed method yields candidates for the local shortest paths to the singular surface on the manifold for both the dynamic model and the static model. Simulations show that the proposed method can identify shorter paths on the manifold than the paths associated with the minimum Euclidean distances. Furthermore, the proposed method always locates the right path ending at the correct singular surface which is responsible for the voltage instability; while the Euclidean distance formulation can mistakenly find solutions on the wrong singular surface. A broad range of potential applications using the proposed method are also discussed.

preprint2020arXiv

Surrogate-assisted Particle Swarm Optimisation for Evolving Variable-length Transferable Blocks for Image Classification

Deep convolutional neural networks have demonstrated promising performance on image classification tasks, but the manual design process becomes more and more complex due to the fast depth growth and the increasingly complex topologies of convolutional neural networks. As a result, neural architecture search has emerged to automatically design convolutional neural networks that outperform handcrafted counterparts. However, the computational cost is immense, e.g. 22,400 GPU-days and 2,000 GPU-days for two outstanding neural architecture search works named NAS and NASNet, respectively, which motivates this work. A new effective and efficient surrogate-assisted particle swarm optimisation algorithm is proposed to automatically evolve convolutional neural networks. This is achieved by proposing a novel surrogate model, a new method of creating a surrogate dataset and a new encoding strategy to encode variable-length blocks of convolutional neural networks, all of which are integrated into a particle swarm optimisation algorithm to form the proposed method. The proposed method shows its effectiveness by achieving competitive error rates of 3.49% on the CIFAR-10 dataset, 18.49% on the CIFAR-100 dataset, and 1.82% on the SVHN dataset. The convolutional neural network blocks are efficiently learned by the proposed method from CIFAR-10 within 3 GPU-days due to the acceleration achieved by the surrogate model and the surrogate dataset to avoid the training of 80.1% of convolutional neural network blocks represented by the particles. Without any further search, the evolved blocks from CIFAR-10 can be successfully transferred to CIFAR-100 and SVHN, which exhibits the transferability of the block learned by the proposed method.

preprint2020arXiv

TEAM: An Taylor Expansion-Based Method for Generating Adversarial Examples

Although Deep Neural Networks(DNNs) have achieved successful applications in many fields, they are vulnerable to adversarial examples.Adversarial training is one of the most effective methods to improve the robustness of DNNs, and it is generally considered as solving a saddle point problem that minimizes risk and maximizes perturbation.Therefore, powerful adversarial examples can effectively replicate the situation of perturbation maximization to solve the saddle point problem.The method proposed in this paper approximates the output of DNNs in the input neighborhood by using the Taylor expansion, and then optimizes it by using the Lagrange multiplier method to generate adversarial examples. If it is used for adversarial training, the DNNs can be effectively regularized and the defects of the model can be improved.

preprint2020arXiv

TEAM: We Need More Powerful Adversarial Examples for DNNs

Although deep neural networks (DNNs) have achieved success in many application fields, it is still vulnerable to imperceptible adversarial examples that can lead to misclassification of DNNs easily. To overcome this challenge, many defensive methods are proposed. Indeed, a powerful adversarial example is a key benchmark to measure these defensive mechanisms. In this paper, we propose a novel method (TEAM, Taylor Expansion-Based Adversarial Methods) to generate more powerful adversarial examples than previous methods. The main idea is to craft adversarial examples by minimizing the confidence of the ground-truth class under untargeted attacks or maximizing the confidence of the target class under targeted attacks. Specifically, we define the new objective functions that approximate DNNs by using the second-order Taylor expansion within a tiny neighborhood of the input. Then the Lagrangian multiplier method is used to obtain the optimize perturbations for these objective functions. To decrease the amount of computation, we further introduce the Gauss-Newton (GN) method to speed it up. Finally, the experimental result shows that our method can reliably produce adversarial examples with 100% attack success rate (ASR) while only by smaller perturbations. In addition, the adversarial example generated with our method can defeat defensive distillation based on gradient masking.

preprint2020arXiv

The Horizon of the McVittie Black Hole: On the Role of the Cosmic Fluid Modeling

In this paper, we investigate the existence and time evolution of the cosmological and event horizons in a McVittie universe whose expansion is driven by the Redlich-Kwong, (Modified) Berthelot, Dieterici, and Peng-Robinson fluids, respectively. The equations of state of these fluids are rich enough to account for both exotic and regular, as well as ideal and non-ideal matter contents of the universe. We show that the cosmological horizon is expanding, while the event horizon is shrinking along the cosmic time evolution. The former achieves larger size for regular types of matter, contrary to the latter. The strength of interactions within the cosmic fluid are shown to play a more important role in affecting the evolution of the event horizon, rather than of the cosmological horizon in the case of a singularity-free universe. While the cosmological horizon always exists during the time evolution, the event horizon can exist only when a certain relationship between the Hawking-Hayward quasi-local mass and the Hubble function is fulfilled. In this manner, we can study the role played by the large-scale physics (cosmic evolution) on the local scale physics (evolution of a black hole).

preprint2020arXiv

The parameter-free Finger-Of-God model and its application to 21cm intensity mapping

Using the galaxy catalog built from ELUCID N-body simulation and the semi-analytical galaxy formation model, we have built a mock HI intensity mapping map. We have implemented the Finger-of-God (FoG) effect in the map by considering the galaxy HI gas velocity dispersion. By comparing the HI power spectrum in the redshift space with the measurement from IllustrisTNG simulation, we have found that such FoG effect can explain the discrepancy between current mock map built from N-body simulation and Illustris TNG simulation. Then we built a parameter-free FoG model and a shot-noise model to calculate the HI power spectrum. We found that our model can accurately fit both the monopole and quadrupole moments of the HI matter power spectrum. Our method of building the mock HI intensity map and the parameter-free FoG model will be widely useful for the up-coming 21cm intensity mapping experiments, such as CHIME, Tianlai, BINGO, FAST and SKA. It is also crucial for us to study the non-linear effects in 21cm intensity mapping.

preprint2020arXiv

The TianQin project: current progress on science and technology

TianQin is a planned space-based gravitational wave (GW) observatory consisting of three earth orbiting satellites with an orbital radius of about $10^5~{\rm km}$. The satellites will form a equilateral triangle constellation the plane of which is nearly perpendicular to the ecliptic plane. TianQin aims to detect GWs between $10^{-4}~{\rm Hz}$ and $1~{\rm Hz}$ that can be generated by a wide variety of important astrophysical and cosmological sources, including the inspiral of Galactic ultra-compact binaries, the inspiral of stellar-mass black hole binaries, extreme mass ratio inspirals, the merger of massive black hole binaries, and possibly the energetic processes in the very early universe or exotic sources such as cosmic strings. In order to start science operations around 2035, a roadmap called the 0123 plan is being used to bring the key technologies of TianQin to maturity, supported by the construction of a series of research facilities on the ground. Two major projects of the 0123 plan are being carried out. In this process, the team has created a new generation $17~{\rm cm}$ single-body hollow corner-cube retro-reflector which has been launched with the QueQiao satellite on 21 May 2018; a new laser ranging station equipped with a $1.2~{\rm m}$ telescope has been constructed and the station has successfully ranged to all the five retro-reflectors on the Moon; and the TianQin-1 experimental satellite has been launched on 20 December 2019 and the first round result shows that the satellite has exceeded all of its mission requirements.

preprint2020arXiv

Thermodynamics of Shearing Massless Scalar Field Spacetimes is Inconsistent With the Weyl Curvature Hypothesis

Our Universe has an arrow of time. In accordance with the second law of thermodynamics, entropy has been increasing ever since the Big Bang. The fact that matter is in thermal equilibrium in the very early Universe, as indicated by the cosmic microwave background, has led to the idea that gravitational entropy must be very low in the beginning. Penrose proposed that gravitational entropy can be quantified by the Weyl curvature, which increases as structures formed. A concrete realization of such a measure is the Clifton-Ellis-Tavakol gravitational entropy, which has been shown to be increasing in quite a number of cosmological models. In this work, we show a counter-example involving a class of inhomogeneous universes that are supported by a chameleon massless scalar field and exhibit anisotropic spacetime shearing effects. In fact, in our model the Clifton-Ellis-Tavakol gravitational entropy is increasing although the magnitude of the Weyl curvature is decreasing; this is due to the growth of the spacetime shear. The topology and the values of the three free parameters of the model are constrained by imposing a positive energy density for the cosmic fluid, and the thermodynamical requirements which follow from the cosmological holographic principle and the second law. It is shown that a negative deceleration parameter and a time decreasing Weyl curvature automatically follow from those conditions. Thus, we argue that our model can account for the formation of some primordial structures, like the Large Quasar Groups, which has required a non-standard evolution of the spatial anisotropies.

preprint2020arXiv

Triple-GAIL: A Multi-Modal Imitation Learning Framework with Generative Adversarial Nets

Generative adversarial imitation learning (GAIL) has shown promising results by taking advantage of generative adversarial nets, especially in the field of robot learning. However, the requirement of isolated single modal demonstrations limits the scalability of the approach to real world scenarios such as autonomous vehicles&#39; demand for a proper understanding of human drivers&#39; behavior. In this paper, we propose a novel multi-modal GAIL framework, named Triple-GAIL, that is able to learn skill selection and imitation jointly from both expert demonstrations and continuously generated experiences with data augmentation purpose by introducing an auxiliary skill selector. We provide theoretical guarantees on the convergence to optima for both of the generator and the selector respectively. Experiments on real driver trajectories and real-time strategy game datasets demonstrate that Triple-GAIL can better fit multi-modal behaviors close to the demonstrators and outperforms state-of-the-art methods.

preprint2020arXiv

Unified Multi-Criteria Chinese Word Segmentation with BERT

Multi-Criteria Chinese Word Segmentation (MCCWS) aims at finding word boundaries in a Chinese sentence composed of continuous characters while multiple segmentation criteria exist. The unified framework has been widely used in MCCWS and shows its effectiveness. Besides, the pre-trained BERT language model has been also introduced into the MCCWS task in a multi-task learning framework. In this paper, we combine the superiority of the unified framework and pretrained language model, and propose a unified MCCWS model based on BERT. Moreover, we augment the unified BERT-based MCCWS model with the bigram features and an auxiliary criterion classification task. Experiments on eight datasets with diverse criteria demonstrate that our methods could achieve new state-of-the-art results for MCCWS.

preprint2019arXiv

Fractional Dark Matter decay: cosmological imprints and observational constraints

If a fraction $f_{\rm dcdm}$ of the Dark Matter decays into invisible and massless particles (so-called &#34;dark radiation&#34;) with the decay rate (or inverse lifetime) $Γ_{\rm dcdm}$, such decay will leave distinctive imprints on cosmological observables. With a full consideration of the Boltzmann hierarchy, we calculate the decay-induced impacts not only on the CMB but also on the redshift distortion and the kinetic Sunyaev-Zel&#39;dovich effect, while providing detailed physical interpretations based on evaluating the evolution of gravitational potential. By using the current cosmological data with a combination of Planck 2015, Baryon Acoustic Oscillation and redshift distortion measurements which can improve the constraints, we update the $1σ$ bound on the fraction of decaying DM from $f_{\rm dcdm}\lesssim5.26\%$ to $f_{\rm dcdm}\lesssim2.73\%$ for the short-lived DM (assuming $Γ_{\rm dcdm}/H_0\gtrsim10^4$). However, no constraints are improved from RSD data ($f_{\rm dcdm}\lesssim0.94\%$) for the long-lived DM (i.e., $Γ_{\rm dcdm}/H_0\lesssim10^4$). We also find the fractional DM decay can only slightly reduce the $H_0$ and $σ_8$ tensions, which is consistent with other previous works. Furthermore, our calculations show that the kSZ effect in future would provide a further constraining power on the decaying DM.

preprint2019arXiv

Graph Representation Learning: A Survey

Research on graph representation learning has received a lot of attention in recent years since many data in real-world applications come in form of graphs. High-dimensional graph data are often in irregular form, which makes them more difficult to analyze than image/video/audio data defined on regular lattices. Various graph embedding techniques have been developed to convert the raw graph data into a low-dimensional vector representation while preserving the intrinsic graph properties. In this review, we first explain the graph embedding task and its challenges. Next, we review a wide range of graph embedding techniques with insights. Then, we evaluate several state-of-the-art methods against small and large datasets and compare their performance. Finally, potential applications and future directions are presented.

preprint2019arXiv

Post-Processing of Word Representations via Variance Normalization and Dynamic Embedding

Although embedded vector representations of words offer impressive performance on many natural language processing (NLP) applications, the information of ordered input sequences is lost to some extent if only context-based samples are used in the training. For further performance improvement, two new post-processing techniques, called post-processing via variance normalization (PVN) and post-processing via dynamic embedding (PDE), are proposed in this work. The PVN method normalizes the variance of principal components of word vectors while the PDE method learns orthogonal latent variables from ordered input sequences. The PVN and the PDE methods can be integrated to achieve better performance. We apply these post-processing techniques to two popular word embedding methods (i.e., word2vec and GloVe) to yield their post-processed representations. Extensive experiments are conducted to demonstrate the effectiveness of the proposed post-processing techniques.

preprint2019arXiv

Single-photon computational 3D imaging at 45 km

Long-range active imaging has a variety of applications in remote sensing and target recognition. Single-photon LiDAR (light detection and ranging) offers single-photon sensitivity and picosecond timing resolution, which is desirable for high-precision three-dimensional (3D) imaging over long distances. Despite important progress, further extending the imaging range presents enormous challenges because only weak echo photons return and are mixed with strong noise. Herein, we tackled these challenges by constructing a high-efficiency, low-noise confocal single-photon LiDAR system, and developing a long-range-tailored computational algorithm that provides high photon efficiency and super-resolution in the transverse domain. Using this technique, we experimentally demonstrated active single-photon 3D-imaging at a distance of up to 45 km in an urban environment, with a low return-signal level of $\sim$1 photon per pixel. Our system is feasible for imaging at a few hundreds of kilometers by refining the setup, and thus represents a significant milestone towards rapid, low-power, and high-resolution LiDAR over extra-long ranges.

preprint2019arXiv

Strong Cosmic Censorship in Charged de Sitter spacetime with Scalar Field Non-minimally Coupled to Curvature

We examine the stability and the strong cosmic censorship in the Reissner-Nordstrom-de Sitter (RN-dS) black hole by investigating the evolution of a scalar field non-minimally coupled to the curvature. We find that when the coupling parameter is negative, the RN-dS black hole experiences instability. The instability disappears when the coupling parameter becomes non-negative. With the increase of the coupling parameter, the violation of the strong cosmic censorship occurs at a larger critical charge ratio. But such an increase of the critical charge is suppressed by the increase of the cosmological constant. Different from the minimal coupling situation, it is possible to accommodate $β\ge1$ in the near extremal black hole when the scalar field is non-minimally coupled to curvature. The increase of the cosmological constant can allow $β\ge1$ to be satisfied for even smaller value of the coupling parameter. The existence of $β\ge1$ implies that the resulting curvature can continuously cross the Cauchy horizon.