Source author record

Yang Xu

Yang Xu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

64works

31topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Decoding Scientific Experimental Images: The SPUR Benchmark for Perception, Understanding, and Reasoning

We introduce SPUR, a comprehensive benchmark for scientific experimental image perception, understanding, and reasoning, comprising 4,264 question-answering (QA) pairs derived from 1,084 expert-curated images. SPUR features three key innovations: (1) Panel-Level Fine-Grained Perception: evaluating the visual perception of multimodal large language models (MLLMs) across three dimensions (numerical, morphological, and information localization) on six fine-grained panel types; (2) Cross-Panel Relation Understanding: utilizing complex images with an average of 14.3 panels per sample to evaluate MLLMs' ability to decipher intricate cross-panel relations; (3) Expert-Level Reasoning: assessment of qualitative and quantitative reasoning across five experimental paradigms to determine if models can infer conclusions from evidence as human experts do. Comprehensive evaluation of 20 MLLMs and four multimodal Chain-of-Thought (MCoT) methods reveals that current models fall significantly short of the expert-level requirements for scientific image interpretation, underscoring a critical bottleneck in AI for Science (AI4S) research.

preprint2025arXiv

Anomalous Hall effect and rich magnetic phase diagram of Mn$_{100-x}$Rh$_{x}$ epitaxial films

A series of Mn$_{100-x}$Rh$_x$ ($20 \le x \le 50$) thin films were epitaxially grown on the MgO substrate using magnetron sputtering technique, and were systematically investigated by magnetization, longitudinal electrical resistivity, and transverse Hall resistivity. After optimizing the growth conditions, phase-pure Mn$_{100-x}$Rh$_x$ films with a cubic CsCl-type structure were obtained, and their magnetic phase diagram was built. The manipulation of Rh content leads to a rich magnetic phase diagram, where three different regimes can be identified: for $x < 40$, Mn$_{100-x}$Rh$_x$ films undergo a ferromagnetic (FM) transition below $T_\mathrm{C} \approx$ 330-350 K; for $40 \le x \le 45$, in addition to the FM transition at $T_\mathrm{C} \approx$ 200 K, Mn$_{100-x}$Rh$_x$ films undergo a FM-to-antiferromagnetic (AFM) transition at $T_\mathrm{N} \approx$ 120 K; finally for $x > 45$, only one AFM transition at $T_\mathrm{N} \approx$ 150 K can be tracked. All the Mn$_{100-x}$Rh$_x$ films exhibit distinct anomalous Hall effect in their magnetically ordered state, which is most likely due to the intrinsic Berry-curvature mechanism. In addition, all the anomalous Hall transport properties, including the resistivity, conductivity, and angle exhibit a strong correlation with the magnetic properties of Mn$_{100-x}$Rh$_x$ films, which become most evident for $x$ = 35. Our systematic investigations suggest a strong correlation between magnetic properties and electronic band topology in Mn$_{100-x}$Rh$_x$ films, highlighting their great potential for AFM spintronics.

preprint2023arXiv

A Unified Single-loop Alternating Gradient Projection Algorithm for Nonconvex-Concave and Convex-Nonconcave Minimax Problems

Much recent research effort has been directed to the development of efficient algorithms for solving minimax problems with theoretical convergence guarantees due to the relevance of these problems to a few emergent applications. In this paper, we propose a unified single-loop alternating gradient projection (AGP) algorithm for solving smooth nonconvex-(strongly) concave and (strongly) convex-nonconcave minimax problems. AGP employs simple gradient projection steps for updating the primal and dual variables alternatively at each iteration. We show that it can find an $\varepsilon$-stationary point of the objective function in $\mathcal{O}\left( \varepsilon ^{-2} \right)$ (resp. $\mathcal{O}\left( \varepsilon ^{-4} \right)$) iterations under nonconvex-strongly concave (resp. nonconvex-concave) setting. Moreover, its gradient complexity to obtain an $\varepsilon$-stationary point of the objective function is bounded by $\mathcal{O}\left( \varepsilon ^{-2} \right)$ (resp., $\mathcal{O}\left( \varepsilon ^{-4} \right)$) under the strongly convex-nonconcave (resp., convex-nonconcave) setting. To the best of our knowledge, this is the first time that a simple and unified single-loop algorithm is developed for solving both nonconvex-(strongly) concave and (strongly) convex-nonconcave minimax problems. Moreover, the complexity results for solving the latter (strongly) convex-nonconcave minimax problems have never been obtained before in the literature. Numerical results show the efficiency of the proposed AGP algorithm. Furthermore, we extend the AGP algorithm by presenting a block alternating proximal gradient (BAPG) algorithm for solving more general multi-block nonsmooth nonconvex-(strongly) concave and (strongly) convex-nonconcave minimax problems. We can similarly establish the gradient complexity of the proposed algorithm under these four different settings.

preprint2023arXiv

Bayesian Generalized Kernel Inference for Exploration of Autonomous Robots

This paper concerns realizing highly efficient information-theoretic robot exploration with desired performance in complex scenes. We build a continuous lightweight inference model to predict the mutual information (MI) and the associated prediction confidence of the robot's candidate actions which have not been evaluated explicitly. This allows the decision-making stage in robot exploration to run with a logarithmic complexity approximately, this will also benefit online exploration in large unstructured, and cluttered places that need more spatial samples to assess and decide. We also develop an objective function to balance the local optimal action with the highest MI value and the global choice with high prediction variance. Extensive numerical and dataset simulations show the desired efficiency of our proposed method without losing exploration performance in different environments. We also provide our open-source implementation codes released on GitHub for the robot community.

preprint2022arXiv

An asperity-based statistical model for the adhesive friction of elastic nominally flat rough contact interfaces

Contact mechanics-based models for the friction of nominally flat rough surfaces have not been able to adequately capture certain key experimentally observed phenomenona, such as the transition from a static friction peak to a lower level of sliding friction and the shear-induced contact area reduction that has been observed in the pre-sliding regime especially for soft materials. Here, we propose a statistical model based on physically-rooted contact mechanics laws describing the micromechanics of individual junctions. The model considers the quasi-static tangential loading, up to full sliding, of the contact between a smooth rigid flat surface and a nominally flat linear elastic rough surface comprising random independent spherical asperities, and accounts for the coupling between adhesion and friction at the micro-junction level. The model qualitatively reproduces both the macroscopic shear-induced contact area reduction and, remarkably, the static friction peak without the need to explicitly introduce two different friction levels. It also demonstrates how the static friction peak and contact area evolution depend on the normal load and certain key microscale interface properties such as surface energy, mode mixity and frictional shear strength. "Tougher" interfaces (i.e. with larger surface energy and smaller mode mixity parameter) are shown to result in a larger real contact area and a more pronounced static friction peak. Overall, this work provides important insights about how key microscale properties operating at the asperity level can combine with the surface statistics to reproduce important macroscopic responses observed in rough frictionalsoft contact experiments.

preprint2022arXiv

Axial and Vector Structure Functions for Lepton-Nucleon Scattering, NuFact 2021 Update

We report on an update (2021) of a phenomenological model for inelastic neutrino- and electron-nucleon scattering cross sections using effective leading order parton distribution functions with a new scaling variable $ξ_w$. Non-perturbative effects are well described using the $ξ_w$ scaling variable in combination with multiplicative $K$ factors at low $Q^2$. The model describes all inelastic charged-lepton-nucleon scattering data (HERA/NMC/BCDMS/SLAC/JLab) ranging from very high $Q^2$ to very low $Q^2$ and down to the $Q^2=0$ photo-production region. The model has been developed to be used in analyses of neutrino oscillation experiments in the few GeV region. The 2021 update accounts for the difference between axial and vector structure functions which brings it into much better agreement with neutrino-nucleon total cross section measurements. The model has been developed primarily for hadronic final state masses $W$ above 1.8 GeV. However with additional parameters the model also describes the $average$ neutrino cross sections in the resonance region down to $W$=1.4 GeV.

preprint2022arXiv

Confidence-rich Localization and Mapping based on Particle Filter for Robotic Exploration

This paper mainly studies the localization and mapping of range sensing robots in the confidence-rich map (CRM) and then extends it to provide a full state estimate for information-theoretic exploration. Most previous works about active simultaneous localization and mapping and exploration always assumed the known robot poses or utilized inaccurate information metrics to approximate pose uncertainty, resulting in imbalanced exploration performance and efficiency in the unknown environment. This inspires us to extend the confidence-rich mutual information (CRMI) with measurable pose uncertainty. Specifically, we propose a Rao-Blackwellized particle filter-based localization and mapping scheme (RBPF-CLAM) for CRM, then we develop a new closed-form weighting method to improve the localization accuracy without scan matching. We further derive the uncertain CRMI (UCRMI) with the weighted particles by a more accurate approximation. Simulations and experimental evaluations show the localization accuracy and exploration performance of the proposed methods.

preprint2022arXiv

Crosslinguistic word order variation reflects evolutionary pressures of dependency and information locality

Languages vary considerably in syntactic structure. About 40% of the world's languages have subject-verb-object order, and about 40% have subject-object-verb order. Extensive work has sought to explain this word order variation across languages. However, the existing approaches are not able to explain coherently the frequency distribution and evolution of word order in individual languages. We propose that variation in word order reflects different ways of balancing competing pressures of dependency locality and information locality, whereby languages favor placing elements together when they are syntactically related or contextually informative about each other. Using data from 80 languages in 17 language families and phylogenetic modeling, we demonstrate that languages evolve to balance these pressures, such that word order change is accompanied by change in the frequency distribution of the syntactic structures which speakers communicate to maintain overall efficiency. Variability in word order thus reflects different ways in which languages resolve these evolutionary pressures. We identify relevant characteristics that result from this joint optimization, particularly the frequency with which subjects and objects are expressed together for the same verb. Our findings suggest that syntactic structure and usage across languages co-adapt to support efficient communication under limited cognitive resources.

preprint2022arXiv

HistBERT: A Pre-trained Language Model for Diachronic Lexical Semantic Analysis

Contextualized word embeddings have demonstrated state-of-the-art performance in various natural language processing tasks including those that concern historical semantic change. However, language models such as BERT was trained primarily on contemporary corpus data. To investigate whether training on historical corpus data improves diachronic semantic analysis, we present a pre-trained BERT-based language model, HistBERT, trained on the balanced Corpus of Historical American English. We examine the effectiveness of our approach by comparing the performance of the original BERT and that of HistBERT, and we report promising results in word similarity and semantic shift analysis. Our work suggests that the effectiveness of contextual embeddings in diachronic semantic analysis is dependent on the temporal profile of the input text and care should be taken in applying this methodology to study historical semantic change.

preprint2022arXiv

Image Captioning In the Transformer Age

Image Captioning (IC) has achieved astonishing developments by incorporating various techniques into the CNN-RNN encoder-decoder architecture. However, since CNN and RNN do not share the basic network component, such a heterogeneous pipeline is hard to be trained end-to-end where the visual encoder will not learn anything from the caption supervision. This drawback inspires the researchers to develop a homogeneous architecture that facilitates end-to-end training, for which Transformer is the perfect one that has proven its huge potential in both vision and language domains and thus can be used as the basic component of the visual encoder and language decoder in an IC pipeline. Meantime, self-supervised learning releases the power of the Transformer architecture that a pre-trained large-scale one can be generalized to various tasks including IC. The success of these large-scale models seems to weaken the importance of the single IC task. However, we demonstrate that IC still has its specific significance in this age by analyzing the connections between IC with some popular self-supervised learning paradigms. Due to the page limitation, we only refer to highly important papers in this short survey and more related works can be found at https://github.com/SjokerLily/awesome-image-captioning.

preprint2022arXiv

Improving short-term bike sharing demand forecast through an irregular convolutional neural network

As an important task for the management of bike sharing systems, accurate forecast of travel demand could facilitate dispatch and relocation of bicycles to improve user satisfaction. In recent years, many deep learning algorithms have been introduced to improve bicycle usage forecast. A typical practice is to integrate convolutional (CNN) and recurrent neural network (RNN) to capture spatial-temporal dependency in historical travel demand. For typical CNN, the convolution operation is conducted through a kernel that moves across a "matrix-format" city to extract features over spatially adjacent urban areas. This practice assumes that areas close to each other could provide useful information that improves prediction accuracy. However, bicycle usage in neighboring areas might not always be similar, given spatial variations in built environment characteristics and travel behavior that affect cycling activities. Yet, areas that are far apart can be relatively more similar in temporal usage patterns. To utilize the hidden linkage among these distant urban areas, the study proposes an irregular convolutional Long-Short Term Memory model (IrConv+LSTM) to improve short-term bike sharing demand forecast. The model modifies traditional CNN with irregular convolutional architecture to extract dependency among "semantic neighbors". The proposed model is evaluated with a set of benchmark models in five study sites, which include one dockless bike sharing system in Singapore, and four station-based systems in Chicago, Washington, D.C., New York, and London. We find that IrConv+LSTM outperforms other benchmark models in the five cities. The model also achieves superior performance in areas with varying levels of bicycle usage and during peak periods. The findings suggest that "thinking beyond spatial neighbors" can further improve short-term travel demand prediction of urban bike sharing systems.

preprint2022arXiv

Isometries and MacWilliams Extension Property for Weighted Poset Metric

Let $\mathbf{H}$ be the cartesian product of a family of left modules over a ring $S$, indexed by a finite set $Ω$. We are concerned with the $(\mathbf{P},ω)$-weight on $\mathbf{H}$, where $\mathbf{P}=(Ω,\preccurlyeq_{\mathbf{P}})$ is a poset and $ω:Ω\longrightarrow\mathbb{R}^{+}$ is a weight function. We characterize the group of $(\mathbf{P},ω)$-weight isometries of $\mathbf{H}$, and give a canonical decomposition for semi-simple subcodes of $\mathbf{H}$ when $\mathbf{P}$ is hierarchical. We then study the MacWilliams extension property (MEP) for $(\mathbf{P},ω)$-weight. We show that the MEP implies the unique decomposition property (UDP) of $(\mathbf{P},ω)$, which further implies that $\mathbf{P}$ is hierarchical if $ω$ is identically $1$. For the case that either $\mathbf{P}$ is hierarchical or $ω$ is identically $1$, we show that the MEP for $(\mathbf{P},ω)$-weight can be characterized in terms of the MEP for Hamming weight, and give necessary and sufficient conditions for $\mathbf{H}$ to satisfy the MEP for $(\mathbf{P},ω)$-weight when $S$ is an Artinian simple ring (either finite or infinite). When $S$ is a finite field, in the context of $(\mathbf{P},ω)$-weight, we compare the MEP with other coding theoretic properties including the MacWilliams identity, Fourier-reflexivity of partitions and the UDP, and show that the MEP is strictly stronger than all the rest among them.

preprint2022arXiv

K-Detector: Identifying Duplicate Crash Failures in Large-Scale Software Delivery

After a developer submits code, corresponding test cases arise to ensure the quality of software delivery. Test failures would occur during this period, such as crash, error, and timeout. Since it takes time for developers to resolve them, many duplicate failures will happen during this period. In the delivery practice of SAP HANA, crash triage is considered as the most time-consuming task. If duplicate crash failures can be automatically identified, the degree of automation will be significantly enhanced. To find such duplicates, we propose a training-based mathematical model that utilizes component information of SAP HANA to achieve better crash similarity comparison. We implement our approach in a tool named Knowledge-based Detector (K-Detector), which is verified by 11,208 samples and performs 0.986 in AUC. Furthermore, we have deployed K-Detector to the production environment, and it can save 97% human efforts in crash triage as statistics.

preprint2022arXiv

KE-QI: A Knowledge Enhanced Article Quality Identification Dataset

With so many articles of varying qualities being produced every moment, it is a very urgent task to screen outstanding articles and commit them to social media. To our best knowledge, there is a lack of datasets and mature research works in identifying high-quality articles. Consequently, we conduct some surveys and finalize 7 objective indicators to annotate the quality of 10k articles. During annotation, we find that many characteristics of high-quality articles (e.g., background) rely more on extensive external knowledge than inner semantic information of articles. In response, we link extracted article entities to Baidu Encyclopedia, then propose Knowledge Enhanced article Quality Identification (KE-QI) dataset. To make better use of external knowledge, we propose a compound model which fuses the text and external knowledge information via a gate unit to classify the quality of an article. Our experimental results on KE-QI show that with initialization of our pre-trained Node2Vec model, our model achieves about 78\% $F_1$, outperforming other baselines.

preprint2022arXiv

LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding

Pre-training of text and layout has proved effective in a variety of visually-rich document understanding tasks due to its effective model architecture and the advantage of large-scale unlabeled scanned/digital-born documents. We propose LayoutLMv2 architecture with new pre-training tasks to model the interaction among text, layout, and image in a single multi-modal framework. Specifically, with a two-stream multi-modal Transformer encoder, LayoutLMv2 uses not only the existing masked visual-language modeling task but also the new text-image alignment and text-image matching tasks, which make it better capture the cross-modality interaction in the pre-training stage. Meanwhile, it also integrates a spatial-aware self-attention mechanism into the Transformer architecture so that the model can fully understand the relative positional relationship among different text blocks. Experiment results show that LayoutLMv2 outperforms LayoutLM by a large margin and achieves new state-of-the-art results on a wide variety of downstream visually-rich document understanding tasks, including FUNSD (0.7895 $\to$ 0.8420), CORD (0.9493 $\to$ 0.9601), SROIE (0.9524 $\to$ 0.9781), Kleister-NDA (0.8340 $\to$ 0.8520), RVL-CDIP (0.9443 $\to$ 0.9564), and DocVQA (0.7295 $\to$ 0.8672). We made our model and code publicly available at \url{https://aka.ms/layoutlmv2}.

preprint2022arXiv

ML4CO-KIDA: Knowledge Inheritance in Dataset Aggregation

The Machine Learning for Combinatorial Optimization (ML4CO) NeurIPS 2021 competition aims to improve state-of-the-art combinatorial optimization solvers by replacing key heuristic components with machine learning models. On the dual task, we design models to make branching decisions to promote the dual bound increase faster. We propose a knowledge inheritance method to generalize knowledge of different models from the dataset aggregation process, named KIDA. Our improvement overcomes some defects of the baseline graph-neural-networks-based methods. Further, we won the $1$\textsuperscript{st} Place on the dual task. We hope this report can provide useful experience for developers and researchers. The code is available at https://github.com/megvii-research/NeurIPS2021-ML4CO-KIDA.

preprint2022arXiv

Neural reality of argument structure constructions

In lexicalist linguistic theories, argument structure is assumed to be predictable from the meaning of verbs. As a result, the verb is the primary determinant of the meaning of a clause. In contrast, construction grammarians propose that argument structure is encoded in constructions (or form-meaning pairs) that are distinct from verbs. Decades of psycholinguistic research have produced substantial empirical evidence in favor of the construction view. Here we adapt several psycholinguistic studies to probe for the existence of argument structure constructions (ASCs) in Transformer-based language models (LMs). First, using a sentence sorting experiment, we find that sentences sharing the same construction are closer in embedding space than sentences sharing the same verb. Furthermore, LMs increasingly prefer grouping by construction with more input data, mirroring the behaviour of non-native language learners. Second, in a "Jabberwocky" priming-based experiment, we find that LMs associate ASCs with meaning, even in semantically nonsensical sentences. Our work offers the first evidence for ASCs in LMs and highlights the potential to devise novel probing methods grounded in psycholinguistic research.

preprint2022arXiv

Noun2Verb: Probabilistic frame semantics for word class conversion

Humans can flexibly extend word usages across different grammatical classes, a phenomenon known as word class conversion. Noun-to-verb conversion, or denominal verb (e.g., to Google a cheap flight), is one of the most prevalent forms of word class conversion. However, existing natural language processing systems are impoverished in interpreting and generating novel denominal verb usages. Previous work has suggested that novel denominal verb usages are comprehensible if the listener can compute the intended meaning based on shared knowledge with the speaker. Here we explore a computational formalism for this proposal couched in frame semantics. We present a formal framework, Noun2Verb, that simulates the production and comprehension of novel denominal verb usages by modeling shared knowledge of speaker and listener in semantic frames. We evaluate an incremental set of probabilistic models that learn to interpret and generate novel denominal verb usages via paraphrasing. We show that a model where the speaker and listener cooperatively learn the joint distribution over semantic frame elements better explains the empirical denominal verb usages than state-of-the-art language models, evaluated against data from 1) contemporary English in both adult and child speech, 2) contemporary Mandarin Chinese, and 3) the historical development of English. Our work grounds word class conversion in probabilistic frame semantics and bridges the gap between natural language processing systems and humans in lexical creativity.

preprint2022arXiv

Optimization of rule-based energy management strategies for hybrid vehicles using dynamic programming

Reducing energy consumption is a key focus for hybrid electric vehicle (HEV) development. The popular vehicle dynamic model used in many energy management optimization studies does not capture the vehicle dynamics that the in-vehicle measurement system does. However, feedback from the measurement system is what the vehicle controller actually uses to manage energy consumption. Therefore, the optimization solely using the model does not represent what the vehicle controller sees in the vehicle. This paper reports the utility factor-weighted energy consumption using a rule-based strategy under a real-world representative drive cycle. In addition, the vehicle test data was used to perform the optimization approach. By comparing results from both rule-based and optimization-based strategies, the areas for further improving rule-based strategy are discussed. Furthermore, recent development of OBD raises a concern about the increase of energy consumption. This paper investigates the energy consumption increase with extensive OBD usage.

preprint2022arXiv

Quantile Off-Policy Evaluation via Deep Conditional Generative Learning

Off-Policy evaluation (OPE) is concerned with evaluating a new target policy using offline data generated by a potentially different behavior policy. It is critical in a number of sequential decision making problems ranging from healthcare to technology industries. Most of the work in existing literature is focused on evaluating the mean outcome of a given policy, and ignores the variability of the outcome. However, in a variety of applications, criteria other than the mean may be more sensible. For example, when the reward distribution is skewed and asymmetric, quantile-based metrics are often preferred for their robustness. In this paper, we propose a doubly-robust inference procedure for quantile OPE in sequential decision making and study its asymptotic properties. In particular, we propose utilizing state-of-the-art deep conditional generative learning methods to handle parameter-dependent nuisance function estimation. We demonstrate the advantages of this proposed estimator through both simulations and a real-world dataset from a short-video platform. In particular, we find that our proposed estimator outperforms classical OPE estimators for the mean in settings with heavy-tailed reward distributions.

preprint2022arXiv

Reflexivity of Partitions Induced by Weighted Poset Metric and Combinatorial Metric

Let $\mathbf{H}$ be the Cartesian product of a family of finite abelian groups. Via a polynomial approach, we give sufficient conditions for a partition of $\mathbf{H}$ induced by weighted poset metric to be reflexive, which also become necessary for some special cases. Moreover, by examining the roots of the Krawtchouk polynomials, we establish non-reflexive partitions of $\mathbf{H}$ induced by combinatorial metric. When $\mathbf{H}$ is a vector space over a finite field $\mathbb{F}$, we consider the property of admitting MacWilliams identity (PAMI) and the MacWilliams extension property (MEP) for partitions of $\mathbf{H}$. With some invariance assumptions, we show that two partitions of $\mathbf{H}$ admit MacWilliams identity if and only if they are mutually dual and reflexive, and any partition of $\mathbf{H}$ satisfying the MEP is in fact an orbit partition induced by some subgroup of $\Aut_{\mathbb{F}}(\mathbf{H})$, which is necessarily reflexive. As an application of the aforementioned results, we establish partitions of $\mathbf{H}$ induced by combinatorial metric that do not satisfy the MEP, which further enable us to provide counter-examples to a conjecture proposed by Pinheiro, Machado and Firer in \cite{39}.

preprint2022arXiv

Revisiting the Persson theory of elastoplastic contact: A simpler closed-form solution and a rigorous proof of boundary conditions

Persson's theory of contact is extensively used in the study of the purely normal interaction between a nominally flat rough surface and a rigid flat. In the literature, Persson's theory was successfully applied to the elastoplastic contact problem with a scale-independent hardness $H$. However, it yields a closed-form solution, $P(p, ξ)$, in terms of an infinite sum of sines. In this study, $P(p, ξ)$ is found to have a simpler form which is a superposition of three Gaussian functions. A rigorous proof of the boundary condition $P(p=0, ξ)=P(p=H, ξ) = 0$ is given based on the new solution.

preprint2022arXiv

Robust Inertial-aided Underwater Localization based on Imaging Sonar Keyframes

This article focuses on feature-based underwater localization and navigation for autonomous underwater vehicles (AUVs) using 2D imaging sonar measurements. The sparsity of underwater acoustic features and the loss of elevation angle in sonar images may introduce wrong feature matches or insufficient features for optimization-based underwater localization (i.e. under-constrained/degeneracy cases). This motivates us to propose a novel inertial-aided sliding window optimization framework to improve the estimation accuracy and the robustness to front-end outliers. Concretely, we first discriminate under-constrained/ well-constrained sonar frames and define sonar keyframes (SKFs) based on the Jacobian matrix derived from odometry and sonar measurements. To utilize the past well-constrained SKFs mostly, we design a size-adjustable windowed back-end optimization scheme based on singular values. We also prove that the landmark triangulation failure (navigation problem) caused by sonar motion can be solved in 2D scenes. Comparative simulation and evaluation on a public dataset show the proposed method outperforms the existing ones in pose estimation and robustness even without loop closure and also ensures the real-time performance for online applications.

preprint2022arXiv

The Machine Learning for Combinatorial Optimization Competition (ML4CO): Results and Insights

Combinatorial optimization is a well-established area in operations research and computer science. Until recently, its methods have focused on solving problem instances in isolation, ignoring that they often stem from related data distributions in practice. However, recent years have seen a surge of interest in using machine learning as a new approach for solving combinatorial problems, either directly as solvers or by enhancing exact solvers. Based on this context, the ML4CO aims at improving state-of-the-art combinatorial optimization solvers by replacing key heuristic components. The competition featured three challenging tasks: finding the best feasible solution, producing the tightest optimality certificate, and giving an appropriate solver configuration. Three realistic datasets were considered: balanced item placement, workload apportionment, and maritime inventory routing. This last dataset was kept anonymous for the contestants.

preprint2022arXiv

Ultrafast disinfection of SARS-CoV-2 viruses

The wide use of surgical masks has been proven effective for mitigating the spread of respiration diseases, such as COVID-19, alongside social distance control, vaccines, and other efforts. With the newly reported variants, such as Delta and Omicron, a higher spread rate had been found compared to the initial strains. People might get infected even by inhaling fewer loading of viruses. More frequent sterilization of surgical masks is needed to protect the wearers. However, it is challenging to sterilize the commodity surgical masks with a fast and effective method. Herein, we reported the sterilization of the SARS-CoV-2 viruses within an ultra-short time, while retaining the mask performance. Silver thin film is coated on commercial polyimide film by physical vapor deposition and patterned by laser scribing to form a Joule heating electrode. Another layer of the gold thin film was coated onto the opposite side of the device to promote the uniformity of the Joule heating through nano-heat transfer regulation. As a result, the surgical mask can be heated to inactivation temperature within a short time and with high uniformity. By Joule-heating the surgical mask with the temperature at 90 °C for 3 minutes, the inactivation of the SARS-CoV-2 showed an efficacy of 99.89%. Normal commodity surgical masks can be sterilized faster, more frequently, and efficiently against SARS-CoV-2 viruses and the new invariants.

preprint2022arXiv

Ultrasensitive refractive index sensor with rotatory biased weak measurement

A modified weak measurement scheme, rotatory biased weak measurement, is proposed to significantly improve the sensitivity and resolution of the refractive index sensor on a total reflection structure. This method introduces an additional phase in the post-selected procedure and generates an extinction point in the spectrum distribution. The biased post-selection makes smaller coupling strength available, which leads to an enhancement of phase sensitivity and refractive index sensitivity. In rotatory biased weak measurement, we achieve an enhanced refractive index sensitivity of 13605 nm/RIU compared to 1644 nm/RIU in standard weak measurement. The performance of sensors with different sensitivity is analyzed, and we find the optimal refractive index resolution of sensors increases with sensitivity. In this work, we demonstrate an optimal refractive index resolution of $4\times10^{-7}$ RIU on a total reflection structure. The rabbit anti-mouse IgG and mouse IgG binding reaction experiments demonstrate that our system has a high response to the concentration of IgG in a wide range and the limit of detection is 15 ng/mL. The improvements in this work are helpful to the optimizations of other optical sensors with weak measurement.

preprint2021arXiv

Magnetotransport of dirty-limit van Hove singularity quasiparticles

Tuning of electronic density-of-states singularities is a common route to unconventional metal physics. Conceptually, van Hove singularities are realized only in clean two-dimensional systems. Little attention has therefore been given to the disordered (dirty) limit. Here, we provide a magnetotransport study of the dirty metamagnetic system calcium-doped strontium ruthenate. Fermi liquid properties persist across the metamagnetic transition, but with an unusually strong variation of the Kadowaki-Woods ratio. This is revealed by a strong decoupling of inelastic electron scattering and electronic mass inferred from density-of-state probes. We discuss this Fermi liquid behavior in terms of a magnetic field tunable van Hove singularity in the presence of disorder. More generally, we show how dimensionality and disorder control the fate of transport properties across metamagnetic transitions.

preprint2020arXiv

A Computational Investigation on Denominalization

Language has been a dynamic system and word meanings always have been changed over times. Every time a novel concept or sense is introduced, we need to assign it a word to express it. Also, some changes have happened because the result of a change can be more desirable for humans, or cognitively easier to be used by humans. Finding the patterns of these changes is interesting and can reveal some facts about human cognitive evolution. As we have enough resources for studying this problem, it is a good idea to work on the problem through computational modeling, and that can make the work easier and possible to be studied on large scale. In this work, we want to study the nouns which have been used as verbs after some years of their emergence as nouns and find some commonalities among these nouns. In other words, we are interested in finding what potential requirements are essential for this change.

preprint2020arXiv

A Real-time Automatic Validation System for Optical Transients detected by GWAC

The ground-based wide-angle camera array (GWAC) generates millions of single frame alerts per night. After the complicated and elaborate filters by multiple methods, a couple of dozens of candidates are still needed to be confirmed by follow-up observations in real-time. In order to free scientists from the complex and high-intensity follow-up tasks, we developed a Real-time Automatic transient Validation System (RAVS), and introduce here its system architecture, data processing flow, database schema, automatic follow-up control flow, and mobile message notification solution. This system is capable of automatically carrying out all operations in real-time without human intervention, including the validation of transient candidates, the adaptive light-curve sampling for identified targets in multi-band, and the pushing of observation results to the mobile client. The running of RAVS shows that an M-type stellar flare event can be well sampled by RAVS without a significant loss of the details, while the observing time is only less than one-third of the time coverage. Because the control logic of RAVS is designed to be independent of the telescope hardware, RAVS can be conveniently transplanted to other telescopes, especially the follow-up system of SVOM. Some future improvements are presented for the adaptive light-curve sampling, after taking into account both the brightness of sources and the evolution trends of the corresponding light-curves.

preprint2020arXiv

An algorithm of selection of meteor candidates in GWAC system

With its large field of view, GWAC can record hundreds of meteors every day. These meteors are valuable treasures for some meteor research groups. It is therefore very important to accurately find all of these meteors. To address the challenge of precisely distinguishing meteors from other elongated objects in a GWAC-like sky survey system, we design and implement a meteor candidate recognition algorithm, including the recognizing and morphology analysis of the light curves of the meteor candidates. Although the algorithm may filter out some real meteors, it can provide a sample of meteor with high confidence. After processing the images of Mini-GWAC taken in two months, we detect 109,000 elongated objects in which more than 90 percent of objects are not meteor. Among the elongated objects, about 5.9% objects are identified as meteors with high confidence, after the filters based upon an existence in a single frame, a single peak in the light curves, and a slow variation of the light curves.

preprint2020arXiv

Application of Pre-training Models in Named Entity Recognition

Named Entity Recognition (NER) is a fundamental Natural Language Processing (NLP) task to extract entities from unstructured data. The previous methods for NER were based on machine learning or deep learning. Recently, pre-training models have significantly improved performance on multiple NLP tasks. In this paper, firstly, we introduce the architecture and pre-training tasks of four common pre-training models: BERT, ERNIE, ERNIE2.0-tiny, and RoBERTa. Then, we apply these pre-training models to a NER task by fine-tuning, and compare the effects of the different model architecture and pre-training tasks on the NER task. The experiment results showed that RoBERTa achieved state-of-the-art results on the MSRA-2006 dataset.

preprint2020arXiv

Contextualized moral inference

Developing moral awareness in intelligent systems has shifted from a topic of philosophical inquiry to a critical and practical issue in artificial intelligence over the past decades. However, automated inference of everyday moral situations remains an under-explored problem. We present a text-based approach that predicts people's intuitive judgment of moral vignettes. Our methodology builds on recent work in contextualized language models and textual inference of moral sentiment. We show that a contextualized representation offers a substantial advantage over alternative representations based on word embeddings and emotion sentiment in inferring human moral judgment, evaluated and reflected in three independent datasets from moral psychology. We discuss the promise and limitations of our approach toward automated textual moral reasoning.

preprint2020arXiv

Corpus of Chinese Dynastic Histories: Gender Analysis over Two Millennia

Chinese dynastic histories form a large continuous linguistic space of approximately 2000 years, from the 3rd century BCE to the 18th century CE. The histories are documented in Classical (Literary) Chinese in a corpus of over 20 million characters, suitable for the computational analysis of historical lexicon and semantic change. However, there is no freely available open-source corpus of these histories, making Classical Chinese low-resource. This project introduces a new open-source corpus of twenty-four dynastic histories covered by Creative Commons license. An original list of Classical Chinese gender-specific terms was developed as a case study for analyzing the historical linguistic use of male and female terms. The study demonstrates considerable stability in the usage of these terms, with dominance of male terms. Exploration of word meanings uses keyword analysis of focus corpora created for genderspecific terms. This method yields meaningful semantic representations that can be used for future studies of diachronic semantics.

preprint2020arXiv

Gate field effects on the topological insulator BiSbTeSe2 interface

Interfaces between two topological insulators are of fundamental interest in condensed matter physics. Inspired by experimental efforts, we study interfacial processes between two slabs of BiSbTeSe2 (BSTS) via first principles calculations. Topological surface states are absent for the BSTS interface at its equilibrium separation, but our calculations show that they appear if the inter-slab distance is greater than 6 Ang. More importantly, we find that topological interface states can be preserved by inserting two or more layers of hexagonal boron nitride between the two BSTS slabs. In experiments, the electric current tunneling through the interface is insensitive to back gate voltage when the bias voltage is small. Using a first-principles based method that allows us to simulate gate field, we show that at low bias the extra charge induced by a gate voltage resides on the surface that is closest to the gate electrode, leaving the interface almost undoped. This provides clues to understand the origin of the observed insensitivity of transport properties to back voltage at low bias. Our study resolves a few questions raised in experiment, which does not yet offer a clear correlation between microscopic physics and transport data. We provide a road map for the design of vertical tunneling junctions involving the interface between two topological insulators.

preprint2020arXiv

Improving probability selecting based weights for Satisfiability Problem

The Boolean Satisfiability problem (SAT) is important on artificial intelligence community and the impact of its solving on complex problems. Recently, great breakthroughs have been made respectively on stochastic local search (SLS) algorithms for uniform random k-SAT resulting in several state-of-the-art SLS algorithms Score2SAT, YalSAT, ProbSAT, CScoreSAT and on a hybrid algorithm for hard random SAT (HRS) resulting in one state-of-the-art hybrid algorithm SparrowToRiss. However, there is no an algorithm which can effectively solve both uniform random k-SAT and HRS. In this paper, we present a new SLS algorithm named SelectNTS for uniform random k-SAT and HRS. SelectNTS is an improved probability selecting based local search algorithm for SAT problem. The core of SelectNTS relies on new clause and variable selection heuristics. The new clause selection heuristic uses a new clause weighting scheme and a biased random walk. The new variable selection heuristic uses a probability selecting strategy with the variation of CC strategy based on a new variable weighting scheme. Extensive experimental results on the well-known random benchmarks instances from the SAT Competitions in 2017 and 2018, and on randomly generated problems, show that our algorithm outperforms state-of-the-art random SAT algorithms, and our SelectNTS can effectively solve both uniform random k-SAT and HRS.

preprint2020arXiv

Multi-Feature Discrete Collaborative Filtering for Fast Cold-start Recommendation

Hashing is an effective technique to address the large-scale recommendation problem, due to its high computation and storage efficiency on calculating the user preferences on items. However, existing hashing-based recommendation methods still suffer from two important problems: 1) Their recommendation process mainly relies on the user-item interactions and single specific content feature. When the interaction history or the content feature is unavailable (the cold-start problem), their performance will be seriously deteriorated. 2) Existing methods learn the hash codes with relaxed optimization or adopt discrete coordinate descent to directly solve binary hash codes, which results in significant quantization loss or consumes considerable computation time. In this paper, we propose a fast cold-start recommendation method, called Multi-Feature Discrete Collaborative Filtering (MFDCF), to solve these problems. Specifically, a low-rank self-weighted multi-feature fusion module is designed to adaptively project the multiple content features into binary yet informative hash codes by fully exploiting their complementarity. Additionally, we develop a fast discrete optimization algorithm to directly compute the binary hash codes with simple operations. Experiments on two public recommendation datasets demonstrate that MFDCF outperforms the state-of-the-arts on various aspects.

preprint2020arXiv

Text-based inference of moral sentiment change

We present a text-based framework for investigating moral sentiment change of the public via longitudinal corpora. Our framework is based on the premise that language use can inform people's moral perception toward right or wrong, and we build our methodology by exploring moral biases learned from diachronic word embeddings. We demonstrate how a parameter-free model supports inference of historical shifts in moral sentiment toward concepts such as slavery and democracy over centuries at three incremental levels: moral relevance, moral polarity, and fine-grained moral dimensions. We apply this methodology to visualizing moral time courses of individual concepts and analyzing the relations between psycholinguistic variables and rates of moral sentiment change at scale. Our work offers opportunities for applying natural language processing toward characterizing moral sentiment change in society.

preprint2020arXiv

The darkweb: a social network anomaly

We analyse the darkweb and find its structure is unusual. For example, $ \sim 87 \%$ of darkweb sites \emph{never} link to another site. To call the darkweb a "web" is thus a misnomer -- it's better described as a set of largely isolated dark silos. As we show through a detailed comparison to the World Wide Web (www), this siloed structure is highly dissimilar to other social networks and indicates the social behavior of darkweb users is much different to that of www users. We show a generalized preferential attachment model can partially explain the strange topology of the darkweb, but an understanding of the anomalous behavior of its users remains out of reach. Our results are relevant to network scientists, social scientists, and other researchers interested in the social interactions of large numbers of agents.

preprint2020arXiv

The extended Gaia-PS1-SDSS (GPS1+) proper motion catalog

The GPS1 catalog was released in 2017. It delivered precise proper motions for around 350 million sources across three-fourths of the sky down to a magnitude of $r\sim20$\,mag. In this study, we present GPS1+ the extension GPS1 catalog down to $r\sim22.5$\,mag, based on {\it Gaia} DR2, PS1, SDSS and 2MASS astrometry. The GPS1+ totally provides proper motions for $\sim$400 million sources with a characteristic systematic error of less than 0.1\masyr. This catalog is divided into two sub-samples, i.e., the primary and secondary parts. The primary $\sim$264 million sources have either or both of the {\it Gaia} and SDSS astrometry, with a typical precision of 2.0-5.0 \masyr. In this part, $\sim$160 million sources have {\it Gaia} proper motions, we provide another new proper motion for each of them by building a Bayesian model. Relative to {\it Gaia}'s values, the precision is improved by $\sim$0.1\,dex on average at the faint end; $\sim$50 million sources are the objects whose proper motions are missing in {\it Gaia} DR2, we provide their proper motion with a precision of $\sim$4.5\masyr; the remaining $\sim$54 million faint sources are beyond {\it Gaia} detecting capability, we provide their proper motions for the first time with a precision of 7.0 \masyr. However, the secondary $\sim$136 million sources only have PS1 astrometry, the average precision is worse than 15.0 \masyr. All the proper motions have been validated using QSOs and the existing {\it Gaia} proper motions. The catalog will be released on-line and available via the VO-TAP Service, or via the National Astronomical Data Center serviced by China-VO: https://nadc.china-vo.org/data/data/gps1p/f.

preprint2020arXiv

The Typology of Polysemy: A Multilingual Distributional Framework

Lexical semantic typology has identified important cross-linguistic generalizations about the variation and commonalities in polysemy patterns---how languages package up meanings into words. Recent computational research has enabled investigation of lexical semantics at a much larger scale, but little work has explored lexical typology across semantic domains, nor the factors that influence cross-linguistic similarities. We present a novel computational framework that quantifies semantic affinity, the cross-linguistic similarity of lexical semantics for a concept. Our approach defines a common multilingual semantic space that enables a direct comparison of the lexical expression of concepts across languages. We validate our framework against empirical findings on lexical semantic typology at both the concept and domain levels. Our results reveal an intricate interaction between semantic domains and extra-linguistic factors, beyond language phylogeny, that co-shape the typology of polysemy across languages.

preprint2020arXiv

To schedule or not to schedule: when no-scheduling can beat the best-known flow scheduling algorithm in datacenter networks

Conventional wisdom for minimizing the average flow completion time (AFCT) in the datacenter network (DCN), where flow sizes are highly variable, would suggest scheduling every individual flow. However, we show that considering scheduling delay (including scheduler's computational and communication delays), serving most of the flows without any scheduling and only in first-come-first-served (FCFS) manner significantly improves their performance even when it is compared to the shortest remaining processing time (SRPT)-known as optimum algorithm when scheduling delay is zero. To do so, we only require to have two coarse classes of flows categorized based on flows' sizes (1st-class including flows smaller than a threshold, H, and 2nd-class including others) and serve 1st-class flows always before serving 2nd-class ones. To show that, we take SRPT scheduling algorithm accompanied by the global knowledge of flows, formulate impact of scheduling delay on its performance, and prove that for any flow size distribution and network load (<1), there is always a threshold, H, which guarantees 1st-class flows achieve lower AFCT under FCFS compared to SRPT. Our numerically calculated results and extensive flow-level simulations show that on average, more than 90% of flows could be in 1st-class and consequently do not require any scheduling.

preprint2020arXiv

Word class flexibility: A deep contextualized approach

Word class flexibility refers to the phenomenon whereby a single word form is used across different grammatical categories. Extensive work in linguistic typology has sought to characterize word class flexibility across languages, but quantifying this phenomenon accurately and at scale has been fraught with difficulties. We propose a principled methodology to explore regularity in word class flexibility. Our method builds on recent work in contextualized word embeddings to quantify semantic shift between word classes (e.g., noun-to-verb, verb-to-noun), and we apply this method to 37 languages. We find that contextualized embeddings not only capture human judgment of class variation within words in English, but also uncover shared tendencies in class flexibility across languages. Specifically, we find greater semantic variation when flexible lemmas are used in their dominant word class, supporting the view that word class flexibility is a directional process. Our work highlights the utility of deep contextualized models in linguistic typology.

preprint2019arXiv

Fourier-based Rotation-invariant Feature Boosting: An Efficient Framework for Geospatial Object Detection

Geospatial object detection of remote sensing imagery has been attracting an increasing interest in recent years, due to the rapid development in spaceborne imaging. Most of previously proposed object detectors are very sensitive to object deformations, such as scaling and rotation. To this end, we propose a novel and efficient framework for geospatial object detection in this letter, called Fourier-based rotation-invariant feature boosting (FRIFB). A Fourier-based rotation-invariant feature is first generated in polar coordinate. Then, the extracted features can be further structurally refined using aggregate channel features. This leads to a faster feature computation and more robust feature representation, which is good fitting for the coming boosting learning. Finally, in the test phase, we achieve a fast pyramid feature extraction by estimating a scale factor instead of directly collecting all features from image pyramid. Extensive experiments are conducted on two subsets of NWPU VHR-10 dataset, demonstrating the superiority and effectiveness of the FRIFB compared to previous state-of-the-art methods.

preprint2019arXiv

In-network Congestion-aware Load Balancing at Transport Layer

Load balancing at transport layer is an important function in data centers, content delivery networks, and mobile networks, where per-connection consistency (PCC) has to be met for optimal performance. Cloud-native L4 load balancers are commonly deployed as virtual network functions (VNFs) and are a critical forwarding element in modern cloud infrastructure. We identify load imbalance among service instances as the main cause of additional processing delay caused by transport-layer load balancers. Existing transport-layer load balancers rely on one of two methods: host-level traffic redirection, which may add as much as 12.48% additional traffic to underlying networks, or connection tracking, which consumes a considerable amount of memory in load balancers. Both of these methods result in inefficient usage of network resources. We propose the in-network congestion-aware load Balancer (INCAB) to achieve even load distribution among service instances and optimal network resources usage in addition to meeting the PCC requirement. We show that INCAB is capable of identifying and monitoring each instance's most-utilized resource and can improve the load distribution among all service instances. INCAB utilizes a Bloom filter and an ultra-compact connection table for in-network flow distribution. Furthermore, it does not rely on end hosts for traffic redirection. Our flow level simulations show that INCAB improves flows' average completion time by 31.97% compared to stateless solutions.

preprint2016arXiv

Quantum transport of two-species Dirac fermions in dual-gated three-dimensional topological insulators

Topological insulators are a novel class of quantum matter with a gapped insulating bulk yet gapless spin helical Dirac fermion conducting surface states. Here, we report local and non-local electrical and magneto transport measurements in dual-gated BiSbTeSe2 thin film topological insulator devices, with conduction dominated by the spatially separated top and bottom surfaces, each hosting a single species of Dirac fermions with independent gate control over the carrier type and density. We observe many intriguing quantum transport phenomena in such a fully-tunable two-species topological Dirac gas, including a zero-magnetic-field minimum conductivity close to twice the conducatance quantum at the double Dirac point, a series of ambipolar two-component half-integer Dirac quantum Hall states and an electron-hole total filling factor zero state (with a zero-Hall plateau), exhibiting dissipationless (chiral) and dissipative (non-chiral) edge conduction respectively. Such a system paves the way to explore rich physics ranging from topological magnetoelectric effects to exciton condensation.

preprint2015arXiv

End-to-end delay in two hop relay MANETs with limited buffer

Despite lots of literature has been dedicated to researching the delay performance in two-hop relay (2HR) mobile ad hoc networks (MANETs), however, they usually assume the buffer size of each node is infinite, so these studies are not applicable to and thus may not reflect the real delay performance of a practical MANET with limited buffer. To address this issue, in this paper we explore the packet end-to-end delay in a 2HR MANET, where each node is equipped with a bounded and shared relay-buffer for storing and forwarding packets of all other flows. The transmission range of each node can be adjusted and a group-based scheduling scheme is adopted to avoid interference between simultaneous transmissions, meanwhile a handshake mechanism is added to the 2HR routing algorithm to avoid packet loss. With the help of Markov Chain Theory and Queuing Theory, we develop a new framework to fully characterize the packet delivery processes, and obtain the relay-buffer blocking probability (RBP) under any given exogenous packet input rate. Based on the RBP, we can compute the packet queuing delay in its source node and delivery delay respectively, and further derive the end-to-end delay in such a MANET with limited buffer.

preprint2015arXiv

End-to-end delay modeling in buffer-limited MANETs: a general theoretical framework

This paper focuses on a class of important two-hop relay mobile ad hoc networks (MANETs) with limited-buffer constraint and any mobility model that leads to the uniform distribution of the locations of nodes in steady state, and develops a general theoretical framework for the end-to-end (E2E) delay modeling there. We first combine the theories of Fixed-Point, Quasi-Birth-and-Death process and embedded Markov chain to model the limiting distribution of the occupancy states of a relay buffer, and then apply the absorbing Markov chain theory to characterize the packet delivery process, such that a complete theoretical framework is developed for the E2E delay analysis. With the help of this framework, we derive a general and exact expression for the E2E delay based on the modeling of both packet queuing delay and delivery delay. To demonstrate the application of our framework, case studies are further provided under two network scenarios with different MAC protocols to show how the E2E delay can be analytically determined for a given network scenario. Finally, we present extensive simulation and numerical results to illustrate the efficiency of our delay analysis as well as the impacts of network parameters on delay performance.

preprint2015arXiv

From Silicene to Half-Silicane by Hydrogenation

Graphane is graphene fully hydrogenated from both sides, forming a 1x1 structure, where all C atoms are in sp3 configuration. In silicene, the Si atoms are in a mix-sp2/sp3 configuration, it is therefore natural to imagine silicane in analogue to graphane. However, monoatomic silicene sheet grown on substrates generally reconstructs into different phases, and only partially hydrogenated silicene with reconstructions had been reported before. In this report we produce half-silicane, where one Si sublattice is fully H-saturated and the other sublattice is intact, forming a perfect 1x1 structure. By hydrogenating various silicene phases on Ag(111) substrate, we found that only the (2r3x2r3)R30° phase can produce half-silicane. Interestingly, this phase was previous considered to be a highly defective or incomplete silicene structure. Our results indicate that the structure of (2r3x2r3)R30° phase involves a complete silicene-1x1 lattice instead of defective fragments, and the formation mechanism of half-silicane was discussed with the help of first principles calculations.

preprint2015arXiv

On throughput capacity for a class of buffer-limited MANETs

Available throughput performance studies for mobile ad hoc networks (MANETs) suffer from two major limitations: they mainly focus on the scaling law study of throughput, while the exact throughput of such networks remains largely unknown; they usually consider the infinite buffer scenarios, which are not applicable to the practical networks with limited buffer. As a step to address these limitations, this paper develops a general framework for the exact throughput capacity study of a class of buffer-limited MANETs with the two-hop relay. We first provide analysis to reveal how the throughput capacity of such a MANET is determined by its relay-buffer blocking probability (RBP). Based on the Embedded Markov Chain Theory and Queuing Theory, a novel theoretical framework is then developed to enable the RBP and closed-form expression for exact throughput capacity to be derived. We further conduct case studies under two typical transmission scheduling schemes to illustrate the applicability of our framework and to explore the corresponding capacity optimization as well as capacity scaling law. Finally, extensive simulation and numerical results are provided to validate the efficiency of our framework and to show the impacts brought by the buffer constraint.

preprint2015arXiv

Proximity effect between a topological insulator and a magnetic insulator with large perpendicular anisotropy

We report that thin films of a prototype topological insulator, Bi$_{2}$Se$_{3}$, can be epitaxially grown onto the (0001) surface of BaFe$_{12}$O$_{19}$(BaM), a magnetic insulator with high Curie temperature and large perpendicular anisotropy. In the Bi$_2$Se$_3$ thin films grown on non-magnetic substrates, classic weak antilocalization (WAL) is manifested as cusp-shaped positive magnetoresistance (MR) in perpendicular magnetic fields and parabola-shaped positive MR in parallel fields, whereas in Bi$_{2}$Se$_{3}$/BaM heterostructures the low field MR is parabola-shaped, which is positive in perpendicular fields and negative in parallel fields. The magnetic field and temperature dependence of the MR is explained as a consequence of the suppression of WAL due to strong magnetic interactions at the Bi$_{2}$Se$_{3}$/BaM interface.

preprint2015arXiv

Throughput capacity of two-hop relay MANETs under finite buffers

Since the seminal work of Grossglauser and Tse [1], the two-hop relay algorithm and its variants have been attractive for mobile ad hoc networks (MANETs) due to their simplicity and efficiency. However, most literature assumed an infinite buffer size for each node, which is obviously not applicable to a realistic MANET. In this paper, we focus on the exact throughput capacity study of two-hop relay MANETs under the practical finite relay buffer scenario. The arrival process and departure process of the relay queue are fully characterized, and an ergodic Markov chain-based framework is also provided. With this framework, we obtain the limiting distribution of the relay queue and derive the throughput capacity under any relay buffer size. Extensive simulation results are provided to validate our theoretical framework and explore the relationship among the throughput capacity, the relay buffer size and the number of nodes.

preprint2014arXiv

A Bayesian Method for the Extinction

We propose a Bayesian method to measure the total Galactic extinction parameters, $R_V$ and $A_V$. Validation tests based on the simulated data indicate that the method can achieve the accuracy of around 0.01\,mag. We apply this method to the SDSS BHB stars in the northern Galactic cap and find that the derived extinctions are highly consistent with those from \cite{SFD98}. It suggests that the Bayesian method is promising for the extinction estimation, even the reddening values are close to the observational errors.

preprint2014arXiv

Controlling and distinguishing electronic transport of topological and trivial surface states in a topological insulator

Topological insulators (TI), with characteristic Dirac-fermion topological surface states (TSS), have emerged as a new class of electronic materials with rich potentials for both novel physics and device applications. However, a major challenge with realistic TI materials is to access, distinguish and manipulate the electronic transport of TSS often obscured by other possible parallel conduction channels that include the bulk as well as a two-dimensional electron gas (2DEG) formed near the surface due to bending of the bulk bands. Such a (Schrodinger-fermion) 2DEG represents topologically-trivial surface states, whose coexistence with the TSS has been revealed by angle resolved photoemission spectroscopy. Here we show that simple manipulations of surface conditions can be used to access and control both types of surface states and their coexistence in bulk-insulating Bi2Te2Se, whose surface conduction is prominently manifested in temperature dependent resistance and nonlocal transport. The trivial 2DEG and TSS can both exhibit clear Shubnikov-de Haas oscillations in magnetoresistance, with different Berry phases ~0 and ~pi that distinguish their different topological characters. We also report a deviation from the typical weak antilocalization behavior, possibly due to high mobility TSS. Our study enables distinguishing, controlling and harnessing electronic transport of TI surface carriers with different topological natures.

preprint2014arXiv

Document Clustering Based On Max-Correntropy Non-Negative Matrix Factorization

Nonnegative matrix factorization (NMF) has been successfully applied to many areas for classification and clustering. Commonly-used NMF algorithms mainly target on minimizing the $l_2$ distance or Kullback-Leibler (KL) divergence, which may not be suitable for nonlinear case. In this paper, we propose a new decomposition method by maximizing the correntropy between the original and the product of two low-rank matrices for document clustering. This method also allows us to learn the new basis vectors of the semantic feature space from the data. To our knowledge, we haven't seen any work has been done by maximizing correntropy in NMF to cluster high dimensional document data. Our experiment results show the supremacy of our proposed method over other variants of NMF algorithm on Reuters21578 and TDT2 databasets.

preprint2014arXiv

Exploring the total Galactic extinction with SDSS BHB stars

Aims: We used 12,530 photometrically-selected blue horizontal branch (BHB) stars from the Sloan Digital Sky Survey (SDSS) to estimate the total extinction of the Milky Way at the high Galactic latitudes, $R_V$ and $A_V$ in each line of sight. Methods: A Bayesian method was developed to estimate the reddening values in the given lines of sight. Based on the most likely values of reddening in multiple colors, we were able to derive the values of $R_V$ and $A_V$. Results: We selected 94 zero-reddened BHB stars from seven globular clusters as the template. The reddening in the four SDSS colors for the northern Galactic cap were estimated by comparing the field BHB stars with the template stars. The accuracy of this estimation is around 0.01\,mag for most lines of sight. We also obtained $<R_V>$ to be around 2.40$\pm1.05$ and $A_V$ map within an uncertainty of 0.1\,mag. The results, including reddening values in the four SDSS colors, $A_V$, and $R_V$ in each line of sight, are released on line. In this work, we employ an up-to-date parallel technique on GPU card to overcome time-consuming computations. We plan to release online the C++ CUDA code used for this analysis. Conclusions: The extinction map derived from BHB stars is highly consistent with that from Schlegel, Finkbeiner & Davis(1998). The derived $R_V$ is around 2.40$\pm1.05$. The contamination probably makes the $R_V$ be larger.

preprint2014arXiv

Graph Regularized Non-negative Matrix Factorization By Maximizing Correntropy

Non-negative matrix factorization (NMF) has proved effective in many clustering and classification tasks. The classic ways to measure the errors between the original and the reconstructed matrix are $l_2$ distance or Kullback-Leibler (KL) divergence. However, nonlinear cases are not properly handled when we use these error measures. As a consequence, alternative measures based on nonlinear kernels, such as correntropy, are proposed. However, the current correntropy-based NMF only targets on the low-level features without considering the intrinsic geometrical distribution of data. In this paper, we propose a new NMF algorithm that preserves local invariance by adding graph regularization into the process of max-correntropy-based matrix factorization. Meanwhile, each feature can learn corresponding kernel from the data. The experiment results of Caltech101 and Caltech256 show the benefits of such combination against other NMF algorithms for the unsupervised image clustering.

preprint2014arXiv

Observation of topological surface state quantum Hall effect in an intrinsic three-dimensional topological insulator

A three-dimensional (3D) topological insulator (TI) is a quantum state of matter with a gapped insulating bulk yet a conducting surface hosting topologically-protected gapless surface states. One of the most distinct electronic transport signatures predicted for such topological surface states (TSS) is a well-defined half-integer quantum Hall effect (QHE) in a magnetic field, where the surface Hall conductivities become quantized in units of (1/2)e2/h (e being the electron charge, h the Planck constant) concomitant with vanishing resistance. Here, we observe well-developed QHE arising from TSS in an intrinsic TI of BiSbTeSe2. Our samples exhibit surface dominated conduction even close to room temperature, while the bulk conduction is negligible. At low temperatures and high magnetic fields perpendicular to the top and bottom surfaces, we observe well-developed integer quantized Hall plateaus, where the two parallel surfaces each contributing a half integer e2/h quantized Hall (QH) conductance, accompanied by vanishing longitudinal resistance. When the bottom surface is gated to match the top surface in carrier density, only odd integer QH plateaus are observed, representing a half-integer QHE of two degenerate Dirac gases. This system provides an excellent platform to pursue a plethora of exotic physics and novel device applications predicted for TIs, ranging from magnetic monopoles and Majorana particles to dissipationless electronics and fault-tolerant quantum computers.

preprint2012arXiv

Experimental demonstration of a free space cylindrical cloak without superluminal propagation

We experimentally demonstrated an alternative approach of invisibility cloaking that can combine technical advantages of all current major cloaking strategies in a unified manner and thus can solve bottlenecks of individual strategies. A broadband cylindrical invisibility cloak in free space is designed based on scattering cancellation (the approach of previous plasmonic cloaking), and implemented with anisotropic metamaterials (a fundamental property of singular-transformation cloaks). Particularly, non-superluminal propagation of electromagnetic waves, a superior advantage of non-Euclidian-transformation cloaks constructed with complex branch cuts, is inherited in this design, and thus is the reason of its relatively broad bandwidth. This demonstration provides the possibility for future practical implementation of cloaking devices at large scales in free space.

preprint2012arXiv

Gluon Spin, Canonical Momentum, and Gauge Symmetry

It is well known that in gauge theories, the spin (and orbital angular momentum) of gauge particles is not gauge invariant, although the helicity is; neither are the canonical momentum and canonical angular momentum of charged particles. However, the simple appeal of these concepts has motivated repeated attempts to resurrect them as physical descriptions of gauge systems. In particular, measurability of the gluon-spin-contribution to the proton helicity in polarized proton scattering has generated many theoretical efforts in generalizing it and others as gauge-invariant quantities. In this work, we analyze the constraints of gauge symmetry, the significance of gluon spin in the light-cone gauge, and what is possible and natural in QCD parton physics, emphasizing experimental observability and physical interpretation in the structure of bound states. We also comment on the measurability of the orbital angular momentum of the Laguerre-Gaussian laser modes in optics.

preprint2012arXiv

Optical and electronic properties of two dimensional graphitic silicon carbide

Optical and electronic properties of two dimensional few layers graphitic silicon carbide (GSiC), in particular monolayer and bilayer, are investigated by density functional theory and found different from that of graphene and silicene. Monolayer GSiC has direct bandgap while few layers exhibit indirect bandgap. The bandgap of monolayer GSiC can be tuned by an in-plane strain. Properties of bilayer GSiC are extremely sensitive to the interlayer distance. These predictions promise that monolayer GSiC could be a remarkable candidate for novel type of light-emitting diodes utilizing its unique optical properties distinct from graphene, silicene and few layers GSiC.

preprint2011arXiv

Atomic structure, energetics, and dynamics of topological solitons in Indium chains on Si(111) surfaces

Based on scanning tunneling microscopy and first-principles theoretical studies, we characterize the precise atomic structure of a topological soliton in In chains grown on Si(111) surfaces. Variable-temperature measurements of the soliton population allow us to determine the soliton formation energy to be ~60 meV, smaller than one half of the band gap of ~200 meV. Once created, these solitons have very low mobility, even though the activation energy is only about 20 meV; the sluggish nature is attributed to the exceptionally low attempt frequency for soliton migration. We further demonstrate local electric field-enhanced soliton dynamics.

preprint2011arXiv

Electronic Transport in Monolayer Graphene with Extreme Physical Deformation: ab Initio Density Functional Calculation

Electronic transport properties of monolayer graphene with extreme physical bending up to 90o angle are studied using ab Initio first-principle calculations. The importance of key structural parameters including step height, curvature radius and bending angle are discussed how they modify the transport properties of the deformed graphene sheet comparing to the corresponding flat ones. The local density of state reveals that energy state modification caused by the physical bending is highly localized. It is observed that the transport properties of bent graphene with a wide range of geometrical configurations are insensitive to the structural deformation in the low-energy transmission spectra, even in the extreme case of bending. The results support that graphene, with its superb electromechanical robustness, could serve as a viable material platform in a spectrum of applications such as photovoltaics, flexible electronics, OLED, and 3D electronic chips.

preprint2011arXiv

Theoretical comparison of quantum and thermal noise squeezing in silicon and graphene nanoresonators

We theoretically compared quantum noise squeezing differences between silicon and graphene nanoresonators based on experimental structure parameters. The conditions to achieve squeezed states of silicon and graphene have been discussed. According to our theoretical analysis, graphene nanoresonators can obtain a much smaller squeezing factor than silicon, taking advantage of their thin thickness. Both the quantum noise and thermal noise (Brownian motion) of typical monolayer graphene nanoresonator can be reduced by 12.58 dB at T = 5 K with a pump voltage of 5 V.

preprint2009arXiv

Angular Momentum in Non-Relativistic QED and Photon Contribution to Spin of Hydrogen Atom

We study angular momentum in non-relativistic quantum electrodynamics (NRQED). We construct the effective total angular momentum operator by applying Noether's theorem to the NRQED lagrangian. We calculate the NRQED matching for the individual components of the QED angular momentum up to one loop. We illustrate an application of our results by the first calculation of the angular momentum of the ground state hydrogen atom carried in radiative photons, $α_{\rm em}^3/18π$, which might be measurable in future atomic experiments.

Yang Xu

What is connected

Connect this record

See the researcher in context

Building this map preview

64 published item(s)

Decoding Scientific Experimental Images: The SPUR Benchmark for Perception, Understanding, and Reasoning

Anomalous Hall effect and rich magnetic phase diagram of Mn$_{100-x}$Rh$_{x}$ epitaxial films

A Unified Single-loop Alternating Gradient Projection Algorithm for Nonconvex-Concave and Convex-Nonconcave Minimax Problems

Bayesian Generalized Kernel Inference for Exploration of Autonomous Robots

An asperity-based statistical model for the adhesive friction of elastic nominally flat rough contact interfaces

Axial and Vector Structure Functions for Lepton-Nucleon Scattering, NuFact 2021 Update

Confidence-rich Localization and Mapping based on Particle Filter for Robotic Exploration

Crosslinguistic word order variation reflects evolutionary pressures of dependency and information locality

HistBERT: A Pre-trained Language Model for Diachronic Lexical Semantic Analysis

Image Captioning In the Transformer Age

Improving short-term bike sharing demand forecast through an irregular convolutional neural network

Isometries and MacWilliams Extension Property for Weighted Poset Metric

K-Detector: Identifying Duplicate Crash Failures in Large-Scale Software Delivery

KE-QI: A Knowledge Enhanced Article Quality Identification Dataset

LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding

ML4CO-KIDA: Knowledge Inheritance in Dataset Aggregation

Neural reality of argument structure constructions

Noun2Verb: Probabilistic frame semantics for word class conversion

Optimization of rule-based energy management strategies for hybrid vehicles using dynamic programming

Quantile Off-Policy Evaluation via Deep Conditional Generative Learning

Reflexivity of Partitions Induced by Weighted Poset Metric and Combinatorial Metric

Revisiting the Persson theory of elastoplastic contact: A simpler closed-form solution and a rigorous proof of boundary conditions

Robust Inertial-aided Underwater Localization based on Imaging Sonar Keyframes

The Machine Learning for Combinatorial Optimization Competition (ML4CO): Results and Insights

Ultrafast disinfection of SARS-CoV-2 viruses

Ultrasensitive refractive index sensor with rotatory biased weak measurement

Magnetotransport of dirty-limit van Hove singularity quasiparticles

A Computational Investigation on Denominalization

A Real-time Automatic Validation System for Optical Transients detected by GWAC

An algorithm of selection of meteor candidates in GWAC system

Application of Pre-training Models in Named Entity Recognition

Contextualized moral inference

Corpus of Chinese Dynastic Histories: Gender Analysis over Two Millennia

Gate field effects on the topological insulator BiSbTeSe2 interface

Improving probability selecting based weights for Satisfiability Problem

Multi-Feature Discrete Collaborative Filtering for Fast Cold-start Recommendation

Text-based inference of moral sentiment change

The darkweb: a social network anomaly

The extended Gaia-PS1-SDSS (GPS1+) proper motion catalog

The Typology of Polysemy: A Multilingual Distributional Framework

To schedule or not to schedule: when no-scheduling can beat the best-known flow scheduling algorithm in datacenter networks

Word class flexibility: A deep contextualized approach

Fourier-based Rotation-invariant Feature Boosting: An Efficient Framework for Geospatial Object Detection

In-network Congestion-aware Load Balancing at Transport Layer

Quantum transport of two-species Dirac fermions in dual-gated three-dimensional topological insulators

End-to-end delay in two hop relay MANETs with limited buffer

End-to-end delay modeling in buffer-limited MANETs: a general theoretical framework

From Silicene to Half-Silicane by Hydrogenation

On throughput capacity for a class of buffer-limited MANETs

Proximity effect between a topological insulator and a magnetic insulator with large perpendicular anisotropy

Throughput capacity of two-hop relay MANETs under finite buffers

A Bayesian Method for the Extinction

Controlling and distinguishing electronic transport of topological and trivial surface states in a topological insulator

Document Clustering Based On Max-Correntropy Non-Negative Matrix Factorization

Exploring the total Galactic extinction with SDSS BHB stars

Graph Regularized Non-negative Matrix Factorization By Maximizing Correntropy

Observation of topological surface state quantum Hall effect in an intrinsic three-dimensional topological insulator

Experimental demonstration of a free space cylindrical cloak without superluminal propagation

Gluon Spin, Canonical Momentum, and Gauge Symmetry

Optical and electronic properties of two dimensional graphitic silicon carbide

Atomic structure, energetics, and dynamics of topological solitons in Indium chains on Si(111) surfaces

Electronic Transport in Monolayer Graphene with Extreme Physical Deformation: ab Initio Density Functional Calculation

Theoretical comparison of quantum and thermal noise squeezing in silicon and graphene nanoresonators

Angular Momentum in Non-Relativistic QED and Photon Contribution to Spin of Hydrogen Atom