Source author record

Xu Yang

Xu Yang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

63works

31topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Rethinking LLM Ensembling from the Perspective of Mixture Models

Model ensembling is a well-established technique for improving the performance of machine learning models. Conventionally, this involves averaging the output distributions of multiple models and selecting the most probable label. This idea has been naturally extended to large language models (LLMs), yielding improved performance but incurring substantial computational cost. This inefficiency stems from directly applying conventional ensemble implementation to LLMs, which require a separate forward pass for each model to explicitly compute the ensemble distribution. In this paper, we propose the Mixture-model-like Ensemble (ME). By reinterpreting the ensemble as a mixture model, ME stochastically selects a single model at each step to generate the next token, thereby avoiding the need to explicitly compute the full ensemble distribution. ME is mathematically equivalent to sampling from the ensemble distribution, but requires invoking only one model, making it 1.78x-2.68x faster than conventional ensemble. Furthermore, this perspective connects LLM ensembling and token-level routing methods, suggesting that LLM ensembling is a special case of routing methods. Our findings open new avenues for efficient LLM ensembling and motivate further exploration of token-level routing strategies for LLMs. Our code is available at https://github.com/jialefu/Mixture-model-like-Ensemble/.

preprint2024arXiv

Capacity Results for Multiple-Input Multiple-Output Optical Wireless Communication With Per-Antenna Intensity Constraints

In this paper, we investigate the capacity of a multiple-input multiple-output (MIMO) optical intensity channel (OIC) under per-antenna peak- and average-intensity constraints. We first consider the case where the average intensities of input are required to be equal to preassigned constants due to the requirement of illumination quality and color temperature. When the channel graph of the MIMO OIC is strongly connected, we prove that the strongest eigen-subchannel must have positive channel gains, which simplifies the capacity analysis. Then we derive various capacity bounds by utilizing linear precoding, generalized entropy power inequality, and QR decomposition, etc. These bounds are numerically verified to approach the capacity in the low or high signal-to-noise ratio regime. Specifically, when the channel rank is one less than the number of transmit antennas, we derive an equivalent capacity expression from the perspective of convex geometry, and new lower bounds are derived based on this equivalent expression. Finally, the developed results are extended to the more general case where the average intensities of input are required to be no larger than preassigned constants.

preprint2023arXiv

Pairing Symmetry and Fermion Projective Symmetry Groups

The Ginzburg-Landau (GL) theory is very successful in describing the pairing symmetry, a fundamental characterization of the broken symmetries in a paired superfluid or superconductor. However, GL theory does not describe fermionic excitations such as Bogoliubov quasiparticles or Andreev bound states that are directly related to topological properties of the superconductor. In this work, we show that the symmetries of the fermionic excitations are captured by a Projective Symmetry Group (PSG), which is a group extension of the bosonic symmetry group in the superconducting state. We further establish a correspondence between the pairing symmetry and the fermion PSG. When the normal and superconducting states share the same spin rotational symmetry, there is a simpler correspondence between the pairing symmetry and the fermion PSG, which we enumerate for all 32 crystalline point groups. We also discuss the general framework for computing PSGs when the spin rotational symmetry is spontaneously broken in the superconducting state. This PSG formalism leads to experimental consequences, and as an example, we show how a given pairing symmetry dictates the classification of topological superconductivity.

preprint2022arXiv

A Distributed Implementation of Steady-State Kalman Filter

This paper studies the distributed state estimation in sensor network, where $m$ sensors are deployed to infer the $n$-dimensional state of a linear time-invariant (LTI) Gaussian system. By a lossless decomposition of optimal steady-state Kalman filter, we show that the problem of distributed estimation can be reformulated as synchronization of homogeneous linear systems. Based on such decomposition, a distributed estimator is proposed, where each sensor node runs a local filter using only its own measurement and fuses the local estimate of each node with a consensus algorithm. We show that the average of the estimate from all sensors coincides with the optimal Kalman estimate. Numerical examples are provided in the end to illustrate the performance of the proposed scheme.

preprint2022arXiv

Auto-Encoding Score Distribution Regression for Action Quality Assessment

The action quality assessment (AQA) of videos is a challenging vision task since the relation between videos and action scores is difficult to model. Thus, AQA has been widely studied in the literature. Traditionally, AQA is treated as a regression problem to learn the underlying mappings between videos and action scores. But previous methods ignored data uncertainty in AQA dataset. To address aleatoric uncertainty, we further develop a plug-and-play module Distribution Auto-Encoder (DAE). Specifically, it encodes videos into distributions and uses the reparameterization trick in variational auto-encoders (VAE) to sample scores, which establishes a more accurate mapping between videos and scores. Meanwhile, a likelihood loss is used to learn the uncertainty parameters. We plug our DAE approach into MUSDL and CoRe. Experimental results on public datasets demonstrate that our method achieves state-of-the-art on AQA-7, MTL-AQA, and JIGSAWS datasets. Our code is available at https://github.com/InfoX-SEU/DAE-AQA.

preprint2022arXiv

Automatically Discovering Novel Visual Categories with Self-supervised Prototype Learning

This paper tackles the problem of novel category discovery (NCD), which aims to discriminate unknown categories in large-scale image collections. The NCD task is challenging due to the closeness to the real-world scenarios, where we have only encountered some partial classes and images. Unlike other works on the NCD, we leverage the prototypes to emphasize the importance of category discrimination and alleviate the issue of missing annotations of novel classes. Concretely, we propose a novel adaptive prototype learning method consisting of two main stages: prototypical representation learning and prototypical self-training. In the first stage, we obtain a robust feature extractor, which could serve for all images with base and novel categories. This ability of instance and category discrimination of the feature extractor is boosted by self-supervised learning and adaptive prototypes. In the second stage, we utilize the prototypes again to rectify offline pseudo labels and train a final parametric classifier for category clustering. We conduct extensive experiments on four benchmark datasets and demonstrate the effectiveness and robustness of the proposed method with state-of-the-art performance.

preprint2022arXiv

Chromospheric recurrent jets in a sunspot group and their inter-granular origin

We report on high resolution observations of recurrent fan-like jets by the Goode Solar telescope (GST) in multi-wavelengths inside a sunspot group. The dynamics behaviour of the jets is derived from the Ha line profiles. Quantitative values for one well-identified event have been obtained showing a maximum projected velocity of 42 km s^-1 and a Doppler shift of the order of 20 km s^-1. The footpoints/roots of the jets have a lifted center on the Ha line profile compared to the quiet sun suggesting a long lasting heating at these locations. The magnetic field between the small sunspots in the group shows a very high resolution pattern with parasitic polarities along the inter-granular lanes accompanied by high velocity converging flows (4 km s^-1) in the photosphere. Magnetic cancellations between the opposite polarities are observed in the vicinity of the footpoints of the jets. Along the inter-granular lanes horizontal magnetic field around 1000 Gauss is generated impulsively. Overall, all the kinetic features at the different layers through photosphere and chromosphere favor a convection-driven reconnection scenario for the recurrent fan-like jets, and evidence a site of reconnection between the photosphere and chromosphere corresponding to the inter-granular lanes.

preprint2022arXiv

Computation of the Time-Dependent Dirac Equation with Physics-Informed Neural Networks

We propose to compute the time-dependent Dirac equation using physics-informed neural networks (PINNs), a new powerful tool in scientific machine learning avoiding the use of approximate derivatives of differential operators. PINNs search solutions in the form of parameterized (deep) neural networks, whose derivatives (in time and space) are performed by automatic differentiation. The computational cost comes from the need to solve high-dimensional optimization problems using stochastic gradient methods and train the network with a large number of points. Specifically, we derive PINNs-based algorithms and present some key fundamental properties of these algorithms when applied to the Dirac equations in different physical frameworks.

preprint2022arXiv

Deep Neural Networks for Creating Reliable PmP Database with a Case Study in Southern California

Recent progresses in artificial intelligence and machine learning make it possible to automatically identify seismic phases from exponentially growing seismic data. Despite some exciting successes in automatic picking of the first P- and S-wave arrivals, auto-identification of later seismic phases such as the Moho-reflected PmP waves remains a significant challenge in matching the performance of experienced analysts. The main difficulty of machine-identifying PmP waves is that the identifiable PmP waves are rare, making the problem of identifying the PmP waves from a massive seismic database inherently unbalanced. In this work, by utilizing a high-quality PmP dataset (10,192 manual picks) in southern California, we develop PmPNet, a deep-neural-network-based algorithm to automatically identify PmP waves efficiently; by doing so, we accelerate the process of identifying the PmP waves. PmPNet applies similar techniques in the machine learning community to address the unbalancement of PmP datasets. The architecture of PmPNet is a residual neural network (ResNet)-autoencoder with additional predictor block, where encoder, decoder, and predictor are equipped with ResNet connection. We conduct systematic research with field data, concluding that PmPNet can efficiently achieve high precision and high recall simultaneously to automatically identify PmP waves from a massive seismic database. Applying the pre-trained PmPNet to the seismic database from January 1990 to December 1999 in southern California, we obtain nearly twice more PmP picks than the original PmP dataset, providing valuable data for other studies such as mapping the topography of the Moho discontinuity and imaging the lower crust structures of southern California.

preprint2022arXiv

Electromagnetically induced transparency in inhomogeneously broadened divacancy defect ensembles in SiC

Electromagnetically induced transparency (EIT) is a phenomenon that can provide strong and robust interfacing between optical signals and quantum coherence of electronic spins. In its archetypical form, mainly explored with atomic media, it uses a (near-)homogeneous ensemble of three-level systems, in which two low-energy spin-1/2 levels are coupled to a common optically excited state. We investigate the implementation of EIT with c-axis divacancy color centers in silicon carbide. While this material has attractive properties for quantum device technologies with near-IR optics, implementing EIT is complicated by the inhomogeneous broadening of the optical transitions throughout the ensemble and the presence of multiple ground-state levels. These may lead to darkening of the ensemble upon resonant optical excitation. Here, we show that EIT can be established with high visibility also in this material platform upon careful design of the measurement geometry. Comparison of our experimental results with a model based on the Lindblad equations indicates that we can create coherences between different sets of two levels all-optically in these systems, with potential impact for RF-free quantum sensing applications. Our work provides an understanding of EIT in multi-level systems with significant inhomogeneities, and our considerations are valid for a wide array of defects in semiconductors.

preprint2022arXiv

EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching

Current metrics for video captioning are mostly based on the text-level comparison between reference and candidate captions. However, they have some insuperable drawbacks, e.g., they cannot handle videos without references, and they may result in biased evaluation due to the one-to-many nature of video-to-text and the neglect of visual relevance. From the human evaluator's viewpoint, a high-quality caption should be consistent with the provided video, but not necessarily be similar to the reference in literal or semantics. Inspired by human evaluation, we propose EMScore (Embedding Matching-based score), a novel reference-free metric for video captioning, which directly measures similarity between video and candidate captions. Benefit from the recent development of large-scale pre-training models, we exploit a well pre-trained vision-language model to extract visual and linguistic embeddings for computing EMScore. Specifically, EMScore combines matching scores of both coarse-grained (video and caption) and fine-grained (frames and words) levels, which takes the overall understanding and detailed characteristics of the video into account. Furthermore, considering the potential information gain, EMScore can be flexibly extended to the conditions where human-labeled references are available. Last but not least, we collect VATEX-EVAL and ActivityNet-FOIl datasets to systematically evaluate the existing metrics. VATEX-EVAL experiments demonstrate that EMScore has higher human correlation and lower reference dependency. ActivityNet-FOIL experiment verifies that EMScore can effectively identify "hallucinating" captions. The datasets will be released to facilitate the development of video captioning metrics. The code is available at: https://github.com/ShiYaya/emscore.

preprint2022arXiv

MemoNav: Selecting Informative Memories for Visual Navigation

Image-goal navigation is a challenging task, as it requires the agent to navigate to a target indicated by an image in a previously unseen scene. Current methods introduce diverse memory mechanisms which save navigation history to solve this task. However, these methods use all observations in the memory for generating navigation actions without considering which fraction of this memory is informative. To address this limitation, we present the MemoNav, a novel memory mechanism for image-goal navigation, which retains the agent's informative short-term memory and long-term memory to improve the navigation performance on a multi-goal task. The node features on the agent's topological map are stored in the short-term memory, as these features are dynamically updated. To aid the short-term memory, we also generate long-term memory by continuously aggregating the short-term memory via a graph attention module. The MemoNav retains the informative fraction of the short-term memory via a forgetting module based on a Transformer decoder and then incorporates this retained short-term memory and the long-term memory into working memory. Lastly, the agent uses the working memory for action generation. We evaluate our model on a new multi-goal navigation dataset. The experimental results show that the MemoNav outperforms the SoTA methods by a large margin with a smaller fraction of navigation history. The results also empirically show that our model is less likely to be trapped in a deadlock, which further validates that the MemoNav improves the agent's navigation efficiency by reducing redundant steps.

preprint2022arXiv

Nontrivial Solutions of Dirac-Laplace Equation on Compact Spin Manifolds

We apply the Fountain theorem to a class of nonlinear Dirac-Laplace equation on compact spin manifold. We show the standard Ambrosetti-Rabinowitz condition can be replaced by a more natural super-quadratic condition that is enough to obtain the Cerami condition under certain conditions. Multiple solutions of nonlinear Dirac-Laplace equation are obtained in this note.

preprint2022arXiv

Observations of pores and surrounding regions with CO 4.66 μm lines by BBSO/CYRA

Solar observations of carbon monoxide (CO) indicate the existence of lower-temperature gas in the lower solar chromosphere. We present an observation of pores, and quiet-Sun, and network magnetic field regions with CO 4.66 μm lines by the Cryogenic Infrared Spectrograph (CYRA) at Big Bear Solar Observatory. We used the strong CO lines at around 4.66 μm to understand the properties of the thermal structures of lower solar atmosphere in different solar features with various magnetic field strengths. AIA 1700 Å images, HMI continuum images and magnetograms are also included in the observation. The data from 3D radiation magnetohydrodynamic (MHD) simulation with the Bifrost code are also employed for the first time to be compared with the observation. We used the RH code to synthesize the CO line profiles in the network regions. The CO 3-2 R14 line center intensity changes to be either enhanced or diminished with increasing magnetic field strength, which should be caused by different heating effects in magnetic flux tubes with different sizes. We find several "cold bubbles" in the CO 3-2 R14 line center intensity images, which can be classified into two types. One type is located in the quiet-Sun regions without magnetic fields. The other type, which has rarely been reported in the past, is near or surrounded by magnetic fields. Notably, some are located at the edge of the magnetic network. The two kinds of cold bubbles and the relationship between cold bubble intensities and network magnetic field strength are both reproduced by the 3D MHD simulation with the Bifrost and RH codes. The simulation also shows that there is a cold plasma blob near the network magnetic fields, causing the observed cold bubbles seen in the CO 3-2 R14 line center image. Our observation and simulation illustrate that the magnetic field plays a vital role in the generation of some CO cold bubbles.

preprint2022arXiv

On Color Isomorphic Pairs in Proper Edge Colourings of Complete Graphs

Following the recent paper which initiated the study of colour isomorphism problems for complete graphs, we obtain upper bounds for $f_2(n,H)$ for a family of graphs $H$ obtained as the $K_0$-th rooted power of a balanced rooted tree for some sufficiently large $K_0$. The proof uses the random polynomial method of Bukh. We also obtain matching lower bounds for $1$-subdivisions of the complete bipartite graph.

preprint2022arXiv

On Coupled Dirac Systems under Boundary Condition

In this article we study the existence of solutions for the Dirac systems \begin{equation}\label{e:0.1} \left\{ \begin{array}{c} Pu=\frac{\partial H}{\partial v}(x,u,v) \quad\hbox{on} \ M, Pv=\frac{\partial H}{\partial u}(x,u,v) \quad\hbox{on} \ M, B_{\text{CHI}}u= B_{\text{CHI}}v=0\quad\hbox{on} \ \partial M \end{array} \right. \end{equation} where $M$ is an $m$-dimensional compact oriented Riemannian spin manifold with smooth boundary $\partial M$, $P$ is the Dirac operator under the boundary condition $B_{\text{CHI}}u= B_{\text{CHI}}v=0$ on $\partial M$, $ u,v\in C^{\infty}(M,ΣM)$ are spinors. Using an analytic framework of proper products of fractional Sobolev spaces, the solutions existence results of the coupled Dirac systems are obtained for nonlinearity with superquadratic growth rates.

preprint2022arXiv

On the extinction-extinguishing dichotomy for a stochastic Lotka-Volterra type population dynamical system

We study a two-dimensional process $(X, Y)$ arising as the unique nonnegative solution to a pair of stochastic differential equations driven by independent Brownian motions and compensated spectrally positive Lévy random measures. Both processes $X$ and $Y$ can be identified as continuous-state nonlinear branching processes where the evolution of $Y$ is negatively affected by $X$. Assuming that process $X$ extinguishes, i.e. it converges to $0$ but never reaches $0$ in finite time, and process $Y$ converges to $0$, we identify rather sharp conditions under which the process $Y$ exhibits, respectively, one of the following behaviors: extinction with probability one, extinguishing with probability one or both extinction and extinguishing occurring with strictly positive probabilities.

preprint2022arXiv

Siamese Contrastive Embedding Network for Compositional Zero-Shot Learning

Compositional Zero-Shot Learning (CZSL) aims to recognize unseen compositions formed from seen state and object during training. Since the same state may be various in the visual appearance while entangled with different objects, CZSL is still a challenging task. Some methods recognize state and object with two trained classifiers, ignoring the impact of the interaction between object and state; the other methods try to learn the joint representation of the state-object compositions, leading to the domain gap between seen and unseen composition sets. In this paper, we propose a novel Siamese Contrastive Embedding Network (SCEN) (Code: https://github.com/XDUxyLi/SCEN-master) for unseen composition recognition. Considering the entanglement between state and object, we embed the visual feature into a Siamese Contrastive Space to capture prototypes of them separately, alleviating the interaction between state and object. In addition, we design a State Transition Module (STM) to increase the diversity of training compositions, improving the robustness of the recognition model. Extensive experiments indicate that our method significantly outperforms the state-of-the-art approaches on three challenging benchmark datasets, including the recent proposed C-QGA dataset.

preprint2022arXiv

The Co-alignment of Winged Hα Data Observed by the New Vacuum Solar Telescop

The New Vacuum Solar Telescope (NVST) has been releasing its novel winged Ha data (WHD) since April 2021, namely the Ha imaging spectroscopic data. Compared with the prior released version, the new data are further co-aligned among the off-band images and packaged into a standard solar physics community format. In this study, we illustrate the alignment algorithm used by the novel WHD, which is mainly based on the optical flow method to obtain the translation offset between the winged images. To quantitatively evaluate the alignment results of two images with different similarities, we calculate the alignment accuracies between the images of different off-band and line center, respectively. The result shows that our alignment algorithm could reach up to the accuracy of about 0.1 "when the off-band of winged image is lower than 0.6 Ȧ. In addition, we introduce the final product of the WHD in detail, which can provide convenience for the solar physicists to use high-resolution Hα imaging spectroscopic data of NVST.

preprint2022arXiv

Towards Unbiased Visual Emotion Recognition via Causal Intervention

Although much progress has been made in visual emotion recognition, researchers have realized that modern deep networks tend to exploit dataset characteristics to learn spurious statistical associations between the input and the target. Such dataset characteristics are usually treated as dataset bias, which damages the robustness and generalization performance of these recognition systems. In this work, we scrutinize this problem from the perspective of causal inference, where such dataset characteristic is termed as a confounder which misleads the system to learn the spurious correlation. To alleviate the negative effects brought by the dataset bias, we propose a novel Interventional Emotion Recognition Network (IERN) to achieve the backdoor adjustment, which is one fundamental deconfounding technique in causal inference. Specifically, IERN starts by disentangling the dataset-related context feature from the actual emotion feature, where the former forms the confounder. The emotion feature will then be forced to see each confounder stratum equally before being fed into the classifier. A series of designed tests validate the efficacy of IERN, and experiments on three emotion benchmarks demonstrate that IERN outperforms state-of-the-art approaches for unbiased visual emotion recognition. Code is available at https://github.com/donydchen/causal_emotion

preprint2022arXiv

Weakly Aligned Feature Fusion for Multimodal Object Detection

To achieve accurate and robust object detection in the real-world scenario, various forms of images are incorporated, such as color, thermal, and depth. However, multimodal data often suffer from the position shift problem, i.e., the image pair is not strictly aligned, making one object has different positions in different modalities. For the deep learning method, this problem makes it difficult to fuse multimodal features and puzzles the convolutional neural network (CNN) training. In this article, we propose a general multimodal detector named aligned region CNN (AR-CNN) to tackle the position shift problem. First, a region feature (RF) alignment module with adjacent similarity constraint is designed to consistently predict the position shift between two modalities and adaptively align the cross-modal RFs. Second, we propose a novel region of interest (RoI) jitter strategy to improve the robustness to unexpected shift patterns. Third, we present a new multimodal feature fusion method that selects the more reliable feature and suppresses the less useful one via feature reweighting. In addition, by locating bounding boxes in both modalities and building their relationships, we provide novel multimodal labeling named KAIST-Paired. Extensive experiments on 2-D and 3-D object detection, RGB-T, and RGB-D datasets demonstrate the effectiveness and robustness of our method.

preprint2021arXiv

Causal Attention for Vision-Language Tasks

We present a novel attention mechanism: Causal Attention (CATT), to remove the ever-elusive confounding effect in existing attention-based vision-language models. This effect causes harmful bias that misleads the attention module to focus on the spurious correlations in training data, damaging the model generalization. As the confounder is unobserved in general, we use the front-door adjustment to realize the causal intervention, which does not require any knowledge on the confounder. Specifically, CATT is implemented as a combination of 1) In-Sample Attention (IS-ATT) and 2) Cross-Sample Attention (CS-ATT), where the latter forcibly brings other samples into every IS-ATT, mimicking the causal intervention. CATT abides by the Q-K-V convention and hence can replace any attention module such as top-down attention and self-attention in Transformers. CATT improves various popular attention-based vision-language models by considerable margins. In particular, we show that CATT has great potential in large-scale pre-training, e.g., it can promote the lighter LXMERT~\cite{tan2019lxmert}, which uses fewer data and less computational power, comparable to the heavier UNITER~\cite{chen2020uniter}. Code is published in \url{https://github.com/yangxuntu/catt}.

preprint2021arXiv

Classical limit for the varying-mass Schrödinger equation with random inhomogeneities

The varying-mass Schrödinger equation (VMSE) has been successfully applied to model electronic properties of semiconductor hetero-structures, for example, quantum dots and quantum wells. In this paper, we consider VMSE with small random heterogeneities, and derive a radiative transfer equation as its asymptotic limit. The main tool is to systematically apply the Wigner transform in the classical regime when the rescaled Planck constant $ε\ll 1$, and expand the Wigner equation to proper orders of $ε$. As a proof of concept, we numerically compute both VMSE and its limiting radiative transfer equation, and show that their solutions agree well in the classical regime.

preprint2021arXiv

Cloud Cover and Aurora Contamination at Dome A in 2017 from KLCAM

Dome A in Antarctica has many characteristics that make it an excellent site for astronomical observations, from the optical to the terahertz. Quantitative site testing is still needed to confirm the site's properties. In this paper, we present a statistical analysis of cloud cover and aurora contamination from the Kunlun Cloud and Aurora Monitor (KLCAM). KLCAM is an automatic, unattended all-sky camera aiming for long-term monitoring of the usable observing time and optical sky background at Dome~A. It was installed at Dome~A in January 2017, worked through the austral winter, and collected over 47,000 images over 490 days. A semi-quantitative visual data analysis of cloud cover and auroral contamination was carried out by five individuals. The analysis shows that the night sky was free of cloud for 83 per cent of the time, which ranks Dome~A highly in a comparison with other observatory sites. Although aurorae were detected somewhere on an image for nearly 45 per cent of the time, the strongest auroral emission lines can be filtered out with customized filters.

preprint2021arXiv

Deep unfitted Nitsche method for elliptic interface problems

This paper proposes a deep unfitted Nitsche method for computing elliptic interface problems with high contrasts in high dimensions. To capture discontinuities of the solution caused by interfaces, we reformulate the problem as an energy minimization involving two weakly coupled components. This enables us to train two deep neural networks to represent two components of the solution in high-dimensional. The curse of dimensionality is alleviated by using the Monte-Carlo method to discretize the unfitted Nitsche energy function. We present several numerical examples to show the performance of the proposed method.

preprint2021arXiv

Doubly Contrastive Deep Clustering

Deep clustering successfully provides more effective features than conventional ones and thus becomes an important technique in current unsupervised learning. However, most deep clustering methods ignore the vital positive and negative pairs introduced by data augmentation and further the significance of contrastive learning, which leads to suboptimal performance. In this paper, we present a novel Doubly Contrastive Deep Clustering (DCDC) framework, which constructs contrastive loss over both sample and class views to obtain more discriminative features and competitive results. Specifically, for the sample view, we set the class distribution of the original sample and its augmented version as positive sample pairs and set one of the other augmented samples as negative sample pairs. After that, we can adopt the sample-wise contrastive loss to pull positive sample pairs together and push negative sample pairs apart. Similarly, for the class view, we build the positive and negative pairs from the sample distribution of the class. In this way, two contrastive losses successfully constrain the clustering results of mini-batch samples in both sample and class level. Extensive experimental results on six benchmark datasets demonstrate the superiority of our proposed model against state-of-the-art methods. Particularly in the challenging dataset Tiny-ImageNet, our method leads 5.6\% against the latest comparison method. Our code will be available at \url{https://github.com/ZhiyuanDang/DCDC}.

preprint2021arXiv

Multi-Passband Observations of A Solar Flare over the He I 10830 Å line

This study presents a C3.0 flare observed by the BBSO/GST and IRIS, on 2018-May-28 around 17:10 UT. The Near Infrared Imaging Spectropolarimeter (NIRIS) of GST was set to spectral imaging mode to scan five spectral positions at $\pm$ 0.8 Å, $\pm$ 0.4 Åand line center of He I 10830. At the flare ribbon's leading edge the line is observed to undergo enhanced absorption, while the rest of the ribbon is observed to be in emission. When in emission, the contrast compared to the pre-flare ranges from about $30~\%$ to nearly $100~\%$ at different spectral positions. Two types of spectra, "convex" shape with higher intensity at line core and "concave" shape with higher emission in the line wings, are found at the trailing and peak flaring areas, respectively. On the ribbon front, negative contrasts, or enhanced absorption, of about $\sim 10\% - 20\%$ appear in all five wavelengths. This observation strongly suggests that the negative flares observed in He I 10830 with mono-filtergram previously were not caused by pure Doppler shifts of this spectral line. Instead, the enhanced absorption appears to be a consequence of flare energy injection, namely non-thermal collisional ionization of helium caused by the precipitation of high energy electrons, as found in our recent numerical modeling results. In addition, though not strictly simultaneous, observations of Mg II from the IRIS spacecraft, show an obvious central reversal pattern at the locations where enhanced absorption of He I 10830 is seen, which is in consistent with previous observations.

preprint2020arXiv

Automated Pavement Crack Segmentation Using U-Net-based Convolutional Neural Network

Automated pavement crack image segmentation is challenging because of inherent irregular patterns, lighting conditions, and noise in images. Conventional approaches require a substantial amount of feature engineering to differentiate crack regions from non-affected regions. In this paper, we propose a deep learning technique based on a convolutional neural network to perform segmentation tasks on pavement crack images. Our approach requires minimal feature engineering compared to other machine learning techniques. We propose a U-Net-based network architecture in which we replace the encoder with a pretrained ResNet-34 neural network. We use a "one-cycle" training schedule based on cyclical learning rates to speed up the convergence. Our method achieves an F1 score of 96% on the CFD dataset and 73% on the Crack500 dataset, outperforming other algorithms tested on these datasets. We perform ablation studies on various techniques that helped us get marginal performance boosts, i.e., the addition of spatial and channel squeeze and excitation (SCSE) modules, training with gradually increasing image sizes, and training various neural network layers with different learning rates.

preprint2020arXiv

Automation of the AST3 optical sky survey from Dome~A, Antarctica

The 0.5\,m Antarctic Survey Telescopes (AST3) were designed for time-domain optical/infrared astronomy. They are located in Dome~A, Antarctica, where they can take advantage of the continuous dark time during winter. Since the site is unattended in winter, everything for the operation, from observing to data reduction, had to be fully automated. Here, we present a brief overview of the AST3 project and some of its unique characteristics due to its location in Antarctica. We summarise the various components of the survey, including the customized hardware and software, that make complete automation possible.

preprint2020arXiv

CYRA: the cryogenic infrared spectrograph for the Goode Solar Telescope in Big Bear

CYRA (CrYogenic solar spectrogRAph) is a facility instrument of the 1.6-meter Goode Solar Telescope (GST) at the Big Bear Solar Observatory (BBSO). CYRA focuses on the study of the near-infrared solar spectrum between 1 and 5 microns, a under explored region which is not only a fertile ground for photospheric magnetic diagnostics, but also allows a unique window into the chromosphere lying atop the photosphere. CYRA is the first ever fully cryogenic spectrograph in any solar observatory with its two predecessors, on the McMath-Pierce and Mees Telescopes, being based on warm optics except for the detectors and order sorting filters. CYRA is used to probe magnetic fields in various solar features and the quiet photosphere. CYRA measurements will allow new and better 3D extrapolations of the solar magnetic field and will provide more accurate boundary conditions for solar activity models. Superior spectral resolution of 150,000 and better allows enhanced observations of the chromosphere in the carbon monoxide (CO) spectral bands and will yield a better understanding of energy transport in the solar atmosphere. CYRA is divided into two optical sub-systems: The Fore-Optics Module and the Spectrograph. The Spectrograph is the heart of the instrument and contains the IR detector, grating, slits, filters, and imaging optics all in a cryogenically cooled Dewar (cryostat). The detector a 2048 by 2048 pixel HAWAII 2 array produced by Teledyne Scientific & Imaging, LLC. The interior of the cryostat and the readout electronics are maintained at 90 Kelvin by helium refrigerant based cryo-coolers, while the IR array is cooled to 30 Kelvin. The Fore-Optics Module de-rotates and stabilizes the solar image, provides scanning capabilities, and transfers the GST image to the Spectrograph. CYRA has been installed and is undergoing its commissioning phase.

preprint2020arXiv

Detecting chirality in two-terminal electronic devices

Central to spintronics is the interconversion between electronic charge and spin currents, and this can arise from the chirality-induced spin selectivity (CISS) effect. CISS is often studied as magnetoresistance (MR) in two-terminal (2T) electronic devices containing a chiral (molecular) component and a ferromagnet. However, fundamental understanding of when and how this MR can occur is lacking. Here, we uncover an elementary mechanism that generates such a MR for nonlinear response. It requires energy-dependent transport and energy relaxation within the device. The sign of the MR depends on chirality, charge carrier type, and bias direction. Additionally, we reveal how CISS can be detected in the linear response regime in magnet-free 2T devices, either by forming a chirality-based spin-valve using two or more chiral components, or by Hanle spin precession in devices with a single chiral component. Our results provide operation principles and design guidelines for chirality-based spintronic devices and technologies.

preprint2020arXiv

Discovery of segmented Fermi surface induced by Cooper pair momentum

Since the early days of Bardeen-Cooper-Schrieffer theory, it has been predicted that a sufficiently large supercurrent can close the energy gap in a superconductor and creates gapless Bogoliubov quasiparticles through the Doppler shift of quasiparticle energy due to the Cooper pair momentum. In this gapless superconducting state, zero-energy quasiparticles reside on a segment of the normal state Fermi surface, while its remaining part is still gapped. The finite density of states of field-induced quasiparticles, known as the Volovik effect, has been observed in tunneling and specific heat measurements on d- and s-wave superconductors. However, the segmented Fermi surface of a finite-momentum state carrying a supercurrent has never been detected directly. Here we use quasiparticle interference (QPI) technique to image field-controlled Fermi surface of Bi$_2$Te$_3$ thin films proximitized by the superconductor NbSe$_2$. By applying a small in-plane magnetic field, a screening supercurrent is induced which leads to finite-momentum pairing on topological surface states of Bi$_2$Te$_3$. Our measurements and analysis reveal the strong impact of finite Cooper pair momentum on the quasiparticle spectrum, and thus pave the way for STM study of pair density wave and FFLO states in unconventional superconductors.

preprint2020arXiv

Incremental Embedding Learning via Zero-Shot Translation

Modern deep learning methods have achieved great success in machine learning and computer vision fields by learning a set of pre-defined datasets. Howerver, these methods perform unsatisfactorily when applied into real-world situations. The reason of this phenomenon is that learning new tasks leads the trained model quickly forget the knowledge of old tasks, which is referred to as catastrophic forgetting. Current state-of-the-art incremental learning methods tackle catastrophic forgetting problem in traditional classification networks and ignore the problem existing in embedding networks, which are the basic networks for image retrieval, face recognition, zero-shot learning, etc. Different from traditional incremental classification networks, the semantic gap between the embedding spaces of two adjacent tasks is the main challenge for embedding networks under incremental learning setting. Thus, we propose a novel class-incremental method for embedding network, named as zero-shot translation class-incremental method (ZSTCI), which leverages zero-shot translation to estimate and compensate the semantic gap without any exemplars. Then, we try to learn a unified representation for two adjacent tasks in sequential learning process, which captures the relationships of previous classes and current classes precisely. In addition, ZSTCI can easily be combined with existing regularization-based incremental learning methods to further improve performance of embedding networks. We conduct extensive experiments on CUB-200-2011 and CIFAR100, and the experiment results prove the effectiveness of our method. The code of our method has been released.

preprint2020arXiv

Large spin to charge conversion in topological superconductor \b{eta}-PdBi2 at room temperature

\b{eta}-PdBi2 has attracted much attention for its prospective ability to possess simultaneously topological surface and superconducting states due to its unprecedented spin-orbit interaction (SOC). Whereas most works have focused solely on investigating its topological surface states, the coupling between spin and charge degrees of freedom in this class of quantum material remains unexplored. Here we first report a study of spin-to-charge conversion in a \b{eta}-PdBi2 ultrathin film grown by molecular beam epitaxy, utilizing a spin pumping technique to perform inverse spin Hall effect measurements. We find that the room temperature spin Hall angle of Fe/\b{eta}-PdBi2, θ_SH=0.037. This value is one order of magnitude larger than that of reported conventional superconductors, and is comparable to that of the best SOC metals and topological insulators. Our results provide an avenue for developing superconductor-based spintronic applications.

preprint2020arXiv

Night-time measurements of astronomical seeing at Dome A in Antarctica

Seeing, the angular size of stellar images blurred by atmospheric turbulence, is a critical parameter used to assess the quality of astronomical sites. Median values at the best mid-latitude sites are generally in the range of 0.6--0.8\,arcsec. Sites on the Antarctic plateau are characterized by comparatively-weak turbulence in the free-atmosphere above a strong but thin boundary layer. The median seeing at Dome C is estimated to be 0.23--0.36 arcsec above a boundary layer that has a typical height of 30\,m. At Dome A and F, the only previous seeing measurements were made during daytime. Here we report the first direct measurements of night-time seeing at Dome A, using a Differential Image Motion Monitor. Located at a height of just 8\,m, it recorded seeing as low as 0.13\,arcsec, and provided seeing statistics that are comparable to those for a 20\,m height at Dome C. It indicates that the boundary layer was below 8\,m 31\% of the time. At such times the median seeing was 0.31\,arcsec, consistent with free-atmosphere seeing. The seeing and boundary layer thickness are found to be strongly correlated with the near-surface temperature gradient. The correlation confirms a median thickness of approximately 14\,m for the boundary layer at Dome A, as found from a sonic radar. The thinner boundary layer makes it less challenging to locate a telescope above it, thereby giving greater access to the free-atmosphere.

preprint2020arXiv

Nonreciprocal directional dichroism induced by a temperature gradient as a probe for mobile spin dynamics in quantum magnets

Novel states of matter in quantum magnets like quantum spin liquids attract considerable interest recently. Despite the existence of a plenty of candidate materials, there is no confirmed quantum spin liquid, largely due to the lack of proper experimental probes. For instance, spectrosocopy experiments like neutron scattering receive contributions from disorder-induced local modes, while thermal transport experiments receive contributions from phonons. Here we propose a thermo-optic experiment which directly probes the mobile magnetic excitations in spatial-inversion symmetric and/or time-reversal symmetric Mott insulators: the temperature-gradient-induced nonreciprocal directional dichroism (TNDD) spectroscopy. Unlike traditional probes, TNDD directly detects mobile magnetic excitations and decouples from phonons and local magnetic modes.

preprint2020arXiv

Rapid Evolution of Type II Spicules Observed in Goode Solar Telescope On-Disk H-alpha Images

We analyze ground-based chromospheric data acquired at a high temporal cadence of 2 s in wings of the H$α$ spectral line using Goode Solar Telescope (GST) operating at the Big Bear Solar Observatory. We inspected a 30 minute long H$α$-0.08~nm data set to find that rapid blue-shifted H$α$ excursions (RBEs), which are a cool component of type II spicules, experience very rapid morphological changes on the time scales of the order of 1 second. Unlike typical reconnection jets, RBEs very frequently appear \textit{in situ} without any clear evidence of H$α$ material being injected from below. Their evolution includes inverted "Y", "V", "N", and parallel splitting (doubling) patterns as well as sudden formation of a diffuse region followed by branching. We also find that the same feature may undergo several splitting episodes within about 1 min time interval.

preprint2020arXiv

Reply to "Comment on 'Spin-dependent electron transmission model for chiral molecules in mesoscopic devices'"

Here we emphasize once more the distinction between generating CISS (spin-charge current conversion) in a chiral system and detecting it as magnetoresistance in two-terminal electronic devices. We also highlight important differences between electrical measurement results obtained in the linear response regime and those obtained in the nonlinear regime.

preprint2020arXiv

SPDEs with non-Lipschitz coefficients and nonhomogeous boundary conditions

In this paper we establish the strong existence, pathwise uniqueness and a comparison theorem to a stochastic partial differential equation driven by Gaussian colored noise with non-Lipschitz drift, Hölder continuous diffusion coefficients and the spatial domain in finite interval, $[0,1]$, and with Dirichlet, Neumann or mixed nonhomogeneous random conditions imposed on the endpoints. The Hölder continuity of the solution both in time and in space variables is also studied.

preprint2020arXiv

Unfitted Nitsche's method for computing wave modes in topological materials

In this paper, we propose an unfitted Nitsche's method for computing wave modes in topological materials. The proposed method is based on Nitsche's technique to study the performance-enhanced topological materials which have strongly heterogeneous structures (e.g., the refractive index is piecewise constant with high contrasts). For periodic bulk materials, we use Floquet-Bloch theory and solve an eigenvalue problem on a torus with unfitted meshes. For the materials with a line defect, a sufficiently large domain with zero boundary conditions is used to compute the localized eigenfunctions corresponding to the edge modes. The interfaces are handled by Nitsche's method on an unfitted uniform mesh. We prove the proposed methods converge optimally, and present numerical examples to validate the theoretical results and demonstrate the capability of simulating topological materials.

preprint2019arXiv

TBC-Net: A real-time detector for infrared small target detection using semantic constraint

Infrared small target detection is a key technique in infrared search and tracking (IRST) systems. Although deep learning has been widely used in the vision tasks of visible light images recently, it is rarely used in infrared small target detection due to the difficulty in learning small target features. In this paper, we propose a novel lightweight convolutional neural network TBC-Net for infrared small target detection. The TBCNet consists of a target extraction module (TEM) and a semantic constraint module (SCM), which are used to extract small targets from infrared images and to classify the extracted target images during the training, respectively. Meanwhile, we propose a joint loss function and a training method. The SCM imposes a semantic constraint on TEM by combining the high-level classification task and solve the problem of the difficulty to learn features caused by class imbalance problem. During the training, the targets are extracted from the input image and then be classified by SCM. During the inference, only the TEM is used to detect the small targets. We also propose a data synthesis method to generate training data. The experimental results show that compared with the traditional methods, TBC-Net can better reduce the false alarm caused by complicated background, the proposed network structure and joint loss have a significant improvement on small target feature learning. Besides, TBC-Net can achieve real-time detection on the NVIDIA Jetson AGX Xavier development board, which is suitable for applications such as field research with drones equipped with infrared sensors.

preprint2016arXiv

A distribution-function-valued SPDE and its applications

In this paper we further study the stochastic partial differential equation first proposed by Xiong (2013). Under localized conditions on the coefficients we show that the solution is in fact distribution-function-valued and we establish the pathwise uniqueness of the solution. As applications we obtain the well-posedness of the martingale problems for two classes of measure-valued diffusions: interacting super-Brownian motions and interacting Fleming-Viot processes. Properties of the two superprocesses such as the existence of density fields and the extinction behaviors are also studied.

preprint2016arXiv

ELM control with RMP: plasma response models and the role of edge peeling response

Resonant magnetic perturbations (RMP) have extensively been demonstrated as a plausible technique for mitigating or suppressing large edge localized modes (ELMs). Associated with this is a substantial amount of theory and modelling efforts during recent years. Various models describing the plasma response to the RMP fields have been proposed in the literature, and are briefly reviewed in this work. Despite their simplicity, linear response models can provide alternative criteria, than the vacuum field based criteria, for guiding the choice of the coil configurations to achieve the best control of ELMs. The role of the edge peeling response to the RMP fields is illustrated as a key indicator for the ELM mitigation in low collisionality plasmas, in various tokamak devices.

preprint2016arXiv

Extended relativistic configuration interaction and many-body perturbation calculations of spectroscopic data for the $n \leq 6$ configurationsin ne-like ions between Cr XV and Kr XXVII

Level energies, wavelengths, electric dipole, magnetic dipole, electric quadrupole, and magnetic quadrupole transition rates, oscillator strengths, and line strengths from combined relativistic configuration interaction and many-body perturbation calculations are reported for the 201 fine-structure states of the $2s^2 2p^6$, $2s^2 2p^5 3l$, $2s 2p^6 3l$, $2s^2 2p^5 4l$, $2s 2p^6 4l$, $2s^2 2p^5 5l$, and $2s^2 2p^5 6l$ configurations in all Ne-like ions between Cr XV and Kr XXVII. Calculated level energies and transition data are compared with experiments from the NIST and CHIANTI databases, and other recent benchmark calculations. The mean energy difference with the NIST experiments is only 0.05%. The present calculations significantly increase the amount of accurate spectroscopic data for the $n >3$ states in a number of Ne-like ions of astrophysics interest. A complete dataset should be helpful in analyzing new observations from the solar and other astrophysical sources, and is also likely to be useful for modeling and diagnosing a variety of plasmas including astronomical and fusion plasma.

preprint2016arXiv

Frozen Gaussian approximation for high frequency wave propagation in periodic media

Propagation of high-frequency wave in periodic media is a challenging problem due to the existence of multiscale characterized by short wavelength, small lattice constant and large physical domain size. Conventional computational methods lead to extremely expensive costs, especially in high dimensions. In this paper, based on Bloch decomposition and asymptotic analysis in the phase space, we derive the frozen Gaussian approximation for high-frequency wave propagation in periodic media and establish its converge to the true solution. The formulation leads to efficient numerical algorithms, which are presented in a companion paper [Delgadillo, Lu and Yang, arXiv:1509.05552].

preprint2016arXiv

Gauge-invariant frozen Gaussian approximation method for the Schrödinger equation with periodic potentials

We develop a gauge-invariant frozen Gaussian approximation (GIFGA) method for the linear Schrödinger equation (LSE) with periodic potentials in the semiclassical regime. The method generalizes the Herman-Kluk propagator for LSE to the case with periodic media. It provides an efficient computational tool based on asymptotic analysis on phase space and Bloch waves to capture the high-frequency oscillations of the solution. Compared to geometric optics and Gaussian beam methods, GIFGA works in both scenarios of caustics and beam spreading. Moreover, it is invariant with respect to the gauge choice of the Bloch eigenfunctions, and thus avoids the numerical difficulty of computing gauge-dependent Berry phase. We numerically test the method by several one-dimensional examples, in particular, the first order convergence is validated, which agrees with our companion analysis paper [Delgadillo, Lu and Yang, arXiv:1504.08051].

preprint2016arXiv

Gradient recovery for elliptic interface problem: I. body-fitted mesh

In this paper, we propose a novel gradient recovery method for elliptic interface problem using body-fitted mesh in two dimension. Due to the lack of regularity of solution at interface, standard gradient recovery methods fail to give superconvergent results, and thus will lead to overrefinement when served as a posteriori error estimator. This drawback is overcome by designing an immersed gradient recovery operator in our method. We prove the superconvergence of this method for both mildly unstructured mesh and adaptive mesh, and present several numerical examples to verify the superconvergence and its robustness as a posteriori error estimator.

preprint2016arXiv

Maximum likelihood type estimation for discretely observed CIR model with small $α$-stable noises

A maximum likelihood type estimation of the drift and volatility coefficient parameters in the CIR type model driven by $α$-stable noises is studied when the dispersion parameter $\varepsilon\to0$ and the discrete observations frequency $n\to\infty$ simultaneously.

preprint2016arXiv

Pathwise uniqueness for a SPDE with Hölder continuous coefficient driven by α-stable noise

In this paper we study the pathwise uniqueness of solution to the following stochastic partial differential equation (SPDE) with Hölder continuous coefficient: \begin{eqnarray*} \frac{\partial X_t(x)}{\partial t}=\frac{1}{2} ΔX_t(x) +G(X_t(x))+H(X_{t-}(x)) \dot{L}_t(x),~~~ t>0, ~x\in\mathbb{R}, \end{eqnarray*} where $\dot{L}$ denotes an $α$-stable white noise on $\mathbb{R}_+\times \mathbb{R}$ without negative jumps, $G$ satisfies the Lipschitz condition and $H$ is nondecreasing and $β$-Hölder continuous for $1<α<2$ and $0<β<1$. For $G\equiv0$ and $H(x)=x^β$, in Mytnik (2002) a weak solution to the above SPDE was constructedand the pathwise uniqueness of the solution was left as an open problem. In this paper we give an affirmative answer to this problem for certain values of $α$ and $β$. In particular, for $αβ=1$, where the solution to the above equation is the density of a super-Brownian motion with $α$-stable branching (see also Mytnik (2002)), our result leads to its pathwise uniqueness for $1<α<\sqrt{5}-1$. The local Hölder continuity of the solution is also obtained in this paper for fixed time $t>0$.

preprint2016arXiv

Schwinger boson spin liquid states on square lattice

We study possible spin liquids on square lattice that respect all lattice symmetries and time-reversal symmetry within the framework of Schwinger boson (mean-field) theory. Such spin liquids have spin gap and emergent Z_2 gauge field excitations. We classify them by the projective symmetry group method, and find six spin liquid states that are potentially relevant to the J_1-J_2 Heisenberg model. The properties of these states are studied under mean-field approximation. Interestingly we find a spin liquid state that can go through continuous phase transitions to either the Néel magnetic order or magnetic orders of the wavevector at Brillouin zone edge center. We also discuss the connection between our results and the Abrikosov fermion spin liquids.

preprint2015arXiv

Electronic structure of Li$_{1+x}$[Mn$_{0.5}$Ni$_{0.5}$]$_{1-x}$O$_2$ studied by photoemission and x-ray absorption spectroscopy

We have studied the electronic structure of Li$_{1+x}$[Mn$_{0.5}$Ni$_{0.5}$]$_{1-x}$O$_2$ ($x$ = 0.00 and 0.05), one of the promising cathode materials for Li ion battery, by means of x-ray photoemission and absorption spectroscopy. The results show that the valences of Mn and Ni are basically 4+ and 2+, respectively. However, the Mn$^{3+}$ component in the $x$ = 0.00 sample gradually increases with the bulk sensitivity of the experiment, indicating that the Jahn-Teller active Mn$^{3+}$ ions are introduced in the bulk due to the site exchange between Li and Ni. The Mn$^{3+}$ component gets negligibly small in the $x$ = 0.05 sample, which indicates that the excess Li suppresses the site exchange and removes the Jahn-Teller active Mn$^{3+}$.

preprint2015arXiv

Face Photo Sketch Synthesis via Larger Patch and Multiresolution Spline

Face photo sketch synthesis has got some researchers' attention in recent years because of its potential applications in digital entertainment and law enforcement. Some patches based methods have been proposed to solve this problem. These methods usually focus more on how to get a sketch patch for a given photo patch than how to blend these generated patches. However, without appropriately blending method, some jagged parts and mottled points will appear in the entire face sketch. In order to get a smoother sketch, we propose a new method to reduce such jagged parts and mottled points. In our system, we resort to an existed method, which is Markov Random Fields (MRF), to train a crude face sketch firstly. Then this crude sketch face sketch will be divided into some larger patches again and retrained by Non-Negative Matrix Factorization (NMF). At last, we use Multiresolution Spline and a blend trick named full-coverage trick to blend these retrained patches. The experiment results show that compared with some previous method, we can get a smoother face sketch.

preprint2014arXiv

A Weighted Common Subgraph Matching Algorithm

We propose a weighted common subgraph (WCS) matching algorithm to find the most similar subgraphs in two labeled weighted graphs. WCS matching, as a natural generalization of the equal-sized graph matching or subgraph matching, finds wide applications in many computer vision and machine learning tasks. In this paper, the WCS matching is first formulated as a combinatorial optimization problem over the set of partial permutation matrices. Then it is approximately solved by a recently proposed combinatorial optimization framework - Graduated NonConvexity and Concavity Procedure (GNCCP). Experimental comparisons on both synthetic graphs and real world images validate its robustness against noise level, problem size, outlier number, and edge density.

preprint2014arXiv

Raman-gain induced loss-compensation in whispering-gallery-microresonators and single-nanoparticle detection with whispering-gallery Raman-microlasers

Recently optical whispering-gallery-mode resonators (WGMRs) have emerged as promising platforms to achieve label-free detection of nanoscale objects and to reach single molecule sensitivity. The ultimate detection performance of WGMRs are limited by energy dissipation in the material they are fabricated from. Up to date, to improve detection limit, either rare-earth ions are doped into the WGMR to compensate losses or plasmonic resonances are exploited for their superior field confinement. Here, we demonstrate, for the first time, enhanced detection of single-nanoparticle induced mode-splitting in a silica WGMR via Raman-gain assisted loss-compensation and WGM Raman lasing. Notably, we detected and counted individual dielectric nanoparticles down to a record low radius of 10 nm by monitoring a beatnote signal generated when split Raman lasing lines are heterodyne-mixed at a photodetector. This dopant-free scheme retains the inherited biocompatibility of silica, and could find widespread use for sensing in biological media. It also opens the possibility of using intrinsic Raman or parametric gain in other systems, where dissipation hinders the progress of the field and limits applications.

preprint2014arXiv

Reversible self-Kerr-nonlinearity in an N-type atomic system through a switching field

We investigate the self-Kerr nonlinearity of a four-level N-type atomic system in 87Rb and observe its reversible property with the unidirectional increase of the switching field. For the laser arrangement that the probe field interacts with the middle two states, the slope and the sign of the self-Kerr nonlinearity around the atomic resonance can not only be changed from negative to positive, but also can be changed to negative again with the unidirectional increasing of the switching field. Numerical simulation agrees very well with the experimental results and dressed state analysis is presented to explain the experimental results.

preprint2013arXiv

A pathway-based mean-field model for E. coli chemotaxis: Mathematical derivation and Keller-Segel limit

A pathway-based mean-field theory (PBMFT) was recently proposed for E. coli chemotaxis in [G. Si, T. Wu, Q. Quyang and Y. Tu, Phys. Rev. Lett., 109 (2012), 048101]. In this paper, we derived a new moment system of PBMFT by using the moment closure technique in kinetic theory under the assumption that the methylation level is locally concentrated. The new system is hyperbolic with linear convection terms. Under certain assumptions, the new system can recover the original model. Especially the assumption on the methylation difference made there can be understood explicitly in this new moment system. We obtain the Keller-Segel limit by taking into account the different physical time scales of tumbling, adaptation and the experimental observations. We also present numerical evidence to show the quantitative agreement of the moment system with the individual based E. coli chemotaxis simulator.

preprint2013arXiv

Post-selection free, integrated optical source of non-degenerate, polarization entangled photon pairs

We present an integrated source of polarization entangled photon pairs in the telecom regime, which is based on type II-phasematched parametric down-conversion (PDC) in a Ti-indiffused waveguide in periodically poled lithium niobate. The domain grating -- consisting of an interlaced bi-periodic structure -- is engineered to provide simultaneous phase-matching of two PDC processes, and enables the direct generation of non-degenerate, polarization entangled photon pairs with a brightness of $B=7\times10^3$ pairs/(s mW GHz). The spatial separation of the photon pairs is accomplished by a fiber-optical multiplexer facilitating a high compactness of the overall source. Visibilities exceeding 95% and a violation of the Bell inequality with $S=2.57\pm0.06$ could be demonstrated.

preprint2013arXiv

Seismic modeling using the frozen Gaussian approximation

We adopt the frozen Gaussian approximation (FGA) for modeling seismic waves. The method belongs to the category of ray-based beam methods. It decomposes seismic wavefield into a set of Gaussian functions and propagates these Gaussian functions along appropriate ray paths. As opposed to the classic Gaussian-beam method, FGA keeps the Gaussians frozen (at a fixed width) during the propagation process and adjusts their amplitudes to produce an accurate approximation after summation. We perform the initial decomposition of seismic data using a fast version of the Fourier-Bros-Iagolnitzer (FBI) transform and propagate the frozen Gaussian beams numerically using ray tracing. A test using a smoothed Marmousi model confirms the validity of FGA for accurate modeling of seismic wavefields.

preprint2011arXiv

Frozen Gaussian approximation for general linear strictly hyperbolic system: formulation and Eulerian methods

The frozen Gaussian approximation, proposed in [Lu and Yang, [15]], is an efficient computational tool for high frequency wave propagation. We continue in this paper the development of frozen Gaussian approximation. The frozen Gaussian approximation is extended to general linear strictly hyperbolic systems. Eulerian methods based on frozen Gaussian approximation are developed to overcome the divergence problem of Lagrangian methods. The proposed Eulerian methods can also be used for the Herman-Kluk propagator in quantum mechanics. Numerical examples verify the performance of the proposed methods.

preprint2011arXiv

Frozen Gaussian approximation for high frequency wave propagation

We propose the frozen Gaussian approximation for computation of high frequency wave propagation. This method approximates the solution to the wave equation by an integral representation. It provides a highly efficient computational tool based on the asymptotic analysis on the phase plane. Compared to geometric optics, it provides a valid solution around caustics. Compared to the Gaussian beam method, it not only overcomes the drawback of beam spreading but also improves the asymptotic accuracy. We give several numerical examples to verify that the frozen Gaussian approximation performs well in the presence of caustics and when the Gaussian beam spreads.

preprint2011arXiv

Mössbauer study of the field induced uniaxial anisotropy in electro-deposited FeCo alloy films

Thin ferromagnetic films with in-plane magnetic anisotropy are promising materials for obtaining high microwave permeability. The paper reports on the Mössbauer study of the field induced in-plane uniaxial anisotropy in electro-deposited $FeCo$ alloy films. The $FeCo$ alloy films have been prepared by electro-deposition method with and without external magnetic field applied parallel to the film plane during deposition. The vibrating sample magnetometry and Mössbauer spectroscopy measurements at room temperature indicate that the film deposited in external field shows an in-plane uniaxial anisotropy with an easy direction coincides with the external field direction and a hard direction perpendicular to the field direction, whereas the film deposited without external field doesn't show any in-plane anisotropy. Mössbauer spectra taken in three geometric arrangements show that the magnetic moments are almost constrained in the film plane for the film deposited with applied magnetic field. And the magnetic moments are tend to align in the direction of the applied external magnetic field during deposition, indicating that the observed anisotropy should be attributed to directional ordering of atomic pairs.

preprint2010arXiv

Convergence of frozen Gaussian approximation for high frequency wave propagation

The frozen Gaussian approximation provides a highly efficient computational method for high frequency wave propagation. The derivation of the method is based on asymptotic analysis. In this paper, for general linear strictly hyperbolic system, we establish the rigorous convergence result for frozen Gaussian approximation. As a byproduct, higher order frozen Gaussian approximation is developed.

preprint2010arXiv

Effective Maxwell equations from time-dependent density functional theory

The behavior of interacting electrons in a perfect crystal under macroscopic external electric and magnetic fields is studied. Effective Maxwell equations for the macroscopic electric and magnetic fields are derived starting from time-dependent density functional theory. Effective permittivity and permeability coefficients are obtained.

Xu Yang

What is connected

Connect this record

See the researcher in context

Building this map preview

63 published item(s)

Rethinking LLM Ensembling from the Perspective of Mixture Models

Capacity Results for Multiple-Input Multiple-Output Optical Wireless Communication With Per-Antenna Intensity Constraints

Pairing Symmetry and Fermion Projective Symmetry Groups

A Distributed Implementation of Steady-State Kalman Filter

Auto-Encoding Score Distribution Regression for Action Quality Assessment

Automatically Discovering Novel Visual Categories with Self-supervised Prototype Learning

Chromospheric recurrent jets in a sunspot group and their inter-granular origin

Computation of the Time-Dependent Dirac Equation with Physics-Informed Neural Networks

Deep Neural Networks for Creating Reliable PmP Database with a Case Study in Southern California

Electromagnetically induced transparency in inhomogeneously broadened divacancy defect ensembles in SiC

EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching

MemoNav: Selecting Informative Memories for Visual Navigation

Nontrivial Solutions of Dirac-Laplace Equation on Compact Spin Manifolds

Observations of pores and surrounding regions with CO 4.66 μm lines by BBSO/CYRA

On Color Isomorphic Pairs in Proper Edge Colourings of Complete Graphs

On Coupled Dirac Systems under Boundary Condition

On the extinction-extinguishing dichotomy for a stochastic Lotka-Volterra type population dynamical system

Siamese Contrastive Embedding Network for Compositional Zero-Shot Learning

The Co-alignment of Winged Hα Data Observed by the New Vacuum Solar Telescop

Towards Unbiased Visual Emotion Recognition via Causal Intervention

Weakly Aligned Feature Fusion for Multimodal Object Detection

Causal Attention for Vision-Language Tasks

Classical limit for the varying-mass Schrödinger equation with random inhomogeneities

Cloud Cover and Aurora Contamination at Dome A in 2017 from KLCAM

Deep unfitted Nitsche method for elliptic interface problems

Doubly Contrastive Deep Clustering

Multi-Passband Observations of A Solar Flare over the He I 10830 Å line

Automated Pavement Crack Segmentation Using U-Net-based Convolutional Neural Network

Automation of the AST3 optical sky survey from Dome~A, Antarctica

CYRA: the cryogenic infrared spectrograph for the Goode Solar Telescope in Big Bear

Detecting chirality in two-terminal electronic devices

Discovery of segmented Fermi surface induced by Cooper pair momentum

Incremental Embedding Learning via Zero-Shot Translation

Large spin to charge conversion in topological superconductor \b{eta}-PdBi2 at room temperature

Night-time measurements of astronomical seeing at Dome A in Antarctica

Nonreciprocal directional dichroism induced by a temperature gradient as a probe for mobile spin dynamics in quantum magnets

Rapid Evolution of Type II Spicules Observed in Goode Solar Telescope On-Disk H-alpha Images

Reply to "Comment on 'Spin-dependent electron transmission model for chiral molecules in mesoscopic devices'"

SPDEs with non-Lipschitz coefficients and nonhomogeous boundary conditions

Unfitted Nitsche's method for computing wave modes in topological materials

TBC-Net: A real-time detector for infrared small target detection using semantic constraint

A distribution-function-valued SPDE and its applications

ELM control with RMP: plasma response models and the role of edge peeling response

Extended relativistic configuration interaction and many-body perturbation calculations of spectroscopic data for the $n \leq 6$ configurationsin ne-like ions between Cr XV and Kr XXVII

Frozen Gaussian approximation for high frequency wave propagation in periodic media

Gauge-invariant frozen Gaussian approximation method for the Schrödinger equation with periodic potentials

Gradient recovery for elliptic interface problem: I. body-fitted mesh

Maximum likelihood type estimation for discretely observed CIR model with small $α$-stable noises

Pathwise uniqueness for a SPDE with Hölder continuous coefficient driven by α-stable noise

Schwinger boson spin liquid states on square lattice

Electronic structure of Li$_{1+x}$[Mn$_{0.5}$Ni$_{0.5}$]$_{1-x}$O$_2$ studied by photoemission and x-ray absorption spectroscopy

Face Photo Sketch Synthesis via Larger Patch and Multiresolution Spline

A Weighted Common Subgraph Matching Algorithm

Raman-gain induced loss-compensation in whispering-gallery-microresonators and single-nanoparticle detection with whispering-gallery Raman-microlasers

Reversible self-Kerr-nonlinearity in an N-type atomic system through a switching field

A pathway-based mean-field model for E. coli chemotaxis: Mathematical derivation and Keller-Segel limit

Post-selection free, integrated optical source of non-degenerate, polarization entangled photon pairs

Seismic modeling using the frozen Gaussian approximation

Frozen Gaussian approximation for general linear strictly hyperbolic system: formulation and Eulerian methods

Frozen Gaussian approximation for high frequency wave propagation

Mössbauer study of the field induced uniaxial anisotropy in electro-deposited FeCo alloy films

Convergence of frozen Gaussian approximation for high frequency wave propagation

Effective Maxwell equations from time-dependent density functional theory