Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
41works
0followers
22topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

41 published item(s)

preprint2026arXiv

Rethinking LLM Ensembling from the Perspective of Mixture Models

Model ensembling is a well-established technique for improving the performance of machine learning models. Conventionally, this involves averaging the output distributions of multiple models and selecting the most probable label. This idea has been naturally extended to large language models (LLMs), yielding improved performance but incurring substantial computational cost. This inefficiency stems from directly applying conventional ensemble implementation to LLMs, which require a separate forward pass for each model to explicitly compute the ensemble distribution. In this paper, we propose the Mixture-model-like Ensemble (ME). By reinterpreting the ensemble as a mixture model, ME stochastically selects a single model at each step to generate the next token, thereby avoiding the need to explicitly compute the full ensemble distribution. ME is mathematically equivalent to sampling from the ensemble distribution, but requires invoking only one model, making it 1.78x-2.68x faster than conventional ensemble. Furthermore, this perspective connects LLM ensembling and token-level routing methods, suggesting that LLM ensembling is a special case of routing methods. Our findings open new avenues for efficient LLM ensembling and motivate further exploration of token-level routing strategies for LLMs. Our code is available at https://github.com/jialefu/Mixture-model-like-Ensemble/.

preprint2024arXiv

Capacity Results for Multiple-Input Multiple-Output Optical Wireless Communication With Per-Antenna Intensity Constraints

In this paper, we investigate the capacity of a multiple-input multiple-output (MIMO) optical intensity channel (OIC) under per-antenna peak- and average-intensity constraints. We first consider the case where the average intensities of input are required to be equal to preassigned constants due to the requirement of illumination quality and color temperature. When the channel graph of the MIMO OIC is strongly connected, we prove that the strongest eigen-subchannel must have positive channel gains, which simplifies the capacity analysis. Then we derive various capacity bounds by utilizing linear precoding, generalized entropy power inequality, and QR decomposition, etc. These bounds are numerically verified to approach the capacity in the low or high signal-to-noise ratio regime. Specifically, when the channel rank is one less than the number of transmit antennas, we derive an equivalent capacity expression from the perspective of convex geometry, and new lower bounds are derived based on this equivalent expression. Finally, the developed results are extended to the more general case where the average intensities of input are required to be no larger than preassigned constants.

preprint2023arXiv

Pairing Symmetry and Fermion Projective Symmetry Groups

The Ginzburg-Landau (GL) theory is very successful in describing the pairing symmetry, a fundamental characterization of the broken symmetries in a paired superfluid or superconductor. However, GL theory does not describe fermionic excitations such as Bogoliubov quasiparticles or Andreev bound states that are directly related to topological properties of the superconductor. In this work, we show that the symmetries of the fermionic excitations are captured by a Projective Symmetry Group (PSG), which is a group extension of the bosonic symmetry group in the superconducting state. We further establish a correspondence between the pairing symmetry and the fermion PSG. When the normal and superconducting states share the same spin rotational symmetry, there is a simpler correspondence between the pairing symmetry and the fermion PSG, which we enumerate for all 32 crystalline point groups. We also discuss the general framework for computing PSGs when the spin rotational symmetry is spontaneously broken in the superconducting state. This PSG formalism leads to experimental consequences, and as an example, we show how a given pairing symmetry dictates the classification of topological superconductivity.

preprint2022arXiv

A Distributed Implementation of Steady-State Kalman Filter

This paper studies the distributed state estimation in sensor network, where $m$ sensors are deployed to infer the $n$-dimensional state of a linear time-invariant (LTI) Gaussian system. By a lossless decomposition of optimal steady-state Kalman filter, we show that the problem of distributed estimation can be reformulated as synchronization of homogeneous linear systems. Based on such decomposition, a distributed estimator is proposed, where each sensor node runs a local filter using only its own measurement and fuses the local estimate of each node with a consensus algorithm. We show that the average of the estimate from all sensors coincides with the optimal Kalman estimate. Numerical examples are provided in the end to illustrate the performance of the proposed scheme.

preprint2022arXiv

Auto-Encoding Score Distribution Regression for Action Quality Assessment

The action quality assessment (AQA) of videos is a challenging vision task since the relation between videos and action scores is difficult to model. Thus, AQA has been widely studied in the literature. Traditionally, AQA is treated as a regression problem to learn the underlying mappings between videos and action scores. But previous methods ignored data uncertainty in AQA dataset. To address aleatoric uncertainty, we further develop a plug-and-play module Distribution Auto-Encoder (DAE). Specifically, it encodes videos into distributions and uses the reparameterization trick in variational auto-encoders (VAE) to sample scores, which establishes a more accurate mapping between videos and scores. Meanwhile, a likelihood loss is used to learn the uncertainty parameters. We plug our DAE approach into MUSDL and CoRe. Experimental results on public datasets demonstrate that our method achieves state-of-the-art on AQA-7, MTL-AQA, and JIGSAWS datasets. Our code is available at https://github.com/InfoX-SEU/DAE-AQA.

preprint2022arXiv

Automatically Discovering Novel Visual Categories with Self-supervised Prototype Learning

This paper tackles the problem of novel category discovery (NCD), which aims to discriminate unknown categories in large-scale image collections. The NCD task is challenging due to the closeness to the real-world scenarios, where we have only encountered some partial classes and images. Unlike other works on the NCD, we leverage the prototypes to emphasize the importance of category discrimination and alleviate the issue of missing annotations of novel classes. Concretely, we propose a novel adaptive prototype learning method consisting of two main stages: prototypical representation learning and prototypical self-training. In the first stage, we obtain a robust feature extractor, which could serve for all images with base and novel categories. This ability of instance and category discrimination of the feature extractor is boosted by self-supervised learning and adaptive prototypes. In the second stage, we utilize the prototypes again to rectify offline pseudo labels and train a final parametric classifier for category clustering. We conduct extensive experiments on four benchmark datasets and demonstrate the effectiveness and robustness of the proposed method with state-of-the-art performance.

preprint2022arXiv

Chromospheric recurrent jets in a sunspot group and their inter-granular origin

We report on high resolution observations of recurrent fan-like jets by the Goode Solar telescope (GST) in multi-wavelengths inside a sunspot group. The dynamics behaviour of the jets is derived from the Ha line profiles. Quantitative values for one well-identified event have been obtained showing a maximum projected velocity of 42 km s^-1 and a Doppler shift of the order of 20 km s^-1. The footpoints/roots of the jets have a lifted center on the Ha line profile compared to the quiet sun suggesting a long lasting heating at these locations. The magnetic field between the small sunspots in the group shows a very high resolution pattern with parasitic polarities along the inter-granular lanes accompanied by high velocity converging flows (4 km s^-1) in the photosphere. Magnetic cancellations between the opposite polarities are observed in the vicinity of the footpoints of the jets. Along the inter-granular lanes horizontal magnetic field around 1000 Gauss is generated impulsively. Overall, all the kinetic features at the different layers through photosphere and chromosphere favor a convection-driven reconnection scenario for the recurrent fan-like jets, and evidence a site of reconnection between the photosphere and chromosphere corresponding to the inter-granular lanes.

preprint2022arXiv

Computation of the Time-Dependent Dirac Equation with Physics-Informed Neural Networks

We propose to compute the time-dependent Dirac equation using physics-informed neural networks (PINNs), a new powerful tool in scientific machine learning avoiding the use of approximate derivatives of differential operators. PINNs search solutions in the form of parameterized (deep) neural networks, whose derivatives (in time and space) are performed by automatic differentiation. The computational cost comes from the need to solve high-dimensional optimization problems using stochastic gradient methods and train the network with a large number of points. Specifically, we derive PINNs-based algorithms and present some key fundamental properties of these algorithms when applied to the Dirac equations in different physical frameworks.

preprint2022arXiv

Deep Neural Networks for Creating Reliable PmP Database with a Case Study in Southern California

Recent progresses in artificial intelligence and machine learning make it possible to automatically identify seismic phases from exponentially growing seismic data. Despite some exciting successes in automatic picking of the first P- and S-wave arrivals, auto-identification of later seismic phases such as the Moho-reflected PmP waves remains a significant challenge in matching the performance of experienced analysts. The main difficulty of machine-identifying PmP waves is that the identifiable PmP waves are rare, making the problem of identifying the PmP waves from a massive seismic database inherently unbalanced. In this work, by utilizing a high-quality PmP dataset (10,192 manual picks) in southern California, we develop PmPNet, a deep-neural-network-based algorithm to automatically identify PmP waves efficiently; by doing so, we accelerate the process of identifying the PmP waves. PmPNet applies similar techniques in the machine learning community to address the unbalancement of PmP datasets. The architecture of PmPNet is a residual neural network (ResNet)-autoencoder with additional predictor block, where encoder, decoder, and predictor are equipped with ResNet connection. We conduct systematic research with field data, concluding that PmPNet can efficiently achieve high precision and high recall simultaneously to automatically identify PmP waves from a massive seismic database. Applying the pre-trained PmPNet to the seismic database from January 1990 to December 1999 in southern California, we obtain nearly twice more PmP picks than the original PmP dataset, providing valuable data for other studies such as mapping the topography of the Moho discontinuity and imaging the lower crust structures of southern California.

preprint2022arXiv

Electromagnetically induced transparency in inhomogeneously broadened divacancy defect ensembles in SiC

Electromagnetically induced transparency (EIT) is a phenomenon that can provide strong and robust interfacing between optical signals and quantum coherence of electronic spins. In its archetypical form, mainly explored with atomic media, it uses a (near-)homogeneous ensemble of three-level systems, in which two low-energy spin-1/2 levels are coupled to a common optically excited state. We investigate the implementation of EIT with c-axis divacancy color centers in silicon carbide. While this material has attractive properties for quantum device technologies with near-IR optics, implementing EIT is complicated by the inhomogeneous broadening of the optical transitions throughout the ensemble and the presence of multiple ground-state levels. These may lead to darkening of the ensemble upon resonant optical excitation. Here, we show that EIT can be established with high visibility also in this material platform upon careful design of the measurement geometry. Comparison of our experimental results with a model based on the Lindblad equations indicates that we can create coherences between different sets of two levels all-optically in these systems, with potential impact for RF-free quantum sensing applications. Our work provides an understanding of EIT in multi-level systems with significant inhomogeneities, and our considerations are valid for a wide array of defects in semiconductors.

preprint2022arXiv

EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching

Current metrics for video captioning are mostly based on the text-level comparison between reference and candidate captions. However, they have some insuperable drawbacks, e.g., they cannot handle videos without references, and they may result in biased evaluation due to the one-to-many nature of video-to-text and the neglect of visual relevance. From the human evaluator's viewpoint, a high-quality caption should be consistent with the provided video, but not necessarily be similar to the reference in literal or semantics. Inspired by human evaluation, we propose EMScore (Embedding Matching-based score), a novel reference-free metric for video captioning, which directly measures similarity between video and candidate captions. Benefit from the recent development of large-scale pre-training models, we exploit a well pre-trained vision-language model to extract visual and linguistic embeddings for computing EMScore. Specifically, EMScore combines matching scores of both coarse-grained (video and caption) and fine-grained (frames and words) levels, which takes the overall understanding and detailed characteristics of the video into account. Furthermore, considering the potential information gain, EMScore can be flexibly extended to the conditions where human-labeled references are available. Last but not least, we collect VATEX-EVAL and ActivityNet-FOIl datasets to systematically evaluate the existing metrics. VATEX-EVAL experiments demonstrate that EMScore has higher human correlation and lower reference dependency. ActivityNet-FOIL experiment verifies that EMScore can effectively identify "hallucinating" captions. The datasets will be released to facilitate the development of video captioning metrics. The code is available at: https://github.com/ShiYaya/emscore.

preprint2022arXiv

MemoNav: Selecting Informative Memories for Visual Navigation

Image-goal navigation is a challenging task, as it requires the agent to navigate to a target indicated by an image in a previously unseen scene. Current methods introduce diverse memory mechanisms which save navigation history to solve this task. However, these methods use all observations in the memory for generating navigation actions without considering which fraction of this memory is informative. To address this limitation, we present the MemoNav, a novel memory mechanism for image-goal navigation, which retains the agent's informative short-term memory and long-term memory to improve the navigation performance on a multi-goal task. The node features on the agent's topological map are stored in the short-term memory, as these features are dynamically updated. To aid the short-term memory, we also generate long-term memory by continuously aggregating the short-term memory via a graph attention module. The MemoNav retains the informative fraction of the short-term memory via a forgetting module based on a Transformer decoder and then incorporates this retained short-term memory and the long-term memory into working memory. Lastly, the agent uses the working memory for action generation. We evaluate our model on a new multi-goal navigation dataset. The experimental results show that the MemoNav outperforms the SoTA methods by a large margin with a smaller fraction of navigation history. The results also empirically show that our model is less likely to be trapped in a deadlock, which further validates that the MemoNav improves the agent's navigation efficiency by reducing redundant steps.

preprint2022arXiv

Nontrivial Solutions of Dirac-Laplace Equation on Compact Spin Manifolds

We apply the Fountain theorem to a class of nonlinear Dirac-Laplace equation on compact spin manifold. We show the standard Ambrosetti-Rabinowitz condition can be replaced by a more natural super-quadratic condition that is enough to obtain the Cerami condition under certain conditions. Multiple solutions of nonlinear Dirac-Laplace equation are obtained in this note.

preprint2022arXiv

Observations of pores and surrounding regions with CO 4.66 μm lines by BBSO/CYRA

Solar observations of carbon monoxide (CO) indicate the existence of lower-temperature gas in the lower solar chromosphere. We present an observation of pores, and quiet-Sun, and network magnetic field regions with CO 4.66 μm lines by the Cryogenic Infrared Spectrograph (CYRA) at Big Bear Solar Observatory. We used the strong CO lines at around 4.66 μm to understand the properties of the thermal structures of lower solar atmosphere in different solar features with various magnetic field strengths. AIA 1700 Å images, HMI continuum images and magnetograms are also included in the observation. The data from 3D radiation magnetohydrodynamic (MHD) simulation with the Bifrost code are also employed for the first time to be compared with the observation. We used the RH code to synthesize the CO line profiles in the network regions. The CO 3-2 R14 line center intensity changes to be either enhanced or diminished with increasing magnetic field strength, which should be caused by different heating effects in magnetic flux tubes with different sizes. We find several "cold bubbles" in the CO 3-2 R14 line center intensity images, which can be classified into two types. One type is located in the quiet-Sun regions without magnetic fields. The other type, which has rarely been reported in the past, is near or surrounded by magnetic fields. Notably, some are located at the edge of the magnetic network. The two kinds of cold bubbles and the relationship between cold bubble intensities and network magnetic field strength are both reproduced by the 3D MHD simulation with the Bifrost and RH codes. The simulation also shows that there is a cold plasma blob near the network magnetic fields, causing the observed cold bubbles seen in the CO 3-2 R14 line center image. Our observation and simulation illustrate that the magnetic field plays a vital role in the generation of some CO cold bubbles.

preprint2022arXiv

On Color Isomorphic Pairs in Proper Edge Colourings of Complete Graphs

Following the recent paper which initiated the study of colour isomorphism problems for complete graphs, we obtain upper bounds for $f_2(n,H)$ for a family of graphs $H$ obtained as the $K_0$-th rooted power of a balanced rooted tree for some sufficiently large $K_0$. The proof uses the random polynomial method of Bukh. We also obtain matching lower bounds for $1$-subdivisions of the complete bipartite graph.

preprint2022arXiv

On Coupled Dirac Systems under Boundary Condition

In this article we study the existence of solutions for the Dirac systems \begin{equation}\label{e:0.1} \left\{ \begin{array}{c} Pu=\frac{\partial H}{\partial v}(x,u,v) \quad\hbox{on} \ M, Pv=\frac{\partial H}{\partial u}(x,u,v) \quad\hbox{on} \ M, B_{\text{CHI}}u= B_{\text{CHI}}v=0\quad\hbox{on} \ \partial M \end{array} \right. \end{equation} where $M$ is an $m$-dimensional compact oriented Riemannian spin manifold with smooth boundary $\partial M$, $P$ is the Dirac operator under the boundary condition $B_{\text{CHI}}u= B_{\text{CHI}}v=0$ on $\partial M$, $ u,v\in C^{\infty}(M,ΣM)$ are spinors. Using an analytic framework of proper products of fractional Sobolev spaces, the solutions existence results of the coupled Dirac systems are obtained for nonlinearity with superquadratic growth rates.

preprint2022arXiv

On the extinction-extinguishing dichotomy for a stochastic Lotka-Volterra type population dynamical system

We study a two-dimensional process $(X, Y)$ arising as the unique nonnegative solution to a pair of stochastic differential equations driven by independent Brownian motions and compensated spectrally positive Lévy random measures. Both processes $X$ and $Y$ can be identified as continuous-state nonlinear branching processes where the evolution of $Y$ is negatively affected by $X$. Assuming that process $X$ extinguishes, i.e. it converges to $0$ but never reaches $0$ in finite time, and process $Y$ converges to $0$, we identify rather sharp conditions under which the process $Y$ exhibits, respectively, one of the following behaviors: extinction with probability one, extinguishing with probability one or both extinction and extinguishing occurring with strictly positive probabilities.

preprint2022arXiv

Siamese Contrastive Embedding Network for Compositional Zero-Shot Learning

Compositional Zero-Shot Learning (CZSL) aims to recognize unseen compositions formed from seen state and object during training. Since the same state may be various in the visual appearance while entangled with different objects, CZSL is still a challenging task. Some methods recognize state and object with two trained classifiers, ignoring the impact of the interaction between object and state; the other methods try to learn the joint representation of the state-object compositions, leading to the domain gap between seen and unseen composition sets. In this paper, we propose a novel Siamese Contrastive Embedding Network (SCEN) (Code: https://github.com/XDUxyLi/SCEN-master) for unseen composition recognition. Considering the entanglement between state and object, we embed the visual feature into a Siamese Contrastive Space to capture prototypes of them separately, alleviating the interaction between state and object. In addition, we design a State Transition Module (STM) to increase the diversity of training compositions, improving the robustness of the recognition model. Extensive experiments indicate that our method significantly outperforms the state-of-the-art approaches on three challenging benchmark datasets, including the recent proposed C-QGA dataset.

preprint2022arXiv

The Co-alignment of Winged Hα Data Observed by the New Vacuum Solar Telescop

The New Vacuum Solar Telescope (NVST) has been releasing its novel winged Ha data (WHD) since April 2021, namely the Ha imaging spectroscopic data. Compared with the prior released version, the new data are further co-aligned among the off-band images and packaged into a standard solar physics community format. In this study, we illustrate the alignment algorithm used by the novel WHD, which is mainly based on the optical flow method to obtain the translation offset between the winged images. To quantitatively evaluate the alignment results of two images with different similarities, we calculate the alignment accuracies between the images of different off-band and line center, respectively. The result shows that our alignment algorithm could reach up to the accuracy of about 0.1 "when the off-band of winged image is lower than 0.6 Ȧ. In addition, we introduce the final product of the WHD in detail, which can provide convenience for the solar physicists to use high-resolution Hα imaging spectroscopic data of NVST.

preprint2022arXiv

Towards Unbiased Visual Emotion Recognition via Causal Intervention

Although much progress has been made in visual emotion recognition, researchers have realized that modern deep networks tend to exploit dataset characteristics to learn spurious statistical associations between the input and the target. Such dataset characteristics are usually treated as dataset bias, which damages the robustness and generalization performance of these recognition systems. In this work, we scrutinize this problem from the perspective of causal inference, where such dataset characteristic is termed as a confounder which misleads the system to learn the spurious correlation. To alleviate the negative effects brought by the dataset bias, we propose a novel Interventional Emotion Recognition Network (IERN) to achieve the backdoor adjustment, which is one fundamental deconfounding technique in causal inference. Specifically, IERN starts by disentangling the dataset-related context feature from the actual emotion feature, where the former forms the confounder. The emotion feature will then be forced to see each confounder stratum equally before being fed into the classifier. A series of designed tests validate the efficacy of IERN, and experiments on three emotion benchmarks demonstrate that IERN outperforms state-of-the-art approaches for unbiased visual emotion recognition. Code is available at https://github.com/donydchen/causal_emotion

preprint2022arXiv

Weakly Aligned Feature Fusion for Multimodal Object Detection

To achieve accurate and robust object detection in the real-world scenario, various forms of images are incorporated, such as color, thermal, and depth. However, multimodal data often suffer from the position shift problem, i.e., the image pair is not strictly aligned, making one object has different positions in different modalities. For the deep learning method, this problem makes it difficult to fuse multimodal features and puzzles the convolutional neural network (CNN) training. In this article, we propose a general multimodal detector named aligned region CNN (AR-CNN) to tackle the position shift problem. First, a region feature (RF) alignment module with adjacent similarity constraint is designed to consistently predict the position shift between two modalities and adaptively align the cross-modal RFs. Second, we propose a novel region of interest (RoI) jitter strategy to improve the robustness to unexpected shift patterns. Third, we present a new multimodal feature fusion method that selects the more reliable feature and suppresses the less useful one via feature reweighting. In addition, by locating bounding boxes in both modalities and building their relationships, we provide novel multimodal labeling named KAIST-Paired. Extensive experiments on 2-D and 3-D object detection, RGB-T, and RGB-D datasets demonstrate the effectiveness and robustness of our method.

preprint2021arXiv

Causal Attention for Vision-Language Tasks

We present a novel attention mechanism: Causal Attention (CATT), to remove the ever-elusive confounding effect in existing attention-based vision-language models. This effect causes harmful bias that misleads the attention module to focus on the spurious correlations in training data, damaging the model generalization. As the confounder is unobserved in general, we use the front-door adjustment to realize the causal intervention, which does not require any knowledge on the confounder. Specifically, CATT is implemented as a combination of 1) In-Sample Attention (IS-ATT) and 2) Cross-Sample Attention (CS-ATT), where the latter forcibly brings other samples into every IS-ATT, mimicking the causal intervention. CATT abides by the Q-K-V convention and hence can replace any attention module such as top-down attention and self-attention in Transformers. CATT improves various popular attention-based vision-language models by considerable margins. In particular, we show that CATT has great potential in large-scale pre-training, e.g., it can promote the lighter LXMERT~\cite{tan2019lxmert}, which uses fewer data and less computational power, comparable to the heavier UNITER~\cite{chen2020uniter}. Code is published in \url{https://github.com/yangxuntu/catt}.

preprint2021arXiv

Classical limit for the varying-mass Schrödinger equation with random inhomogeneities

The varying-mass Schrödinger equation (VMSE) has been successfully applied to model electronic properties of semiconductor hetero-structures, for example, quantum dots and quantum wells. In this paper, we consider VMSE with small random heterogeneities, and derive a radiative transfer equation as its asymptotic limit. The main tool is to systematically apply the Wigner transform in the classical regime when the rescaled Planck constant $ε\ll 1$, and expand the Wigner equation to proper orders of $ε$. As a proof of concept, we numerically compute both VMSE and its limiting radiative transfer equation, and show that their solutions agree well in the classical regime.

preprint2021arXiv

Cloud Cover and Aurora Contamination at Dome A in 2017 from KLCAM

Dome A in Antarctica has many characteristics that make it an excellent site for astronomical observations, from the optical to the terahertz. Quantitative site testing is still needed to confirm the site's properties. In this paper, we present a statistical analysis of cloud cover and aurora contamination from the Kunlun Cloud and Aurora Monitor (KLCAM). KLCAM is an automatic, unattended all-sky camera aiming for long-term monitoring of the usable observing time and optical sky background at Dome~A. It was installed at Dome~A in January 2017, worked through the austral winter, and collected over 47,000 images over 490 days. A semi-quantitative visual data analysis of cloud cover and auroral contamination was carried out by five individuals. The analysis shows that the night sky was free of cloud for 83 per cent of the time, which ranks Dome~A highly in a comparison with other observatory sites. Although aurorae were detected somewhere on an image for nearly 45 per cent of the time, the strongest auroral emission lines can be filtered out with customized filters.

preprint2021arXiv

Deep unfitted Nitsche method for elliptic interface problems

This paper proposes a deep unfitted Nitsche method for computing elliptic interface problems with high contrasts in high dimensions. To capture discontinuities of the solution caused by interfaces, we reformulate the problem as an energy minimization involving two weakly coupled components. This enables us to train two deep neural networks to represent two components of the solution in high-dimensional. The curse of dimensionality is alleviated by using the Monte-Carlo method to discretize the unfitted Nitsche energy function. We present several numerical examples to show the performance of the proposed method.

preprint2021arXiv

Doubly Contrastive Deep Clustering

Deep clustering successfully provides more effective features than conventional ones and thus becomes an important technique in current unsupervised learning. However, most deep clustering methods ignore the vital positive and negative pairs introduced by data augmentation and further the significance of contrastive learning, which leads to suboptimal performance. In this paper, we present a novel Doubly Contrastive Deep Clustering (DCDC) framework, which constructs contrastive loss over both sample and class views to obtain more discriminative features and competitive results. Specifically, for the sample view, we set the class distribution of the original sample and its augmented version as positive sample pairs and set one of the other augmented samples as negative sample pairs. After that, we can adopt the sample-wise contrastive loss to pull positive sample pairs together and push negative sample pairs apart. Similarly, for the class view, we build the positive and negative pairs from the sample distribution of the class. In this way, two contrastive losses successfully constrain the clustering results of mini-batch samples in both sample and class level. Extensive experimental results on six benchmark datasets demonstrate the superiority of our proposed model against state-of-the-art methods. Particularly in the challenging dataset Tiny-ImageNet, our method leads 5.6\% against the latest comparison method. Our code will be available at \url{https://github.com/ZhiyuanDang/DCDC}.

preprint2021arXiv

Multi-Passband Observations of A Solar Flare over the He I 10830 Å line

This study presents a C3.0 flare observed by the BBSO/GST and IRIS, on 2018-May-28 around 17:10 UT. The Near Infrared Imaging Spectropolarimeter (NIRIS) of GST was set to spectral imaging mode to scan five spectral positions at $\pm$ 0.8 Å, $\pm$ 0.4 Åand line center of He I 10830. At the flare ribbon's leading edge the line is observed to undergo enhanced absorption, while the rest of the ribbon is observed to be in emission. When in emission, the contrast compared to the pre-flare ranges from about $30~\%$ to nearly $100~\%$ at different spectral positions. Two types of spectra, "convex" shape with higher intensity at line core and "concave" shape with higher emission in the line wings, are found at the trailing and peak flaring areas, respectively. On the ribbon front, negative contrasts, or enhanced absorption, of about $\sim 10\% - 20\%$ appear in all five wavelengths. This observation strongly suggests that the negative flares observed in He I 10830 with mono-filtergram previously were not caused by pure Doppler shifts of this spectral line. Instead, the enhanced absorption appears to be a consequence of flare energy injection, namely non-thermal collisional ionization of helium caused by the precipitation of high energy electrons, as found in our recent numerical modeling results. In addition, though not strictly simultaneous, observations of Mg II from the IRIS spacecraft, show an obvious central reversal pattern at the locations where enhanced absorption of He I 10830 is seen, which is in consistent with previous observations.

preprint2020arXiv

Automated Pavement Crack Segmentation Using U-Net-based Convolutional Neural Network

Automated pavement crack image segmentation is challenging because of inherent irregular patterns, lighting conditions, and noise in images. Conventional approaches require a substantial amount of feature engineering to differentiate crack regions from non-affected regions. In this paper, we propose a deep learning technique based on a convolutional neural network to perform segmentation tasks on pavement crack images. Our approach requires minimal feature engineering compared to other machine learning techniques. We propose a U-Net-based network architecture in which we replace the encoder with a pretrained ResNet-34 neural network. We use a "one-cycle" training schedule based on cyclical learning rates to speed up the convergence. Our method achieves an F1 score of 96% on the CFD dataset and 73% on the Crack500 dataset, outperforming other algorithms tested on these datasets. We perform ablation studies on various techniques that helped us get marginal performance boosts, i.e., the addition of spatial and channel squeeze and excitation (SCSE) modules, training with gradually increasing image sizes, and training various neural network layers with different learning rates.

preprint2020arXiv

Automation of the AST3 optical sky survey from Dome~A, Antarctica

The 0.5\,m Antarctic Survey Telescopes (AST3) were designed for time-domain optical/infrared astronomy. They are located in Dome~A, Antarctica, where they can take advantage of the continuous dark time during winter. Since the site is unattended in winter, everything for the operation, from observing to data reduction, had to be fully automated. Here, we present a brief overview of the AST3 project and some of its unique characteristics due to its location in Antarctica. We summarise the various components of the survey, including the customized hardware and software, that make complete automation possible.

preprint2020arXiv

CYRA: the cryogenic infrared spectrograph for the Goode Solar Telescope in Big Bear

CYRA (CrYogenic solar spectrogRAph) is a facility instrument of the 1.6-meter Goode Solar Telescope (GST) at the Big Bear Solar Observatory (BBSO). CYRA focuses on the study of the near-infrared solar spectrum between 1 and 5 microns, a under explored region which is not only a fertile ground for photospheric magnetic diagnostics, but also allows a unique window into the chromosphere lying atop the photosphere. CYRA is the first ever fully cryogenic spectrograph in any solar observatory with its two predecessors, on the McMath-Pierce and Mees Telescopes, being based on warm optics except for the detectors and order sorting filters. CYRA is used to probe magnetic fields in various solar features and the quiet photosphere. CYRA measurements will allow new and better 3D extrapolations of the solar magnetic field and will provide more accurate boundary conditions for solar activity models. Superior spectral resolution of 150,000 and better allows enhanced observations of the chromosphere in the carbon monoxide (CO) spectral bands and will yield a better understanding of energy transport in the solar atmosphere. CYRA is divided into two optical sub-systems: The Fore-Optics Module and the Spectrograph. The Spectrograph is the heart of the instrument and contains the IR detector, grating, slits, filters, and imaging optics all in a cryogenically cooled Dewar (cryostat). The detector a 2048 by 2048 pixel HAWAII 2 array produced by Teledyne Scientific & Imaging, LLC. The interior of the cryostat and the readout electronics are maintained at 90 Kelvin by helium refrigerant based cryo-coolers, while the IR array is cooled to 30 Kelvin. The Fore-Optics Module de-rotates and stabilizes the solar image, provides scanning capabilities, and transfers the GST image to the Spectrograph. CYRA has been installed and is undergoing its commissioning phase.

preprint2020arXiv

Detecting chirality in two-terminal electronic devices

Central to spintronics is the interconversion between electronic charge and spin currents, and this can arise from the chirality-induced spin selectivity (CISS) effect. CISS is often studied as magnetoresistance (MR) in two-terminal (2T) electronic devices containing a chiral (molecular) component and a ferromagnet. However, fundamental understanding of when and how this MR can occur is lacking. Here, we uncover an elementary mechanism that generates such a MR for nonlinear response. It requires energy-dependent transport and energy relaxation within the device. The sign of the MR depends on chirality, charge carrier type, and bias direction. Additionally, we reveal how CISS can be detected in the linear response regime in magnet-free 2T devices, either by forming a chirality-based spin-valve using two or more chiral components, or by Hanle spin precession in devices with a single chiral component. Our results provide operation principles and design guidelines for chirality-based spintronic devices and technologies.

preprint2020arXiv

Discovery of segmented Fermi surface induced by Cooper pair momentum

Since the early days of Bardeen-Cooper-Schrieffer theory, it has been predicted that a sufficiently large supercurrent can close the energy gap in a superconductor and creates gapless Bogoliubov quasiparticles through the Doppler shift of quasiparticle energy due to the Cooper pair momentum. In this gapless superconducting state, zero-energy quasiparticles reside on a segment of the normal state Fermi surface, while its remaining part is still gapped. The finite density of states of field-induced quasiparticles, known as the Volovik effect, has been observed in tunneling and specific heat measurements on d- and s-wave superconductors. However, the segmented Fermi surface of a finite-momentum state carrying a supercurrent has never been detected directly. Here we use quasiparticle interference (QPI) technique to image field-controlled Fermi surface of Bi$_2$Te$_3$ thin films proximitized by the superconductor NbSe$_2$. By applying a small in-plane magnetic field, a screening supercurrent is induced which leads to finite-momentum pairing on topological surface states of Bi$_2$Te$_3$. Our measurements and analysis reveal the strong impact of finite Cooper pair momentum on the quasiparticle spectrum, and thus pave the way for STM study of pair density wave and FFLO states in unconventional superconductors.

preprint2020arXiv

Incremental Embedding Learning via Zero-Shot Translation

Modern deep learning methods have achieved great success in machine learning and computer vision fields by learning a set of pre-defined datasets. Howerver, these methods perform unsatisfactorily when applied into real-world situations. The reason of this phenomenon is that learning new tasks leads the trained model quickly forget the knowledge of old tasks, which is referred to as catastrophic forgetting. Current state-of-the-art incremental learning methods tackle catastrophic forgetting problem in traditional classification networks and ignore the problem existing in embedding networks, which are the basic networks for image retrieval, face recognition, zero-shot learning, etc. Different from traditional incremental classification networks, the semantic gap between the embedding spaces of two adjacent tasks is the main challenge for embedding networks under incremental learning setting. Thus, we propose a novel class-incremental method for embedding network, named as zero-shot translation class-incremental method (ZSTCI), which leverages zero-shot translation to estimate and compensate the semantic gap without any exemplars. Then, we try to learn a unified representation for two adjacent tasks in sequential learning process, which captures the relationships of previous classes and current classes precisely. In addition, ZSTCI can easily be combined with existing regularization-based incremental learning methods to further improve performance of embedding networks. We conduct extensive experiments on CUB-200-2011 and CIFAR100, and the experiment results prove the effectiveness of our method. The code of our method has been released.

preprint2020arXiv

Large spin to charge conversion in topological superconductor \b{eta}-PdBi2 at room temperature

\b{eta}-PdBi2 has attracted much attention for its prospective ability to possess simultaneously topological surface and superconducting states due to its unprecedented spin-orbit interaction (SOC). Whereas most works have focused solely on investigating its topological surface states, the coupling between spin and charge degrees of freedom in this class of quantum material remains unexplored. Here we first report a study of spin-to-charge conversion in a \b{eta}-PdBi2 ultrathin film grown by molecular beam epitaxy, utilizing a spin pumping technique to perform inverse spin Hall effect measurements. We find that the room temperature spin Hall angle of Fe/\b{eta}-PdBi2, θ_SH=0.037. This value is one order of magnitude larger than that of reported conventional superconductors, and is comparable to that of the best SOC metals and topological insulators. Our results provide an avenue for developing superconductor-based spintronic applications.

preprint2020arXiv

Night-time measurements of astronomical seeing at Dome A in Antarctica

Seeing, the angular size of stellar images blurred by atmospheric turbulence, is a critical parameter used to assess the quality of astronomical sites. Median values at the best mid-latitude sites are generally in the range of 0.6--0.8\,arcsec. Sites on the Antarctic plateau are characterized by comparatively-weak turbulence in the free-atmosphere above a strong but thin boundary layer. The median seeing at Dome C is estimated to be 0.23--0.36 arcsec above a boundary layer that has a typical height of 30\,m. At Dome A and F, the only previous seeing measurements were made during daytime. Here we report the first direct measurements of night-time seeing at Dome A, using a Differential Image Motion Monitor. Located at a height of just 8\,m, it recorded seeing as low as 0.13\,arcsec, and provided seeing statistics that are comparable to those for a 20\,m height at Dome C. It indicates that the boundary layer was below 8\,m 31\% of the time. At such times the median seeing was 0.31\,arcsec, consistent with free-atmosphere seeing. The seeing and boundary layer thickness are found to be strongly correlated with the near-surface temperature gradient. The correlation confirms a median thickness of approximately 14\,m for the boundary layer at Dome A, as found from a sonic radar. The thinner boundary layer makes it less challenging to locate a telescope above it, thereby giving greater access to the free-atmosphere.

preprint2020arXiv

Nonreciprocal directional dichroism induced by a temperature gradient as a probe for mobile spin dynamics in quantum magnets

Novel states of matter in quantum magnets like quantum spin liquids attract considerable interest recently. Despite the existence of a plenty of candidate materials, there is no confirmed quantum spin liquid, largely due to the lack of proper experimental probes. For instance, spectrosocopy experiments like neutron scattering receive contributions from disorder-induced local modes, while thermal transport experiments receive contributions from phonons. Here we propose a thermo-optic experiment which directly probes the mobile magnetic excitations in spatial-inversion symmetric and/or time-reversal symmetric Mott insulators: the temperature-gradient-induced nonreciprocal directional dichroism (TNDD) spectroscopy. Unlike traditional probes, TNDD directly detects mobile magnetic excitations and decouples from phonons and local magnetic modes.

preprint2020arXiv

Rapid Evolution of Type II Spicules Observed in Goode Solar Telescope On-Disk H-alpha Images

We analyze ground-based chromospheric data acquired at a high temporal cadence of 2 s in wings of the H$α$ spectral line using Goode Solar Telescope (GST) operating at the Big Bear Solar Observatory. We inspected a 30 minute long H$α$-0.08~nm data set to find that rapid blue-shifted H$α$ excursions (RBEs), which are a cool component of type II spicules, experience very rapid morphological changes on the time scales of the order of 1 second. Unlike typical reconnection jets, RBEs very frequently appear \textit{in situ} without any clear evidence of H$α$ material being injected from below. Their evolution includes inverted "Y", "V", "N", and parallel splitting (doubling) patterns as well as sudden formation of a diffuse region followed by branching. We also find that the same feature may undergo several splitting episodes within about 1 min time interval.

preprint2020arXiv

Reply to "Comment on 'Spin-dependent electron transmission model for chiral molecules in mesoscopic devices'"

Here we emphasize once more the distinction between generating CISS (spin-charge current conversion) in a chiral system and detecting it as magnetoresistance in two-terminal electronic devices. We also highlight important differences between electrical measurement results obtained in the linear response regime and those obtained in the nonlinear regime.

preprint2020arXiv

SPDEs with non-Lipschitz coefficients and nonhomogeous boundary conditions

In this paper we establish the strong existence, pathwise uniqueness and a comparison theorem to a stochastic partial differential equation driven by Gaussian colored noise with non-Lipschitz drift, Hölder continuous diffusion coefficients and the spatial domain in finite interval, $[0,1]$, and with Dirichlet, Neumann or mixed nonhomogeneous random conditions imposed on the endpoints. The Hölder continuity of the solution both in time and in space variables is also studied.

preprint2020arXiv

Unfitted Nitsche's method for computing wave modes in topological materials

In this paper, we propose an unfitted Nitsche's method for computing wave modes in topological materials. The proposed method is based on Nitsche's technique to study the performance-enhanced topological materials which have strongly heterogeneous structures (e.g., the refractive index is piecewise constant with high contrasts). For periodic bulk materials, we use Floquet-Bloch theory and solve an eigenvalue problem on a torus with unfitted meshes. For the materials with a line defect, a sufficiently large domain with zero boundary conditions is used to compute the localized eigenfunctions corresponding to the edge modes. The interfaces are handled by Nitsche's method on an unfitted uniform mesh. We prove the proposed methods converge optimally, and present numerical examples to validate the theoretical results and demonstrate the capability of simulating topological materials.

preprint2019arXiv

TBC-Net: A real-time detector for infrared small target detection using semantic constraint

Infrared small target detection is a key technique in infrared search and tracking (IRST) systems. Although deep learning has been widely used in the vision tasks of visible light images recently, it is rarely used in infrared small target detection due to the difficulty in learning small target features. In this paper, we propose a novel lightweight convolutional neural network TBC-Net for infrared small target detection. The TBCNet consists of a target extraction module (TEM) and a semantic constraint module (SCM), which are used to extract small targets from infrared images and to classify the extracted target images during the training, respectively. Meanwhile, we propose a joint loss function and a training method. The SCM imposes a semantic constraint on TEM by combining the high-level classification task and solve the problem of the difficulty to learn features caused by class imbalance problem. During the training, the targets are extracted from the input image and then be classified by SCM. During the inference, only the TEM is used to detect the small targets. We also propose a data synthesis method to generate training data. The experimental results show that compared with the traditional methods, TBC-Net can better reduce the false alarm caused by complicated background, the proposed network structure and joint loss have a significant improvement on small target feature learning. Besides, TBC-Net can achieve real-time detection on the NVIDIA Jetson AGX Xavier development board, which is suitable for applications such as field research with drones equipped with infrared sensors.