Source author record

Jian Kang

Jian Kang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

53works

22topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

A Unified Spoken Language Model with Injected Emotional-Attribution Thinking for Human-like Interaction

This paper presents a unified spoken language model for emotional intelligence, enhanced by a novel data construction strategy termed Injected Emotional-Attribution Thinking (IEAT). IEAT incorporates user emotional states and their underlying causes into the model's internal reasoning process, enabling emotion-aware reasoning to be internalized rather than treated as explicit supervision. The model is trained with a two-stage progressive strategy. The first stage performs speech-text alignment and emotional attribute modeling via self-distillation, while the second stage conducts end-to-end cross-modal joint optimization to ensure consistency between textual and spoken emotional expressions. Experiments on the Human-like Spoken Dialogue Systems Challenge (HumDial) Emotional Intelligence benchmark demonstrate that the proposed approach achieves top-ranked performance across emotional trajectory modeling, emotional reasoning, and empathetic response generation under both LLM-based and human evaluations.

preprint2026arXiv

TELEVAL: A Dynamic Benchmark Designed for Spoken Language Models in Chinese Interactive Scenarios

Spoken language models (SLMs) have advanced rapidly in recent years, accompanied by a growing number of evaluation benchmarks. However, most existing benchmarks emphasize task completion and capability scaling, while remaining poorly aligned with how users interact with SLMs in real-world spoken conversations. Effective spoken interaction requires not only accurate understanding of user intent and content, but also the ability to respond with appropriate interactional strategies. In this paper, we present TELEVAL, a dynamic, user-centered benchmark for evaluating SLMs in realistic Chinese spoken interaction scenarios. TELEVAL consolidates evaluation into two core aspects. Reliable Content Fulfillment assesses whether models can comprehend spoken inputs and produce semantically correct responses. Interactional Appropriateness evaluates whether models act as socially capable interlocutors, requiring them not only to generate human-like, colloquial responses, but also to implicitly incorporate paralinguistic cues for natural interaction. Experiments reveal that, despite strong performance on semantic and knowledge-oriented tasks, current SLMs still struggle to produce natural and interactionally appropriate responses, highlighting the need for more interaction-faithful evaluation.

preprint2024arXiv

Rewrite Caption Semantics: Bridging Semantic Gaps for Language-Supervised Semantic Segmentation

Vision-Language Pre-training has demonstrated its remarkable zero-shot recognition ability and potential to learn generalizable visual representations from language supervision. Taking a step ahead, language-supervised semantic segmentation enables spatial localization of textual inputs by learning pixel grouping solely from image-text pairs. Nevertheless, the state-of-the-art suffers from clear semantic gaps between visual and textual modality: plenty of visual concepts appeared in images are missing in their paired captions. Such semantic misalignment circulates in pre-training, leading to inferior zero-shot performance in dense predictions due to insufficient visual concepts captured in textual representations. To close such semantic gap, we propose Concept Curation (CoCu), a pipeline that leverages CLIP to compensate for the missing semantics. For each image-text pair, we establish a concept archive that maintains potential visually-matched concepts with our proposed vision-driven expansion and text-to-vision-guided ranking. Relevant concepts can thus be identified via cluster-guided sampling and fed into pre-training, thereby bridging the gap between visual and textual semantics. Extensive experiments over a broad suite of 8 segmentation benchmarks show that CoCu achieves superb zero-shot transfer performance and greatly boosts language-supervised segmentation baseline by a large margin, suggesting the value of bridging semantic gap in pre-training data.

preprint2023arXiv

InfoFair: Information-Theoretic Intersectional Fairness

Algorithmic fairness is becoming increasingly important in data mining and machine learning. Among others, a foundational notation is group fairness. The vast majority of the existing works on group fairness, with a few exceptions, primarily focus on debiasing with respect to a single sensitive attribute, despite the fact that the co-existence of multiple sensitive attributes (e.g., gender, race, marital status, etc.) in the real-world is commonplace. As such, methods that can ensure a fair learning outcome with respect to all sensitive attributes of concern simultaneously need to be developed. In this paper, we study the problem of information-theoretic intersectional fairness (InfoFair), where statistical parity, a representative group fairness measure, is guaranteed among demographic groups formed by multiple sensitive attributes of interest. We formulate it as a mutual information minimization problem and propose a generic end-to-end algorithmic framework to solve it. The key idea is to leverage a variational representation of mutual information, which considers the variational distribution between learning outcomes and sensitive attributes, as well as the density ratio between the variational and the original distributions. Our proposed framework is generalizable to many different settings, including other statistical notions of fairness, and could handle any type of learning task equipped with a gradient-based optimizer. Empirical evaluations in the fair classification task on three real-world datasets demonstrate that our proposed framework can effectively debias the classification results with minimal impact to the classification accuracy.

preprint2022arXiv

A Soft-Thresholding Operator for Sparse Time-Varying Effects in Survival Models

We consider a class of Cox models with time-dependent effects that may be zero over certain unknown time regions or, in short, sparse time-varying effects. The model is particularly useful for biomedical studies as it conveniently depicts the gradual evolution of effects of risk factors on survival. Statistically, estimating and drawing inference on infinite dimensional functional parameters with sparsity (e.g., time-varying effects with zero-effect time intervals) present enormous challenges. To address them, we propose a new soft-thresholding operator for modeling sparse, piecewise smooth and continuous time-varying coefficients in a Cox time-varying effects model. Unlike the common regularized methods, our approach enables one to estimate non-zero time-varying effects and detect zero regions simultaneously, and construct a new type of sparse confidence intervals that accommodate zero regions. This leads to a more interpretable model with a straightforward inference procedure. We develop an efficient algorithm for inference in the target functional space, show that the proposed method enjoys desired theoretical properties, and present its finite sample performance by way of simulations. We apply the proposed method to analyze the data of the Boston Lung Cancer Survivor Cohort, an epidemiological cohort study investigating the impacts of risk factors on lung cancer survival, and obtain clinically useful results.

preprint2022arXiv

Adversarial Sample Detection for Speaker Verification by Neural Vocoders

Automatic speaker verification (ASV), one of the most important technology for biometric identification, has been widely adopted in security-critical applications. However, ASV is seriously vulnerable to recently emerged adversarial attacks, yet effective countermeasures against them are limited. In this paper, we adopt neural vocoders to spot adversarial samples for ASV. We use the neural vocoder to re-synthesize audio and find that the difference between the ASV scores for the original and re-synthesized audio is a good indicator for discrimination between genuine and adversarial samples. This effort is, to the best of our knowledge, among the first to pursue such a technical direction for detecting time-domain adversarial samples for ASV, and hence there is a lack of established baselines for comparison. Consequently, we implement the Griffin-Lim algorithm as the detection baseline. The proposed approach achieves effective detection performance that outperforms the baselines in all the settings. We also show that the neural vocoder adopted in the detection framework is dataset-independent. Our codes will be made open-source for future works to do fair comparison.

preprint2022arXiv

Bayesian learning of COVID-19 Vaccine safety while incorporating Adverse Events ontology

While vaccines are crucial to end the COVID-19 pandemic, public confidence in vaccine safety has always been vulnerable. Many statistical methods have been applied to VAERS (Vaccine Adverse Event Reporting System) database to study the safety of COVID-19 vaccines. However, all these methods ignored the adverse event (AE) ontology. AEs are naturally related; for example, events of retching, dysphagia, and reflux are all related to an abnormal digestive system. Explicitly bringing AE relationships into the model can aid in the detection of true AE signals amid the noise while reducing false positives. We propose a Bayesian graphical model to estimate all AEs while incorporating the AE ontology simultaneously. We proposed strategies to construct conjugate forms leading to an efficient Gibbs sampler. Built upon the posterior distributions, we proposed a negative control approach to mitigate reporting bias and an enrichment approach to detect AE groups of concern. The proposed methods were evaluated using simulation studies and were further illustrated on studying the safety of COVID-19 vaccines. The proposed methods were implemented in R package \textit{BGrass} and source code are available at https://github.com/BangyaoZhao/BGrass.

preprint2022arXiv

Deep Historical Borrowing Framework to Prospectively and Simultaneously Synthesize Control Information in Confirmatory Clinical Trials with Multiple Endpoints

In current clinical trial development, historical information is receiving more attention as it provides utility beyond sample size calculation. Meta-analytic-predictive (MAP) priors and robust MAP priors have been proposed for prospectively borrowing historical data on a single endpoint. To simultaneously synthesize control information from multiple endpoints in confirmatory clinical trials, we propose to approximate posterior probabilities from a Bayesian hierarchical model and estimate critical values by deep learning to construct pre-specified strategies for hypothesis testing. This feature is important to ensure study integrity by establishing prospective decision functions before the trial conduct. Simulations are performed to show that our method properly controls family-wise error rate (FWER) and preserves power as compared with a typical practice of choosing constant critical values given a subset of null space. Satisfactory performance under prior-data conflict is also demonstrated. We further illustrate our method using a case study in Immunology.

preprint2022arXiv

Finite-Sample Two-Group Composite Hypothesis Testing via Machine Learning

In the problem of composite hypothesis testing, identifying the potential uniformly most powerful (UMP) unbiased test is of great interest. Beyond typical hypothesis settings with exponential family, it is usually challenging to prove the existence and further construct such UMP unbiased tests with finite sample size. For example in the COVID-19 pandemic with limited previous assumptions on the treatment for investigation and the standard of care, adaptive clinical trials are appealing due to ethical considerations, and the ability to accommodate uncertainty while conducting the trial. Although several methods have been proposed to control type I error rates, how to find a more powerful hypothesis testing strategy is still an open question. Motivated by this problem, we propose an automatic framework of constructing test statistics and corresponding critical values via machine learning methods to enhance power in a finite sample. In this article, we particularly illustrate the performance using Deep Neural Networks (DNN) and discuss its advantages. Simulations and two case studies of adaptive designs demonstrate that our method is automatic, general and pre-specified to construct statistics with satisfactory power in finite-sample. Supplemental materials are available online including R code and an R shiny app.

preprint2022arXiv

Image Response Regression via Deep Neural Networks

Delineating the associations between images and a vector of covariates is of central interest in medical imaging studies. To tackle this problem of image response regression, we propose a novel nonparametric approach in the framework of spatially varying coefficient models, where the spatially varying functions are estimated through deep neural networks. Compared to existing solutions, the proposed method explicitly accounts for spatial smoothness and subject heterogeneity, has straightforward interpretations, and is highly flexible and accurate in capturing complex association patterns. A key idea in our approach is to treat the image voxels as the effective samples, which not only alleviates the limited sample size issue that haunts the majority of medical imaging studies, but also leads to more robust and reproducible results. Focusing on a broad family of piecewise smooth functions, we establish the estimation and selection consistency, and derive the asymptotic error bounds. We demonstrate the efficacy of the method through intensive simulations, and further illustrate its advantages with analyses of two functional magnetic resonance imaging datasets.

preprint2022arXiv

Individualized Risk Assessment of Preoperative Opioid Use by Interpretable Neural Network Regression

Preoperative opioid use has been reported to be associated with higher preoperative opioid demand, worse postoperative outcomes, and increased postoperative healthcare utilization and expenditures. Understanding the risk of preoperative opioid use helps establish patient-centered pain management. In the field of machine learning, deep neural network (DNN) has emerged as a powerful means for risk assessment because of its superb prediction power; however, the blackbox algorithms may make the results less interpretable than statistical models. Bridging the gap between the statistical and machine learning fields, we propose a novel Interpretable Neural Network Regression (INNER), which combines the strengths of statistical and DNN models. We use the proposed INNER to conduct individualized risk assessment of preoperative opioid use. Intensive simulations and an analysis of 34,186 patients expecting surgery in the Analgesic Outcomes Study (AOS) show that the proposed INNER not only can accurately predict the preoperative opioid use using preoperative characteristics as DNN, but also can estimate the patient specific odds of opioid use without pain and the odds ratio of opioid use for a unit increase in the reported overall body pain, leading to more straightforward interpretations of the tendency to use opioids than DNN. Our results identify the patient characteristics that are strongly associated with opioid use and is largely consistent with the previous findings, providing evidence that INNER is a useful tool for individualized risk assessment of preoperative opioid use.

preprint2022arXiv

Leveraging Phone Mask Training for Phonetic-Reduction-Robust E2E Uyghur Speech Recognition

In Uyghur speech, consonant and vowel reduction are often encountered, especially in spontaneous speech with high speech rate, which will cause a degradation of speech recognition performance. To solve this problem, we propose an effective phone mask training method for Conformer-based Uyghur end-to-end (E2E) speech recognition. The idea is to randomly mask off a certain percentage features of phones during model training, which simulates the above verbal phenomena and facilitates E2E model to learn more contextual information. According to experiments, the above issues can be greatly alleviated. In addition, deep investigations are carried out into different units in masking, which shows the effectiveness of our proposed masking unit. We also further study the masking method and optimize filling strategy of phone mask. Finally, compared with Conformer-based E2E baseline without mask training, our model demonstrates about 5.51% relative Word Error Rate (WER) reduction on reading speech and 12.92% on spontaneous speech, respectively. The above approach has also been verified on test-set of open-source data THUYG-20, which shows 20% relative improvements.

preprint2022arXiv

Linguistic-Acoustic Similarity Based Accent Shift for Accent Recognition

General accent recognition (AR) models tend to directly extract low-level information from spectrums, which always significantly overfit on speakers or channels. Considering accent can be regarded as a series of shifts relative to native pronunciation, distinguishing accents will be an easier task with accent shift as input. But due to the lack of native utterance as an anchor, estimating the accent shift is difficult. In this paper, we propose linguistic-acoustic similarity based accent shift (LASAS) for AR tasks. For an accent speech utterance, after mapping the corresponding text vector to multiple accent-associated spaces as anchors, its accent shift could be estimated by the similarities between the acoustic embedding and those anchors. Then, we concatenate the accent shift with a dimension-reduced text vector to obtain a linguistic-acoustic bimodal representation. Compared with pure acoustic embedding, the bimodal representation is richer and more clear by taking full advantage of both linguistic and acoustic information, which can effectively improve AR performance. Experiments on Accented English Speech Recognition Challenge (AESRC) dataset show that our method achieves 77.42% accuracy on Test set, obtaining a 6.94% relative improvement over a competitive system in the challenge.

preprint2022arXiv

M2HF: Multi-level Multi-modal Hybrid Fusion for Text-Video Retrieval

Videos contain multi-modal content, and exploring multi-level cross-modal interactions with natural language queries can provide great prominence to text-video retrieval task (TVR). However, new trending methods applying large-scale pre-trained model CLIP for TVR do not focus on multi-modal cues in videos. Furthermore, the traditional methods simply concatenating multi-modal features do not exploit fine-grained cross-modal information in videos. In this paper, we propose a multi-level multi-modal hybrid fusion (M2HF) network to explore comprehensive interactions between text queries and each modality content in videos. Specifically, M2HF first utilizes visual features extracted by CLIP to early fuse with audio and motion features extracted from videos, obtaining audio-visual fusion features and motion-visual fusion features respectively. Multi-modal alignment problem is also considered in this process. Then, visual features, audio-visual fusion features, motion-visual fusion features, and texts extracted from videos establish cross-modal relationships with caption queries in a multi-level way. Finally, the retrieval outputs from all levels are late fused to obtain final text-video retrieval results. Our framework provides two kinds of training strategies, including an ensemble manner and an end-to-end manner. Moreover, a novel multi-modal balance loss function is proposed to balance the contributions of each modality for efficient end-to-end training. M2HF allows us to obtain state-of-the-art results on various benchmarks, eg, Rank@1 of 64.9\%, 68.2\%, 33.2\%, 57.1\%, 57.8\% on MSR-VTT, MSVD, LSMDC, DiDeMo, and ActivityNet, respectively.

preprint2022arXiv

Optimizing Graphical Procedures for Multiplicity Control in a Confirmatory Clinical Trial via Deep Learning

In confirmatory clinical trials, it has been proposed to use a simple iterative graphical approach to construct and perform intersection hypotheses tests with a weighted Bonferroni-type procedure to control type I errors in the strong sense. Given Phase II study results or other prior knowledge, it is usually of main interest to find the optimal graph that maximizes a certain objective function in a future Phase III study. In this article, we evaluate the performance of two existing derivative-free constrained methods, and further propose a deep learning enhanced optimization framework. Our method numerically approximates the objective function via feedforward neural networks (FNNs) and then performs optimization with available gradient information. It can be constrained so that some features of the testing procedure are held fixed while optimizing over other features. Simulation studies show that our FNN-based approach has a better balance between robustness and time efficiency than some existing derivative-free constrained optimization algorithms. Compared to the traditional stochastic search method, our optimizer has moderate multiplicity adjusted power gain when the number of hypotheses is relatively large. We further apply it to a case study to illustrate how to optimize a multiple testing procedure with respect to a specific study objective.

preprint2022arXiv

RawlsGCN: Towards Rawlsian Difference Principle on Graph Convolutional Network

Graph Convolutional Network (GCN) plays pivotal roles in many real-world applications. Despite the successes of GCN deployment, GCN often exhibits performance disparity with respect to node degrees, resulting in worse predictive accuracy for low-degree nodes. We formulate the problem of mitigating the degree-related performance disparity in GCN from the perspective of the Rawlsian difference principle, which is originated from the theory of distributive justice. Mathematically, we aim to balance the utility between low-degree nodes and high-degree nodes while minimizing the task-specific loss. Specifically, we reveal the root cause of this degree-related unfairness by analyzing the gradients of weight matrices in GCN. Guided by the gradients of weight matrices, we further propose a pre-processing method RawlsGCN-Graph and an in-processing method RawlsGCN-Grad that achieves fair predictive accuracy in low-degree nodes without modification on the GCN architecture or introduction of additional parameters. Extensive experiments on real-world graphs demonstrate the effectiveness of our proposed RawlsGCN methods in significantly reducing degree-related bias while retaining comparable overall performance.

preprint2022arXiv

Understanding the dynamic impact of COVID-19 through competing risk modeling with bivariate varying coefficients

The coronavirus disease 2019 (COVID-19) pandemic has exerted a profound impact on patients with end-stage renal disease relying on kidney dialysis to sustain their lives. Motivated by a request by the U.S. Centers for Medicare & Medicaid Services, our analysis of their postdischarge hospital readmissions and deaths in 2020 revealed that the COVID-19 effect has varied significantly with postdischarge time and time since the onset of the pandemic. However, the complex dynamics of the COVID-19 effect trajectories cannot be characterized by existing varying coefficient models. To address this issue, we propose a bivariate varying coefficient model for competing risks within a cause-specific hazard framework, where tensor-product B-splines are used to estimate the surface of the COVID-19 effect. An efficient proximal Newton algorithm is developed to facilitate the fitting of the new model to the massive Medicare data for dialysis patients. Difference-based anisotropic penalization is introduced to mitigate model overfitting and the wiggliness of the estimated trajectories; various cross-validation methods are considered in the determination of optimal tuning parameters. Hypothesis testing procedures are designed to examine whether the COVID-19 effect varies significantly with postdischarge time and the time since pandemic onset, either jointly or separately. Simulation experiments are conducted to evaluate the estimation accuracy, type I error rate, statistical power, and model selection procedures. Applications to Medicare dialysis patients demonstrate the real-world performance of the proposed methods.

preprint2021arXiv

Cascades between light and heavy fermions in the normal state of magic angle twisted bilayer graphene

We present a framework for understanding the recently observed cascade transitions and the Landau level degeneracies at every integer filling of twisted bilayer graphene. The Coulomb interaction projected onto narrow bands causes the charged excitations at an integer filling to disperse, forming new bands. If the excitation moves the filling away from the charge neutrality point, then it has a band minimum at the moire Brillouin zone center with a small mass that compares well with the experiment; if towards the charge neutrality point, then it has a much larger mass and a higher degeneracy. At a non-zero density away from an integer filling the excitations interact. The system on the small mass side has a large bandwidth and forms a Fermi liquid. On the large mass side the bandwidth is narrow, the compressibility is negative and the Fermi liquid is likely unstable. This explains the observed sawtooth features in compressibility, the Landau fans pointing away from charge neutrality as well as their degeneracies. By providing a description of the charge itineracy in the normal state this framework sets the stage for superconductivity at lower temperatures.

preprint2021arXiv

Correlated Insulating Phases in the Twisted Bilayer Graphene

We review analytical and numerical studies of correlated insulating states in twisted bilayer graphene, focusing on real-space lattice models constructions and their unbiased quantum many-body solutions. We show that by constructing localized Wannier states for the narrow bands, the projected Coulomb interactions can be approximated by interactions of cluster charges with assisted nearest neighbor hopping terms. With the interaction part only, the Hamiltonian is $SU(4)$ symmetric considering both spin and valley degrees of freedom. In the strong coupling limit where the kinetic terms are neglected, the ground states are found to be in the $SU(4)$ manifold with degeneracy. The kinetic terms, treated as perturbation, break this large $SU(4)$ symmetry and propel the appearance of intervalley coherent state, quantum topological insulators and other symmetry-breaking insulating states. We first present the theoretical analysis of moiré lattice model construction and then show how to solve the model with large-scale quantum Monte Carlo simulations in an unbiased manner. We further provide potential directions such that from the real-space model construction and its quantum many-body solutions how the perplexing yet exciting experimental discoveries in the correlation physics of twisted bilayer graphene can be gradually understood. This review will be helpful for the readers to grasp the fast growing field of the model study of twisted bilayer graphene.

preprint2021arXiv

Correlation-induced insulating topological phases at charge neutrality in twisted bilayer graphene

Twisted bilayer graphene (TBG) provides a unique framework to elucidate the interplay between strong correlations and topological phenomena in two-dimensional systems. The existence of multiple electronic degrees of freedom -- charge, spin, and valley -- gives rise to a plethora of possible ordered states and instabilities. Identifying which of them are realized in the regime of strong correlations is fundamental to shed light on the nature of the superconducting and correlated insulating states observed in the TBG experiments. Here, we use unbiased, sign-problem-free quantum Monte Carlo simulations to solve an effective interacting lattice model for TBG at charge neutrality. Besides the usual cluster Hubbard-like repulsion, this model also contains an assisted hopping interaction that emerges due to the non-trivial topological properties of TBG. Such a non-local interaction fundamentally alters the phase diagram at charge neutrality, gapping the Dirac cones even for infinitesimally small interaction. As the interaction strength increases, a sequence of different correlated insulating phases emerge, including a quantum valley Hall state with topological edge states, an intervalley-coherent insulator, and a valence bond solid. The charge-neutrality correlated insulating phases discovered here provide the sought-after reference states needed for a comprehensive understanding of the insulating states at integer fillings and the proximate superconducting states of TBG.

preprint2021arXiv

Topological and nematic superconductivity mediated by ferro-SU(4) fluctuations in twisted bilayer graphene

We propose an SU(4) spin-valley-fermion model to investigate the superconducting instabilities of twisted bilayer graphene (TBG). In this approach, bosonic fluctuations associated with an emergent SU(4) symmetry, corresponding to combined rotations in valley and spin spaces, couple to the low-energy fermions that comprise the flat bands. These fluctuations are peaked at zero wave-vector, reflecting the "ferromagnetic-like" SU(4) ground state recently found in strong-coupling solutions of microscopic models for TBG. Focusing on electronic states related to symmetry-imposed points of the Fermi surface, dubbed here "valley hot-spots" and "van Hove hot-spots", we find that the coupling to the itinerant electrons partially lifts the huge degeneracy of the ferro-SU(4) ground state manifold, favoring inter-valley order, spin-valley coupled order, ferromagnetic order, spin-current order, and valley-polarized order, depending on details of the band structure. These fluctuations, in turn, promote attractive pairing interactions in a variety of closely competing channels, including a nodeless $f$-wave state, a nodal $i$-wave state, and topological $d+id$ and $p+ip$ states with unusual Chern numbers $2$ and $4$, respectively. Nematic superconductivity, although not realized as a primary instability of the system, still appears as a consequence of the near-degeneracy of superconducting order parameters that transform as one-dimensional and two-dimensional irreducible representations of the point group $D_{6}$.

preprint2020arXiv

Bayesian Sparse Mediation Analysis with Targeted Penalization of Natural Indirect Effects

Causal mediation analysis aims to characterize an exposure's effect on an outcome and quantify the indirect effect that acts through a given mediator or a group of mediators of interest. With the increasing availability of measurements on a large number of potential mediators, like the epigenome or the microbiome, new statistical methods are needed to simultaneously accommodate high-dimensional mediators while directly target penalization of the natural indirect effect (NIE) for active mediator identification. Here, we develop two novel prior models for identification of active mediators in high-dimensional mediation analysis through penalizing NIEs in a Bayesian paradigm. Both methods specify a joint prior distribution on the exposure-mediator effect and mediator-outcome effect with either (a) a four-component Gaussian mixture prior or (b) a product threshold Gaussian prior. By jointly modeling the two parameters that contribute to the NIE, the proposed methods enable penalization on their product in a targeted way. Resultant inference can take into account the four-component composite structure underlying the NIE. We show through simulations that the proposed methods improve both selection and estimation accuracy compared to other competing methods. We applied our methods for an in-depth analysis of two ongoing epidemiologic studies: the Multi-Ethnic Study of Atherosclerosis (MESA) and the LIFECODES birth cohort. The identified active mediators in both studies reveal important biological pathways for understanding disease mechanisms.

preprint2020arXiv

Bayesian Symbolic Regression

Interpretability is crucial for machine learning in many scenarios such as quantitative finance, banking, healthcare, etc. Symbolic regression (SR) is a classic interpretable machine learning method by bridging X and Y using mathematical expressions composed of some basic functions. However, the search space of all possible expressions grows exponentially with the length of the expression, making it infeasible for enumeration. Genetic programming (GP) has been traditionally and commonly used in SR to search for the optimal solution, but it suffers from several limitations, e.g. the difficulty in incorporating prior knowledge; overly-complicated output expression and reduced interpretability etc. To address these issues, we propose a new method to fit SR under a Bayesian framework. Firstly, Bayesian model can naturally incorporate prior knowledge (e.g., preference of basis functions, operators and raw features) to improve the efficiency of fitting SR. Secondly, to improve interpretability of expressions in SR, we aim to capture concise but informative signals. To this end, we assume the expected signal has an additive structure, i.e., a linear combination of several concise expressions, whose complexity is controlled by a well-designed prior distribution. In our setup, each expression is characterized by a symbolic tree, and the proposed SR model could be solved by sampling symbolic trees from the posterior distribution using an efficient Markov chain Monte Carlo (MCMC) algorithm. Finally, compared with GP, the proposed BSR(Bayesian Symbolic Regression) method saves computer memory with no need to keep an updated 'genome pool'. Numerical experiments show that, compared with GP, the solutions of BSR are closer to the ground truth and the expressions are more concise. Meanwhile we find the solution of BSR is robust to hyper-parameter specifications such as the number of trees.

preprint2020arXiv

Crystalline Nodal Topological Superconductivity and Bogolyubov Fermi Surfaces in Monolayer NbSe$_2$

We present a microscopic calculation of the phase diagram of the Ising superconductor NbSe$_{2}$ in presence of both in-plane magnetic field and Rashba spin-orbit coupling (SOC). Repulsive interactions lead to two distinct instabilities, in singlet- and triplet- interaction channels. While we recover the previously predicted nodal topological superconducting state in the absence of Rashba SOC at large magnetic field with six pairs of nodes along $Γ$-$M$ lines, a finite Rashba SOC breaks the symmetry that protects these nodes and therefore generally lifts them, resulting in a topologically trivial phase. There is an exception when the field is applied along one of the three $Γ$-$K$ lines, however. In that case, a single mirror symmetry remains that can protect two pairs of nodes out of the original six, resulting in a \emph{crystalline} topological superconducting phase. Depending on the Cooper pairs' center-of-mass momentum, this superconducting state displays either Bogolyubov Fermi surfaces or point nodes. Moreover, a chiral topological superconducting phase with Chern number of 6 is realized in the regime of large Rashba SOC and dominant triplet interactions, spontaneously breaking time-reversal symmetry.

preprint2020arXiv

Deep Learning for Image Search and Retrieval in Large Remote Sensing Archives

This chapter presents recent advances in content based image search and retrieval (CBIR) systems in remote sensing (RS) for fast and accurate information discovery from massive data archives. Initially, we analyze the limitations of the traditional CBIR systems that rely on the hand-crafted RS image descriptors. Then, we focus our attention on the advances in RS CBIR systems for which deep learning (DL) models are at the forefront. In particular, we present the theoretical properties of the most recent DL based CBIR systems for the characterization of the complex semantic content of RS images. After discussing their strengths and limitations, we present the deep hashing based CBIR systems that have high time-efficient search capability within huge data archives. Finally, the most promising research directions in RS CBIR are discussed.

preprint2020arXiv

Learning Convolutional Sparse Coding on Complex Domain for Interferometric Phase Restoration

Interferometric phase restoration has been investigated for decades and most of the state-of-the-art methods have achieved promising performances for InSAR phase restoration. These methods generally follow the nonlocal filtering processing chain aiming at circumventing the staircase effect and preserving the details of phase variations. In this paper, we propose an alternative approach for InSAR phase restoration, i.e. Complex Convolutional Sparse Coding (ComCSC) and its gradient regularized version. To our best knowledge, this is the first time that we solve the InSAR phase restoration problem in a deconvolutional fashion. The proposed methods can not only suppress interferometric phase noise, but also avoid the staircase effect and preserve the details. Furthermore, they provide an insight of the elementary phase components for the interferometric phases. The experimental results on synthetic and realistic high- and medium-resolution datasets from TerraSAR-X StripMap and Sentinel-1 interferometric wide swath mode, respectively, show that our method outperforms those previous state-of-the-art methods based on nonlocal InSAR filters, particularly the state-of-the-art method: InSAR-BM3D. The source code of this paper will be made publicly available for reproducible research inside the community.

preprint2020arXiv

Learning Shared Cross-modality Representation Using Multispectral-LiDAR and Hyperspectral Data

Due to the ever-growing diversity of the data source, multi-modality feature learning has attracted more and more attention. However, most of these methods are designed by jointly learning feature representation from multi-modalities that exist in both training and test sets, yet they are less investigated in absence of certain modality in the test phase. To this end, in this letter, we propose to learn a shared feature space across multi-modalities in the training process. By this way, the out-of-sample from any of multi-modalities can be directly projected onto the learned space for a more effective cross-modality representation. More significantly, the shared space is regarded as a latent subspace in our proposed method, which connects the original multi-modal samples with label information to further improve the feature discrimination. Experiments are conducted on the multispectral-Lidar and hyperspectral dataset provided by the 2018 IEEE GRSS Data Fusion Contest to demonstrate the effectiveness and superiority of the proposed method in comparison with several popular baselines.

preprint2020arXiv

Multipass SAR Interferometry Based on Total Variation Regularized Robust Low Rank Tensor Decomposition

Multipass SAR interferometry (InSAR) techniques based on meter-resolution spaceborne SAR satellites, such as TerraSAR-X or COSMO-Skymed, provide 3D reconstruction and the measurement of ground displacement over large urban areas. Conventional method such as Persistent Scatterer Interferometry (PSI) usually requires a fairly large SAR image stack (usually in the order of tens), in order to achieve reliable estimates of these parameters. Recently, low rank property in multipass InSAR data stack was explored and investigated in our previous work. By exploiting this low rank prior, more accurate estimation of the geophysical parameters can be achieved, which in turn can effectively reduce the number of interferograms required for a reliable estimation. Based on that, this paper proposes a novel tensor decomposition method in complex domain, which jointly exploits low rank and variational prior of the interferometric phase in InSAR data stacks. Specifically, a total variation (TV) regularized robust low rank tensor decomposition method is exploited for recovering outlier-free InSAR stacks. We demonstrate that the filtered InSAR data stacks can greatly improve the accuracy of geophysical parameters estimated from real data. Moreover, this paper demonstrates for the first time in the community that tensor-decomposition-based methods can be beneficial for large-scale urban mapping problems using multipass InSAR. Two TerraSAR-X data stacks with large spatial areas demonstrate the promising performance of the proposed method.

preprint2020arXiv

Non-Abelian Dirac node braiding and near-degeneracy of correlated phases at odd integer filling in magic angle twisted bilayer graphene

We use the DMRG to study the correlated electron states favored by the Coulomb interaction projected onto the narrow bands of twisted bilayer graphene within a spinless one-valley model. The Hilbert space of the narrow bands is constructed from a pair of hybrid Wannier states with opposite Chern numbers. Depending on the parameters in the BM model, the DMRG in this basis determines the ground state at one particle per unit cell to be either QAH state or a state with no Hall effect which is nearly a product state. Based on this form, we then apply the variational method to study their competition, thus identifying three states: the QAH, a gapless $C_2T$ symmetric nematic, and a gapped $C_2T$ symmetric stripe. All three states are nearly degenerate at the realistic parameters of the BM model. The single particle spectrum of the nematic contains either a quadratic node or two close Dirac nodes near $Γ$. Motivated by the Landau level degeneracy found in this state, we propose it to be the state observed at the charge neutrality point once spin and valley degeneracies are restored. The optimal period for the $C_2T$ stripe state is found to be $2$ unit cells. In addition, using the fact that the topological charge of the nodes in the $C_2T$ nematic phase is no longer described simply by their winding numbers once the translation symmetry is broken, but rather by certain elements of a non-Abelian group that was recently pointed out, we identify the mechanism of the gap opening within the $C_2T$ stripe state. Although the nodes at the Fermi energy are locally stable, they can be annihilated after braiding with other nodes connecting them to adjacent (folded) bands. Therefore, if the translation symmetry is broken, the gap at one particle per unit cell can open even if the system preserves the $C_2T$ and valley $U(1)$ symmetries, and the gap to remote bands remains open.

preprint2020arXiv

Statistical Inference for High-Dimensional Vector Autoregression with Measurement Error

High-dimensional vector autoregression with measurement error is frequently encountered in a large variety of scientific and business applications. In this article, we study statistical inference of the transition matrix under this model. While there has been a large body of literature studying sparse estimation of the transition matrix, there is a paucity of inference solutions, especially in the high-dimensional scenario. We develop inferential procedures for both the global and simultaneous testing of the transition matrix. We first develop a new sparse expectation-maximization algorithm to estimate the model parameters, and carefully characterize their estimation precisions. We then construct a Gaussian matrix, after proper bias and variance corrections, from which we derive the test statistics. Finally, we develop the testing procedures and establish their asymptotic guarantees. We study the finite-sample performance of our tests through intensive simulations, and illustrate with a brain connectivity analysis example.

preprint2019arXiv

Minorization-Maximization-based Steepest Ascent for Large-scale Survival Analysis with Time-Varying Effects: Application to the National Kidney Transplant Dataset

The time-varying effects model is a flexible and powerful tool for modeling the dynamic changes of covariate effects. However, in survival analysis, its computational burden increases quickly as the number of sample sizes or predictors grows. Traditional methods that perform well for moderate sample sizes and low-dimensional data do not scale to massive data. Analysis of national kidney transplant data with a massive sample size and large number of predictors defy any existing statistical methods and software. In view of these difficulties, we propose a Minorization-Maximization-based steepest ascent procedure for estimating the time-varying effects. Leveraging the block structure formed by the basis expansions, the proposed procedure iteratively updates the optimal block-wise direction along which the approximate increase in the log-partial likelihood is maximized. The resulting estimates ensure the ascent property and serve as refinements of the previous step. The performance of the proposed method is examined by simulations and applications to the analysis of national kidney transplant data.

preprint2016arXiv

Classification with Ultrahigh-Dimensional Features

Although much progress has been made in classification with high-dimensional features \citep{Fan_Fan:2008, JGuo:2010, CaiSun:2014, PRXu:2014}, classification with ultrahigh-dimensional features, wherein the features much outnumber the sample size, defies most existing work. This paper introduces a novel and computationally feasible multivariate screening and classification method for ultrahigh-dimensional data. Leveraging inter-feature correlations, the proposed method enables detection of marginally weak and sparse signals and recovery of the true informative feature set, and achieves asymptotic optimal misclassification rates. We also show that the proposed procedure provides more powerful discovery boundaries compared to those in \citet{CaiSun:2014} and \citet{JJin:2009}. The performance of the proposed procedure is evaluated using simulation studies and demonstrated via classification of patients with different post-transplantation renal functional types.

preprint2016arXiv

Conditional Screening for Ultra-high Dimensional Covariates with Survival Outcomes

Identifying important biomarkers that are predictive for cancer patients' prognosis is key in gaining better insights into the biological influences on the disease and has become a critical component of precision medicine. The emergence of large-scale biomedical survival studies, which typically involve excessive number of biomarkers, has brought high demand in designing efficient screening tools for selecting predictive biomarkers. The vast amount of biomarkers defies any existing variable selection methods via regularization. The recently developed variable screening methods, though powerful in many practical setting, fail to incorporate prior information on the importance of each biomarker and are less powerful in detecting marginally weak while jointly important signals. We propose a new conditional screening method for survival outcome data by computing the marginal contribution of each biomarker given priorly known biological information. This is based on the premise that some biomarkers are known to be associated with disease outcomes a priori. Our method possesses sure screening properties and a vanishing false selection rate. The utility of the proposal is further confirmed with extensive simulation studies and analysis of a Diffuse large B-cell lymphoma (DLBCL) dataset.

preprint2016arXiv

High Dimensional Tests for Functional Networks of Brain Anatomic Regions

There has been increasing interests in learning resting-state brain functional connectivity of autism disorders using functional magnetic resonance imaging (fMRI) data. The data in a standard brain template consist of over 200,000 voxel specific time series for each single subject. Such an ultra-high dimensionality of data makes the voxel-level functional connectivity analysis (involving four billion voxel pairs) lack of power and extremely inefficient. In this work, we introduce a new framework to identify functional brain network at brain anatomic region-level for each individual. We propose two pairwise tests to detect region dependence, and one multiple testing procedure to identify global structures of the network. The limiting null distributions of the test statistics are derived. It is also shown that the tests are rate optimal when the alternative networks are sparse. The numerical studies show the proposed tests are valid and powerful. We apply our method to a resting-state fMRI study on autism and identify patient-unique and control-unique hub regions. These findings are consistent with autism clinical symptoms.

preprint2016arXiv

Robustness of quantum critical pairing against disorder

The remarkable robustness of high-temperature superconductors against disorder remains a controversial obstacle towards the elucidation of their pairing state. Indeed, experiments report a weak suppression rate of the transition temperature $T_{c}$ with disorder, significantly smaller than the universal value predicted by extensions of the conventional theory of dirty superconductors. However, in many high-$T_{c}$ compounds, superconductivity appears near a putative magnetic quantum critical point, suggesting that quantum fluctuations, which suppress coherent electronic spectral weight, may also promote unconventional pairing. Here we investigate theoretically the impact of disorder on such a quantum critical pairing state, considering the coupling of impurities both to the low-energy electronic states and to the pairing interaction itself. We find a significant reduction in the suppression rate of $T_{c}$ with disorder near the magnetic quantum critical point, shedding new light on the nature of unconventional superconductivity in correlated materials.

preprint2016arXiv

Scalar-on-Image Regression via the Soft-Thresholded Gaussian Process

The focus of this work is on spatial variable selection for scalar-on-image regression. We propose a new class of Bayesian nonparametric models, soft-thresholded Gaussian processes and develop the efficient posterior computation algorithms. Theoretically, soft-thresholded Gaussian processes provide large prior support for the spatially varying coefficients that enjoy piecewise smoothness, sparsity and continuity, characterizing the important features of imaging data. Also, under some mild regularity conditions, the soft-thresholded Gaussian process leads to the posterior consistency for both parameter estimation and variable selection for scalar-on-image regression, even when the number of true predictors is larger than the sample size. The proposed method is illustrated via simulations, compared numerically with existing alternatives and applied to Electroencephalography (EEG) study of alcoholism.

preprint2016arXiv

Spin anisotropy due to spin-orbit coupling in optimally hole-doped Ba$_{0.67}$K$_{0.33}$Fe$_{2}$As$_{2}$

We use polarized inelastic neutron scattering to study the temperature and energy dependence of spin space anisotropies in the optimally hole-doped iron pnictide Ba$_{0.67}$K$_{0.33}$Fe$_{2}$As$_{2}$ ($T_{\rm c}=38$ K). In the superconducting state, while the high-energy part of the magnetic spectrum is nearly isotropic, the low-energy part displays a pronouced anisotropy, manifested by a $c$-axis polarized resonance. We also observe that the spin anisotropy in superconducting Ba$_{0.67}$K$_{0.33}$Fe$_{2}$As$_{2}$ extends to higher energies compared to electron-doped BaFe$_{2-x}TM_{x}$As$_{2}$ ($TM=$Co, Ni) and isovalent-doped BaFe$_{2}$As$_{1.4}$P$_{0.6}$, suggesting a connection between $T_{\rm c}$ and the energy scale of the spin anisotropy. In the normal state, the low-energy spin anisotropy for optimally hole- and electron-doped iron pnictides onset at temperatures similar to the temperatures at which the elastoresistance deviate from Curie-Weiss behavior, pointing to a possible connection between the two phenomena. Our results highlight the relevance of the spin-orbit coupling to the superconductivity of the iron pnictides.

preprint2016arXiv

Spin-Driven Nematic Instability of the Multi-Orbital Hubbard Model: Application to Iron-Based Superconductors

Nematic order resulting from the partial melting of density-waves has been proposed as the mechanism to explain nematicity in iron-based superconductors. An outstanding question, however, is whether the microscopic electronic model for these systems -- the multi-orbital Hubbard model -- displays such an ordered state as its leading instability. In contrast to usual electronic instabilities, such as magnetic and charge order, this fluctuation-driven phenomenon cannot be captured by the standard RPA method. Here, by including fluctuations beyond RPA in the multi-orbital Hubbard model, we derive its nematic susceptibility and contrast it with its ferro-orbital order susceptibility, showing that its leading instability is the spin-driven nematic phase. Our results also demonstrate the primary role played by the $d_{xy}$ orbital in driving the nematic transition, and reveal that high-energy magnetic fluctuations are essential to stabilize nematic order in the absence of magnetic order.

preprint2016arXiv

Superconductivity in FeSe thin films driven by the interplay between nematic fluctuations and spin-orbit coupling

The origin of the high-temperature superconducting state observed in FeSe thin films, whose phase diagram displays no sign of magnetic order, remains a hotly debated topic. Here we investigate whether fluctuations arising due to the proximity to a nematic phase, which is observed in the phase diagram of this material, can promote superconductivity. We find that nematic fluctuations alone promote a highly degenerate pairing state, in which both $s$-wave and $d$-wave symmetries are equally favored, and $T_{c}$ is consequently suppressed. However, the presence of a sizable spin-orbit coupling or inversion symmetry-breaking at the film interface lifts this harmful degeneracy and selects the $s$-wave state, in agreement with recent experimental proposals. The resulting gap function displays a weak anisotropy, which agrees with experiments in monolayer FeSe and intercalated Li$_{1-x}$(OH)$_{x}$FeSe.

preprint2015arXiv

Direct evidence for the emergence of a pressure induced nodal superconducting gap in the iron-based superconductor Ba_0.65Rb_0.35Fe_2As_2

Identifying the superconducting (SC) gap structure of the iron-based high-temperature superconductors (Fe-HTS's) remains a key issue for the understanding of superconductivity in these materials. In contrast to other unconventional superconductors, in the Fe-HTS's both $d$-wave and extended s-wave pairing symmetries are close in energy, with the latter believed to be generally favored over the former. Probing the proximity between these very different SC states and identifying experimental parameters that can tune them, are of central interest. Here we report high-pressure muon spin rotation experiments on the temperature-dependent magnetic penetration depth (lambda) in the optimally doped Fe-HTS Ba_0.65Rb_0.35Fe_2As_2. At ambient pressure this material is known to be a nodeless s-wave superconductor. Upon pressure a strong decrease of (lambda) is observed, while the SC transition temperature remains nearly constant. More importantly, the low-temperature behavior of (1/lambda^{2}) changes from exponential saturation at zero pressure to a power-law with increasing pressure, providing unambiguous evidence that hydrostatic pressure promotes nodal SC gaps. Comparison to microscopic models favors a d-wave over a nodal s^{+-}-wave pairing as the origin of the nodes. Our results provide a new route of understanding the complex topology of the SC gap in Fe-HTS's.

preprint2015arXiv

Magnetic order without tetragonal-symmetry-breaking in iron arsenides: microscopic mechanism and spin-wave spectrum

Most iron-based superconductors undergo a transition to a magnetically ordered state characterized by staggered stripes of parallel spins. With ordering vectors $(π,0)$ or $(0,π)$, this magnetic state breaks the high-temperature tetragonal symmetry of the system, which is manifested by a splitting of the lattice Bragg peaks. Remarkably, recent experiments in hole-doped iron arsenides reported an ordered state that displays magnetic Bragg peaks at $(π,0)$ and $(0,π)$ but remains tetragonal. Despite being inconsistent with a magnetic stripe configuration, this unusual magnetic phase can be described in terms of a double-$\mathbf{Q}$ magnetic structure consisting of an equal-weight superposition of the ordering vectors $(π,0)$ and $(0,π)$. Here we show that a non-collinear double-$\mathbf{Q}$ magnetic configuration, dubbed \emph{orthomagnetic}, arises naturally within an itinerant three-band microscopic model for the iron pnictides. In particular, we find that strong deviations from perfect nesting and residual interactions between the electron pockets favor the orthomagnetic over the stripe magnetic state. Using an effective low-energy model, we also calculate the spin-wave spectrum of the orthomagnetic state. In contrast to the stripe state, there are three Goldstone modes, manifested in all diagonal and one off-diagonal component of the spin-spin correlation function. The total magnetic structure factor displays two anisotropic spin-wave branches emerging from both $(π,0)$ and $(0,π)$ momenta, in contrast to the case of domains of stripe order, where only one spin-wave branch emerges from each momentum. We propose that these unique features of the orthomagnetic state can be used to unambiguously distinguish it from the stripe state via neutron scattering experiments, and discuss the implications of its existence to the nature of the magnetism of the iron arsenides.

preprint2015arXiv

Phenomenological theory of the superconducting state inside the hidden-order phase of URu$_{2}$Si$_{2}$

Recent experiments have unveiled important properties of the ground state of the elusive heavy fermion $\mathrm{URu_{2}Si_{2}}$. While tetragonal symmetry-breaking was reported below the hidden-order (HO) transition at $T_{HO}\approx17.5$ K, time-reversal symmetry-breaking was observed below the superconducting transition temperature $T_{c}<T_{HO}$. Although the latter results have been used to argue in favor of a chiral $d+id$ superconducting state, such an order parameter is incompatible with broken tetragonal symmetry. Here, we employ a phenomenological model to investigate the properties of a chiral superconducting state that develops inside the hidden-order phase. In this case, there are actually two superconducting transition temperatures: while $T_{c}$ marks a normal-state to superconducting transition, $T_{c}^{*}<T_{c}$ signals a superconducting-to-superconducting transition in which time-reversal symmetry is broken. In the phase $T_{c}^{*}<T<T_{c}$, the low-energy density of states $ρ\left(ω\right)$ is enhanced due to the crossing of two nodal lines, giving rise to an unusual $ω\log(ω)$ dependence of $ρ\left(ω\right)$, which is manifested in several thermodynamic properties. We also investigate the emergence of a soft amplitude gap mode near $T_{c}^{*}$. In contrast to the usual amplitude mode near a regular normal-state to superconducting transition, this mode becomes soft near a superconducting-to-superconducting transition, which in principle allows for its detection by Raman spectroscopy. Finally, we investigate the impact of twin domains on the anisotropic properties of the superconducting state, and propose experiments in mechanically strained samples to explore the interplay between hidden order and superconductivity in $\mathrm{URu_{2}Si_{2}}$.

preprint2015arXiv

Spin reorientation driven by the interplay between spin-orbit coupling and Hund's rule coupling in iron pnictides

In most magnetically-ordered iron pnictides, the magnetic moments lie in the FeAs planes, parallel to the modulation direction of the spin stripes. However, recent experiments in hole-doped iron pnictides have observed a reorientation of the magnetic moments from in-plane to out-of-plane. Interestingly, this reorientation is accompanied by a change in the magnetic ground state from a stripe antiferromagnet to a tetragonal non-uniform magnetic configuration. Motivated by these recent observations, here we investigate the origin of the spin anisotropy in iron pnictides using an itinerant microscopic electronic model that respects all the symmetry properties of a single FeAs plane. We find that the interplay between the spin-orbit coupling and the Hund's rule coupling can account for the observed spin anisotropies, including the spin reorientation in hole-doped pnictides, without the need to invoke orbital or nematic order. Our calculations also reveal an asymmetry between the magnetic ground states of electron- and hole-doped compounds, with only the latter displaying tetragonal magnetic states.

preprint2015arXiv

Thresholded Multiscale Gaussian Processes with Application to Bayesian Feature Selection for Massive Neuroimaging Data

Motivated by the needs of selecting important features for massive neuroimaging data, we propose a spatially varying coefficient model (SVCMs) with sparsity and piecewise smoothness imposed on the coefficient functions. A new class of nonparametric priors is developed based on thresholded multiresolution Gaussian processes (TMGP). We show that the TMGP has a large support on a space of sparse and piecewise smooth functions, leading to posterior consistency in coefficient function estimation and feature selection. Also, we develop a method for prior specifications of thresholding parameters in TMGPs and discuss their theoretical properties. Efficient posterior computation algorithms are developed by adopting a kernel convolution approach, where a modified square exponential kernel is chosen taking the advantage that the analytical form of the eigen decomposition is available. Based on simulation studies, we demonstrate that our methods can achieve better performance in estimating the spatially varying coefficient. Also, the proposed model has been applied to an analysis of resting state functional magnetic resonance imaging (Rs-fMRI) data from the Autism Brain Imaging Data Exchange (ABIDE) study, it provides biologically meaningful results.

preprint2014arXiv

A Bayesian hierarchical spatial point process model for multi-type neuroimaging meta-analysis

Neuroimaging meta-analysis is an important tool for finding consistent effects over studies that each usually have 20 or fewer subjects. Interest in meta-analysis in brain mapping is also driven by a recent focus on so-called "reverse inference": where as traditional "forward inference" identifies the regions of the brain involved in a task, a reverse inference identifies the cognitive processes that a task engages. Such reverse inferences, however, require a set of meta-analysis, one for each possible cognitive domain. However, existing methods for neuroimaging meta-analysis have significant limitations. Commonly used methods for neuroimaging meta-analysis are not model based, do not provide interpretable parameter estimates, and only produce null hypothesis inferences; further, they are generally designed for a single group of studies and cannot produce reverse inferences. In this work we address these limitations by adopting a nonparametric Bayesian approach for meta-analysis data from multiple classes or types of studies. In particular, foci from each type of study are modeled as a cluster process driven by a random intensity function that is modeled as a kernel convolution of a gamma random field. The type-specific gamma random fields are linked and modeled as a realization of a common gamma random field, shared by all types, that induces correlation between study types and mimics the behavior of a univariate mixed effects model. We illustrate our model on simulation studies and a meta-analysis of five emotions from 219 studies and check model fit by a posterior predictive assessment. In addition, we implement reverse inference by using the model to predict study type from a newly presented study. We evaluate this predictive performance via leave-one-out cross-validation that is efficiently implemented using importance sampling techniques.

preprint2014arXiv

A Bayesian nonparametric mixture model for selecting genes and gene subnetworks

It is very challenging to select informative features from tens of thousands of measured features in high-throughput data analysis. Recently, several parametric/regression models have been developed utilizing the gene network information to select genes or pathways strongly associated with a clinical/biological outcome. Alternatively, in this paper, we propose a nonparametric Bayesian model for gene selection incorporating network information. In addition to identifying genes that have a strong association with a clinical outcome, our model can select genes with particular expressional behavior, in which case the regression models are not directly applicable. We show that our proposed model is equivalent to an infinity mixture model for which we develop a posterior computation algorithm based on Markov chain Monte Carlo (MCMC) methods. We also propose two fast computing algorithms that approximate the posterior simulation with good accuracy but relatively low computational cost. We illustrate our methods on simulation studies and the analysis of Spellman yeast cell cycle microarray data.

preprint2014arXiv

Interplay between tetragonal magnetic order, stripe magnetism, and superconductivity in iron-based materials

Motivated by recent experiments in Ba$_{1-x}$K$_{x}$Fe$_{2}$As$_{2}$ {[}A. E. Böhmer \textit{et al}, to be published{]}, we analyze the type of spin-density wave (SDW) order in doped iron-pnictides and the discontinuities of the superconducting transition temperature $T_{c}$ in the coexistence phase with SDW magnetism. By tracking the magnetic transition line $T_{N}(x)$ towards optimal doping within an itinerant fermionic model, we find a sequence of transitions from the stripe-orthorhombic ($C_{2}$) SDW order to the tetragonal ($C_{4}$) order and then back to the $C_{2}$ order. We argue that the superconducting $T_{c}$ has two discontinuities -- it jumps to a smaller value upon entering the coexistence region with the $C_{4}$ magnetic phase, and then jumps to a larger value inside the SDW state when it crosses the boundary between the $C_{4}$ and $C_{2}$ SDW orders. The full agreement with the experimental phase diagram provides a strong indication that the itinerant approach is adequate to describe the physics of weakly/moderately doped iron-pnictides.

preprint2014arXiv

Magnetoresistance due to Broken C4 Symmetry in Cubic B20 Chiral Magnets

The B20 chiral magnets with broken inversion symmetry and C4 rotation symmetry have attracted much attention. The broken inversion symmetry leads to the Dzyaloshinskii-Moriya that gives rise to the helical and Skyrmion states. We report the unusual magnetoresistance (MR) of B20 chiral magnet Fe0.85Co0.15Si that directly reveals the broken C4 rotation symmetry. We present a microscopic theory, a minimal theory with two spin-orbit terms, that satisfies all the symmetry requirements and accounts for the transport experiments.

preprint2014arXiv

Manipulation of gap nodes by uniaxial strain in iron-based superconductors

In the iron pnictides and chalcogenides, multiple orbitals participate in the superconducting state, enabling different gap structures to be realized in distinct materials. Here we argue that the spectral weights of these orbitals can in principle be controlled by a tetragonal symmetry-breaking uniaxial strain, due to the enhanced nematic susceptibility of many iron-based superconductors. By investigating multi-orbital microscopic models in the presence of orbital order, we show that not only $T_{c}$ can be enhanced, but pairs of accidental gap nodes can be annihilated and created in the Fermi surface by an increasing external strain. We explain our results as a mixture of nearly-degenerate superconducting states promoted by strain, and show that the annihilation and creation of nodes can be detected experimentally via anisotropic penetration depth measurements. Our results provide a promising framework to externally control the superconducting properties of iron-based materials.

preprint2014arXiv

Transport Theory of Metallic B20 Helimagnets

B20 compounds are a class of cubic helimagnets harboring nontrivial spin textures such as spin helices and skyrmions. It has been well understood that the Dzyaloshinskii-Moriya (DM) interaction is the origin of these textures, and the physics behind the DM interaction is the spin-orbital coupling (SOC). However the SOC shows its effect not only on the spins, but also on the electrons. In this paper, we will discuss effects of the SOC on the electron and spin transports in B20 compounds. An effective Hamiltonian is presented from symmetry analysis, and the spin-orbital coupling therein shows anomalous behaviors in anisotropic magnetoresistance (AMR) and helical resistance. New effects such as inverse spin-galvanic effect is proposed, and the origin of the DM interaction is discussed.

preprint2014arXiv

Universal Ratio of Intrinsic Resistivities of Spin Helix in B20 (Fe-Co)Si Magnets

The B20 magnets with the Dzyaloshinskii-Moriya (D-M) interaction exhibit spin helix and Skyrmion spin textures unattainable in traditional Heisenberg ferromagnets. We have determined the intrinsic resistivity of the spin helix, which is a macroscopic Bloch domain wall, in B20 (Fe-Co)Si magnets. We found a universal resistance ratio of gamma = 1.35 with current parallel and perpendicular to the helix, independent of composition and temperature. This gamma value is much smaller than 3, the well-known minimum value for domain wall resistivity in traditional ferromagnets, due to the significant spin-orbit coupling in the B20 magnets.

preprint2012arXiv

Dimer Impurity Scattering, "Reconstructed" Nesting and Density-Wave Diagnostics in Iron Pnictides

While the impurity-induced nanoscale electronic disorder has been extensively reported in the underdoped iron pnictides, its microscopic origins remain elusive. Recent scanning tunneling microscopy (STM) measurements reveal a dimer-type resonant structure induced by cobalt doping. These dimers are randomly distributed but uniformly aligned with the antiferromagnetic a axis. A theory of the impurity-induced quasiparticle interference patterns is presented that shows the local density of states developing an oscillatory pattern characterized by both geometry and orbital content of the {\em reconstructed} Fermi pockets, occasioned by the pocket density-wave (PoDW) order along the b axis. This pattern breaks the $C_4$ symmetry and its size and orientation compare well with the dimer resonances found in the STM experiments, hinting at the presence of a "hidden" PoDW order. More broadly, our theory spotlights such nanoscale structures as a useful diagnostic tool for various forms of order in iron pnictides.

preprint2011arXiv

Theory of Valley-Density Wave and Hidden Order in Iron-Pnictides

In the limit of perfect nesting, the physics of iron-pnictides is governed by the density wave formation at the zone-edge vector M. At high energies, various spin- (SDW), charge- (CDW), orbital/pocket- (PDW) density waves, and their linear combinations, all appear equally likely, unified within the unitary order parameter of U(4)XU(4) symmetry. Nesting imperfections and low-energy interactions reduce this symmetry to that of real materials. Nevertheless, the generic ground state preserves a distinct signature of its highly symmetric origins: a SDW along one axis of the iron lattice is predicted to coexist with a perpendicular PDW, accompanied by weak charge currents. This "hidden" order induces the structural transition in our theory, naturally insures T_s >= T_N, and leads to orbital ferromagnetism and other observable consequences.

Jian Kang

What is connected

Connect this record

See the researcher in context

Building this map preview

53 published item(s)

A Unified Spoken Language Model with Injected Emotional-Attribution Thinking for Human-like Interaction

TELEVAL: A Dynamic Benchmark Designed for Spoken Language Models in Chinese Interactive Scenarios

Rewrite Caption Semantics: Bridging Semantic Gaps for Language-Supervised Semantic Segmentation

InfoFair: Information-Theoretic Intersectional Fairness

A Soft-Thresholding Operator for Sparse Time-Varying Effects in Survival Models

Adversarial Sample Detection for Speaker Verification by Neural Vocoders

Bayesian learning of COVID-19 Vaccine safety while incorporating Adverse Events ontology

Deep Historical Borrowing Framework to Prospectively and Simultaneously Synthesize Control Information in Confirmatory Clinical Trials with Multiple Endpoints

Finite-Sample Two-Group Composite Hypothesis Testing via Machine Learning

Image Response Regression via Deep Neural Networks

Individualized Risk Assessment of Preoperative Opioid Use by Interpretable Neural Network Regression

Leveraging Phone Mask Training for Phonetic-Reduction-Robust E2E Uyghur Speech Recognition

Linguistic-Acoustic Similarity Based Accent Shift for Accent Recognition

M2HF: Multi-level Multi-modal Hybrid Fusion for Text-Video Retrieval

Optimizing Graphical Procedures for Multiplicity Control in a Confirmatory Clinical Trial via Deep Learning

RawlsGCN: Towards Rawlsian Difference Principle on Graph Convolutional Network

Understanding the dynamic impact of COVID-19 through competing risk modeling with bivariate varying coefficients

Cascades between light and heavy fermions in the normal state of magic angle twisted bilayer graphene

Correlated Insulating Phases in the Twisted Bilayer Graphene

Correlation-induced insulating topological phases at charge neutrality in twisted bilayer graphene

Topological and nematic superconductivity mediated by ferro-SU(4) fluctuations in twisted bilayer graphene

Bayesian Sparse Mediation Analysis with Targeted Penalization of Natural Indirect Effects

Bayesian Symbolic Regression

Crystalline Nodal Topological Superconductivity and Bogolyubov Fermi Surfaces in Monolayer NbSe$_2$

Deep Learning for Image Search and Retrieval in Large Remote Sensing Archives

Learning Convolutional Sparse Coding on Complex Domain for Interferometric Phase Restoration

Learning Shared Cross-modality Representation Using Multispectral-LiDAR and Hyperspectral Data

Multipass SAR Interferometry Based on Total Variation Regularized Robust Low Rank Tensor Decomposition

Non-Abelian Dirac node braiding and near-degeneracy of correlated phases at odd integer filling in magic angle twisted bilayer graphene

Statistical Inference for High-Dimensional Vector Autoregression with Measurement Error

Minorization-Maximization-based Steepest Ascent for Large-scale Survival Analysis with Time-Varying Effects: Application to the National Kidney Transplant Dataset

Classification with Ultrahigh-Dimensional Features

Conditional Screening for Ultra-high Dimensional Covariates with Survival Outcomes

High Dimensional Tests for Functional Networks of Brain Anatomic Regions

Robustness of quantum critical pairing against disorder

Scalar-on-Image Regression via the Soft-Thresholded Gaussian Process

Spin anisotropy due to spin-orbit coupling in optimally hole-doped Ba$_{0.67}$K$_{0.33}$Fe$_{2}$As$_{2}$

Spin-Driven Nematic Instability of the Multi-Orbital Hubbard Model: Application to Iron-Based Superconductors

Superconductivity in FeSe thin films driven by the interplay between nematic fluctuations and spin-orbit coupling

Direct evidence for the emergence of a pressure induced nodal superconducting gap in the iron-based superconductor Ba_0.65Rb_0.35Fe_2As_2

Magnetic order without tetragonal-symmetry-breaking in iron arsenides: microscopic mechanism and spin-wave spectrum

Phenomenological theory of the superconducting state inside the hidden-order phase of URu$_{2}$Si$_{2}$

Spin reorientation driven by the interplay between spin-orbit coupling and Hund's rule coupling in iron pnictides

Thresholded Multiscale Gaussian Processes with Application to Bayesian Feature Selection for Massive Neuroimaging Data

A Bayesian hierarchical spatial point process model for multi-type neuroimaging meta-analysis

A Bayesian nonparametric mixture model for selecting genes and gene subnetworks

Interplay between tetragonal magnetic order, stripe magnetism, and superconductivity in iron-based materials

Magnetoresistance due to Broken C4 Symmetry in Cubic B20 Chiral Magnets

Manipulation of gap nodes by uniaxial strain in iron-based superconductors

Transport Theory of Metallic B20 Helimagnets

Universal Ratio of Intrinsic Resistivities of Spin Helix in B20 (Fe-Co)Si Magnets

Dimer Impurity Scattering, "Reconstructed" Nesting and Density-Wave Diagnostics in Iron Pnictides

Theory of Valley-Density Wave and Hidden Order in Iron-Pnictides