Researcher profile

Tao Xie

Tao Xie contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
16works
0followers
11topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

16 published item(s)

preprint2026arXiv

TFEC: Multivariate Time-Series Clustering via Temporal-Frequency Enhanced Contrastive Learning

Multivariate Time-Series (MTS) clustering is crucial for signal processing and data analysis. Although deep learning approaches, particularly those leveraging Contrastive Learning (CL), are prominent for MTS representation, existing CL-based models face two key limitations: 1) neglecting clustering information during positive/negative sample pair construction, and 2) introducing unreasonable inductive biases, e.g., destroying time dependence and periodicity through augmentation strategies, compromising representation quality. This paper, therefore, proposes a Temporal-Frequency Enhanced Contrastive (TFEC) learning framework. To preserve temporal structure while generating low-distortion representations, a temporal-frequency Co-EnHancement (CoEH) mechanism is introduced. Accordingly, a synergistic dual-path representation and cluster distribution learning framework is designed to jointly optimize cluster structure and representation fidelity. Experiments on six real-world benchmark datasets demonstrate TFEC's superiority, achieving 4.48% average NMI gains over SOTA methods, with ablation studies validating the design. The code of the paper is available at: https://github.com/yueliangy/TFEC.

preprint2024arXiv

Safety and Performance, Why Not Both? Bi-Objective Optimized Model Compression against Heterogeneous Attacks Toward AI Software Deployment

The size of deep learning models in artificial intelligence (AI) software is increasing rapidly, hindering the large-scale deployment on resource-restricted devices (e.g., smartphones). To mitigate this issue, AI software compression plays a crucial role, which aims to compress model size while keeping high performance. However, the intrinsic defects in a big model may be inherited by the compressed one. Such defects may be easily leveraged by adversaries, since a compressed model is usually deployed in a large number of devices without adequate protection. In this article, we aim to address the safe model compression problem from the perspective of safety-performance co-optimization. Specifically, inspired by the test-driven development (TDD) paradigm in software engineering, we propose a test-driven sparse training framework called SafeCompress. By simulating the attack mechanism as safety testing, SafeCompress can automatically compress a big model to a small one following the dynamic sparse training paradigm. Then, considering two kinds of representative and heterogeneous attack mechanisms, i.e., black-box membership inference attack and white-box membership inference attack, we develop two concrete instances called BMIA-SafeCompress and WMIA-SafeCompress. Further, we implement another instance called MMIA-SafeCompress by extending SafeCompress to defend against the occasion when adversaries conduct black-box and white-box membership inference attacks simultaneously. We conduct extensive experiments on five datasets for both computer vision and natural language processing tasks. The results show the effectiveness and generalizability of our framework. We also discuss how to adapt SafeCompress to other attacks besides membership inference attack, demonstrating the flexibility of SafeCompress.

preprint2024arXiv

Using LLM to select the right SQL Query from candidates

Text-to-SQL models can generate a list of candidate SQL queries, and the best query is often in the candidate list, but not at the top of the list. An effective re-rank method can select the right SQL query from the candidate list and improve the model's performance. Previous studies on code generation automatically generate test cases and use them to re-rank candidate codes. However, automatic test case generation for text-to-SQL is an understudied field. We propose an automatic test case generation method that first generates a database and then uses LLMs to predict the ground truth, which is the expected execution results of the ground truth SQL query on this database. To reduce the difficulty for LLMs to predict, we conduct experiments to search for ways to generate easy databases for LLMs and design easy-to-understand prompts. Based on our test case generation method, we propose a re-rank method to select the right SQL query from the candidate list. Given a candidate list, our method can generate test cases and re-rank the candidate list according to their pass numbers on these test cases and their generation probabilities. The experiment results on the validation dataset of Spider show that the performance of some state-of-the-art models can get a 3.6\% improvement after applying our re-rank method.

preprint2023arXiv

DeepMatcher: A Deep Transformer-based Network for Robust and Accurate Local Feature Matching

Local feature matching between images remains a challenging task, especially in the presence of significant appearance variations, e.g., extreme viewpoint changes. In this work, we propose DeepMatcher, a deep Transformer-based network built upon our investigation of local feature matching in detector-free methods. The key insight is that local feature matcher with deep layers can capture more human-intuitive and simpler-to-match features. Based on this, we propose a Slimming Transformer (SlimFormer) dedicated for DeepMatcher, which leverages vector-based attention to model relevance among all keypoints and achieves long-range context aggregation in an efficient and effective manner. A relative position encoding is applied to each SlimFormer so as to explicitly disclose relative distance information, further improving the representation of keypoints. A layer-scale strategy is also employed in each SlimFormer to enable the network to assimilate message exchange from the residual block adaptively, thus allowing it to simulate the human behaviour that humans can acquire different matching cues each time they scan an image pair. To facilitate a better adaption of the SlimFormer, we introduce a Feature Transition Module (FTM) to ensure a smooth transition in feature scopes with different receptive fields. By interleaving the self- and cross-SlimFormer multiple times, DeepMatcher can easily establish pixel-wise dense matches at coarse level. Finally, we perceive the match refinement as a combination of classification and regression problems and design Fine Matches Module to predict confidence and offset concurrently, thereby generating robust and accurate matches. Experimentally, we show that DeepMatcher significantly outperforms the state-of-the-art methods on several benchmarks, demonstrating the superior matching capability of DeepMatcher.

preprint2023arXiv

Practitioners' Expectations on Code Completion

Code completion has become a common practice for programmers during their daily programming activities. It aims at automatically predicting the next tokens or lines that the programmers tend to use. A good code completion tool can substantially save keystrokes and improve the programming efficiency for programmers. Recently, various techniques for code completion have been proposed for usage in practice. However, it is still unclear what are practitioners' expectations on code completion and whether existing research has met their demands. To fill the gap, we perform an empirical study by first interviewing 15 practitioners and then surveying 599 practitioners from 18 IT companies about their expectations on code completion. We then compare the practitioners' demands with current research via conducting a literature review of papers on code completion published in premier publication venues from 2012 to 2022. Based on the comparison, we highlight the directions desirable for researchers to invest efforts towards developing code completion techniques for meeting practitioners' expectations.

preprint2022arXiv

A Survey of Adversarial Learning on Graphs

Deep learning models on graphs have achieved remarkable performance in various graph analysis tasks, e.g., node classification, link prediction, and graph clustering. However, they expose uncertainty and unreliability against the well-designed inputs, i.e., adversarial examples. Accordingly, a line of studies has emerged for both attack and defense addressed in different graph analysis tasks, leading to the arms race in graph adversarial learning. Despite the booming works, there still lacks a unified problem definition and a comprehensive review. To bridge this gap, we investigate and summarize the existing works on graph adversarial learning tasks systemically. Specifically, we survey and unify the existing works w.r.t. attack and defense in graph analysis tasks, and give appropriate definitions and taxonomies at the same time. Besides, we emphasize the importance of related evaluation metrics, investigate and summarize them comprehensively. Hopefully, our works can provide a comprehensive overview and offer insights for the relevant researchers. Latest advances in graph adversarial learning are summarized in our GitHub repository https://github.com/EdisonLeeeee/Graph-Adversarial-Learning.

preprint2022arXiv

GDsmith: Detecting Bugs in Graph Database Engines

Graph database engines stand out in the era of big data for their efficiency of modeling and processing linked data. There is a strong need of testing graph database engines. However, random testing, the most practical way of automated test generation, faces the challenges of semantic validity, non-empty result, and behavior diversity to detect bugs in graph database engines. To address these challenges, in this paper, we propose GDsmith, the first black-box approach for testing graph database engines. It ensures that each randomly generated Cypher query satisfies the semantic requirements via skeleton generation and completion. GDsmith includes our technique to increase the probability of producing Cypher queries that return non-empty results by leveraging three types of structural mutation strategies. GDsmith also includes our technique to improve the behavior diversity of the generated Cypher queries by selecting property keys according to their previous frequencies when generating new queries. Our evaluation results demonstrate that GDsmith is effective and efficient for automated query generation and substantially outperforms the baseline. GDsmith successfully detects 27 previously unknown bugs on the released versions of three popular open-source graph database engines and receive positive feedback from their developers.

preprint2022arXiv

Nematic fluctuations in the non-superconducting iron pnictide BaFe$_{1.9-x}$Ni$_{0.1}$Cr$_{x}$As$_{2}$

The main driven force of the electronic nematic phase in iron-based superconductors is still under debate. Here, we report a comprehensive study on the nematic fluctuations in a non-superconducting iron pnictide system BaFe$_{1.9-x}$Ni$_{0.1}$Cr$_{x}$As$_{2}$ by electronic transport, angle-resolved photoemission spectroscopy (ARPES) and inelastic neutron scattering (INS) measurements. Previous neutron diffraction and transport measurements suggested that the collinear antiferromagnetism persists to $x=0.8$, with similar Néel temperature $T_N$ and structural transition temperature $T_s$ around 32 K, but the charge carriers change from electron type to hole type around $x=$ 0.5. In this study, we have found that the in-plane resistivity anisotropy also highly depends on the Cr dopings and the type of charge carriers. While ARPES measurements suggest possibly weak orbital anisotropy onset near $T_s$ for both $x=0.05$ and $x=0.5$ compounds, INS experiments reveal clearly different onset temperatures of low-energy spin excitation anisotropy, which is likely related to the energy scale of spin nematicity. These results suggest that the interplay between the local spins on Fe atoms and the itinerant electrons on Fermi surfaces is crucial to the nematic fluctuations of iron pnictides, where the orbital degree of freedom may behave differently from the spin degree of freedom, and the transport properties are intimately related to the spin dynamics.

preprint2022arXiv

On the Certified Robustness for Ensemble Models and Beyond

Recent studies show that deep neural networks (DNN) are vulnerable to adversarial examples, which aim to mislead DNNs by adding perturbations with small magnitude. To defend against such attacks, both empirical and theoretical defense approaches have been extensively studied for a single ML model. In this work, we aim to analyze and provide the certified robustness for ensemble ML models, together with the sufficient and necessary conditions of robustness for different ensemble protocols. Although ensemble models are shown more robust than a single model empirically; surprisingly, we find that in terms of the certified robustness the standard ensemble models only achieve marginal improvement compared to a single model. Thus, to explore the conditions that guarantee to provide certifiably robust ensemble ML models, we first prove that diversified gradient and large confidence margin are sufficient and necessary conditions for certifiably robust ensemble models under the model-smoothness assumption. We then provide the bounded model-smoothness analysis based on the proposed Ensemble-before-Smoothing strategy. We also prove that an ensemble model can always achieve higher certified robustness than a single base model under mild conditions. Inspired by the theoretical findings, we propose the lightweight Diversity Regularized Training (DRT) to train certifiably robust ensemble ML models. Extensive experiments show that our DRT enhanced ensembles can consistently achieve higher certified robustness than existing single and ensemble ML models, demonstrating the state-of-the-art certified L2-robustness on MNIST, CIFAR-10, and ImageNet datasets.

preprint2022arXiv

Preferred Spin Excitations in the Bilayer Iron-Based Superconductor CaK(Fe$_{0.96}$Ni$_{0.04}$)$_4$As$_4$ with Spin-Vortex Crystal Order

The spin-orbit coupling (SOC) is a key to understand the magnetically driven superconductivity in iron-based superconductors, where both local and itinerant electrons are present and the orbital angular momentum is not completely quenched. Here, we report a neutron scattering study on the bilayer compound CaK(Fe$_{0.96}$Ni$_{0.04}$)$_4$As$_4$ with superconductivity coexisting with a non-collinear spin-vortex crystal magnetic order that preserves the tetragonal symmetry of Fe-Fe plane. In the superconducting state, two spin resonance modes with odd and even $L$ symmetries due to the bilayer coupling are found similar to the undoped compound CaKFe$_4$As$_4$ but at lower energies. Polarization analysis reveals that the odd mode is $c-$axis polarized, and the low-energy spin anisotropy can persist to the paramagnetic phase at high temperature, which closely resembles other systems with in-plane collinear and $c-$axis biaxial magnetic orders. These results provide the missing piece of the puzzle on the SOC effect in iron-pnictide superconductors, and also establish a common picture of $c-$axis preferred magnetic excitations below $T_c$ regardless of the details of magnetic pattern or lattice symmetry.

preprint2022arXiv

Spin excitations in the kagome-lattice metallic antiferromagnet Fe$_{0.89}$Co$_{0.11}$Sn

Kagome-lattice materials have attracted tremendous interest due to the broad prospect for seeking superconductivity, quantum spin liquid states, and topological electronic structures. Among them, the transition-metal kagome lattices are high-profile objects for the combination of topological properties, rich magnetism, and multiple-orbital physics. Here we report an inelastic neutron scattering study on the spin dynamics of a kagome-lattice antiferromagnetic metal Fe$_{0.89}$Co$_{0.11}$Sn. Although the magnetic excitations can be observed up to $\sim$250 meV, well-defined spin waves are only identified below $\sim$90 meV and can be modeled using Heisenberg exchange with ferromagnetic in-plane nearest-neighbor coupling $J_1$, in-plane next-nearest-neighbor coupling $J_2$, and antiferromagnetic (AFM) interlayer coupling $J_c$ under linear spin-wave theory. Above $\sim$90 meV, the spin waves enter the itinerant Stoner continuum and become highly damped particle-hole excitations. At the K point of the Brillouin zone, we reveal a possible band crossing of the spin wave, which indicates a potential Dirac magnon. Our results uncover the evolution of the spin excitations from the planar AFM state to the axial AFM state in Fe$_{0.89}$Co$_{0.11}$Sn, solve the magnetic Hamiltonian for both states, and confirm the significant influence of the itinerant magnetism on the spin excitations.

preprint2022arXiv

Tracking the nematicity in cuprate superconductors: a resistivity study under uniaxial pressure

Overshadowing the superconducting dome in hole-doped cuprates, the pseudogap state is still one of the mysteries that no consensus can be achieved. It has been suggested that the rotational symmetry is broken in this state and may result in a nematic phase transition, whose temperature seems to coincide with the onset temperature of the pseudogap state $T^*$ around optimal doping level, raising the question whether the pseudogap results from the establishment of the nematic order. Here we report results of resistivity measurements under uniaxial pressure on several hole-doped cuprates, where the normalized slope of the elastoresistivity $ζ$ can be obtained as illustrated in iron-based superconductors. The temperature dependence of $ζ$ along particular lattice axis exhibits kink feature at $T_{k}$ and shows Curie-Weiss-like behavior above it, which may suggest a spontaneous nematic transition. While $T_{k}$ seems to be the same as $T^*$ around the optimal doping and in the overdoped region, they become very different in underdoped La$_{2-x}$Sr$_{x}$CuO$_4$. Our results suggest that the nematic order, if indeed existing, is an electronic phase within the pseudogap state.

preprint2020arXiv

Experimental observation of magnetic dimers in diluted Yb:YAlO$_3$

We present a comprehensive experimental investigation of Yb magnetic dimers in Yb$_{0.04}$Y$_{0.96}$AlO$_3$, an Yb-doped Yttrium Aluminum Perovskite (YAP) YAlO$_3$ by means of specific heat, magnetization and high-resolution inelastic neutron scattering (INS) measurements. In our sample, the Yb ions are randomly distributed over the lattice and $\sim 7$\% of Yb ions form quantum dimers due to nearest-neighbor antiferromagnetic coupling along the $c$-axis. At zero field, the dimer formation manifests itself in an appearance of an inelastic peak at $Δ\approx 0.2$~meV in the INS spectrum and a Schottky-like anomaly in the specific heat. The structure factor of the INS peak exhibits a cosine modulation along the $L$ direction, in agreement with the $c$-axis nearest-neighbor intra-dimer coupling. A careful fitting of the low-temperature specific heat shows that the excited state is a degenerate triplet, which indicates a surprisingly small anisotropy of the effective Yb-Yb exchange interaction despite the low crystal symmetry and anisotropic magnetic dipole contribution, in agreement with previous reports for the Yb parent compound, YbAlO$_3$ [arXiv:1904.11513, arXiv:1902.04112], and in contrast to Yb$_2$Pt$_2$Pb [arXiv:1606.01309, arXiv:1907.01067]. The obtained results are precisely reproduced by analytical calculations for the Yb dimers.

preprint2020arXiv

Neutron spin resonance in a quasi-two-dimensional iron-based superconductor

Magnetically mediated Cooper pairing is generally regarded as a key to establish the unified mechanism of unconventional superconductivity. One crucial evidence is the neutron spin resonance arising in the superconducting state, which is commonly interpreted as a spin-exciton from collective particle-hole excitations confined below the superconducting pair-breaking gap ($2Δ$). Here, on the basis of inelastic neutron scattering measurements on a quasi-two-dimensional iron-based superconductor KCa$_2$Fe$_4$As$_4$F$_2$, we have discovered a two-dimensional spin resonant mode with downward dispersions, a behavior closely resembling the low branch of the hour-glass-type spin resonance in cuprates. The resonant intensity is predominant by two broad incommensurate peaks near $Q=$(0.5, 0.5) with a sharp energy peak at $E_R=16$ meV. The overall energy dispersion of the mode exceeds the measured maximum total gap $Δ_{\rm tot}=|Δ_k|+|Δ_{k+Q}|$. These experimental results deeply challenge the conventional understanding of the resonance modes as magnetic excitons regardless of underlining pairing symmetry schemes, and it also points out that when the iron-based superconductivity becomes very quasi-two-dimensional, the electronic behaviors are similar to those in cuprates.

preprint2020arXiv

Spin Excitations and Spin Wave Gap in the Ferromagnetic Weyl Semimetal Co$_3$Sn$_2$S$_2$

We report a comprehensive neutron scattering study on the spin excitations in the magnetic Weyl semimetal Co$_3$Sn$_2$S$_2$ with quasi-two-dimensional structure. Both in-plane and out-of-plane dispersions of the spin waves are revealed in the ferromagnetic state, similarly dispersive but damped spin excitations persist into the paramagnetic state. The effective exchange interactions have been estimated by a semi-classical Heisenberg model to consistently reproduce the experimental $T_C$ and spin stiffness. However, a full spin wave gap below $E_g=2.3$ meV is observed at $T=4$ K, much larger than the estimated magnetic anisotropy energy ($\sim0.6$ meV), while its temperature dependence indicates a significant contribution from the Weyl fermions. These results suggest that Co$_3$Sn$_2$S$_2$ is a three-dimensional correlated system with large spin stiffness, and the low-energy spin dynamics could interplay with the topological electron states.

preprint2020arXiv

Spin-Excitations Anisotropy in the Bilayer Iron-Based Superconductor CaKFe$_4$As$_4$

We use polarized inelastic neutron scattering to study the spin-excitations anisotropy in the bilayer iron-based superconductor CaKFe$_4$As$_4$ ($T_c$ = 35 K). In the superconducting state, both odd and even $L-$modulations of spin resonance have been observed in our previous unpolarized neutron scattering experiments (T. Xie {\it et al.} Phys. Rev. Lett. {\bf 120}, 267003 (2018)). Here we find that the high-energy even mode ($\sim 18$ meV) is isotropic in spin space, but the low-energy odd modes consist of a $c-$axis polarized mode around 9 meV along with another partially overlapped in-plane mode around 12 meV. We argue that such spin anisotropy is induced by the spin-orbit coupling in the spin-vortex-type fluctuations of this unique compound. The spin anisotropy is strongly affected by the superconductivity, where it is weak below 6 meV in the normal state and then transferred to higher energy and further enhanced in the odd mode of spin resonance below $T_c$.