Researcher profile

Xian Yang

Xian Yang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
10works
0followers
10topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

10 published item(s)

preprint2026arXiv

Rethinking Generalization in Graph Neural Networks: A Structural Complexity Perspective

Graph neural networks (GNNs) have emerged as a fundamental tool for learning from graph-structured data, achieving strong performance across a wide range of applications. However, understanding their generalization capabilities remains challenging due to the complex structural dependencies inherent in such data. Existing generalization analyses largely follow the classical machine learning paradigm, focusing primarily on model complexity while overlooking the fundamental role of graph structure. Therefore, in this work, we systematically investigate this role by asking: does the graph structure actually influence generalization, and if so, by how much? To answer the first question and validate our intuition, we theoretically prove that incorporating more edges into the prediction process transforms the input representations to be overly accommodating to the output model, thereby inducing overfitting. To address the second question, we formulate a structural complexity measure based on the number of effective edges and derive a Rademacher complexity-based generalization bound. In doing so, we demonstrate that GNN generalization depends explicitly on structural complexity, alongside traditional parameter-dependent factors. Motivated by these theoretical findings, we propose a structural entropy regularization method. This approach controls structural complexity by regulating effective edges to balance underfitting and overfitting, ultimately improving the generalization performance of GNNs.

preprint2025arXiv

One-shot synthesis of rare gastrointestinal lesions improves diagnostic accuracy and clinical training

Rare gastrointestinal lesions are infrequently encountered in routine endoscopy, restricting the data available for developing reliable artificial intelligence (AI) models and training novice clinicians. Here we present EndoRare, a one-shot, retraining-free generative framework that synthesizes diverse, high-fidelity lesion exemplars from a single reference image. By leveraging language-guided concept disentanglement, EndoRare separates pathognomonic lesion features from non-diagnostic attributes, encoding the former into a learnable prototype embedding while varying the latter to ensure diversity. We validated the framework across four rare pathologies (calcifying fibrous tumor, juvenile polyposis syndrome, familial adenomatous polyposis, and Peutz-Jeghers syndrome). Synthetic images were judged clinically plausible by experts and, when used for data augmentation, significantly enhanced downstream AI classifiers, improving the true positive rate at low false-positive rates. Crucially, a blinded reader study demonstrated that novice endoscopists exposed to EndoRare-generated cases achieved a 0.400 increase in recall and a 0.267 increase in precision. These results establish a practical, data-efficient pathway to bridge the rare-disease gap in both computer-aided diagnostics and clinical education.

preprint2023arXiv

Intelligent Parsing: An Automated Parsing Framework for Extracting Design Semantics from E-commerce Creatives

In the industrial e-commerce landscape, creative designs such as banners and posters are ubiquitous. Extracting structured semantic information from creative e-commerce design materials (manuscripts crafted by designers) to obtain design semantics represents a core challenge in the realm of intelligent design. In this paper, we propose a comprehensive automated framework for intelligently parsing creative materials. This framework comprises material recognition, preprocess, smartname, and label layers. The material recognition layer consolidates various detection and recognition interfaces, covering business aspects including detection of auxiliary areas within creative materials and layer-level detection, alongside label identification. Algorithmically, it encompasses a variety of coarse-to-fine methods such as Cascade RCNN, GFL, and other models. The preprocess layer involves filtering creative layers and grading creative materials. The smartname layer achieves intelligent naming for creative materials, while the label layer covers multi-level tagging for creative materials, enabling tagging at different hierarchical levels. Intelligent parsing constitutes a complete parsing framework that significantly aids downstream processes such as intelligent creation, creative optimization, and material library construction. Within the practical business applications at Suning, it markedly enhances the exposure, circulation, and click-through rates of creative materials, expediting the closed-loop production of creative materials and yielding substantial benefits.

preprint2022arXiv

Label Dependent Attention Model for Disease Risk Prediction Using Multimodal Electronic Health Records

Disease risk prediction has attracted increasing attention in the field of modern healthcare, especially with the latest advances in artificial intelligence (AI). Electronic health records (EHRs), which contain heterogeneous patient information, are widely used in disease risk prediction tasks. One challenge of applying AI models for risk prediction lies in generating interpretable evidence to support the prediction results while retaining the prediction ability. In order to address this problem, we propose the method of jointly embedding words and labels whereby attention modules learn the weights of words from medical notes according to their relevance to the names of risk prediction labels. This approach boosts interpretability by employing an attention mechanism and including the names of prediction tasks in the model. However, its application is only limited to the handling of textual inputs such as medical notes. In this paper, we propose a label dependent attention model LDAM to 1) improve the interpretability by exploiting Clinical-BERT (a biomedical language model pre-trained on a large clinical corpus) to encode biomedically meaningful features and labels jointly; 2) extend the idea of joint embedding to the processing of time-series data, and develop a multi-modal learning framework for integrating heterogeneous information from medical notes and time-series health status indicators. To demonstrate our method, we apply LDAM to the MIMIC-III dataset to predict different disease risks. We evaluate our method both quantitatively and qualitatively. Specifically, the predictive power of LDAM will be shown, and case studies will be carried out to illustrate its interpretability.

preprint2022arXiv

Signatures of a topological Weyl loop in Co$_3$Sn$_2$S$_2$

The search for novel topological phases of matter in quantum magnets has emerged as a frontier of condensed matter physics. Here we use state-of-the-art angle-resolved photoemission spectroscopy (ARPES) to investigate single crystals of Co$_3$Sn$_2$S$_2$ in its ferromagnetic phase. We report for the first time signatures of a topological Weyl loop. From fundamental symmetry considerations, this magnetic Weyl loop is expected to be gapless if spin-orbit coupling (SOC) is strictly zero but gapped, with possible Weyl points, under finite SOC. We point out that high-resolution ARPES results to date cannot unambiguously resolve the SOC gap anywhere along the Weyl loop, leaving open the possibility that Co$_3$Sn$_2$S$_2$ hosts zero Weyl points or some non-zero number of Weyl points. On the surface of our samples, we further observe a possible Fermi arc, but we are unable to clearly verify its topological nature using the established counting criteria. As a result, we argue that from the point of view of photoemission spectroscopy the presence of Weyl points and Fermi arcs in Co$_3$Sn$_2$S$_2$ remains ambiguous. Our results have implications for ongoing investigations of Co$_3$Sn$_2$S$_2$ and other topological magnets.

preprint2021arXiv

Mitigating Backdoor Attacks in Federated Learning

Malicious clients can attack federated learning systems using malicious data, including backdoor samples, during the training phase. The compromised global model will perform well on the validation dataset designed for the task, but a small subset of data with backdoor patterns may trigger the model to make a wrong prediction. There has been an arms race between attackers who tried to conceal attacks and defenders who tried to detect attacks during the aggregation stage of training on the server-side. In this work, we propose a new and effective method to mitigate backdoor attacks after the training phase. Specifically, we design a federated pruning method to remove redundant neurons in the network and then adjust the model's extreme weight values. Our experiments conducted on distributed Fashion-MNIST show that our method can reduce the average attack success rate from 99.7% to 1.9% with a 5.5% loss of test accuracy on the validation dataset. To minimize the pruning influence on test accuracy, we can fine-tune after pruning, and the attack success rate drops to 6.4%, with only a 1.7% loss of test accuracy. Further experiments under Distributed Backdoor Attacks on CIFAR-10 also show promising results that the average attack success rate drops more than 70% with less than 2% loss of test accuracy on the validation dataset.

preprint2020arXiv

A Bayesian Updating Scheme for Pandemics: Estimating the Infection Dynamics of COVID-19

Epidemic models play a key role in understanding and responding to the emerging COVID-19 pandemic. Widely used compartmental models are static and are of limited use to evaluate intervention strategies with the emerging pandemic. Applying the technology of data assimilation, we propose a Bayesian updating approach for estimating epidemiological parameters using observable information for the purpose of assessing the impacts of different intervention strategies. We adopt a concise renewal model and propose new parameters by disentangling the reduction of instantaneous reproduction number Rt into mitigation and suppression factors for quantifying intervention impacts at a finer granularity. Then we developed a data assimilation framework for estimating these parameters including constructing an observation function and developing a Bayesian updating scheme. A statistical analysis framework is then built to quantify the impact of intervention strategies by monitoring the evolution of these estimated parameters. By Investigating the impacts of intervention measures of European countries, the United States and Wuhan with the framework, we reveal the effects of interventions in these countries and the resurgence risk in the USA.

preprint2020arXiv

Discovery of a quantum limit Chern magnet TbMn6Sn6

The quantum level interplay between geometry, topology, and correlation is at the forefront of fundamental physics. Owing to the unusual lattice geometry and breaking of time-reversal symmetry, kagome magnets are predicted to support intrinsic Chern quantum phases. However, quantum materials hosting ideal spin-orbit coupled kagome lattices with strong out-of-plane magnetization have been lacking. Here we use scanning tunneling microscopy to discover a new topological kagome magnet TbMn6Sn6, which is close to satisfying the above criteria. We visualize its effectively defect-free purely Mn-based ferromagnetic kagome lattice with atomic resolution. Remarkably, its electronic state exhibits distinct Landau quantization upon the application of a magnetic field, and the quantized Landau fan structure features spin-polarized Dirac dispersion with a large Chern gap. We further demonstrate the bulk-boundary correspondence between the Chern gap and topological edge state, as well as the Berry curvature field correspondence of Chern gapped Dirac fermions. Our results point to the realization of a quantum-limit Chern phase in TbMn6Sn6, opening up an avenue for discovering topological quantum phenomena in the RMn6Sn6 (R = rare earth element) family with a variety of magnetic structures. Our visualization of the magnetic bulk-boundary-Berry correspondence covering real and momentum space demonstrates a proof-of-principle method revealing topological magnets.

preprint2020arXiv

Spin-orbit quantum impurity in a topological kagome magnet

Quantum states induced by single-atomic-impurities are the current frontier of material and information science. Recently the spin-orbit coupled correlated kagome magnets are emerging as a new class of topological quantum materials, although the effect of single-atomic impurities remains unexplored. Here we use state-of-the-art scanning tunneling microscopy/spectroscopy (STM/S) to study the atomic indium impurity in a topological kagome magnet Co3Sn2S2, which is designed to support the spin-orbit quantum state. We find each impurity features a strongly localized bound state. Our systematic magnetization-polarized tunneling probe reveals its spin-down polarized nature with an unusual moment of -5uB, indicative of additional orbital magnetization. As the separation between two impurities progressively shrinks, their respective bound states interact and form quantized molecular orbital states. The molecular orbital of three neighboring impurities further exhibits an intriguing splitting owing to the combination of geometry, magnetism, and spin-orbit coupling, analogous to the splitting of the topological Weyl fermion line12,19. Our work demonstrates the quantum-level interplay between magnetism and spin-orbit coupling at an individual atomic impurity, which provides insights into the emergent impurity behavior in a topological kagome magnet and the potential of spin-orbit quantum impurities for information science.

preprint2019arXiv

Integrated Multi-omics Analysis Using Variational Autoencoders: Application to Pan-cancer Classification

Different aspects of a clinical sample can be revealed by multiple types of omics data. Integrated analysis of multi-omics data provides a comprehensive view of patients, which has the potential to facilitate more accurate clinical decision making. However, omics data are normally high dimensional with large number of molecular features and relatively small number of available samples with clinical labels. The "dimensionality curse" makes it challenging to train a machine learning model using high dimensional omics data like DNA methylation and gene expression profiles. Here we propose an end-to-end deep learning model called OmiVAE to extract low dimensional features and classify samples from multi-omics data. OmiVAE combines the basic structure of variational autoencoders with a classification network to achieve task-oriented feature extraction and multi-class classification. The training procedure of OmiVAE is comprised of an unsupervised phase without the classifier and a supervised phase with the classifier. During the unsupervised phase, a hierarchical cluster structure of samples can be automatically formed without the need for labels. And in the supervised phase, OmiVAE achieved an average classification accuracy of 97.49% after 10-fold cross-validation among 33 tumour types and normal samples, which shows better performance than other existing methods. The OmiVAE model learned from multi-omics data outperformed that using only one type of omics data, which indicates that the complementary information from different omics datatypes provides useful insights for biomedical tasks like cancer classification.