Source author record

He Zhao

He Zhao appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning Computer Vision cond-mat.mtrl-sci astro-ph.GA cond-mat.str-el astro-ph.SR physics.acc-ph Computation and Language cond-mat.supr-con Information Retrieval cond-mat.mes-hall Cryptography and Security astro-ph.HE cond-mat.other eess.IV Methodology Neural and Evolutionary Computing physics.app-ph

Catalog footprint

What is connected

32works

18topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Arrow: A Foundation Model for Causal Discovery

We introduce Arrow, a foundation model for zero-shot causal discovery on observational tabular data. Arrow factorizes a directed acyclic graph into an undirected skeleton and a topological order, guaranteeing acyclicity by construction. Given a new dataset, it uses a transformer-based architecture to contextualize variables within and across observations, then predicts skeleton edge probabilities and node order scores that together define a graph. Arrow is trained in a supervised fashion on synthetic datasets with ground-truth graphs, using an end-to-end differentiable directed edge composite likelihood induced by the skeleton-order factorization. The training distribution spans diverse graph families, functional forms, noise models, and dataset shapes. Across in- and out-of-distribution synthetic, semi-synthetic, and real datasets, Arrow matches or outperforms existing causal discovery methods at substantially lower inference cost than competitive alternatives. Our results demonstrate that large-scale pretraining on diverse synthetic data can yield zero-shot causal discovery models that are fast, accurate, and reusable on new datasets.

preprint2022arXiv

A Unified Wasserstein Distributional Robustness Framework for Adversarial Training

It is well-known that deep neural networks (DNNs) are susceptible to adversarial attacks, exposing a severe fragility of deep learning systems. As the result, adversarial training (AT) method, by incorporating adversarial examples during training, represents a natural and effective approach to strengthen the robustness of a DNN-based classifier. However, most AT-based methods, notably PGD-AT and TRADES, typically seek a pointwise adversary that generates the worst-case adversarial example by independently perturbing each data sample, as a way to "probe" the vulnerability of the classifier. Arguably, there are unexplored benefits in considering such adversarial effects from an entire distribution. To this end, this paper presents a unified framework that connects Wasserstein distributional robustness with current state-of-the-art AT methods. We introduce a new Wasserstein cost function and a new series of risk functions, with which we show that standard AT methods are special cases of their counterparts in our framework. This connection leads to an intuitive relaxation and generalization of existing AT methods and facilitates the development of a new family of distributional robustness AT-based algorithms. Extensive experiments show that our distributional robustness AT algorithms robustify further their standard AT counterparts in various settings.

preprint2022arXiv

Dust Mass Associated with the Supernova Remnant IC 443 when Emission Meets Extinction

The dust mass of the well-known supernova remnant (SNR) IC 443 is estimated from both the infrared emission and the visual extinction. With photometry to the images taken by \emph{Spitzer}, \emph{WISE}, \emph{IRAS}, \emph{AKARI} and \emph{Planck}, the spectral energy distribution (SED) of the dust is obtained after subtracting the synchrotron radiation and considering the spectral line emission. The dust mass is derived from fitting the SED by a two-component model, which results in a warm component of the temperature of $\sim$ 53 K and the mass of 0.1 $M_\odot$, and a cold component of the temperature of $\sim 17$ K and the mass of 46 $M_\odot$. On the other hand, the dust mass is derived to be $\sim$ 66 $M_\odot$ from the visual extinction of IC 443 which is identified from the 3D Bayestar extinction map and its coincidence with the infrared emission morphology. Roughly the dust mass derived from the infrared emission and the extinction agree mutually. However, the dust mass derived from the infrared emission can be adjusted to be more consistent with that from the extinction by using different dust opacity property or considering optically thick radiation. In addition, the distribution of dust temperature and mass is analyzed by fitting the SED pixel by pixel.

preprint2022arXiv

Improving Ensemble Robustness by Collaboratively Promoting and Demoting Adversarial Robustness

Ensemble-based adversarial training is a principled approach to achieve robustness against adversarial attacks. An important technique of this approach is to control the transferability of adversarial examples among ensemble members. We propose in this work a simple yet effective strategy to collaborate among committee models of an ensemble model. This is achieved via the secure and insecure sets defined for each model member on a given sample, hence help us to quantify and regularize the transferability. Consequently, our proposed framework provides the flexibility to reduce the adversarial transferability as well as to promote the diversity of ensemble members, which are two crucial factors for better robustness in our ensemble approach. We conduct extensive and comprehensive experiments to demonstrate that our proposed method outperforms the state-of-the-art ensemble baselines, at the same time can detect a wide range of adversarial examples with a nearly perfect accuracy. Our code is available at: https://github.com/tuananhbui89/Crossing-Collaborative-Ensemble.

preprint2022arXiv

Learning to Attack with Fewer Pixels: A Probabilistic Post-hoc Framework for Refining Arbitrary Dense Adversarial Attacks

Deep neural network image classifiers are reported to be susceptible to adversarial evasion attacks, which use carefully crafted images created to mislead a classifier. Many adversarial attacks belong to the category of dense attacks, which generate adversarial examples by perturbing all the pixels of a natural image. To generate sparse perturbations, sparse attacks have been recently developed, which are usually independent attacks derived by modifying a dense attack's algorithm with sparsity regularisations, resulting in reduced attack efficiency. In this paper, we aim to tackle this task from a different perspective. We select the most effective perturbations from the ones generated from a dense attack, based on the fact we find that a considerable amount of the perturbations on an image generated by dense attacks may contribute little to attacking a classifier. Accordingly, we propose a probabilistic post-hoc framework that refines given dense attacks by significantly reducing the number of perturbed pixels but keeping their attack power, trained with mutual information maximisation. Given an arbitrary dense attack, the proposed model enjoys appealing compatibility for making its adversarial images more realistic and less detectable with fewer perturbations. Moreover, our framework performs adversarial attacks much faster than existing sparse attacks.

preprint2022arXiv

Learning to Re-weight Examples with Optimal Transport for Imbalanced Classification

Imbalanced data pose challenges for deep learning based classification models. One of the most widely-used approaches for tackling imbalanced data is re-weighting, where training samples are associated with different weights in the loss function. Most of existing re-weighting approaches treat the example weights as the learnable parameter and optimize the weights on the meta set, entailing expensive bilevel optimization. In this paper, we propose a novel re-weighting method based on optimal transport (OT) from a distributional point of view. Specifically, we view the training set as an imbalanced distribution over its samples, which is transported by OT to a balanced distribution obtained from the meta set. The weights of the training samples are the probability mass of the imbalanced distribution and learned by minimizing the OT distance between the two distributions. Compared with existing methods, our proposed one disengages the dependence of the weight learning on the concerned classifier at each iteration. Experiments on image, text and point cloud datasets demonstrate that our proposed re-weighting method has excellent performance, achieving state-of-the-art results in many cases and providing a promising tool for addressing the imbalanced classification issue.

preprint2022arXiv

Leveraging Cross Feedback of User and Item Embeddings with Attention for Variational Autoencoder based Collaborative Filtering

Matrix factorization (MF) has been widely applied to collaborative filtering in recommendation systems. Its Bayesian variants can derive posterior distributions of user and item embeddings, and are more robust to sparse ratings. However, the Bayesian methods are restricted by their update rules for the posterior parameters due to the conjugacy of the priors and the likelihood. Variational autoencoders (VAE) can address this issue by capturing complex mappings between the posterior parameters and the data. However, current research on VAEs for collaborative filtering only considers the mappings based on the explicit data information while the implicit embedding information is overlooked. In this paper, we first derive evidence lower bounds (ELBO) for Bayesian MF models from two viewpoints: user-oriented and item-oriented. Based on the ELBOs, we propose a VAE-based Bayesian MF framework. It leverages not only the data but also the embedding information to approximate the user-item joint distribution. As suggested by the ELBOs, the approximation is iterative with cross feedback of user and item embeddings into each other's encoders. More specifically, user embeddings sampled at the previous iteration are fed to the item-side encoders to estimate the posterior parameters for the item embeddings at the current iteration, and vice versa. The estimation also attends to the cross-fed embeddings to further exploit useful information. The decoder then reconstructs the data via the matrix factorization over the currently re-sampled user and item embeddings.

preprint2022arXiv

Manipulation of Dirac band curvature and momentum-dependent g-factor in a kagome magnet YMn6Sn6

The Zeeman effect describes the energy change of an atomic quantum state in magnetic field. The magnitude and the direction of this change depend on the dimensionless Lande g-factor. In quantum solids, the response of the Bloch electron states to the magnetic field also exhibits the Zeeman effect with an effective g-factor that was theoretically predicted to be dependent on the momentum. While typically negligible in many ordinary solids, the momentum-dependent variation of the g-factor is theorized to be substantially enhanced in many topological and magnetic systems. However, the momentum-dependence of the g-factor is notoriously difficult to extract and it is yet to be directly experimentally measured. In this work, we report the experimental discovery of a strongly momentum-dependent g-factor in a kagome magnet YMn6Sn6. Using spectroscopic-imaging scanning tunneling microscopy, we map the evolution of a massive Dirac band in the vicinity of the Fermi level as a function of magnetic field. We find that electronic states at different lattice momenta exhibit markedly different Zeeman energy shifts, giving rise to an anomalous g-factor that peaks around the Dirac point. Our work provides the first momentum-resolved visualization of Dirac band curvature manipulation by magnetic field, which should in principle be highly relevant to other topological kagome magnets.

preprint2022arXiv

MED-TEX: Transferring and Explaining Knowledge with Less Data from Pretrained Medical Imaging Models

Deep learning methods usually require a large amount of training data and lack interpretability. In this paper, we propose a novel knowledge distillation and model interpretation framework for medical image classification that jointly solves the above two issues. Specifically, to address the data-hungry issue, a small student model is learned with less data by distilling knowledge from a cumbersome pretrained teacher model. To interpret the teacher model and assist the learning of the student, an explainer module is introduced to highlight the regions of an input that are important for the predictions of the teacher model. Furthermore, the joint framework is trained by a principled way derived from the information-theoretic perspective. Our framework outperforms on the knowledge distillation and model interpretation tasks compared to state-of-the-art methods on a fundus dataset.

preprint2022arXiv

Neural Topic Model via Optimal Transport

Recently, Neural Topic Models (NTMs) inspired by variational autoencoders have obtained increasingly research interest due to their promising results on text analysis. However, it is usually hard for existing NTMs to achieve good document representation and coherent/diverse topics at the same time. Moreover, they often degrade their performance severely on short documents. The requirement of reparameterisation could also comprise their training quality and model flexibility. To address these shortcomings, we present a new neural topic model via the theory of optimal transport (OT). Specifically, we propose to learn the topic distribution of a document by directly minimising its OT distance to the document's word distributions. Importantly, the cost matrix of the OT distance models the weights between topics and words, which is constructed by the distances between topics and words in an embedding space. Our proposed model can be trained efficiently with a differentiable loss. Extensive experiments show that our framework significantly outperforms the state-of-the-art NTMs on discovering more coherent and diverse topics and deriving better document representations for both regular and short texts.

preprint2022arXiv

P3IV: Probabilistic Procedure Planning from Instructional Videos with Weak Supervision

In this paper, we study the problem of procedure planning in instructional videos. Here, an agent must produce a plausible sequence of actions that can transform the environment from a given start to a desired goal state. When learning procedure planning from instructional videos, most recent work leverages intermediate visual observations as supervision, which requires expensive annotation efforts to localize precisely all the instructional steps in training videos. In contrast, we remove the need for expensive temporal video annotations and propose a weakly supervised approach by learning from natural language instructions. Our model is based on a transformer equipped with a memory module, which maps the start and goal observations to a sequence of plausible actions. Furthermore, we augment our model with a probabilistic generative module to capture the uncertainty inherent to procedure planning, an aspect largely overlooked by previous work. We evaluate our model on three datasets and show our weaklysupervised approach outperforms previous fully supervised state-of-the-art models on multiple metrics.

preprint2022arXiv

Representing Mixtures of Word Embeddings with Mixtures of Topic Embeddings

A topic model is often formulated as a generative model that explains how each word of a document is generated given a set of topics and document-specific topic proportions. It is focused on capturing the word co-occurrences in a document and hence often suffers from poor performance in analyzing short documents. In addition, its parameter estimation often relies on approximate posterior inference that is either not scalable or suffers from large approximation error. This paper introduces a new topic-modeling framework where each document is viewed as a set of word embedding vectors and each topic is modeled as an embedding vector in the same embedding space. Embedding the words and topics in the same vector space, we define a method to measure the semantic difference between the embedding vectors of the words of a document and these of the topics, and optimize the topic embeddings to minimize the expected difference over all documents. Experiments on text analysis demonstrate that the proposed method, which is amenable to mini-batch stochastic gradient descent based optimization and hence scalable to big corpora, provides competitive performance in discovering more coherent and diverse topics and extracting better document representations.

preprint2022arXiv

Spin-polarized imaging of the antiferromagnetic structure and field-tunable bound states in kagome magnet FeSn

Kagome metals are as an exciting playground for the explorations of novel phenomena at the intersection of topology, electron correlations and magnetism. The family of FeSn-based kagome magnets in particular attracted a lot of attention for simplicity of the layered crystal structure and tunable topological electronic band structure. Despite a significant progress in understanding their bulk properties, surface electronic and magnetic structures are yet to be fully explored in many of these systems. In this work, we focus on a prototypical kagome metal FeSn. Using a combination of spin-averaged and spin-polarized scanning tunneling microscopy, we provide the first atomic-scale visualization of the layered antiferromagnetic structure at the surface of FeSn. In contrast to the field-tunable electronic structure of cousin material Fe3Sn2 that is a ferromagnet, we find that electronic density-of-states of FeSn is robust to the application of external magnetic field. Interestingly, despite the field-insensitive electronic band structure, FeSn exhibits bounds states tied to specific impurities with large effective moments that strongly couple to the magnetic field. Our experiments provide microscopic insights necessary for theoretical modeling of FeSn and serve as a spring board for spin-polarized measurements of topological magnets in general.

preprint2022arXiv

Sports Video Analysis on Large-Scale Data

This paper investigates the modeling of automated machine description on sports video, which has seen much progress recently. Nevertheless, state-of-the-art approaches fall quite short of capturing how human experts analyze sports scenes. There are several major reasons: (1) The used dataset is collected from non-official providers, which naturally creates a gap between models trained on those datasets and real-world applications; (2) previously proposed methods require extensive annotation efforts (i.e., player and ball segmentation at pixel level) on localizing useful visual features to yield acceptable results; (3) very few public datasets are available. In this paper, we propose a novel large-scale NBA dataset for Sports Video Analysis (NSVA) with a focus on captioning, to address the above challenges. We also design a unified approach to process raw videos into a stack of meaningful features with minimum labelling efforts, showing that cross modeling on such features using a transformer architecture leads to strong performance. In addition, we demonstrate the broad application of NSVA by addressing two additional tasks, namely fine-grained sports action recognition and salient player identification. Code and dataset are available at https://github.com/jackwu502/NSVA.

preprint2021arXiv

Explain2Attack: Text Adversarial Attacks via Cross-Domain Interpretability

Training robust deep learning models for down-stream tasks is a critical challenge. Research has shown that down-stream models can be easily fooled with adversarial inputs that look like the training data, but slightly perturbed, in a way imperceptible to humans. Understanding the behavior of natural language models under these attacks is crucial to better defend these models against such attacks. In the black-box attack setting, where no access to model parameters is available, the attacker can only query the output information from the targeted model to craft a successful attack. Current black-box state-of-the-art models are costly in both computational complexity and number of queries needed to craft successful adversarial examples. For real world scenarios, the number of queries is critical, where less queries are desired to avoid suspicion towards an attacking agent. In this paper, we propose Explain2Attack, a black-box adversarial attack on text classification task. Instead of searching for important words to be perturbed by querying the target model, Explain2Attack employs an interpretable substitute model from a similar domain to learn word importance scores. We show that our framework either achieves or out-performs attack rates of the state-of-the-art models, yet with lower queries cost and higher efficiency.

preprint2021arXiv

Nanoscale decoupling of electronic nematicity and structural anisotropy in FeSe thin films

In a material prone to a nematic instability, anisotropic strain in principle provides a preferred symmetry-breaking direction for the electronic nematic state to follow. This is consistent with experimental observations, where electronic nematicity and structural anisotropy typically appear hand-in-hand. In this work, we discover that electronic nematicity can be locally decoupled from the underlying structural anisotropy in strain-engineered iron-selenide (FeSe) thin films. We use heteroepitaxial molecular beam epitaxy to grow FeSe with a nanoscale network of modulations that give rise to spatially varying strain. We map local anisotropic strain by analyzing scanning tunneling microscopy topographs, and visualize electronic nematic domains from concomitant spectroscopic maps. While the domains form so that the energy of nemato-elastic coupling is minimized, we observe distinct regions where electronic nematic ordering fails to flip direction, even though the underlying structural anisotropy is locally reversed. The findings point towards a nanometer-scale stiffness of the nematic order parameter.

preprint2021arXiv

Rotation symmetry breaking in the normal state of a kagome superconductor KV3Sb5

Recently discovered kagome superconductors AV3Sb5 (A=K, Rb, Cs) provide a fresh opportunity to realize and study correlation-driven electronic phenomena on a kagome lattice. The observation of a 2a0 by 2a0 charge density wave (CDW) in the normal state of all members of AV3Sb5 kagome family has generated an enormous amount of interest, in an effort to uncover the nature of this CDW state, and identify any "hidden" broken symmetries. We use spectroscopic-imaging scanning tunneling microscopy to reveal a pronounced intensity anisotropy between different 2a0 CDW directions in KV3Sb5. In particular, by examining the strength of ordering wave vectors as a function of energy in Fourier transforms of differential conductance maps, we find that one of the CDW directions is distinctly different compared to the other two. This observation points towards an intrinsic rotation symmetry broken electronic ground state, where the symmetry is reduced from C6 to C2. Furthermore, in contrast to previous reports, we find that the CDW phase is insensitive to magnetic field direction, regardless of the presence or absence of atomic defects. Our experiments, combined with earlier observations of a stripe 4a0 charge ordering in CsV3Sb5, establish correlation-driven rotation symmetry breaking as a unifying feature of AV3Sb5 kagome superconductors.

preprint2021arXiv

The Dust Mass of Supernova Remnants in M31

The dust temperature and mass of the supernova remnants (SNRs) in M31 are estimated by fitting the infrared spectral energy distribution calculated from the images in the Spitzer/IRAC4 and MIPS24, Herschel/PACS70, 100, 160, and Herschel/SPIRE250, 350$μ$m band. Twenty SNRs with relatively reliable photometry exhibit an average dust temperature of $20.1^{+1.8}_{-1.5}$K, which is higher than the surrounding and indicating the heating effect of supernova explosion. The dust mass of these SNRs ranges from about 100 to 800$ M_{\odot}$, much bigger than the SNRs in the Milky Way. On the other hand, this yields the dust surface density of $0.10^{+0.07}_{-0.04}{ M_{\odot} \rm pc^{-2}}$, about half of the surrounding area, which implies that about half dust in the SNRs is destroyed by the supernova explosion. The dust temperature, the radius, and thus the dust mass all demonstrate that the studied SNRs are old and very likely in the snowplow or even fade away phase because of the limitation by the far distance and observation resolution of M31, and the results can serve as a reference to the final effect of supernova explosion on the surrounding dust.

preprint2021arXiv

Topic Modelling Meets Deep Neural Networks: A Survey

Topic modelling has been a successful technique for text analysis for almost twenty years. When topic modelling met deep neural networks, there emerged a new and increasingly popular research area, neural topic models, with over a hundred models developed and a wide range of applications in neural language understanding such as text generation, summarisation and language models. There is a need to summarise research developments and discuss open problems and future directions. In this paper, we provide a focused yet comprehensive overview of neural topic models for interested researchers in the AI community, so as to facilitate them to navigate and innovate in this fast-growing research area. To the best of our knowledge, ours is the first review focusing on this specific topic.

preprint2020arXiv

A Cleanroom in a Glovebox

The exploration of new materials, novel quantum phases, and devices requires ways to prepare cleaner samples with smaller feature sizes. Initially, this meant the use of a cleanroom that limits the amount and size of dust particles. However, many materials are highly sensitive to oxygen and water in the air. Furthermore, the ever-increasing demand for a quantum workforce, trained and able to use the equipment for creating and characterizing materials, calls for a dramatic reduction in the cost to create and operate such facilities. To this end, we present our cleanroom-in-a-glovebox, a system which allows for the fabrication and characterization of devices in an inert argon atmosphere. We demonstrate the ability to perform a wide range of characterization as well as fabrication steps, without the need for a dedicated room, all in an argon environment. Connection to a vacuum suitcase is also demonstrated to enable receiving from and transfer to various ultra-high vacuum (UHV) equipment including molecular-beam epitaxy (MBE) and scanning tunneling microscopy (STM).

preprint2020arXiv

A Systematic Study of the dust of Galactic Supernova Remnants I. The Distance and the Extinction

By combining the photometric, spectroscopic, and astrometric information of the stars in the sightline of SNRs, the distances to and the extinctions of 32 Galactic supernova remnants (SNRs) are investigated. The stellar atmospheric parameters are from the SDSS$-$DR14$/$APOGEE and LAMOST$-$DR5$/$LEGUE spectroscopic surveys. The multi-band photometry, from optical to infrared, are collected from the {\it Gaia}, APASS, Pan--STARRS1, 2MASS, and {\it WISE} surveys. With the calibrated {\it Gaia} distances of individual stars, the distances to 15 of 32 SNRs are well determined from their produced extinction and association with molecular clouds. The upper limits of distance are derived for 3 SNRs. The color excess ratios $E(g_{\rm P1}-λ) / E(g_{\rm P1}-r_{\rm P1})$ of 32 SNRs are calculated, and their variation with wavebands is fitted by a simple dust model. The inferred dust grain size distribution bifurcates: while the graphite grains have comparable size to the average ISM dust, the silicate grains are generally larger. Along the way, the average extinction law from optical to near-infrared of the Milky Way is derived from the 1.3 million star sample and found to agree with the CCM89 law with $R_{\rm V}=3.15$.

preprint2020arXiv

Distances to the Supernova Remnants in the Inner Disk

Distance measurements of supernova remnants (SNRs) are essential and important. Accurate estimates of physical size, dust masses, and some other properties of SNRs depend critically on accurate distance measurements. However, the determination of SNR distances is still a tough task. Red clump stars (RCs) have a long history been used as standard candles. In this work, we take RCs as tracers to determine the distances to a large group of SNRs in the inner disk. We first select RC stars based on the near-infrared (IR) color-magnitude diagram (CMD). Then, the distance to and extinction of RC stars are calculated. To extend the measurable range of distance, we combine near-IR photometric data from the 2MASS survey with the deeper UKIDSS and VVV surveys. With the help of the Gaia parallaxes, we also remove contaminants including dwarfs and giants. Because an SN explosion compresses the surrounding interstellar medium, the SNR region would become denser and exhibit higher extinction than the surroundings. The distance of a SNR is then recognized by the position where the extinction and its gradient is higher than that of the ambient medium. A total of 63 SNRs' distances in the Galactic inner disk are determined and divided into three Levels A, B, and C with decreasing reliability. The distances to 43 SNRs are well determined with reliability A or B. The diameters and dust masses of SNRs are estimated with the obtained distance and extinction.

preprint2020arXiv

Improving Adversarial Robustness by Enforcing Local and Global Compactness

The fact that deep neural networks are susceptible to crafted perturbations severely impacts the use of deep learning in certain domains of application. Among many developed defense models against such attacks, adversarial training emerges as the most successful method that consistently resists a wide range of attacks. In this work, based on an observation from a previous study that the representations of a clean data example and its adversarial examples become more divergent in higher layers of a deep neural net, we propose the Adversary Divergence Reduction Network which enforces local/global compactness and the clustering assumption over an intermediate layer of a deep neural network. We conduct comprehensive experiments to understand the isolating behavior of each component (i.e., local/global compactness and the clustering assumption) and compare our proposed model with state-of-the-art adversarial training methods. The experimental results demonstrate that augmenting adversarial training with our proposed components can further improve the robustness of the network, leading to higher unperturbed and adversarial predictive performances.

preprint2020arXiv

SummPip: Unsupervised Multi-Document Summarization with Sentence Graph Compression

Obtaining training data for multi-document summarization (MDS) is time consuming and resource-intensive, so recent neural models can only be trained for limited domains. In this paper, we propose SummPip: an unsupervised method for multi-document summarization, in which we convert the original documents to a sentence graph, taking both linguistic and deep representation into account, then apply spectral clustering to obtain multiple clusters of sentences, and finally compress each cluster to generate the final summary. Experiments on Multi-News and DUC-2004 datasets show that our method is competitive to previous unsupervised methods and is even comparable to the neural supervised approaches. In addition, human evaluation shows our system produces consistent and complete summaries compared to human written ones.

preprint2019arXiv

Atomic-scale fragmentation and collapse of antiferromagnetic order in a doped Mott insulator

Disentangling the relationship between the insulating state with a charge gap and the magnetic order in an antiferromagnetic (AF) Mott insulator remains difficult due to inherent phase separation as the Mott state is perturbed. Measuring magnetic and electronic properties at the atomic length scales would provide crucial insight, but this is yet to be experimentally achieved. Here we use spectroscopic-imaging spin-polarized scanning tunneling microscopy (SP-STM) to visualize periodic spin-resolved modulations originating from the AF order in a relativistic Mott insulator Sr2IrO4, and study these as a function of doping. We find that near insulator-to-metal transition (IMT), the long-range AF order melts into a fragmented state with short-range AF correlations. Crucially, we discover that the short-range AF order is locally uncorrelated with the observed spectral gap magnitude. This strongly suggests that short range AF correlations are unlikely to be the culprit behind inhomogeneous gap closing and the emergence of pseudogap regions near IMT. Our work establishes SP-STM as a powerful tool for revealing atomic-scale magnetic information in complex oxides.

preprint2019arXiv

Proximity-Induced Superconductivity in a Topological Crystalline Insulator

Superconducting topological crystalline insulators (TCI) are predicted to host new topological phases protected by crystalline symmetries, but available materials are insufficiently suitable for surface studies. To induce superconductivity at the surface of a prototypical TCI SnTe, we use molecular beam epitaxy to grow a heterostructure of SnTe and a high-Tc superconductor Fe(Te,Se), utilizing a 'buffer' layer to bridge the large lattice mismatch between SnTe and Fe(Te,Se). Using low-temperature scanning tunneling microscopy and spectroscopy, we measure a prominent spectral gap on the surface of SnTe, and demonstrate its superconducting origin by its dependence on temperature and magnetic field. Our work provides a new platform for atomic-scale investigations of emergent topological phenomena in superconducting TCIs.

preprint2016arXiv

Injection method of barrier bucket supported by off-aligned electron cooling for CRing of HIAF

A new accelerator complex, HIAF (the High Intensity Heavy Ion Accelerator Facility), has been approved in China. It is designed to provide intense primary and radioactive ion beams for research in high energy density physics, nuclear physics, atomic physics as well as other applications. In order to achieve a high intensity of up to 5e11 ppp 238U34+, the Compression Ring (CRing) needs to stack more than 5 bunches transferred from the Booster Ring (BRing). However, the normal bucket to bucket injection scheme can only achieve an intensity gain of 2, so an injection method, fixed barrier bucket (BB) supported by electron cooling, is proposed. To suppress the severe space charge effect during the stacking process, off-alignment is adopted in the cooler to control the transverse emittance. In this paper, simulation and optimization with the BETACOOL program are presented.

preprint2016arXiv

Revision of Stellar Intrinsic Colors in the Infrared by the Spectroscopic Surveys

Intrinsic colors of normal stars are derived in the popularly used infrared bands involving the 2MASS/JHKs, WISE, Spitzer/IRAC and AKARI/S9W filters. Based on three spectroscopic surveys -- LAMOST, RAVE and APOGEE, stars are classified into groups of giants and dwarfs, as well as metal-normal and metal-poor stars. An empirical analytical relation of the intrinsic color is obtained with stellar effective temperature (Teff) for each group of stars after the zero-reddening stars are selected from the blue edge in the $J-λ$ versus (Teff) diagram. It is found that metallicity has little effect on the infrared colors. In the near-infrared bands, our results agree with previous work. In addition, the color indexes H-W2 and Ks-W1 that are taken as constant to calculate interstellar extinction are discussed. The intrinsic color of M-type stars are derived separately due to lack of accurate measurement of their effective temperature.

preprint2015arXiv

Calculation of the coupled betatron motion introduced by cooler's solenoids of CSRm

Several solenoids are usually installed in electron cooler device to guide the motion of the electron beam in the cooler. However, the solenoids also have influence to the ion beam in the cooler storage ring. The transverse motion of the ion beam in storage ring will become coupled, if the solenoids installed in the electron cooler are not compensated perfectly. In this paper, the coupled transverse motion due to the uncompensated cooler's solenoids of CSRm (The main storage ring in the IMP, Lan Zhou, China) is studied, and the coupled beam envelopes are calculated by a new method.

preprint2015arXiv

Closed orbit correction of HIMM synchrotron

The correction of closed orbit has great influence on the operation of synchrotron. The design of correction system is one significant component of lattice design. It is suggested to set BPMs at the peaks of betatron oscillation. The correctors need to be located at the positions where \b{eta} function is large or the sources of large errors. In the simulation of the closed orbit correction of HIMM synchrotron, one important reason affecting the result of horizontal correction is the longitudinal alignment error of dipole magnet. It is advisable to decrease this kind of alignment error while the deflection angle of dipole magnet is large.

preprint2015arXiv

Sn-doped Bi1.1Sb0.9Te2S, a bulk topological insulator with ideal properties

A long-standing issue in topological insulator research has been to find a material that provides an ideal platform for characterizing topological surface states without interference from bulk electronic states and can reliably be fabricated as bulk crystals. This material would be a bulk insulator, have a surface state Dirac point energy well isolated from the bulk valence and conduction bands, have high surface state electronic mobility, and be growable as large, high quality bulk single crystals. Here we show that this major materials obstacle in the field is overcome by crystals of lightly Sn-doped Bi1.1Sb0.9Te2S (Sn-BSTS) grown by the Vertical Bridgeman method, which we characterize here via angle-resolved photoemission spectroscopy, scanning tunneling microscopy, transport studies of the bulk and surface states, and X-ray diffraction and Raman scattering. We present this new material as a bulk topological insulator that can be reliably grown and studied in many laboratories around the world.

preprint2015arXiv

The Side-Effects of the Space Charge Field Introduced by Hollow Electron Beam in the Electron Cooler of CSRm

Electron cooler is used to improve the quality of the beam in synchrotron, however it also introduces nonlinear electromagnetic field, which cause tuneshift, tunespread and may drive resonances leading to beam loss. In this paper the tuneshift and the tunespread caused by nonlinear electromagnetic field of the hollow electron beam was investigated, and the resonance driving terms of the nonlinear electromagnetic field was analysed. The differences were presented comparing with the solid electron beam. The calculations were performed for $^{238}U^{32+}$ ions of energy 1.272MeV stored in CSRm, using the parameters given in table1. The conclusion is that in this situation nonlinear field caused by the hollow electron beam do not lead to serious resonances.

He Zhao

What is connected

Connect this record

See the researcher in context

Building this map preview

32 published item(s)

Arrow: A Foundation Model for Causal Discovery

A Unified Wasserstein Distributional Robustness Framework for Adversarial Training

Dust Mass Associated with the Supernova Remnant IC 443 when Emission Meets Extinction

Improving Ensemble Robustness by Collaboratively Promoting and Demoting Adversarial Robustness

Learning to Attack with Fewer Pixels: A Probabilistic Post-hoc Framework for Refining Arbitrary Dense Adversarial Attacks

Learning to Re-weight Examples with Optimal Transport for Imbalanced Classification

Leveraging Cross Feedback of User and Item Embeddings with Attention for Variational Autoencoder based Collaborative Filtering

Manipulation of Dirac band curvature and momentum-dependent g-factor in a kagome magnet YMn6Sn6

MED-TEX: Transferring and Explaining Knowledge with Less Data from Pretrained Medical Imaging Models

Neural Topic Model via Optimal Transport

P3IV: Probabilistic Procedure Planning from Instructional Videos with Weak Supervision

Representing Mixtures of Word Embeddings with Mixtures of Topic Embeddings

Spin-polarized imaging of the antiferromagnetic structure and field-tunable bound states in kagome magnet FeSn

Sports Video Analysis on Large-Scale Data

Explain2Attack: Text Adversarial Attacks via Cross-Domain Interpretability

Nanoscale decoupling of electronic nematicity and structural anisotropy in FeSe thin films

Rotation symmetry breaking in the normal state of a kagome superconductor KV3Sb5

The Dust Mass of Supernova Remnants in M31

Topic Modelling Meets Deep Neural Networks: A Survey

A Cleanroom in a Glovebox

A Systematic Study of the dust of Galactic Supernova Remnants I. The Distance and the Extinction

Distances to the Supernova Remnants in the Inner Disk

Improving Adversarial Robustness by Enforcing Local and Global Compactness

SummPip: Unsupervised Multi-Document Summarization with Sentence Graph Compression

Atomic-scale fragmentation and collapse of antiferromagnetic order in a doped Mott insulator

Proximity-Induced Superconductivity in a Topological Crystalline Insulator

Injection method of barrier bucket supported by off-aligned electron cooling for CRing of HIAF

Revision of Stellar Intrinsic Colors in the Infrared by the Spectroscopic Surveys

Calculation of the coupled betatron motion introduced by cooler's solenoids of CSRm

Closed orbit correction of HIMM synchrotron

Sn-doped Bi1.1Sb0.9Te2S, a bulk topological insulator with ideal properties

The Side-Effects of the Space Charge Field Introduced by Hollow Electron Beam in the Electron Cooler of CSRm