Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
26works
0followers
17topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

26 published item(s)

preprint2026arXiv

Arrow: A Foundation Model for Causal Discovery

We introduce Arrow, a foundation model for zero-shot causal discovery on observational tabular data. Arrow factorizes a directed acyclic graph into an undirected skeleton and a topological order, guaranteeing acyclicity by construction. Given a new dataset, it uses a transformer-based architecture to contextualize variables within and across observations, then predicts skeleton edge probabilities and node order scores that together define a graph. Arrow is trained in a supervised fashion on synthetic datasets with ground-truth graphs, using an end-to-end differentiable directed edge composite likelihood induced by the skeleton-order factorization. The training distribution spans diverse graph families, functional forms, noise models, and dataset shapes. Across in- and out-of-distribution synthetic, semi-synthetic, and real datasets, Arrow matches or outperforms existing causal discovery methods at substantially lower inference cost than competitive alternatives. Our results demonstrate that large-scale pretraining on diverse synthetic data can yield zero-shot causal discovery models that are fast, accurate, and reusable on new datasets.

preprint2022arXiv

A Unified Wasserstein Distributional Robustness Framework for Adversarial Training

It is well-known that deep neural networks (DNNs) are susceptible to adversarial attacks, exposing a severe fragility of deep learning systems. As the result, adversarial training (AT) method, by incorporating adversarial examples during training, represents a natural and effective approach to strengthen the robustness of a DNN-based classifier. However, most AT-based methods, notably PGD-AT and TRADES, typically seek a pointwise adversary that generates the worst-case adversarial example by independently perturbing each data sample, as a way to "probe" the vulnerability of the classifier. Arguably, there are unexplored benefits in considering such adversarial effects from an entire distribution. To this end, this paper presents a unified framework that connects Wasserstein distributional robustness with current state-of-the-art AT methods. We introduce a new Wasserstein cost function and a new series of risk functions, with which we show that standard AT methods are special cases of their counterparts in our framework. This connection leads to an intuitive relaxation and generalization of existing AT methods and facilitates the development of a new family of distributional robustness AT-based algorithms. Extensive experiments show that our distributional robustness AT algorithms robustify further their standard AT counterparts in various settings.

preprint2022arXiv

Dust Mass Associated with the Supernova Remnant IC 443 when Emission Meets Extinction

The dust mass of the well-known supernova remnant (SNR) IC 443 is estimated from both the infrared emission and the visual extinction. With photometry to the images taken by \emph{Spitzer}, \emph{WISE}, \emph{IRAS}, \emph{AKARI} and \emph{Planck}, the spectral energy distribution (SED) of the dust is obtained after subtracting the synchrotron radiation and considering the spectral line emission. The dust mass is derived from fitting the SED by a two-component model, which results in a warm component of the temperature of $\sim$ 53 K and the mass of 0.1 $M_\odot$, and a cold component of the temperature of $\sim 17$ K and the mass of 46 $M_\odot$. On the other hand, the dust mass is derived to be $\sim$ 66 $M_\odot$ from the visual extinction of IC 443 which is identified from the 3D Bayestar extinction map and its coincidence with the infrared emission morphology. Roughly the dust mass derived from the infrared emission and the extinction agree mutually. However, the dust mass derived from the infrared emission can be adjusted to be more consistent with that from the extinction by using different dust opacity property or considering optically thick radiation. In addition, the distribution of dust temperature and mass is analyzed by fitting the SED pixel by pixel.

preprint2022arXiv

Improving Ensemble Robustness by Collaboratively Promoting and Demoting Adversarial Robustness

Ensemble-based adversarial training is a principled approach to achieve robustness against adversarial attacks. An important technique of this approach is to control the transferability of adversarial examples among ensemble members. We propose in this work a simple yet effective strategy to collaborate among committee models of an ensemble model. This is achieved via the secure and insecure sets defined for each model member on a given sample, hence help us to quantify and regularize the transferability. Consequently, our proposed framework provides the flexibility to reduce the adversarial transferability as well as to promote the diversity of ensemble members, which are two crucial factors for better robustness in our ensemble approach. We conduct extensive and comprehensive experiments to demonstrate that our proposed method outperforms the state-of-the-art ensemble baselines, at the same time can detect a wide range of adversarial examples with a nearly perfect accuracy. Our code is available at: https://github.com/tuananhbui89/Crossing-Collaborative-Ensemble.

preprint2022arXiv

Learning to Attack with Fewer Pixels: A Probabilistic Post-hoc Framework for Refining Arbitrary Dense Adversarial Attacks

Deep neural network image classifiers are reported to be susceptible to adversarial evasion attacks, which use carefully crafted images created to mislead a classifier. Many adversarial attacks belong to the category of dense attacks, which generate adversarial examples by perturbing all the pixels of a natural image. To generate sparse perturbations, sparse attacks have been recently developed, which are usually independent attacks derived by modifying a dense attack's algorithm with sparsity regularisations, resulting in reduced attack efficiency. In this paper, we aim to tackle this task from a different perspective. We select the most effective perturbations from the ones generated from a dense attack, based on the fact we find that a considerable amount of the perturbations on an image generated by dense attacks may contribute little to attacking a classifier. Accordingly, we propose a probabilistic post-hoc framework that refines given dense attacks by significantly reducing the number of perturbed pixels but keeping their attack power, trained with mutual information maximisation. Given an arbitrary dense attack, the proposed model enjoys appealing compatibility for making its adversarial images more realistic and less detectable with fewer perturbations. Moreover, our framework performs adversarial attacks much faster than existing sparse attacks.

preprint2022arXiv

Learning to Re-weight Examples with Optimal Transport for Imbalanced Classification

Imbalanced data pose challenges for deep learning based classification models. One of the most widely-used approaches for tackling imbalanced data is re-weighting, where training samples are associated with different weights in the loss function. Most of existing re-weighting approaches treat the example weights as the learnable parameter and optimize the weights on the meta set, entailing expensive bilevel optimization. In this paper, we propose a novel re-weighting method based on optimal transport (OT) from a distributional point of view. Specifically, we view the training set as an imbalanced distribution over its samples, which is transported by OT to a balanced distribution obtained from the meta set. The weights of the training samples are the probability mass of the imbalanced distribution and learned by minimizing the OT distance between the two distributions. Compared with existing methods, our proposed one disengages the dependence of the weight learning on the concerned classifier at each iteration. Experiments on image, text and point cloud datasets demonstrate that our proposed re-weighting method has excellent performance, achieving state-of-the-art results in many cases and providing a promising tool for addressing the imbalanced classification issue.

preprint2022arXiv

Leveraging Cross Feedback of User and Item Embeddings with Attention for Variational Autoencoder based Collaborative Filtering

Matrix factorization (MF) has been widely applied to collaborative filtering in recommendation systems. Its Bayesian variants can derive posterior distributions of user and item embeddings, and are more robust to sparse ratings. However, the Bayesian methods are restricted by their update rules for the posterior parameters due to the conjugacy of the priors and the likelihood. Variational autoencoders (VAE) can address this issue by capturing complex mappings between the posterior parameters and the data. However, current research on VAEs for collaborative filtering only considers the mappings based on the explicit data information while the implicit embedding information is overlooked. In this paper, we first derive evidence lower bounds (ELBO) for Bayesian MF models from two viewpoints: user-oriented and item-oriented. Based on the ELBOs, we propose a VAE-based Bayesian MF framework. It leverages not only the data but also the embedding information to approximate the user-item joint distribution. As suggested by the ELBOs, the approximation is iterative with cross feedback of user and item embeddings into each other's encoders. More specifically, user embeddings sampled at the previous iteration are fed to the item-side encoders to estimate the posterior parameters for the item embeddings at the current iteration, and vice versa. The estimation also attends to the cross-fed embeddings to further exploit useful information. The decoder then reconstructs the data via the matrix factorization over the currently re-sampled user and item embeddings.

preprint2022arXiv

Manipulation of Dirac band curvature and momentum-dependent g-factor in a kagome magnet YMn6Sn6

The Zeeman effect describes the energy change of an atomic quantum state in magnetic field. The magnitude and the direction of this change depend on the dimensionless Lande g-factor. In quantum solids, the response of the Bloch electron states to the magnetic field also exhibits the Zeeman effect with an effective g-factor that was theoretically predicted to be dependent on the momentum. While typically negligible in many ordinary solids, the momentum-dependent variation of the g-factor is theorized to be substantially enhanced in many topological and magnetic systems. However, the momentum-dependence of the g-factor is notoriously difficult to extract and it is yet to be directly experimentally measured. In this work, we report the experimental discovery of a strongly momentum-dependent g-factor in a kagome magnet YMn6Sn6. Using spectroscopic-imaging scanning tunneling microscopy, we map the evolution of a massive Dirac band in the vicinity of the Fermi level as a function of magnetic field. We find that electronic states at different lattice momenta exhibit markedly different Zeeman energy shifts, giving rise to an anomalous g-factor that peaks around the Dirac point. Our work provides the first momentum-resolved visualization of Dirac band curvature manipulation by magnetic field, which should in principle be highly relevant to other topological kagome magnets.

preprint2022arXiv

MED-TEX: Transferring and Explaining Knowledge with Less Data from Pretrained Medical Imaging Models

Deep learning methods usually require a large amount of training data and lack interpretability. In this paper, we propose a novel knowledge distillation and model interpretation framework for medical image classification that jointly solves the above two issues. Specifically, to address the data-hungry issue, a small student model is learned with less data by distilling knowledge from a cumbersome pretrained teacher model. To interpret the teacher model and assist the learning of the student, an explainer module is introduced to highlight the regions of an input that are important for the predictions of the teacher model. Furthermore, the joint framework is trained by a principled way derived from the information-theoretic perspective. Our framework outperforms on the knowledge distillation and model interpretation tasks compared to state-of-the-art methods on a fundus dataset.

preprint2022arXiv

Neural Topic Model via Optimal Transport

Recently, Neural Topic Models (NTMs) inspired by variational autoencoders have obtained increasingly research interest due to their promising results on text analysis. However, it is usually hard for existing NTMs to achieve good document representation and coherent/diverse topics at the same time. Moreover, they often degrade their performance severely on short documents. The requirement of reparameterisation could also comprise their training quality and model flexibility. To address these shortcomings, we present a new neural topic model via the theory of optimal transport (OT). Specifically, we propose to learn the topic distribution of a document by directly minimising its OT distance to the document's word distributions. Importantly, the cost matrix of the OT distance models the weights between topics and words, which is constructed by the distances between topics and words in an embedding space. Our proposed model can be trained efficiently with a differentiable loss. Extensive experiments show that our framework significantly outperforms the state-of-the-art NTMs on discovering more coherent and diverse topics and deriving better document representations for both regular and short texts.

preprint2022arXiv

P3IV: Probabilistic Procedure Planning from Instructional Videos with Weak Supervision

In this paper, we study the problem of procedure planning in instructional videos. Here, an agent must produce a plausible sequence of actions that can transform the environment from a given start to a desired goal state. When learning procedure planning from instructional videos, most recent work leverages intermediate visual observations as supervision, which requires expensive annotation efforts to localize precisely all the instructional steps in training videos. In contrast, we remove the need for expensive temporal video annotations and propose a weakly supervised approach by learning from natural language instructions. Our model is based on a transformer equipped with a memory module, which maps the start and goal observations to a sequence of plausible actions. Furthermore, we augment our model with a probabilistic generative module to capture the uncertainty inherent to procedure planning, an aspect largely overlooked by previous work. We evaluate our model on three datasets and show our weaklysupervised approach outperforms previous fully supervised state-of-the-art models on multiple metrics.

preprint2022arXiv

Representing Mixtures of Word Embeddings with Mixtures of Topic Embeddings

A topic model is often formulated as a generative model that explains how each word of a document is generated given a set of topics and document-specific topic proportions. It is focused on capturing the word co-occurrences in a document and hence often suffers from poor performance in analyzing short documents. In addition, its parameter estimation often relies on approximate posterior inference that is either not scalable or suffers from large approximation error. This paper introduces a new topic-modeling framework where each document is viewed as a set of word embedding vectors and each topic is modeled as an embedding vector in the same embedding space. Embedding the words and topics in the same vector space, we define a method to measure the semantic difference between the embedding vectors of the words of a document and these of the topics, and optimize the topic embeddings to minimize the expected difference over all documents. Experiments on text analysis demonstrate that the proposed method, which is amenable to mini-batch stochastic gradient descent based optimization and hence scalable to big corpora, provides competitive performance in discovering more coherent and diverse topics and extracting better document representations.

preprint2022arXiv

Spin-polarized imaging of the antiferromagnetic structure and field-tunable bound states in kagome magnet FeSn

Kagome metals are as an exciting playground for the explorations of novel phenomena at the intersection of topology, electron correlations and magnetism. The family of FeSn-based kagome magnets in particular attracted a lot of attention for simplicity of the layered crystal structure and tunable topological electronic band structure. Despite a significant progress in understanding their bulk properties, surface electronic and magnetic structures are yet to be fully explored in many of these systems. In this work, we focus on a prototypical kagome metal FeSn. Using a combination of spin-averaged and spin-polarized scanning tunneling microscopy, we provide the first atomic-scale visualization of the layered antiferromagnetic structure at the surface of FeSn. In contrast to the field-tunable electronic structure of cousin material Fe3Sn2 that is a ferromagnet, we find that electronic density-of-states of FeSn is robust to the application of external magnetic field. Interestingly, despite the field-insensitive electronic band structure, FeSn exhibits bounds states tied to specific impurities with large effective moments that strongly couple to the magnetic field. Our experiments provide microscopic insights necessary for theoretical modeling of FeSn and serve as a spring board for spin-polarized measurements of topological magnets in general.

preprint2022arXiv

Sports Video Analysis on Large-Scale Data

This paper investigates the modeling of automated machine description on sports video, which has seen much progress recently. Nevertheless, state-of-the-art approaches fall quite short of capturing how human experts analyze sports scenes. There are several major reasons: (1) The used dataset is collected from non-official providers, which naturally creates a gap between models trained on those datasets and real-world applications; (2) previously proposed methods require extensive annotation efforts (i.e., player and ball segmentation at pixel level) on localizing useful visual features to yield acceptable results; (3) very few public datasets are available. In this paper, we propose a novel large-scale NBA dataset for Sports Video Analysis (NSVA) with a focus on captioning, to address the above challenges. We also design a unified approach to process raw videos into a stack of meaningful features with minimum labelling efforts, showing that cross modeling on such features using a transformer architecture leads to strong performance. In addition, we demonstrate the broad application of NSVA by addressing two additional tasks, namely fine-grained sports action recognition and salient player identification. Code and dataset are available at https://github.com/jackwu502/NSVA.

preprint2021arXiv

Explain2Attack: Text Adversarial Attacks via Cross-Domain Interpretability

Training robust deep learning models for down-stream tasks is a critical challenge. Research has shown that down-stream models can be easily fooled with adversarial inputs that look like the training data, but slightly perturbed, in a way imperceptible to humans. Understanding the behavior of natural language models under these attacks is crucial to better defend these models against such attacks. In the black-box attack setting, where no access to model parameters is available, the attacker can only query the output information from the targeted model to craft a successful attack. Current black-box state-of-the-art models are costly in both computational complexity and number of queries needed to craft successful adversarial examples. For real world scenarios, the number of queries is critical, where less queries are desired to avoid suspicion towards an attacking agent. In this paper, we propose Explain2Attack, a black-box adversarial attack on text classification task. Instead of searching for important words to be perturbed by querying the target model, Explain2Attack employs an interpretable substitute model from a similar domain to learn word importance scores. We show that our framework either achieves or out-performs attack rates of the state-of-the-art models, yet with lower queries cost and higher efficiency.

preprint2021arXiv

Nanoscale decoupling of electronic nematicity and structural anisotropy in FeSe thin films

In a material prone to a nematic instability, anisotropic strain in principle provides a preferred symmetry-breaking direction for the electronic nematic state to follow. This is consistent with experimental observations, where electronic nematicity and structural anisotropy typically appear hand-in-hand. In this work, we discover that electronic nematicity can be locally decoupled from the underlying structural anisotropy in strain-engineered iron-selenide (FeSe) thin films. We use heteroepitaxial molecular beam epitaxy to grow FeSe with a nanoscale network of modulations that give rise to spatially varying strain. We map local anisotropic strain by analyzing scanning tunneling microscopy topographs, and visualize electronic nematic domains from concomitant spectroscopic maps. While the domains form so that the energy of nemato-elastic coupling is minimized, we observe distinct regions where electronic nematic ordering fails to flip direction, even though the underlying structural anisotropy is locally reversed. The findings point towards a nanometer-scale stiffness of the nematic order parameter.

preprint2021arXiv

Rotation symmetry breaking in the normal state of a kagome superconductor KV3Sb5

Recently discovered kagome superconductors AV3Sb5 (A=K, Rb, Cs) provide a fresh opportunity to realize and study correlation-driven electronic phenomena on a kagome lattice. The observation of a 2a0 by 2a0 charge density wave (CDW) in the normal state of all members of AV3Sb5 kagome family has generated an enormous amount of interest, in an effort to uncover the nature of this CDW state, and identify any "hidden" broken symmetries. We use spectroscopic-imaging scanning tunneling microscopy to reveal a pronounced intensity anisotropy between different 2a0 CDW directions in KV3Sb5. In particular, by examining the strength of ordering wave vectors as a function of energy in Fourier transforms of differential conductance maps, we find that one of the CDW directions is distinctly different compared to the other two. This observation points towards an intrinsic rotation symmetry broken electronic ground state, where the symmetry is reduced from C6 to C2. Furthermore, in contrast to previous reports, we find that the CDW phase is insensitive to magnetic field direction, regardless of the presence or absence of atomic defects. Our experiments, combined with earlier observations of a stripe 4a0 charge ordering in CsV3Sb5, establish correlation-driven rotation symmetry breaking as a unifying feature of AV3Sb5 kagome superconductors.

preprint2021arXiv

The Dust Mass of Supernova Remnants in M31

The dust temperature and mass of the supernova remnants (SNRs) in M31 are estimated by fitting the infrared spectral energy distribution calculated from the images in the Spitzer/IRAC4 and MIPS24, Herschel/PACS70, 100, 160, and Herschel/SPIRE250, 350$μ$m band. Twenty SNRs with relatively reliable photometry exhibit an average dust temperature of $20.1^{+1.8}_{-1.5}$K, which is higher than the surrounding and indicating the heating effect of supernova explosion. The dust mass of these SNRs ranges from about 100 to 800$ M_{\odot}$, much bigger than the SNRs in the Milky Way. On the other hand, this yields the dust surface density of $0.10^{+0.07}_{-0.04}{ M_{\odot} \rm pc^{-2}}$, about half of the surrounding area, which implies that about half dust in the SNRs is destroyed by the supernova explosion. The dust temperature, the radius, and thus the dust mass all demonstrate that the studied SNRs are old and very likely in the snowplow or even fade away phase because of the limitation by the far distance and observation resolution of M31, and the results can serve as a reference to the final effect of supernova explosion on the surrounding dust.

preprint2021arXiv

Topic Modelling Meets Deep Neural Networks: A Survey

Topic modelling has been a successful technique for text analysis for almost twenty years. When topic modelling met deep neural networks, there emerged a new and increasingly popular research area, neural topic models, with over a hundred models developed and a wide range of applications in neural language understanding such as text generation, summarisation and language models. There is a need to summarise research developments and discuss open problems and future directions. In this paper, we provide a focused yet comprehensive overview of neural topic models for interested researchers in the AI community, so as to facilitate them to navigate and innovate in this fast-growing research area. To the best of our knowledge, ours is the first review focusing on this specific topic.

preprint2020arXiv

A Cleanroom in a Glovebox

The exploration of new materials, novel quantum phases, and devices requires ways to prepare cleaner samples with smaller feature sizes. Initially, this meant the use of a cleanroom that limits the amount and size of dust particles. However, many materials are highly sensitive to oxygen and water in the air. Furthermore, the ever-increasing demand for a quantum workforce, trained and able to use the equipment for creating and characterizing materials, calls for a dramatic reduction in the cost to create and operate such facilities. To this end, we present our cleanroom-in-a-glovebox, a system which allows for the fabrication and characterization of devices in an inert argon atmosphere. We demonstrate the ability to perform a wide range of characterization as well as fabrication steps, without the need for a dedicated room, all in an argon environment. Connection to a vacuum suitcase is also demonstrated to enable receiving from and transfer to various ultra-high vacuum (UHV) equipment including molecular-beam epitaxy (MBE) and scanning tunneling microscopy (STM).

preprint2020arXiv

A Systematic Study of the dust of Galactic Supernova Remnants I. The Distance and the Extinction

By combining the photometric, spectroscopic, and astrometric information of the stars in the sightline of SNRs, the distances to and the extinctions of 32 Galactic supernova remnants (SNRs) are investigated. The stellar atmospheric parameters are from the SDSS$-$DR14$/$APOGEE and LAMOST$-$DR5$/$LEGUE spectroscopic surveys. The multi-band photometry, from optical to infrared, are collected from the {\it Gaia}, APASS, Pan--STARRS1, 2MASS, and {\it WISE} surveys. With the calibrated {\it Gaia} distances of individual stars, the distances to 15 of 32 SNRs are well determined from their produced extinction and association with molecular clouds. The upper limits of distance are derived for 3 SNRs. The color excess ratios $E(g_{\rm P1}-λ) / E(g_{\rm P1}-r_{\rm P1})$ of 32 SNRs are calculated, and their variation with wavebands is fitted by a simple dust model. The inferred dust grain size distribution bifurcates: while the graphite grains have comparable size to the average ISM dust, the silicate grains are generally larger. Along the way, the average extinction law from optical to near-infrared of the Milky Way is derived from the 1.3 million star sample and found to agree with the CCM89 law with $R_{\rm V}=3.15$.

preprint2020arXiv

Distances to the Supernova Remnants in the Inner Disk

Distance measurements of supernova remnants (SNRs) are essential and important. Accurate estimates of physical size, dust masses, and some other properties of SNRs depend critically on accurate distance measurements. However, the determination of SNR distances is still a tough task. Red clump stars (RCs) have a long history been used as standard candles. In this work, we take RCs as tracers to determine the distances to a large group of SNRs in the inner disk. We first select RC stars based on the near-infrared (IR) color-magnitude diagram (CMD). Then, the distance to and extinction of RC stars are calculated. To extend the measurable range of distance, we combine near-IR photometric data from the 2MASS survey with the deeper UKIDSS and VVV surveys. With the help of the Gaia parallaxes, we also remove contaminants including dwarfs and giants. Because an SN explosion compresses the surrounding interstellar medium, the SNR region would become denser and exhibit higher extinction than the surroundings. The distance of a SNR is then recognized by the position where the extinction and its gradient is higher than that of the ambient medium. A total of 63 SNRs' distances in the Galactic inner disk are determined and divided into three Levels A, B, and C with decreasing reliability. The distances to 43 SNRs are well determined with reliability A or B. The diameters and dust masses of SNRs are estimated with the obtained distance and extinction.

preprint2020arXiv

Improving Adversarial Robustness by Enforcing Local and Global Compactness

The fact that deep neural networks are susceptible to crafted perturbations severely impacts the use of deep learning in certain domains of application. Among many developed defense models against such attacks, adversarial training emerges as the most successful method that consistently resists a wide range of attacks. In this work, based on an observation from a previous study that the representations of a clean data example and its adversarial examples become more divergent in higher layers of a deep neural net, we propose the Adversary Divergence Reduction Network which enforces local/global compactness and the clustering assumption over an intermediate layer of a deep neural network. We conduct comprehensive experiments to understand the isolating behavior of each component (i.e., local/global compactness and the clustering assumption) and compare our proposed model with state-of-the-art adversarial training methods. The experimental results demonstrate that augmenting adversarial training with our proposed components can further improve the robustness of the network, leading to higher unperturbed and adversarial predictive performances.

preprint2020arXiv

SummPip: Unsupervised Multi-Document Summarization with Sentence Graph Compression

Obtaining training data for multi-document summarization (MDS) is time consuming and resource-intensive, so recent neural models can only be trained for limited domains. In this paper, we propose SummPip: an unsupervised method for multi-document summarization, in which we convert the original documents to a sentence graph, taking both linguistic and deep representation into account, then apply spectral clustering to obtain multiple clusters of sentences, and finally compress each cluster to generate the final summary. Experiments on Multi-News and DUC-2004 datasets show that our method is competitive to previous unsupervised methods and is even comparable to the neural supervised approaches. In addition, human evaluation shows our system produces consistent and complete summaries compared to human written ones.

preprint2019arXiv

Atomic-scale fragmentation and collapse of antiferromagnetic order in a doped Mott insulator

Disentangling the relationship between the insulating state with a charge gap and the magnetic order in an antiferromagnetic (AF) Mott insulator remains difficult due to inherent phase separation as the Mott state is perturbed. Measuring magnetic and electronic properties at the atomic length scales would provide crucial insight, but this is yet to be experimentally achieved. Here we use spectroscopic-imaging spin-polarized scanning tunneling microscopy (SP-STM) to visualize periodic spin-resolved modulations originating from the AF order in a relativistic Mott insulator Sr2IrO4, and study these as a function of doping. We find that near insulator-to-metal transition (IMT), the long-range AF order melts into a fragmented state with short-range AF correlations. Crucially, we discover that the short-range AF order is locally uncorrelated with the observed spectral gap magnitude. This strongly suggests that short range AF correlations are unlikely to be the culprit behind inhomogeneous gap closing and the emergence of pseudogap regions near IMT. Our work establishes SP-STM as a powerful tool for revealing atomic-scale magnetic information in complex oxides.

preprint2019arXiv

Proximity-Induced Superconductivity in a Topological Crystalline Insulator

Superconducting topological crystalline insulators (TCI) are predicted to host new topological phases protected by crystalline symmetries, but available materials are insufficiently suitable for surface studies. To induce superconductivity at the surface of a prototypical TCI SnTe, we use molecular beam epitaxy to grow a heterostructure of SnTe and a high-Tc superconductor Fe(Te,Se), utilizing a 'buffer' layer to bridge the large lattice mismatch between SnTe and Fe(Te,Se). Using low-temperature scanning tunneling microscopy and spectroscopy, we measure a prominent spectral gap on the surface of SnTe, and demonstrate its superconducting origin by its dependence on temperature and magnetic field. Our work provides a new platform for atomic-scale investigations of emergent topological phenomena in superconducting TCIs.