Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
36works
0followers
19topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

36 published item(s)

preprint2026arXiv

SpaAct: Spatially-Activated Transition Learning with Curriculum Adaptation for Vision-Language Navigation

Vision-and-Language Navigation (VLN) aims to enable an embodied agent to follow natural-language instructions and navigate to a target location in unseen 3D environments. We argue that adapting VLMs to VLN requires endowing them with two complementary capabilities for acquiring such awareness, namely backward action reasoning (why) and forward transition prediction~(how). Based on this insight, we propose SpaAct, a simple yet effective training framework that activates the dynamic spatial awareness in VLMs. Specifically, SpaAct introduces two spatial activation tasks: Action Retrospection, which asks the model to infer the executed action sequence from visual transitions, and Future Frame Selection, which forces the model to predict the visual transitions conditioned on history and action. These two objectives provide lightweight supervision on both backward action reasoning and forward transition prediction, encouraging the model to build dynamic spatial awareness in a VLM-friendly way. To further stabilize adaptation, we design TriPA, a Tri-factor Progressive Adaptive curriculum learning method that organizes training samples from easy to hard, allowing the model to gradually acquire navigation skills from basic locomotion to long-horizon reasoning. Experiments on standard VLN-CE benchmarks show that SpaAct consistently improves VLM-based navigation and achieves state-of-the-art performance. We will release the code and models to support future research.

preprint2024arXiv

A comprehensive framework for occluded human pose estimation

Occlusion presents a significant challenge in human pose estimation. The challenges posed by occlusion can be attributed to the following factors: 1) Data: The collection and annotation of occluded human pose samples are relatively challenging. 2) Feature: Occlusion can cause feature confusion due to the high similarity between the target person and interfering individuals. 3) Inference: Robust inference becomes challenging due to the loss of complete body structural information. The existing methods designed for occluded human pose estimation usually focus on addressing only one of these factors. In this paper, we propose a comprehensive framework DAG (Data, Attention, Graph) to address the performance degradation caused by occlusion. Specifically, we introduce the mask joints with instance paste data augmentation technique to simulate occlusion scenarios. Additionally, an Adaptive Discriminative Attention Module (ADAM) is proposed to effectively enhance the features of target individuals. Furthermore, we present the Feature-Guided Multi-Hop GCN (FGMP-GCN) to fully explore the prior knowledge of body structure and improve pose estimation results. Through extensive experiments conducted on three benchmark datasets for occluded human pose estimation, we demonstrate that the proposed method outperforms existing methods. Code and data will be publicly available.

preprint2023arXiv

Towards High Performance One-Stage Human Pose Estimation

Making top-down human pose estimation method present both good performance and high efficiency is appealing. Mask RCNN can largely improve the efficiency by conducting person detection and pose estimation in a single framework, as the features provided by the backbone are able to be shared by the two tasks. However, the performance is not as good as traditional two-stage methods. In this paper, we aim to largely advance the human pose estimation results of Mask-RCNN and still keep the efficiency. Specifically, we make improvements on the whole process of pose estimation, which contains feature extraction and keypoint detection. The part of feature extraction is ensured to get enough and valuable information of pose. Then, we introduce a Global Context Module into the keypoints detection branch to enlarge the receptive field, as it is crucial to successful human pose estimation. On the COCO val2017 set, our model using the ResNet-50 backbone achieves an AP of 68.1, which is 2.6 higher than Mask RCNN (AP of 65.5). Compared to the classic two-stage top-down method SimpleBaseline, our model largely narrows the performance gap (68.1 AP vs. 68.9 AP) with a much faster inference speed (77 ms vs. 168 ms), demonstrating the effectiveness of the proposed method. Code is available at: https://github.com/lingl_space/maskrcnn_keypoint_refined.

preprint2022arXiv

A Unified and Biologically-Plausible Relational Graph Representation of Vision Transformers

Vision transformer (ViT) and its variants have achieved remarkable successes in various visual tasks. The key characteristic of these ViT models is to adopt different aggregation strategies of spatial patch information within the artificial neural networks (ANNs). However, there is still a key lack of unified representation of different ViT architectures for systematic understanding and assessment of model representation performance. Moreover, how those well-performing ViT ANNs are similar to real biological neural networks (BNNs) is largely unexplored. To answer these fundamental questions, we, for the first time, propose a unified and biologically-plausible relational graph representation of ViT models. Specifically, the proposed relational graph representation consists of two key sub-graphs: aggregation graph and affine graph. The former one considers ViT tokens as nodes and describes their spatial interaction, while the latter one regards network channels as nodes and reflects the information communication between channels. Using this unified relational graph representation, we found that: a) a sweet spot of the aggregation graph leads to ViTs with significantly improved predictive performance; b) the graph measures of clustering coefficient and average path length are two effective indicators of model prediction performance, especially when applying on the datasets with small samples; c) our findings are consistent across various ViT architectures and multiple datasets; d) the proposed relational graph representation of ViT has high similarity with real BNNs derived from brain science data. Overall, our work provides a novel unified and biologically-plausible paradigm for more interpretable and effective representation of ViT ANNs.

preprint2022arXiv

Conservation of the particle-hole symmetry in the pseudogap state in optimally-doped Bi2Sr2CuO6+δ superconductor

The pseudogap state is one of the most enigmatic characteristics in the anomalous normal state properties of the high temperature cuprate superconductors. A central issue is to reveal whether there is a symmetry breaking and which symmetries are broken across the pseudogap transition. By performing high resolution laser-based angle-resolved photoemission measurements on the optimally-doped Bi2Sr1.6La0.4CuO6+δ superconductor, we report the observations of the particle-hole symmetry conservation in both the superconducting state and the pseudogap state along the entire Fermi surface. These results provide key insights in understanding the nature of the pseudogap and its relation with high temperature superconductivity.

preprint2022arXiv

Coupling Visual Semantics of Artificial Neural Networks and Human Brain Function via Synchronized Activations

Artificial neural networks (ANNs), originally inspired by biological neural networks (BNNs), have achieved remarkable successes in many tasks such as visual representation learning. However, whether there exists semantic correlations/connections between the visual representations in ANNs and those in BNNs remains largely unexplored due to both the lack of an effective tool to link and couple two different domains, and the lack of a general and effective framework of representing the visual semantics in BNNs such as human functional brain networks (FBNs). To answer this question, we propose a novel computational framework, Synchronized Activations (Sync-ACT), to couple the visual representation spaces and semantics between ANNs and BNNs in human brain based on naturalistic functional magnetic resonance imaging (nfMRI) data. With this approach, we are able to semantically annotate the neurons in ANNs with biologically meaningful description derived from human brain imaging for the first time. We evaluated the Sync-ACT framework on two publicly available movie-watching nfMRI datasets. The experiments demonstrate a) the significant correlation and similarity of the semantics between the visual representations in FBNs and those in a variety of convolutional neural networks (CNNs) models; b) the close relationship between CNN's visual representation similarity to BNNs and its performance in image classification tasks. Overall, our study introduces a general and effective paradigm to couple the ANNs and BNNs and provides novel insights for future studies such as brain-inspired artificial intelligence.

preprint2022arXiv

Differentiable Moving Horizon Estimation for Robust Flight Control

Estimating and reacting to external disturbances is of fundamental importance for robust control of quadrotors. Existing estimators typically require significant tuning or training with a large amount of data, including the ground truth, to achieve satisfactory performance. This paper proposes a data-efficient differentiable moving horizon estimation (DMHE) algorithm that can automatically tune the MHE parameters online and also adapt to different scenarios. We achieve this by deriving the analytical gradient of the estimated trajectory from MHE with respect to the tuning parameters, enabling end-to-end learning for auto-tuning. Most interestingly, we show that the gradient can be calculated efficiently from a Kalman filter in a recursive form. Moreover, we develop a model-based policy gradient algorithm to learn the parameters directly from the trajectory tracking errors without the need for the ground truth. The proposed DMHE can be further embedded as a layer with other neural networks for joint optimization. Finally, we demonstrate the effectiveness of the proposed method via both simulation and experiments on quadrotors, where challenging scenarios such as sudden payload change and flying in downwash are examined.

preprint2022arXiv

Disentangling Spatial-Temporal Functional Brain Networks via Twin-Transformers

How to identify and characterize functional brain networks (BN) is fundamental to gain system-level insights into the mechanisms of brain organizational architecture. Current functional magnetic resonance (fMRI) analysis highly relies on prior knowledge of specific patterns in either spatial (e.g., resting-state network) or temporal (e.g., task stimulus) domain. In addition, most approaches aim to find group-wise common functional networks, individual-specific functional networks have been rarely studied. In this work, we propose a novel Twin-Transformers framework to simultaneously infer common and individual functional networks in both spatial and temporal space, in a self-supervised manner. The first transformer takes space-divided information as input and generates spatial features, while the second transformer takes time-related information as input and outputs temporal features. The spatial and temporal features are further separated into common and individual ones via interactions (weights sharing) and constraints between the two transformers. We applied our TwinTransformers to Human Connectome Project (HCP) motor task-fMRI dataset and identified multiple common brain networks, including both task-related and resting-state networks (e.g., default mode network). Interestingly, we also successfully recovered a set of individual-specific networks that are not related to task stimulus and only exist at the individual level.

preprint2022arXiv

Electronic Nature of Charge Density Wave and Electron-Phonon Coupling in Kagome Superconductor KV$_3$Sb$_5$

The Kagome superconductors AV3Sb5 (A=K, Rb, Cs) have received enormous attention due to their nontrivial topological electronic structure, anomalous physical properties and superconductivity. Unconventional charge density wave (CDW) has been detected in AV3Sb5. High-precision electronic structure determination is essential to understand its origin. Here we unveil electronic nature of the CDW phase in our high-resolution angle-resolved photoemission measurements on KV3Sb5. We have observed CDW-induced Fermi surface reconstruction and the associated band folding. The CDW-induced band splitting and the associated gap opening have been revealed at the boundary of the pristine and reconstructed Brillouin zones. The Fermi surface- and momentum-dependent CDW gap is measured and the strongly anisotropic CDW gap is observed for all the V-derived Fermi surface. In particular, we have observed signatures of the electron-phonon coupling in KV3Sb5. These results provide key insights in understanding the nature of the CDW state and its interplay with superconductivity in AV3Sb5 superconductors.

preprint2022arXiv

Exfoliation of 2D van der Waals crystals in ultrahigh vacuum for interface engineering

Two-dimensional (2D) materials and their heterostructures have been intensively studied in recent years due to their potential applications in electronic, optoelectronic, and spintronic devices. Nonetheless, the realization of 2D heterostructures with atomically flat and clean interfaces remains challenging, especially for air-sensitive materials, which hinders the in-depth investigation of interface-induced phenomena and the fabrication of high-quality devices. Here, we circumvented this challenge by exfoliating 2D materials in an ultrahigh vacuum. Remarkably, ultraflat and clean substrate surfaces can assist the exfoliation of 2D materials, regardless of the substrate and 2D material, thus providing a universal method for the preparation of heterostructures with ideal interfaces. In addition, we studied the properties of two prototypical systems that cannot be achieved previously, including the electronic structure of monolayer phospherene and optical responses of transition metal dichalcogenides on different metal substrates. Our work paves the way to engineer rich interface-induced phenomena, such as proximity effects and moiré superlattices.

preprint2022arXiv

Eye-gaze-guided Vision Transformer for Rectifying Shortcut Learning

Learning harmful shortcuts such as spurious correlations and biases prevents deep neural networks from learning the meaningful and useful representations, thus jeopardizing the generalizability and interpretability of the learned representation. The situation becomes even more serious in medical imaging, where the clinical data (e.g., MR images with pathology) are limited and scarce while the reliability, generalizability and transparency of the learned model are highly required. To address this problem, we propose to infuse human experts' intelligence and domain knowledge into the training of deep neural networks. The core idea is that we infuse the visual attention information from expert radiologists to proactively guide the deep model to focus on regions with potential pathology and avoid being trapped in learning harmful shortcuts. To do so, we propose a novel eye-gaze-guided vision transformer (EG-ViT) for diagnosis with limited medical image data. We mask the input image patches that are out of the radiologists' interest and add an additional residual connection in the last encoder layer of EG-ViT to maintain the correlations of all patches. The experiments on two public datasets of INbreast and SIIM-ACR demonstrate our EG-ViT model can effectively learn/transfer experts' domain knowledge and achieve much better performance than baselines. Meanwhile, it successfully rectifies the harmful shortcut learning and significantly improves the EG-ViT model's interpretability. In general, EG-ViT takes the advantages of both human expert's prior knowledge and the power of deep neural networks. This work opens new avenues for advancing current artificial intelligence paradigms by infusing human intelligence.

preprint2022arXiv

Giant and Reversible Electronic Structure Evolution in a Magnetic Topological Material EuCd2As2

The electronic structure and the physical properties of quantum materials can be significantly altered by charge carrier doping and magnetic state transition. Here we report a discovery of a giant and reversible electronic structure evolution with doping in a magnetic topological material. By performing high-resolution angle-resolved photoemission measurements on EuCd2As2,we found that a huge amount of hole doping can be introduced into the sample surface due to surface absorption. The electronic structure exhibits a dramatic change with the hole doping which can not be described by a rigid band shift. Prominent band splitting is observed at high doping which corresponds to a doping-induced magnetic transition at low temperature (below -15 K) from an antiferromagnetic state to a ferromagnetic state. These results have established a detailed electronic phase diagram of EuCd2As2 where the electronic structure and the magnetic structure change systematically and dramatically with the doping level. They further suggest that the transport, magnetic and topological properties of EuCd2As2 can be greatly modified by doping. These work will stimulate further investigations to explore for new phenomena and properties in doping this magnetic topological material.

preprint2022arXiv

Invertible Sharpening Network for MRI Reconstruction Enhancement

High-quality MRI reconstruction plays a critical role in clinical applications. Deep learning-based methods have achieved promising results on MRI reconstruction. However, most state-of-the-art methods were designed to optimize the evaluation metrics commonly used for natural images, such as PSNR and SSIM, whereas the visual quality is not primarily pursued. Compared to the fully-sampled images, the reconstructed images are often blurry, where high-frequency features might not be sharp enough for confident clinical diagnosis. To this end, we propose an invertible sharpening network (InvSharpNet) to improve the visual quality of MRI reconstructions. During training, unlike the traditional methods that learn to map the input data to the ground truth, InvSharpNet adapts a backward training strategy that learns a blurring transform from the ground truth (fully-sampled image) to the input data (blurry reconstruction). During inference, the learned blurring transform can be inverted to a sharpening transform leveraging the network's invertibility. The experiments on various MRI datasets demonstrate that InvSharpNet can improve reconstruction sharpness with few artifacts. The results were also evaluated by radiologists, indicating better visual quality and diagnostic confidence of our proposed method.

preprint2022arXiv

Mask-guided Vision Transformer (MG-ViT) for Few-Shot Learning

Learning with little data is challenging but often inevitable in various application scenarios where the labeled data is limited and costly. Recently, few-shot learning (FSL) gained increasing attention because of its generalizability of prior knowledge to new tasks that contain only a few samples. However, for data-intensive models such as vision transformer (ViT), current fine-tuning based FSL approaches are inefficient in knowledge generalization and thus degenerate the downstream task performances. In this paper, we propose a novel mask-guided vision transformer (MG-ViT) to achieve an effective and efficient FSL on ViT model. The key idea is to apply a mask on image patches to screen out the task-irrelevant ones and to guide the ViT to focus on task-relevant and discriminative patches during FSL. Particularly, MG-ViT only introduces an additional mask operation and a residual connection, enabling the inheritance of parameters from pre-trained ViT without any other cost. To optimally select representative few-shot samples, we also include an active learning based sample selection method to further improve the generalizability of MG-ViT based FSL. We evaluate the proposed MG-ViT on both Agri-ImageNet classification task and ACFR apple detection task with gradient-weighted class activation mapping (Grad-CAM) as the mask. The experimental results show that the MG-ViT model significantly improves the performance when compared with general fine-tuning based ViT models, providing novel insights and a concrete approach towards generalizing data-intensive and large-scale deep learning models for FSL.

preprint2022arXiv

One-step exfoliation method for plasmonic activation of large-area 2D crystals

Advanced exfoliation techniques are crucial for exploring the intrinsic properties and applications of 2D materials. Though the recently discovered Au-enhanced exfoliation technique provides an effective strategy for preparation of large-scale 2D crystals, the high cost of gold hinders this method from being widely adopted in industrial applications. In addition, direct Au contact could significantly quench photoluminescence (PL) emission in 2D semiconductors. It is therefore crucial to find alternative metals that can replace gold to achieve efficient exfoliation of 2D materials. Here, we present a one-step Ag-assisted method that can efficiently exfoliate many large-area 2D monolayers, where the yield ratio is comparable to Au-enhanced exfoliation method. Differing from Au film, however, the surface roughness of as-prepared Ag films on SiO2/Si substrate is much higher, which facilitates the generation of surface plasmons resulting from the nanostructures formed on the rough Ag surface. More interestingly, the strong coupling between 2D semiconductor crystals (e.g. MoS2, MoSe2) and Ag film leads to a unique PL enhancement that has not been observed in other mechanical exfoliation techniques, which can be mainly attributed to enhanced light-matter interaction as a result of extended propagation of surface plasmonic polariton (SPP). Our work provides a lower-cost and universal Ag-assisted exfoliation method, while at the same offering enhanced SPP-matter interactions.

preprint2022arXiv

Physical realization of topological Roman surface by spin-induced ferroelectric polarization in cubic lattice

Topology, a mathematical concept in geometry, has become an ideal theoretical tool for describing topological states and phase transitions. Many topological concepts have found their physical entities in real or reciprocal spaces identified by topological/geometrical invariants, which are usually defined on orientable surfaces such as torus and sphere. It is natural to quest whether it is possible to find the physical realization of more intriguing non-orientable surfaces. Herein, we show that the set of spin-induced ferroelectric polarizations in cubic perovskite oxides AMn3Cr4O12 (A = La and Tb) resides on the topological Roman surface, a non-orientable two-dimensional manifold formed by sewing a Mobius strip edge to that of a disc. The induced polarization may travel in a loop along the non-orientable Mobius strip or orientable disc depending on how the spin evolves as controlled by external magnetic field. Experimentally, the periodicity of polarization can be the same or the twice of the rotating magnetic field, being well consistent with the orientability of disc and Mobius strip, respectively. This path dependent topological magnetoelectric effect presents a way to detect the global geometry of the surface and deepens our understanding of topology in both mathematics and physics

preprint2022arXiv

Rectify ViT Shortcut Learning by Visual Saliency

Shortcut learning is common but harmful to deep learning models, leading to degenerated feature representations and consequently jeopardizing the model's generalizability and interpretability. However, shortcut learning in the widely used Vision Transformer framework is largely unknown. Meanwhile, introducing domain-specific knowledge is a major approach to rectifying the shortcuts, which are predominated by background related factors. For example, in the medical imaging field, eye-gaze data from radiologists is an effective human visual prior knowledge that has the great potential to guide the deep learning models to focus on meaningful foreground regions of interest. However, obtaining eye-gaze data is time-consuming, labor-intensive and sometimes even not practical. In this work, we propose a novel and effective saliency-guided vision transformer (SGT) model to rectify shortcut learning in ViT with the absence of eye-gaze data. Specifically, a computational visual saliency model is adopted to predict saliency maps for input image samples. Then, the saliency maps are used to distil the most informative image patches. In the proposed SGT, the self-attention among image patches focus only on the distilled informative ones. Considering this distill operation may lead to global information lost, we further introduce, in the last encoder layer, a residual connection that captures the self-attention across all the image patches. The experiment results on four independent public datasets show that our SGT framework can effectively learn and leverage human prior knowledge without eye gaze data and achieves much better performance than baselines. Meanwhile, it successfully rectifies the harmful shortcut learning and significantly improves the interpretability of the ViT model, demonstrating the promise of transferring human prior knowledge derived visual saliency in rectifying shortcut learning

preprint2022arXiv

Representing Brain Anatomical Regularity and Variability by Few-Shot Embedding

Effective representation of brain anatomical architecture is fundamental in understanding brain regularity and variability. Despite numerous efforts, it is still difficult to infer reliable anatomical correspondence at finer scale, given the tremendous individual variability in cortical folding patterns. It is even more challenging to disentangle common and individual patterns when comparing brains at different neuro-developmental stages. In this work, we developed a novel learning-based few-shot embedding framework to encode the cortical folding patterns into a latent space represented by a group of anatomically meaningful embedding vectors. Specifically, we adopted 3-hinge (3HG) network as the substrate and designed an autoencoder-based embedding framework to learn a common embedding vector for each 3HG's multi-hop feature: each 3HG can be represented as a combination of these feature embeddings via a set of individual specific coefficients to characterize individualized anatomical information. That is, the regularity of folding patterns is encoded into the embeddings, while the individual variations are preserved by the multi=hop combination coefficients. To effectively learn the embeddings for the population with very limited samples, few-shot learning was adopted. We applied our method on adult HCP and pediatric datasets with 1,000+ brains (from 34 gestational weeks to young adult). Our experimental results show that: 1) the learned embedding vectors can quantitatively encode the commonality and individuality of cortical folding patterns; 2) with the embeddings we can robustly infer the complicated many-to-many anatomical correspondences among different brains and 3) our model can be successfully transferred to new populations with very limited training samples.

preprint2022arXiv

Tracking the nematicity in cuprate superconductors: a resistivity study under uniaxial pressure

Overshadowing the superconducting dome in hole-doped cuprates, the pseudogap state is still one of the mysteries that no consensus can be achieved. It has been suggested that the rotational symmetry is broken in this state and may result in a nematic phase transition, whose temperature seems to coincide with the onset temperature of the pseudogap state $T^*$ around optimal doping level, raising the question whether the pseudogap results from the establishment of the nematic order. Here we report results of resistivity measurements under uniaxial pressure on several hole-doped cuprates, where the normalized slope of the elastoresistivity $ζ$ can be obtained as illustrated in iron-based superconductors. The temperature dependence of $ζ$ along particular lattice axis exhibits kink feature at $T_{k}$ and shows Curie-Weiss-like behavior above it, which may suggest a spontaneous nematic transition. While $T_{k}$ seems to be the same as $T^*$ around the optimal doping and in the overdoped region, they become very different in underdoped La$_{2-x}$Sr$_{x}$CuO$_4$. Our results suggest that the nematic order, if indeed existing, is an electronic phase within the pseudogap state.

preprint2021arXiv

Bayesian spectral density approach for identification and uncertainty quantification of bridge section's flutter derivatives operated in turbulent flow

This study presents a Bayesian spectral density approach for identification and uncertainty quantification of flutter derivatives of bridge sections utilizing buffeting displacement responses, where the wind tunnel test is conducted in turbulent flow. Different from traditional time-domain approaches (e.g., least square method and stochastic subspace identification), the newly-proposed approach is operated in frequency domain. Based on the affine invariant ensemble sampler algorithm, Markov chain Monte-Carlo sampling is employed to accomplish the Bayesian inference. The probability density function of flutter derivatives is modeled based on complex Wishart distribution, where probability serves as the measure. By the Bayesian spectral density approach, the most probable values and corresponding posterior distributions (namely identification uncertainty here) of each flutter derivative can be obtained at the same time. Firstly, numerical simulations are conducted and the identified results are accurate. Secondly, thin plate model, flutter derivatives of which have theoretical solutions, is chosen to be tested in turbulent flow for the sake of verification. The identified results of thin plate model are consistent with the theoretical solutions. Thirdly, the center-slotted girder model, which is widely-utilized long-span bridge sections in current engineering practice, is employed to investigate the applicability of the proposed approach on a general bridge section. For the center-slotted girder model, the flutter derivatives are also extracted by least square method in uniform flow to cross validate the newly-proposed approach. The identified results by two different approaches are compatible.

preprint2021arXiv

Evolution of Charge and Pair Density Modulations in Overdoped Bi2Sr2CuO6+delta

One of the central issues concerning the mechanism of high temperature superconductivity in cuprates is the nature of the ubiquitous charge order and its implications to superconductivity. Here we use scanning tunneling microscopy to investigate the evolution of charge order from the optimally doped to strongly overdoped Bi2Sr2CuO6+δ cuprates. We find that with increasing hole concentration, the long-range checkerboard order gradually evolves into short-range glassy patterns consisting of diluted charge puddles. Each charge puddle has a unidirectional nematic internal structure, and exhibits clear pair density modulations as revealed by the spatial variations of superconducting coherence peak and gap depth. Both the charge puddles and the nematicity vanish completely in the strongly overdoped non-superconducting regime, when another type of short-range order with root2 * root2 periodicity emerges. These results shed important new lights on the intricate interplay between the intertwined orders and the superconducting phase of cuprates.

preprint2021arXiv

Momentum-Resolved Visualization of Electronic Evolution in Doping a Mott Insulator

High temperature superconductivity in cuprates arises from doping a parent Mott insulator by electrons or holes. A central issue is how the Mott gap evolves and the low-energy states emerge with doping. Here we report angle-resolved photoemission spectroscopy measurements on a cuprate parent compound by sequential in situ electron doping. The chemical potential jumps to the bottom of the upper Hubbard band upon a slight electron doping, making it possible to directly visualize the charge transfer band and the full Mott gap region. With increasing doping, the Mott gap rapidly collapses due to the spectral weight transfer from the charge transfer band to the gapped region and the induced low-energy states emerge in a wide energy range inside the Mott gap. These results provide key information on the electronic evolution in doping a Mott insulator and establish a basis for developing microscopic theories for cuprate superconductivity.

preprint2021arXiv

Spectroscopic Evidence on Realization of a Genuine Topological Nodal Line Semimetal in LaSbTe

The nodal line semimetals have attracted much attention due to their unique topological electronic structure and exotic physical properties. A genuine nodal line semimetal is qualified by the presence of Dirac nodes along a line in the momentum space that are protected against the spin-orbit coupling. In addition, it requires that the Dirac points lie close to the Fermi level allowing to dictate the macroscopic physical properties. Although the material realization of nodal line semimetals have been theoretically predicted in numerous compounds, only a few of them have been experimentally verified and the realization of a genuine nodal line semimetal is particularly rare. Here we report the realization of a genuine nodal line semimetal in LaSbTe. We investigated the electronic structure of LaSbTe by band structure calculations and angle-resolved photoemission (ARPES) measurements. Taking spin-orbit coupling into account, our band structure calculations predict that a nodal line is formed in the boundary surface of the Brillouin zone which is robust and lies close to the Fermi level. The Dirac nodes along the X-R line in momentum space are directly observed in our ARPES measurements and the energies of these Dirac nodes are all close to the Fermi level. These results constitute clear evidence that LaSbTe is a genuine nodal line semimetal,providing a new platform to explore for novel phenomena and possible applications associated with the nodal line semimetals.

preprint2020arXiv

Direct Measurement of the Electronic Structure and band gap nature of atomic-layer-thick 2H-MoTe2

The millimeter sized monolayer and bilayer 2H-MoTe2 single crystal samples are prepared by a new mechanical exfoliation method. Based on such high-quality samples, we report the first direct electronic structure study on them, using standard high resolution angle-resolved photoemission spectroscopy (ARPES). A direct band gap of 0.924eV is found at K in the rubidium-doped monolayer MoTe2. Similar valence band alignment is also observed in bilayer MoTe2,supporting an assumption of a analogous direct gap semiconductor on it. Our measurements indicate a rather large band splitting of 212meV at the valence band maximum (VBM) in monolayer MoTe2, and the splitting is systematically enlarged with layer stacking, from monolayer to bilayer and to bulk. Meanwhile, our PBE band calculation on these materials show excellent agreement with ARPES results. Some fundamental electronic parameters are derived from the experimental and calculated electronic structures. Our findings lay a foundation for further application-related study on monolayer and bilayer MoTe2.

preprint2020arXiv

Electronic Evolution from the Parent Mott Insulator to a Superconductor in Lightly Hole-Doped Bi2Sr2CaCu2O8+delta

High temperature superconductivity in cuprates is realized by doping the Mott insulator with charge carriers. A central issue is how such an insulating state can evolve into a conducting or superconducting state when charge carriers are introduced. Here, by in situ vacuum annealing and Rb deposition on the Bi2Sr2Ca0.6Dy0.4Cu2O8+delta (Bi2212) sample surface to push its doping level continuously from deeply underdoped (Tc=25 K, doping level p-0.066) to the near zero doping parent Mott insulator, angle-resolved photoemission spectroscopy measurements are carried out to observe the detailed electronic structure evolution in lightly hole-doped region for the first time. Our results indicate that the chemical potential lies at about 1 eV above the charge transfer band for the parent state at zero doping which is quite close to the upper Hubbard band. With increasing hole doping, the chemical potential moves continuously towards the charge transfer band and the band structure evolution exhibits a rigid band shift-like behavior. When the chemical potential approaches the charge transfer band at a doping level of -0.05, the nodal spectral weight near the Fermi level increases, followed by the emergence of the coherent quasiparticle peak and the insulator-superconductor transition. Our observations provide key insights in understanding the insulator-superconductor transition in doping the parent cuprate compound and for establishing related theories.

preprint2020arXiv

Localization-aware Channel Pruning for Object Detection

Channel pruning is one of the important methods for deep model compression. Most of existing pruning methods mainly focus on classification. Few of them conduct systematic research on object detection. However, object detection is different from classification, which requires not only semantic information but also localization information. In this paper, based on discrimination-aware channel pruning (DCP) which is state-of-the-art pruning method for classification, we propose a localization-aware auxiliary network to find out the channels with key information for classification and regression so that we can conduct channel pruning directly for object detection, which saves lots of time and computing resources. In order to capture the localization information, we first design the auxiliary network with a contextual ROIAlign layer which can obtain precise localization information of the default boxes by pixel alignment and enlarges the receptive fields of the default boxes when pruning shallow layers. Then, we construct a loss function for object detection task which tends to keep the channels that contain the key information for classification and regression. Extensive experiments demonstrate the effectiveness of our method. On MS COCO, we prune 70\% parameters of the SSD based on ResNet-50 with modest accuracy drop, which outperforms the-state-of-art method.

preprint2020arXiv

Momentum Q-learning with Finite-Sample Convergence Guarantee

Existing studies indicate that momentum ideas in conventional optimization can be used to improve the performance of Q-learning algorithms. However, the finite-sample analysis for momentum-based Q-learning algorithms is only available for the tabular case without function approximations. This paper analyzes a class of momentum-based Q-learning algorithms with finite-sample guarantee. Specifically, we propose the MomentumQ algorithm, which integrates the Nesterov's and Polyak's momentum schemes, and generalizes the existing momentum-based Q-learning algorithms. For the infinite state-action space case, we establish the convergence guarantee for MomentumQ with linear function approximations and Markovian sampling. In particular, we characterize the finite-sample convergence rate which is provably faster than the vanilla Q-learning. This is the first finite-sample analysis for momentum-based Q-learning algorithms with function approximations. For the tabular case under synchronous sampling, we also obtain a finite-sample convergence rate that is slightly better than the SpeedyQ \citep{azar2011speedy} when choosing a special family of step sizes. Finally, we demonstrate through various experiments that the proposed MomentumQ outperforms other momentum-based Q-learning algorithms.

preprint2020arXiv

Neutron spin resonance in a quasi-two-dimensional iron-based superconductor

Magnetically mediated Cooper pairing is generally regarded as a key to establish the unified mechanism of unconventional superconductivity. One crucial evidence is the neutron spin resonance arising in the superconducting state, which is commonly interpreted as a spin-exciton from collective particle-hole excitations confined below the superconducting pair-breaking gap ($2Δ$). Here, on the basis of inelastic neutron scattering measurements on a quasi-two-dimensional iron-based superconductor KCa$_2$Fe$_4$As$_4$F$_2$, we have discovered a two-dimensional spin resonant mode with downward dispersions, a behavior closely resembling the low branch of the hour-glass-type spin resonance in cuprates. The resonant intensity is predominant by two broad incommensurate peaks near $Q=$(0.5, 0.5) with a sharp energy peak at $E_R=16$ meV. The overall energy dispersion of the mode exceeds the measured maximum total gap $Δ_{\rm tot}=|Δ_k|+|Δ_{k+Q}|$. These experimental results deeply challenge the conventional understanding of the resonance modes as magnetic excitons regardless of underlining pairing symmetry schemes, and it also points out that when the iron-based superconductivity becomes very quasi-two-dimensional, the electronic behaviors are similar to those in cuprates.

preprint2020arXiv

Simultaneous Generation of Direct- and Indirect-Gap Photoluminescence in Multilayer MoS2 Bubbles

Transition metal dichalcogenide (TMD) materials have received enormous attention due to their extraodinary optical and electrical properties, among which MoS2 is the most typical one. As thickness increases from monolayer to multilayer, the photoluminescence (PL) of MoS2 is gradually quenched due to the direct-to-indirect band gap transition. How to enhance PL response and decrease the layer dependence in multilayer MoS2 is still a challenging task. In this work, we report, for the first time, simultaneous generation of three PL peaks at around 1.3, 1.4 and 1.8 eV on multilayer MoS2 bubbles. The temperature dependent PL measurements indicate that the two peaks at 1.3 and 1.4 eV are phonon-assisted indirect-gap transitions while the peak at 1.8 eV is the direct-gap transition. Using first-principles calculations, the band structure evolution of multilayer MoS2 under strain is studied, from which the origin of the three PL peaks of MoS2 bubbles is further confirmed. Moreover, PL standing waves are observed in MoS2 bubbles that creates Newton-Ring-like patterns. This work demonstrates that the bubble structure may provide new opportunities for engineering the electronic structure and optical properties of layered materials.

preprint2020arXiv

Spectroscopic Evidence of Bilayer Splitting and Interlayer Pairing in an Iron Based Superconductor

In high temperature cuprate superconductors, the interlayer coupling between the CuO$_2$ planes plays an important role in dictating superconductivity, as indicated by the sensitive dependence of the critical temperature (T$_C$) on the number of CuO$_2$ planes in one structural unit. In Bi$_2$Sr$_2$CaCu$_2$O$_{8+δ}$ superconductor with two CuO$_2$ planes in one structural unit, the interaction between the two CuO$_2$ planes gives rise to band splitting into two Fermi surface sheets (bilayer splitting) that have distinct superconducting gap. The iron based superconductors are composed of stacking of the FeAs/FeSe layers; whether the interlayer coupling can cause similar band splitting and its effect on superconductivity remain unclear. Here we report high resolution laser-based angle-resolved photoemission spectroscopy (ARPES) measurements on a newly discovered iron based superconductor, KCa$_2$Fe$_4$As$_4$F$_2$ (T$_C$=33.5\,K) which consists of stacking FeAs blocks with two FeAs layers separated by insulating Ca$_2$F$_2$ blocks. Bilayer splitting effect is observed for the first time that gives rise to totally five hole-like Fermi surface sheets around the Brilliouin zone center. Band structure calculations reproduce the observed bilayer splitting by identifying interlayer interorbital interaction between the two FeAs layers within one FeAs block. All the hole-like pockets around the zone center exhibit Fermi surface-dependent and nodeless superconducting gap. The gap functions with short-range antiferromagetic fluctuations are proposed and the gap symmetry can be well understood when the interlayer pairing is considered. The particularly strong interlayer pairing is observed for one of the bands. Our observations provide key information on the interlayer coupling and interlayer pairing in understanding superconductivity in iron based superconductors.

preprint2020arXiv

Study of pseudogap and superconducting quasiparticle dynamics in $\rm{Bi_2Sr_2CaCu_2O_{8+δ}}$ by time-resolved optical reflectivity

The relation between pseudogap (PG) and superconducting (SC) gap, whether PG is a precursor of SC or they coexist or compete, is a long-standing controversy in cuprate high-temperature supercondutors. Here, we report ultrafast time-resolved optical reflectivity investigation of the dynamic densities and relaxations of PG and SC quasiparticles (QPs) in the underdoped $\rm{Bi_2Sr_2CaCu_2O_{8+δ}}$ ($T_c$ = 82 K) single crystals. We find evidence of two distinct PG components in the positive reflectivity changes in the PG state, characterized by relaxation timescales of $τ_{fast}$ $\approx$ 0.2 ps and $τ_{slow}$ $\approx$ 2 ps with abrupt changes in both amplitudes $A_{fast}$ and $A_{slow}$ at the PG-opening temperature $T^*$. The former presents no obvious change at $T_c$ and coexists with the SC QP. The latter's amplitude starts decreasing at the SC phase fluctuation $T_p$ and vanishes at $T_c$ followed by a negative amplitude signifying the emergence of the SC QP, therefore suggesting a competition with superconductivity.

preprint2020arXiv

Universal mechanical exfoliation of large-area 2D crystals

Two-dimensional (2D) materials provide extraordinary opportunities for exploring phenomena arising in atomically thin crystals. Beginning with the first isolation of graphene, mechanical exfoliation has been a key to provide high-quality 2D materials but despite improvements it is still limited in yield, lateral size and contamination. Here we introduce a contamination-free, one-step and universal Au-assisted mechanical exfoliation method and demonstrate its effectiveness by isolating 40 types of single-crystalline monolayers, including elemental 2D crystals, metal-dichalcogenides, magnets and superconductors. Most of them are of millimeter-size and high-quality, as shown by transfer-free measurements of electron microscopy, photo spectroscopies and electrical transport. Large suspended 2D crystals and heterojunctions were also prepared with high-yield. Enhanced adhesion between the crystals and the substrates enables such efficient exfoliation, for which we identify a common rule that underpins a universal route for producing large-area monolayers and thus supports studies of fundamental properties and potential application of 2D materials.

preprint2019arXiv

Evidence for an Additional Symmetry Breaking from Direct Observation of Band Splitting in the Nematic State of FeSe Superconductor

The iron-based superconductor FeSe has attracted much recent attention because of its simple crystal structure, distinct electronic structure and rich physics exhibited by itself and its derivatives. Determination of its intrinsic electronic structure is crucial to understand its physical properties and superconductivity mechanism. Both theoretical and experimental studies so far have provided a picture that FeSe consists of one hole-like Fermi surface around the Brillouin zone center in its nematic state. Here we report direct observation of two hole-like Fermi surface sheets around the Brillouin zone center, and the splitting of the associated bands, in the nematic state of FeSe by taking high resolution laser-based angle-resolved photoemission measurements. These results indicate that, in addition to nematic order and spin-orbit coupling, there is an additional order in FeSe that breaks either inversion or time reversal symmetries. The new Fermi surface topology asks for reexamination of the existing theoretical and experimental understanding of FeSe and stimulates further efforts to identify the origin of the hidden order in its nematic state.

preprint2019arXiv

High Precision Determination of the Planck Constant by Modern Photoemission Spectroscopy

The Planck constant, with its mathematical symbol $h$, is a fundamental constant in quantum mechanics that is associated with the quantization of light and matter. It is also of fundamental importance to metrology, such as the definition of ohm and volt, and the latest definition of kilogram. One of the first measurements to determine the Planck constant is based on the photoelectric effect, however, the values thus obtained so far have exhibited a large uncertainty. The accepted value of the Planck constant, 6.62607015$\times$10$^{-34}$ J$\cdot$s, is obtained from one of the most precise methods, the Kibble balance, which involves quantum Hall effect, Josephson effect and the use of the International Prototype of the Kilogram (IPK) or its copies. Here we present a precise determination of the Planck constant by modern photoemission spectroscopy technique. Through the direct use of the Einstein's photoelectric equation, the Planck constant is determined by measuring accurately the energy position of the gold Fermi level using light sources with various photon wavelengths. The precision of the measured Planck constant, 6.62610(13)$\times$10$^{-34}$ J$\cdot$s, is four to five orders of magnitude improved from the previous photoelectric effect measurements. It has rendered photoemission method to become one of the most accurate methods in determining the Planck constant. We propose that this direct method of photoemission spectroscopy has advantages and a potential to further increase its measurement precision of the Planck constant to be comparable to the most accurate methods that are available at present.

preprint2019arXiv

Insulating Parent Phase and Distinct Doping Evolution to Superconductivity in Single-Layer FeSe/SrTiO3 Films

The single-layer FeSe/SrTiO3 (FeSe/STO) films have attracted much attention because of their simple crystal structure, distinct electronic structure and record high superconducting transition temperature (Tc). The origin of the dramatic Tc enhancement in single-layer FeSe/STO films and the dichotomy of superconductivity between single-layer and multiple-layer FeSe/STO films are still under debate. Here we report a comprehensive high resolution angle-resolved photoemission spectroscopy and scanning tunneling microscopy/spectroscopy measurements on the electronic structure evolution with doping in single-layer and multiple-layer FeSe/STO films. We find that the single-layer FeSe/STO films have a distinct parent phase and a route of doping evolution to superconductivity that are fundamentally different from multiple-layer FeSe/STO films. The parent phase of the single-layer FeSe/STO films is insulating, and its doping evolution is very similar to that of doping a Mott insulator in cuprate superconductors. In multiple-layer FeSe/STO films, high-Tc superconductivity occurs by suppressing the nematic order in the parent compound with electron doping. The single-layer FeSe/STO films represent the first clear case in the iron-based superconductors that the parent compound is an insulator. Our observations of the unique parent state and doping evolution in the single-layer FeSe/STO films provide key insight in understanding its record high-Tc superconductivity. They also provide a new route of realizing superconductivity in iron-based superconductors that is common in high temperature cuprate superconductors.

preprint2019arXiv

Selective Hybridization between Main Band and Superstructure Band in Bi$_2$Sr$_2$CaCu$_2$O$_{8+δ}$ Superconductor

High-resolution laser-based angle-resolved photoemission measurements have been carried out on Bi$_2$Sr$_2$CaCu$_2$O$_{8+δ}$ (Bi2212) and Bi$_2$Sr$_{2-x}$La$_x$CuO$_{6+δ}$ (Bi2201) superconductors. Unexpected hybridization between the main band and the superstructure band in Bi2212 is clearly revealed. In the momentum space where one main Fermi surface intersects with one superstructure Fermi surface, four bands are observed instead of two. The hybridization exists in both superconducting state and normal state, and in Bi2212 samples with different doping levels. Such a hybridization is not observed in Bi2201. This phenomenon can be understood by considering the bilayer splitting in Bi2212, the selective hybridization of two bands with peculiar combinations, and the altered matrix element effects of the hybridized bands. These observations provide strong evidence on the origin of the superstructure band which is intrinsic to the CuO$_2$ planes. Therefore, understanding physical properties and superconductivity mechanism in Bi2212 should consider the complete Fermi surface topology which involves the main bands, the superstructure bands and their interactions.