Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
70works
0followers
41topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

70 published item(s)

preprint2026arXiv

Breaking Dual Bottlenecks: Evolving Unified Multimodal Models into Self-Adaptive Interleaved Visual Reasoners

Recent unified models integrate multimodal understanding and generation within a single framework. However, an "understanding-generation gap" persists, where models can capture user intent but often fail to translate this semantic knowledge into precise pixel-level manipulation. This gap results in two bottlenecks in anything-to-image task (X2I): the attention entanglement bottleneck, where blind planning struggles with complex prompts, and the visual refinement bottleneck, where unstructured feedback fails to correct imperfections efficiently. In this paper, we propose a novel framework that empowers unified models to autonomously switch between generation strategies based on instruction complexity and model capability. To achieve this, we construct a hierarchical data pipeline that constructs execution paths across three adaptive modes: direct generation for simple cases, self-reflection for quality refinement, and multi-step planning for decomposing complex scenarios. Building on this pipeline, we contribute a high-quality dataset with over 50,000 samples and implement a two-stage training strategy comprising SFT and RL. Specifically, we design step-wise reasoning rewards to ensure logical consistency and intra-group complexity penalty to prevent redundant computational overhead. Extensive experiments demonstrate that our method outperforms existing baselines on X2I, achieving superior generation fidelity among simple-to-complex instructions. The code is released at https://github.com/WeChatCV/Interleaved_Visual_Reasoner.

preprint2026arXiv

Continuous Latent Diffusion Language Model

Large language models have achieved remarkable success under the autoregressive paradigm, yet high-quality text generation need not be tied to a fixed left-to-right order. Existing alternatives still struggle to jointly achieve generation efficiency, scalable representation learning, and effective global semantic modeling. We propose Cola DLM, a hierarchical latent diffusion language model that frames text generation through hierarchical information decomposition. Cola DLM first learns a stable text-to-latent mapping with a Text VAE, then models a global semantic prior in continuous latent space with a block-causal DiT, and finally generates text through conditional decoding. From a unified Markov-path perspective, its diffusion process performs latent prior transport rather than token-level observation recovery, thereby separating global semantic organization from local textual realization. This design yields a more flexible non-autoregressive inductive bias, supports semantic compression and prior fitting in continuous space, and naturally extends to other continuous modalities. Through experiments spanning 4 research questions, 8 benchmarks, strictly matched ~2B-parameter autoregressive and LLaDA baselines, and scaling curves up to about 2000 EFLOPs, we identify an effective overall configuration of Cola DLM and verify its strong scaling behavior for text generation. Taken together, the results establish hierarchical continuous latent prior modeling as a principled alternative to strictly token-level language modeling, where generation quality and scaling behavior may better reflect model capability than likelihood, while also suggesting a concrete path toward unified modeling across discrete text and continuous modalities.

preprint2026arXiv

Forbidden second harmonics in centrosymmetric bilayer crystals

Optical spectroscopy based on second-order nonlinearity is a critical technique for characterizing two-dimensional (2D) crystals as well as bioimaging and quantum optics. It is generally believed that second-harmonic generation (SHG) in centrosymmetric crystals, such as graphene and other bilayer 2D crystals, is negligible without externally breaking the inversion symmetry. Here, we show that with a new homodyne detection technique, we can apparently circumvent this symmetry-imposed constraint and observe robust SHG in pristine centrosymmetric crystals, without any symmetry-breaking field. With its exceptional sensitivity, we resolve polarization-resolved SHG in bilayer hexagonal boron nitride (h-BN), bilayer 2H-WSe$_2$, and remarkably, Bernal-stacked bilayer graphene, allowing us to unambiguously identify the crystallographic orientation in these crystals via SHG for the first time. We also demonstrate that the new technique can be used to non-invasively detect uniaxial strain and optical geometric phase in these crystals. The observed SHG in our experiments is attributed to second-order nonlinearity in the quadrupole channel, which is controlled by the presence of the $C_2$ symmetry instead of the inversion symmetry. Our new technique expands the capability of nonlinear optical spectroscopy to encompass a large class of centrosymmetric materials that could never be measured before, and can be used for quantum sensing of moiré materials and twisted epitaxial films.

preprint2026arXiv

GeoReason: Aligning Thinking And Answering In Remote Sensing Vision-Language Models Via Logical Consistency Reinforcement Learning

The evolution of Remote Sensing Vision-Language Models(RS-VLMs) emphasizes the importance of transitioning from perception-centric recognition toward high-level deductive reasoning to enhance cognitive reliability in complex spatial tasks. However, current models often suffer from logical hallucinations, where correct answers are derived from flawed reasoning chains or rely on positional shortcuts rather than spatial logic. This decoupling undermines reliability in strategic spatial decision-making. To address this, we present GeoReason, a framework designed to synchronize internal thinking with final decisions. We first construct GeoReason-Bench, a logic-driven dataset containing 4,000 reasoning trajectories synthesized from geometric primitives and expert knowledge. We then formulate a two-stage training strategy: (1) Supervised Knowledge Initialization to equip the model with reasoning syntax and domain expertise, and (2) Consistency-Aware Reinforcement Learning to refine deductive reliability. This second stage integrates a novel Logical Consistency Reward, which penalizes logical drift via an option permutation strategy to anchor decisions in verifiable reasoning traces. Experimental results demonstrate that our framework significantly enhances the cognitive reliability and interpretability of RS-VLMs, achieving state-of-the-art performance compared to other advanced methods.

preprint2026arXiv

PILIR: Physics-Informed Local Implicit Representation

Physics-Informed Neural Networks have become a powerful mesh-free method for solving partial differential equations, but their performance is often limited by spectral bias. Specifically, in standard MLPs used in PINNs, the global parameter coupling causes the model to prioritize learning low-frequency components, resulting in slow convergence for high-frequency details. To overcome this limitation, we introduce the Physics-Informed Local Implicit Representation (PILIR). Our approach separates the global physical domain into a discrete latent feature space and a continuous generative decoder. By using a learnable grid to encode explicit spatial locality, PILIR can capture high-frequency details locally, preventing dilution by global patterns. A generative neural operator then synthesizes these local latent features into continuous physical fields, allowing accurate reconstruction of fine-scale structures. Experiments on a range of challenging PDEs show that PILIR effectively mitigates spectral bias, thereby boosting the convergence of high-frequency details and achieving superior accuracy compared to state-of-the-art methods.

preprint2026arXiv

SLGNet: Synergizing Structural Priors and Language-Guided Modulation for Multimodal Object Detection

Multimodal object detection leveraging RGB and Infrared (IR) images is pivotal for robust perception in all-weather scenarios. While recent adapter-based approaches efficiently transfer RGB-pretrained foundation models to this task, they often prioritize model efficiency at the expense of cross-modal structural consistency. Consequently, critical structural cues are frequently lost when significant domain gaps arise, such as in high-contrast or nighttime environments. Moreover, conventional static multimodal fusion mechanisms typically lack environmental awareness, resulting in suboptimal adaptation and constrained detection performance under complex, dynamic scene variations. To address these limitations, we propose SLGNet, a parameter-efficient framework that synergizes hierarchical structural priors and language-guided modulation within a frozen Vision Transformer (ViT)-based foundation model. Specifically, we design a Structure-Aware Adapter to extract hierarchical structural representations from both modalities and dynamically inject them into the ViT to compensate for structural degradation inherent in ViT-based backbones. Furthermore, we propose a Language-Guided Modulation module that exploits VLM-driven structured captions to dynamically recalibrate visual features, thereby endowing the model with robust environmental awareness. Extensive experiments on the LLVIP, FLIR, KAIST, and DroneVehicle datasets demonstrate that SLGNet establishes new state-of-the-art performance. Notably, on the LLVIP benchmark, our method achieves an mAP of 66.1, while reducing trainable parameters by approximately 87% compared to traditional full fine-tuning. This confirms SLGNet as a robust and efficient solution for multimodal perception.

preprint2026arXiv

SRAW-Attack: Space-Reweighted Adversarial Warping Attack for SAR Target Recognition

Synthetic aperture radar (SAR) imagery exhibits intrinsic information sparsity due to its unique electromagnetic scattering mechanism. Despite the widespread adoption of deep neural network (DNN)-based SAR automatic target recognition (SAR-ATR) systems, they remain vulnerable to adversarial examples and tend to over-rely on background regions, leading to degraded adversarial robustness. Existing adversarial attacks for SAR-ATR often require visually perceptible distortions to achieve effective performance, thereby necessitating an attack method that balances effectiveness and stealthiness. In this paper, a novel attack method termed Space-Reweighted Adversarial Warping (SRAW) is proposed, which generates adversarial examples through optimized spatial deformation with reweighted budgets across foreground and background regions. Extensive experiments demonstrate that SRAW significantly degrades the performance of state-of-the-art SAR-ATR models and consistently outperforms existing methods in terms of imperceptibility and adversarial transferability. Code is made available at https://github.com/boremycin/SAR-ATR-TransAttack.

preprint2026arXiv

Video Generation with Predictive Latents

Video Variational Autoencoder (VAE) enables latent video generative modeling by mapping the visual world into compact spatiotemporal latent spaces, improving training efficiency and stability. While existing video VAEs achieve commendable reconstruction quality, continued optimization of reconstruction does not necessarily translate into improved generative performance. How to enhance the diffusability of video latents remains a critical and unresolved challenge. In this work, inspired by principles of predictive world modeling, we investigate the potential of predictive learning to improve the video generative modeling. To this end, we introduce a simple and effective predictive reconstruction objective that unifies predictive learning with video reconstruction. Specifically, we randomly discard future frames and encode only partial past observations, while training the decoder to reconstruct the observed frames and predict future ones simultaneously. This design encourages the latent space to encode temporally predictive structures and build a more coherent understanding of video dynamics, thereby improving generation quality. Our model, termed Predictive Video VAE (PV-VAE), achieves superior performance on video generation, with 52% faster convergence and a 34.42 FVD improvement over the Wan2.2 VAE on UCF101. Furthermore, comprehensive analyses demonstrate that PV-VAE not only exhibits favorable scalability, with generative performance improving alongside VAE training, but also yields consistent gains in downstream video understanding, underscoring a latent space that effectively captures temporal coherence and motion priors.

preprint2025arXiv

Freezing and ice aging dynamics in saline water under natural convection

Understanding the coupled dynamics of liquid-solid phase change and fluid flows is crucial in a wide range of geophysical and industrial applications. When freezing occurs in saline water, the newly formed ice is mushy, with a porous structure that traps the brine within the ice. In this work, which combines experiments and theoretical analyses, we investigate the long-term evolution of saline ice, comprehensively accounting for the coupled dynamics of multiscale fluid flow, heat and mass transfer, and phase change. We show that in a closed convective system the rapid formation of a mushy ice layer is followed by desalination (i.e, the expulsion of salt from the ice) processes that might lead to a slow asymptotic decrease of the ice thickness. Desalination of mushy ice reduces its porosity, which alters the dynamic thermal equilibrium and ice thickness by weakening buoyancy-driven convection within the mushy layer. In turn, changes in brine convection and ice thickness affect the further desalination of the ice. The long-term dynamics of the system can be accurately predicted by a one-dimensional model based on appropriate parameterizations of global heat and mass transfer properties. Furthermore, within the same theoretical model we explore the ice dynamics across a broader parameter space. Our findings advance the understanding of the coupled phase-change physics of saline solutions in the presence of convective fluid flows and provide a basis for explaining and predicting real-world phenomena such as the aging of sea ice.

preprint2025arXiv

Marangoni-driven freezing dynamics of supercooled binary droplets

Solidification of droplets is of great importance to various technological applications, drawing considerable attention from scientists aiming to unravel the fundamental physical mechanisms. In the case of multicomponent droplets undergoing solidification, the emergence of concentration gradients may trigger significant interfacial flows that dominate the freezing dynamics. Here, we experimentally investigate the fascinating interfacial freezing dynamics of supercooled ethanol-water droplets, accompanied with the migration and growth of massive ice particles. We reveal that these unique freezing dynamics are driven by solidification-induced solutal Marangoni flow within the droplets. Our model, which incorporates the temperature- and concentration-dependent properties of the ethanol-water mixture, quantitatively predicts both the migration velocity and the growth rate of the ice particles. The former is determined by the solutal Marangoni flow velocity, while the latter is governed by a balance between the latent heat release and the enhanced thermal dissipation by the Marangoni flow. Moreover, we show that the final wrapping state of droplets can be modulated by the concentration of ethanol. Our findings may pave the way for novel insights into the physicochemical hydrodynamics of multicomponent liquids undergoing phase transitions.

preprint2024arXiv

ClassWise-SAM-Adapter: Parameter Efficient Fine-tuning Adapts Segment Anything to SAR Domain for Semantic Segmentation

In the realm of artificial intelligence, the emergence of foundation models, backed by high computing capabilities and extensive data, has been revolutionary. Segment Anything Model (SAM), built on the Vision Transformer (ViT) model with millions of parameters and vast training dataset SA-1B, excels in various segmentation scenarios relying on its significance of semantic information and generalization ability. Such achievement of visual foundation model stimulates continuous researches on specific downstream tasks in computer vision. The ClassWise-SAM-Adapter (CWSAM) is designed to adapt the high-performing SAM for landcover classification on space-borne Synthetic Aperture Radar (SAR) images. The proposed CWSAM freezes most of SAM's parameters and incorporates lightweight adapters for parameter efficient fine-tuning, and a classwise mask decoder is designed to achieve semantic segmentation task. This adapt-tuning method allows for efficient landcover classification of SAR images, balancing the accuracy with computational demand. In addition, the task specific input module injects low frequency information of SAR images by MLP-based layers to improve the model performance. Compared to conventional state-of-the-art semantic segmentation algorithms by extensive experiments, CWSAM showcases enhanced performance with fewer computing resources, highlighting the potential of leveraging foundational models like SAM for specific downstream tasks in the SAR domain. The source code is available at: https://github.com/xypu98/CWSAM.

preprint2023arXiv

Phason-mediated interlayer exciton diffusion in WS2/WSe2 moiré heterostructure

Moiré potentials in two-dimensional materials have been proven to be of fundamental importance to fully understand the electronic structure of van der Waals heterostructures, from superconductivity to correlated excitonic states. However, understanding how the moiré phonons, so-called phasons, affect the properties of the system still remains an uncharted territory. In this work, we demonstrate how phasons are integral to properly describing and understanding low-temperature interlayer exciton diffusion in WS2/WSe2 heterostructure. We perform photoluminescence (PL) spectroscopy to understand how the coupling between the layers, affected by their relative orientation, impacts the excitonic properties of the system. Samples fabricated with stacking angles of 0° and 60° are investigated taking into account the stacking angle dependence of the two common moiré potential profiles. Additionally, we present spatially and time-resolved exciton diffusion measurements, looking at the photoluminescence emission in a temperature range from 30 K to 250 K. An accurate potential for the two configurations are computed via density functional theory (DFT) calculations. Finally, we perform molecular dynamics simulation in order to visualize the phasons motion, estimating the phason speed at different temperatures, providing novel insights into the mechanics of exciton propagation at low temperatures that cannot be explained within the frame of classical exciton diffusion alone.

preprint2023arXiv

Super Sparse 3D Object Detection

As the perception range of LiDAR expands, LiDAR-based 3D object detection contributes ever-increasingly to the long-range perception in autonomous driving. Mainstream 3D object detectors often build dense feature maps, where the cost is quadratic to the perception range, making them hardly scale up to the long-range settings. To enable efficient long-range detection, we first propose a fully sparse object detector termed FSD. FSD is built upon the general sparse voxel encoder and a novel sparse instance recognition (SIR) module. SIR groups the points into instances and applies highly-efficient instance-wise feature extraction. The instance-wise grouping sidesteps the issue of the center feature missing, which hinders the design of the fully sparse architecture. To further enjoy the benefit of fully sparse characteristic, we leverage temporal information to remove data redundancy and propose a super sparse detector named FSD++. FSD++ first generates residual points, which indicate the point changes between consecutive frames. The residual points, along with a few previous foreground points, form the super sparse input data, greatly reducing data redundancy and computational overhead. We comprehensively analyze our method on the large-scale Waymo Open Dataset, and state-of-the-art performance is reported. To showcase the superiority of our method in long-range detection, we also conduct experiments on Argoverse 2 Dataset, where the perception range ($200m$) is much larger than Waymo Open Dataset ($75m$). Code is open-sourced at https://github.com/tusen-ai/SST.

preprint2023arXiv

Towards A Unified Conformer Structure: from ASR to ASV Task

Transformer has achieved extraordinary performance in Natural Language Processing and Computer Vision tasks thanks to its powerful self-attention mechanism, and its variant Conformer has become a state-of-the-art architecture in the field of Automatic Speech Recognition (ASR). However, the main-stream architecture for Automatic Speaker Verification (ASV) is convolutional Neural Networks, and there is still much room for research on the Conformer based ASV. In this paper, firstly, we modify the Conformer architecture from ASR to ASV with very minor changes. Length-Scaled Attention (LSA) method and Sharpness-Aware Minimizationis (SAM) are adopted to improve model generalization. Experiments conducted on VoxCeleb and CN-Celeb show that our Conformer based ASV achieves competitive performance compared with the popular ECAPA-TDNN. Secondly, inspired by the transfer learning strategy, ASV Conformer is natural to be initialized from the pretrained ASR model. Via parameter transferring, self-attention mechanism could better focus on the relationship between sequence features, brings about 11% relative improvement in EER on test set of VoxCeleb and CN-Celeb, which reveals the potential of Conformer to unify ASV and ASR task. Finally, we provide a runtime in ASV-Subtools to evaluate its inference speed in production scenario. Our code is released at https://github.com/Snowdar/asv-subtools/tree/master/doc/papers/conformer.md.

preprint2022arXiv

A Robust Hot Subdwarfs Identification Method Based on Deep Learning

Hot subdwarf star is a particular type of star that is crucial for studying binary evolution and atmospheric diffusion processes. In recent years, identifying Hot subdwarfs by machine learning methods has become a hot topic, but there are still limitations in automation and accuracy. In this paper, we proposed a robust identification method based on the convolutional neural network (CNN). We first constructed the dataset using the spectral data of LAMOS DR7-V1. We then constructed a hybrid recognition model including an 8-class classification model and a binary classification model. The model achieved an accuracy of 96.17% on the testing set. To further validate the accuracy of the model, we selected 835 Hot subdwarfs that were not involved in the training process from the identified LAMOST catalog (2428, including repeated observations) as the validation set. An accuracy of 96.05% was achieved. On this basis, we used the model to filter and classify all 10,640,255 spectra of LAMOST DR7-V1, and obtained a catalog of 2393 Hot subdwarf candidates, of which 2067 have been confirmed. We found 25 new Hot subdwarfs among the remaining candidates by manual validation. The overall accuracy of the model is 87.42%. Overall, the model presented in this study can effectively identify specific spectra with robust results and high accuracy, and can be further applied to the classification of large-scale spectra and the search of specific targets.

preprint2022arXiv

Adaptive Algorithm for Quantum Amplitude Estimation

Quantum amplitude estimation is a key sub-routine of a number of quantum algorithms with various applications. We propose an adaptive algorithm for interval estimation of amplitudes. The quantum part of the algorithm is based only on Grover's algorithm. The key ingredient is the introduction of an adjustment factor, which adjusts the amplitude of good states such that the amplitude after the adjustment, and the original amplitude, can be estimated without ambiguity in the subsequent step. We show with numerical studies that the proposed algorithm uses a similar number of quantum queries to achieve the same level of precision $ε$ compared to state-of-the-art algorithms, but the classical part, i.e., the non-quantum part, has substantially lower computational complexity. We rigorously prove that the number of oracle queries achieves $O(1/ε)$, i.e., a quadratic speedup over classical Monte Carlo sampling, and the computational complexity of the classical part achieves $O(\log(1/ε))$, both up to a double-logarithmic factor.

preprint2022arXiv

Amplify-and-Forward Relaying for Hierarchical Over-the-Air Computation

This paper studies a hierarchical over-the-air computation (AirComp) network over a large area, in which multiple relays are exploited to facilitate data aggregation from massive WDs. We present a two-phase amplify-and-forward (AF) relaying protocol. In the first phase, the WDs simultaneously send their data to the relays, while in the second phase, the relays amplify the respectively received signals and concurrently forward them to the fusion center (FC) for aggregation. Our objective is to minimize the computational mean squared error (MSE) at the FC, by jointly optimizing the WD transmit coefficients, the relay AF coefficients, and the FC de-noising factor, subject to their individual transmit power constraints. First, we consider the centralized design with global channel state information (CSI), in which the inter-relay signals can be exploited beneficially for data aggregation. In this case, we develop an alternating-optimization-based algorithm to obtain a high-quality solution to the computational MSE minimization problem. Next, to reduce the signaling overhead caused by the centralized design, we consider an alternative decentralized design with partial CSI, in which the relays and the FC make their own decisions by only requiring the channel power gain information across different relays. In this case, the relays and FC need to treat the inter-relay signals as harmful interference or noise. Accordingly, we optimize the transmit coefficients of the WDs associated with each relay, and the relay AF coefficients (together with the FC de-noising factor) in an iterative manner, which can be implemented efficiently in a decentralized way.

preprint2022arXiv

An extended mixed finite element method for elliptic interface problems

In this paper, we propose an extended mixed finite element method for elliptic interface problems. By adding some stabilization terms, we present a mixed approximation form based on Brezzi-Douglas-Marini element space and the piecewise constant function space, and show that the discrete inf-sup constant is independent of how the interface intersects the triangulation. Furthermore, we derive that the optimal convergence holds independent of the location of the interface relative to the mesh. Finally, some numerical examples are presented to verify our theoretical results.

preprint2022arXiv

Charge transfer dynamics in MoSe$_{2}$/hBN/WSe$_{2}$ heterostructures

Ultrafast charge transfer processes provide a facile way to create interlayer excitons in directly contacted transition metal dichalcogenide (TMD) layers. More sophisticated heterostructures composed of TMD/hBN/TMD enable new ways to control interlayer exciton properties and achieve novel exciton phenomena, such as exciton insulators and condensates, where longer lifetimes are desired. In this work, we experimentally study the charge transfer dynamics in a heterostructure composed of a 1 nm thick hBN spacer between MoSe$_{2}$ and WSe$_{2}$ monolayers. We observe the hole transfer from MoSe$_{2}$ to WSe$_{2}$ through the hBN barrier with a time constant of 500 ps, which is over 3 orders of magnitude slower than that between TMD layers without a spacer. Furthermore, we observe strong competition between the interlayer charge transfer and intralayer exciton-exciton annihilation processes at high excitation densities. Our work opens possibilities to understand charge transfer pathways in TMD/hBN/TMD heterostructures for the efficient generation and control of interlayer excitons.

preprint2022arXiv

Communication-Efficient and Privacy-Preserving Feature-based Federated Transfer Learning

Federated learning has attracted growing interest as it preserves the clients' privacy. As a variant of federated learning, federated transfer learning utilizes the knowledge from similar tasks and thus has also been intensively studied. However, due to the limited radio spectrum, the communication efficiency of federated learning via wireless links is critical since some tasks may require thousands of Terabytes of uplink payload. In order to improve the communication efficiency, we in this paper propose the feature-based federated transfer learning as an innovative approach to reduce the uplink payload by more than five orders of magnitude compared to that of existing approaches. We first introduce the system design in which the extracted features and outputs are uploaded instead of parameter updates, and then determine the required payload with this approach and provide comparisons with the existing approaches. Subsequently, we analyze the random shuffling scheme that preserves the clients' privacy. Finally, we evaluate the performance of the proposed learning scheme via experiments on an image classification task to show its effectiveness.

preprint2022arXiv

CP2: Copy-Paste Contrastive Pretraining for Semantic Segmentation

Recent advances in self-supervised contrastive learning yield good image-level representation, which favors classification tasks but usually neglects pixel-level detailed information, leading to unsatisfactory transfer performance to dense prediction tasks such as semantic segmentation. In this work, we propose a pixel-wise contrastive learning method called CP2 (Copy-Paste Contrastive Pretraining), which facilitates both image- and pixel-level representation learning and therefore is more suitable for downstream dense prediction tasks. In detail, we copy-paste a random crop from an image (the foreground) onto different background images and pretrain a semantic segmentation model with the objective of 1) distinguishing the foreground pixels from the background pixels, and 2) identifying the composed images that share the same foreground.Experiments show the strong performance of CP2 in downstream semantic segmentation: By finetuning CP2 pretrained models on PASCAL VOC 2012, we obtain 78.6% mIoU with a ResNet-50 and 79.5% with a ViT-S.

preprint2022arXiv

DePS: An improved deep learning model for de novo peptide sequencing

De novo peptide sequencing from mass spectrometry data is an important method for protein identification. Recently, various deep learning approaches were applied for de novo peptide sequencing and DeepNovoV2 is one of the represetative models. In this study, we proposed an enhanced model, DePS, which can improve the accuracy of de novo peptide sequencing even with missing signal peaks or large number of noisy peaks in tandem mass spectrometry data. It is showed that, for the same test set of DeepNovoV2, the DePS model achieved excellent results of 74.22%, 74.21% and 41.68% for amino acid recall, amino acid precision and peptide recall respectively. Furthermore, the results suggested that DePS outperforms DeepNovoV2 on the cross species dataset.

preprint2022arXiv

Device-Cloud Collaborative Recommendation via Meta Controller

On-device machine learning enables the lightweight deployment of recommendation models in local clients, which reduces the burden of the cloud-based recommenders and simultaneously incorporates more real-time user features. Nevertheless, the cloud-based recommendation in the industry is still very important considering its powerful model capacity and the efficient candidate generation from the billion-scale item pool. Previous attempts to integrate the merits of both paradigms mainly resort to a sequential mechanism, which builds the on-device recommender on top of the cloud-based recommendation. However, such a design is inflexible when user interests dramatically change: the on-device model is stuck by the limited item cache while the cloud-based recommendation based on the large item pool do not respond without the new re-fresh feedback. To overcome this issue, we propose a meta controller to dynamically manage the collaboration between the on-device recommender and the cloud-based recommender, and introduce a novel efficient sample construction from the causal perspective to solve the dataset absence issue of meta controller. On the basis of the counterfactual samples and the extended training, extensive experiments in the industrial recommendation scenarios show the promise of meta controller in the device-cloud collaboration.

preprint2022arXiv

Edge-Cloud Polarization and Collaboration: A Comprehensive Survey for AI

Influenced by the great success of deep learning via cloud computing and the rapid development of edge chips, research in artificial intelligence (AI) has shifted to both of the computing paradigms, i.e., cloud computing and edge computing. In recent years, we have witnessed significant progress in developing more advanced AI models on cloud servers that surpass traditional deep learning models owing to model innovations (e.g., Transformers, Pretrained families), explosion of training data and soaring computing capabilities. However, edge computing, especially edge and cloud collaborative computing, are still in its infancy to announce their success due to the resource-constrained IoT scenarios with very limited algorithms deployed. In this survey, we conduct a systematic review for both cloud and edge AI. Specifically, we are the first to set up the collaborative learning mechanism for cloud and edge modeling with a thorough review of the architectures that enable such mechanism. We also discuss potentials and practical experiences of some on-going advanced edge AI topics including pretraining models, graph neural networks and reinforcement learning. Finally, we discuss the promising directions and challenges in this field.

preprint2022arXiv

Imaging Quantum Interference in Stadium-Shaped Monolayer and Bilayer Graphene Quantum Dots

Experimental realization of graphene-based stadium-shaped quantum dots (QDs) have been few and incompatible with scanned probe microscopy. Yet, direct visualization of electronic states within these QDs is crucial for determining the existence of quantum chaos in these systems. We report the fabrication and characterization of electrostatically defined stadium-shaped QDs in heterostructure devices composed of monolayer graphene (MLG) and bilayer graphene (BLG). To realize a stadium-shaped QD, we utilized the tip of a scanning tunneling microscope to charge defects in a supporting hexagonal boron nitride flake. The stadium states visualized are consistent with tight-binding-based simulations, but lack clear quantum chaos signatures. The absence of quantum chaos features in MLG-based stadium QDs is attributed to the leaky nature of the confinement potential due to Klein tunneling. In contrast, for BLG-based stadium QDs (which have stronger confinement) quantum chaos is precluded by the smooth confinement potential which reduces interference and mixing between states.

preprint2022arXiv

Nature of novel moiré exciton states in WSe$_2$/WS$_2$ heterobilayers

Moiré patterns of transition metal dichalcogenide (TMD) heterobilayers have proven to be an ideal platform to host unusual correlated electronic phases, emerging magnetism, and correlated exciton physics. While the existence of novel moiré excitonic states is established through optical measurements, the microscopic nature of these states is still poorly understood, often relying on empirically fit models. Here, combining large-scale first-principles GW-BSE calculations and micro-reflection spectroscopy, we identify the nature of the exciton resonances in WSe$_2$/WS$_2$ moiré superlattices, discovering a surprisingly rich set of moiré excitons that cannot be even qualitatively captured by prevailing continuum models. Our calculations reveal moiré excitons with distinct characters, including modulated Wannier excitons and previously unindentified intralayer charge-transfer excitons. Signatures of these distinct excitonic characters are confirmed experimentally via the unique carrier-density and magnetic-field dependences of different moiré exciton resonances. Our study highlights the highly non-trivial exciton states that can emerge in TMD moiré superlattices, and suggests novel ways of tuning many-body physics in moiré systems by engineering excited-states with specific spatial characters.

preprint2022arXiv

Optimized Design for IRS-Assisted Integrated Sensing and Communication Systems in Clutter Environments

In this paper, we investigate an intelligent reflecting surface (IRS)-assisted integrated sensing and communication (ISAC) system design in a clutter environment. Assisted by an IRS equipped with a uniform linear array (ULA), a multi-antenna base station (BS) is targeted for communicating with multiple communication users (CUs) and sensing multiple targets simultaneously. We consider the IRS-assisted ISAC design in the case with Type-I or Type-II CUs, where each Type-I and Type-II CU can and cannot cancel the interference from sensing signals, respectively. In particular, we aim to maximize the minimum sensing beampattern gain among multiple targets, by jointly optimizing the BS transmit beamforming vectors and the IRS phase shifting matrix, subject to the signal-to-interference-plus-noise ratio (SINR) constraint for each Type-I/Type-II CU, the interference power constraint per clutter, the transmission power constraint at the BS, and the cross-correlation pattern constraint. Due to the coupling of the BS's transmit design variables and the IRS's phase shifting matrix, the formulated max-min IRS-assisted ISAC design problem in the case with Type-I/Type-II CUs is highly non-convex. As such, we propose an efficient algorithm based on the alternating-optimization and semi-definite relaxation (SDR) techniques. In the case with Type-I CUs, we show that the dedicated sensing signal at the BS is always beneficial to improve the sensing performance. By contrast, the dedicated sensing signal at the BS is not required in the case with Type-II CUs. Numerical results are provided to show that the proposed IRS-assisted ISAC design schemes achieve a significant gain over the existing benchmark schemes.

preprint2022arXiv

Performance evaluation of baseline-dependent averaging based onfull-scale SKA1-LOW simulation

The Square Kilometre Array (SKA) is the largest radio interferometer under construction in the world. Its immense amount of visibility data poses a considerable challenge to the subsequent processing by the science data processor (SDP). Baseline dependent averaging (BDA), which reduces the amount of visibility data based on the baseline distribution of the radio interferometer, has become a focus of SKA SDP development. This paper developed and implemented a full-featured BDA module based on Radio Astronomy Simulation, Calibration and Imaging Library (RASCIL). Simulated observations were then performed with RASCIL based on a full-scale SKA1-LOW configuration. The performance of the BDA was systematically investigated and evaluated based on the simulated data. The experimental results presented that the amount of visibility data is reduced by about 50\% to 85\% for different time intervals ($Δt_{max}$). In addition, different $Δt_{max}$ have a significant effect on the imaging quality. The smaller the $Δt_{max}$, the smaller the degradation of the imaging quality.

preprint2022arXiv

Personalizing Intervened Network for Long-tailed Sequential User Behavior Modeling

In an era of information explosion, recommendation systems play an important role in people's daily life by facilitating content exploration. It is known that user activeness, i.e., number of behaviors, tends to follow a long-tail distribution, where the majority of users are with low activeness. In practice, we observe that tail users suffer from significantly lower-quality recommendation than the head users after joint training. We further identify that a model trained on tail users separately still achieve inferior results due to limited data. Though long-tail distributions are ubiquitous in recommendation systems, improving the recommendation performance on the tail users still remains challenge in both research and industry. Directly applying related methods on long-tail distribution might be at risk of hurting the experience of head users, which is less affordable since a small portion of head users with high activeness contribute a considerate portion of platform revenue. In this paper, we propose a novel approach that significantly improves the recommendation performance of the tail users while achieving at least comparable performance for the head users over the base model. The essence of this approach is a novel Gradient Aggregation technique that learns common knowledge shared by all users into a backbone model, followed by separate plugin prediction networks for the head users and the tail users personalization. As for common knowledge learning, we leverage the backward adjustment from the causality theory for deconfounding the gradient estimation and thus shielding off the backbone training from the confounder, i.e., user activeness. We conduct extensive experiments on two public recommendation benchmark datasets and a large-scale industrial datasets collected from the Alipay platform. Empirical studies validate the rationality and effectiveness of our approach.

preprint2022arXiv

RFI Identification Based On Deep-Learning]{A Robust RFI Identification For Radio Interferometry based on a Convolutional Neural Network

The rapid development of new generation radio interferometers such as the Square Kilometer Array (SKA) has opened up unprecedented opportunities for astronomical research. However, anthropogenic Radio Frequency Interference (RFI) from communication technologies and other human activities severely affects the fidelity of observational data. It also significantly reduces the sensitivity of the telescopes. We proposed a robust Convolutional Neural Network (CNN) model to identify RFI based on machine learning methods. We overlaid RFI on the simulation data of SKA1-LOW to construct three visibility function datasets. One dataset was used for modeling, and the other two were used for validating the model's usability. The experimental results show that the Area Under the Curve (AUC) reaches 0.93, with satisfactory accuracy and precision. We then further investigated the effectiveness of the model by identifying the RFI in the actual observational data from LOFAR and MeerKAT. The results show that the model performs well. The overall effectiveness is comparable to AOFlagger software and provides an improvement over existing methods in some instances.

preprint2022arXiv

Sequential Offloading for Distributed DNN Computation in Multiuser MEC Systems

This paper studies a sequential task offloading problem for a multiuser mobile edge computing (MEC) system. We consider a dynamic optimization approach, which embraces wireless channel fluctuations and random deep neural network (DNN) task arrivals over an infinite horizon. Specifically, we introduce a local CPU workload queue (WD-QSI) and an MEC server workload queue (MEC-QSI) to model the dynamic workload of DNN tasks at each WD and the MEC server, respectively. The transmit power and the partitioning of the local DNN task at each WD are dynamically determined based on the instantaneous channel conditions (to capture the transmission opportunities) and the instantaneous WD-QSI and MEC-QSI (to capture the dynamic urgency of the tasks) to minimize the average latency of the DNN tasks. The joint optimization can be formulated as an ergodic Markov decision process (MDP), in which the optimality condition is characterized by a centralized Bellman equation. However, the brute force solution of the MDP is not viable due to the curse of dimensionality as well as the requirement for knowledge of the global state information. To overcome these issues, we first decompose the MDP into multiple lower dimensional sub-MDPs, each of which can be associated with a WD or the MEC server. Next, we further develop a parametric online Q-learning algorithm, so that each sub-MDP is solved locally at its associated WD or the MEC server. The proposed solution is completely decentralized in the sense that the transmit power for sequential offloading and the DNN task partitioning can be determined based on the local channel state information (CSI) and the local WD-QSI at the WD only. Additionally, no prior knowledge of the distribution of the DNN task arrivals or the channel statistics will be needed for the MEC server.

preprint2022arXiv

SHREC'22 Track: Sketch-Based 3D Shape Retrieval in the Wild

Sketch-based 3D shape retrieval (SBSR) is an important yet challenging task, which has drawn more and more attention in recent years. Existing approaches address the problem in a restricted setting, without appropriately simulating real application scenarios. To mimic the realistic setting, in this track, we adopt large-scale sketches drawn by amateurs of different levels of drawing skills, as well as a variety of 3D shapes including not only CAD models but also models scanned from real objects. We define two SBSR tasks and construct two benchmarks consisting of more than 46,000 CAD models, 1,700 realistic models, and 145,000 sketches in total. Four teams participated in this track and submitted 15 runs for the two tasks, evaluated by 7 commonly-adopted metrics. We hope that, the benchmarks, the comparative results, and the open-sourced evaluation code will foster future research in this direction among the 3D object retrieval community.

preprint2022arXiv

Spectroscopy Signatures of Electron Correlations in a Trilayer Graphene/hBN Moiré Superlattice

ABC-stacked trilayer graphene/hBN moiré superlattice (TLG/hBN) has emerged as a playground for correlated electron physics. We report spectroscopy measurements of dual-gated TLG/hBN using Fourier transformed infrared photocurrent spectroscopy. We observed a strong optical transition between moiré mini-bands that narrows continuously as a bandgap is opened by gating, indicating a reduction of the single particle bandwidth. At half-filling of the valence flat band, a broad absorption peak emerges at ~18 meV, indicating direct optical excitation across an emerging Mott gap. Similar photocurrent spectra are observed in two other correlated insulating states at quarter- and half-filling of the first conduction band. Our findings provide key parameters of the Hubbard model for the understanding of electron correlation in TLG/hBN.

preprint2022arXiv

The Temporal and Spatial Behaviors of CME Occurrence Rate at Different Latitudes

The statistical study of the Coronal Mass Ejections (CMEs) is a hot topic in solar physics. To further reveal the temporal and spatial behaviors of the CMEs at different latitudes and heights, we analyzed the correlation and phase relationships between the occurrence rate of CMEs, the Coronal Brightness Index (CBI), and the 10.7-cm solar radio flux (F10.7). We found that the occurrence rate of the CMEs correlates with CBI relatively stronger at high latitudes (>=60) than at low latitudes (<=50). At low latitudes, the occurrence rate of the CMEs correlates relatively weaker with CBI than F10.7. There is a relatively stronger correlation relationship between CMEs, F10.7, and CBI during Solar Cycle 24(SC24) than Solar Cycle 23 (SC23). During SC23, the high-latitude CME occurrence rate lags behind F10.7 by three months, and during SC24, the low-latitude CME occurrence rate leads to the low-latitude CBI by one month. The correlation coefficient values turn out to be larger when the very faint CMEsare removed from the samples of the CDAW catalog. Based on our results, we may speculate that the source regions of the high/low-latitude CMEs may vary in height, and the process of magnetic energy accumulation and dissipation is from the lower to the upper atmosphere of the Sun. The temporal offsets between different indicators could help us better understand the physical processes responsible for the solar-terrestrial interactions.

preprint2022arXiv

Towards Understanding and Mitigating Audio Adversarial Examples for Speaker Recognition

Speaker recognition systems (SRSs) have recently been shown to be vulnerable to adversarial attacks, raising significant security concerns. In this work, we systematically investigate transformation and adversarial training based defenses for securing SRSs. According to the characteristic of SRSs, we present 22 diverse transformations and thoroughly evaluate them using 7 recent promising adversarial attacks (4 white-box and 3 black-box) on speaker recognition. With careful regard for best practices in defense evaluations, we analyze the strength of transformations to withstand adaptive attacks. We also evaluate and understand their effectiveness against adaptive attacks when combined with adversarial training. Our study provides lots of useful insights and findings, many of them are new or inconsistent with the conclusions in the image and speech recognition domains, e.g., variable and constant bit rate speech compressions have different performance, and some non-differentiable transformations remain effective against current promising evasion techniques which often work well in the image domain. We demonstrate that the proposed novel feature-level transformation combined with adversarial training is rather effective compared to the sole adversarial training in a complete white-box setting, e.g., increasing the accuracy by 13.62% and attack cost by two orders of magnitude, while other transformations do not necessarily improve the overall defense capability. This work sheds further light on the research directions in this field. We also release our evaluation platform SPEAKERGUARD to foster further research.

preprint2022arXiv

WS-Snapshot: An effective algorithm for wide-field and large-scale imaging

The Square Kilometre Array (SKA) is the largest radio interferometer under construction in the world. The high accuracy, wide-field and large size imaging significantly challenge the construction of the Science Data Processor (SDP) of SKA. We propose a hybrid imaging method based on improved W-Stacking and snapshots. The w range is reduced by fitting the snapshot $uv$ plane, thus effectively enhancing the performance of the improved W-Stacking algorithm. We present a detailed implementation of WS-Snapshot. With full-scale SKA1-LOW simulations, we present the imaging performance and imaging quality results for different parameter cases. The results show that the WS-Snapshot method enables more efficient distributed processing and significantly reduces the computational time overhead within an acceptable accuracy range, which would be crucial for subsequent SKA science studies.

preprint2021arXiv

A Catalog of LAMOST Variable Sources Based on Time-domain Photometry of ZTF

The identification and analysis of different variable sources is a hot issue in astrophysical research. The Large Sky Area Multi-Object Fiber Spectroscopic Telescope (LAMOST) spectroscopic survey has accumulated massive spectral data but contains no information about variable sources. Although a few related studies present variable source catalogs for the LAMOST, the studies still have a few deficiencies regarding the type and number of variable sources identified. In this study, we presented a statistical modeling approach to identify variable source candidates. We first crossed the Kepler, Sloan Digital Sky Survey (SDSS), and Zwicky Transient Facility (ZTF) catalogs to obtain light curves data of variable and non-variable sources. The data are then modeled statistically using commonly used variability parameters, respectively. And then, an optimal variable source identification model is determined using the Receiver Operating Characteristic (ROC) curve and four credible evaluation indices such as precision, accuracy, recall, and F1score. Based on this identification model, a catalog of LAMOST variable sources (including 631,769 variable source candidates with a probability greater than 95% and so on) is obtained. To validate the correctness of the catalog, we performed a two-by-two cross-comparison with the GAIA catalog and other published variable source catalogs. We achieved the correct rate ranging from 50% to 100%. Among the 123,756 sources cross-matched, our variable source catalog identifies 85,669 with a correct rate of 69%, which indicates that the variable source catalog presented in this study is credible.

preprint2021arXiv

A two-species competition model with mixed dispersal and free boundaries in time-periodic environment

This paper is concerned with a Lotka-Volterra type competition model with free boundaries in time-periodic environment. One species is assumed to adopt nonlocal dispersal and the other one adopts mixed dispersal, which is a combination of both random dispersal and nonlocal dispersal. We show that this free boundary problem with more general growth functions admits a unique solution defined for all time. A spreading-vanishing dichotomy is obtained and criteria for spreading and vanishing are provided. Moreover, under the weak competition condition we provide the long-time asymptotic behavior of solution when spreading occurs.

preprint2021arXiv

Correlated interlayer exciton insulator in double layers of monolayer WSe2 and moiré WS2/WSe2

Moiré superlattices in van der Waals heterostructures have emerged as a powerful tool for engineering novel quantum phenomena. Here we report the observation of a correlated interlayer exciton insulator in a double-layer heterostructure composed of a WSe2 monolayer and a WS2/WSe2 moiré bilayer that are separated by an ultrathin hexagonal boron nitride (hBN). The moiré WS2/WSe2 bilayer features a Mott insulator state at hole density p/p0 = 1, where p0 corresponds to one hole per moiré lattice site. When electrons are added to the Mott insulator in the WS2/WSe2 moiré bilayer and an equal number of holes are injected into the WSe2 monolayer, a new interlayer exciton insulator emerges with the holes in the WSe2 monolayer and the electrons in the doped Mott insulator bound together through interlayer Coulomb interactions. The excitonic insulator is stable up to a critical hole density of ~ 0.5p0 in the WSe2 monolayer, beyond which the system becomes metallic. Our study highlights the opportunities for realizing novel quantum phases in double-layer moiré systems due to the interplay between the moiré flat band and strong interlayer electron interactions.

preprint2021arXiv

Correlation-Driven Electron-Hole Asymmetry in Graphene Field Effect Devices

Electron-hole asymmetry is a fundamental property in solids that can determine the nature of quantum phase transitions and the regime of operation for devices. The observation of electron-hole asymmetry in graphene and recently in the phase diagram of bilayer graphene has spurred interest into whether it stems from disorder or from fundamental interactions such as correlations. Here, we report an effective new way to access electron-hole asymmetry in 2D materials by directly measuring the quasiparticle self-energy in graphene/Boron Nitride field effect devices. As the chemical potential moves from the hole to the electron doped side, we see an increased strength of electronic correlations manifested by an increase in the band velocity and inverse quasiparticle lifetime. These results suggest that electronic correlations play an intrinsic role in driving electron hole asymmetry in graphene and provide a new insight for asymmetries in more strongly correlated materials.

preprint2021arXiv

Imaging gate-tunable Tomonaga-Luttinger liquids in 1H-MoSe$_2$ mirror twin boundaries

One-dimensional electron systems (1DESs) exhibit properties that are fundamentally different from higher-dimensional systems. For example, electron-electron interactions in 1DESs have been predicted to induce Tomonaga-Luttinger liquid behavior. Naturally-occurring grain boundaries in single-layer semiconducting transition metal dichalcogenides provide 1D conducting channels that have been proposed to host Tomonaga-Luttinger liquids, but charge density wave physics has also been suggested to explain their behavior. Clear identification of the electronic ground state of this system has been hampered by an inability to electrostatically gate such boundaries and thereby tune their charge carrier concentration. Here we present a scanning tunneling microscopy/spectroscopy study of gate-tunable mirror twin boundaries (MTBs) in single-layer 1H-MoSe$_2$ devices. Gating here enables STM spectroscopy to be performed for different MTB electron densities, thus allowing precise characterization of electron-electron interaction effects. Visualization of MTB electronic structure under these conditions allows unambiguous identification of collective density wave excitations having two distinct velocities, in quantitative agreement with the spin-charge separation predicted by finite-length Tomonaga-Luttinger-liquid theory.

preprint2020arXiv

A Hybrid method of accurate classification for Blazars Of Uncertain Type in Fermi LAT Catalogs

Significant progress in the classification of Fermi unassociated sources , has led to an increasing number of blazars are being found. The optical spectrum is effectively used to classify the blazars into two groups such as BL Lacs and flat spectrum radio quasars (FSRQs). However, the accurate classification of the blazars without optical spectrum information, i.e., blazars of uncertain type (BCUs), remains a significant challenge. In this paper, we present a principal component analysis (PCA) and machine learning hybrid blazars classification method. The method, based on the data from Fermi LAT 3FGL Catalog, first used the PCA to extract the primary features of the BCUs and then used a machine learning algorithm to further classify the BCUs. Experimental results indicate that the that the use of PCA algorithms significantly improved the classification. More importantly, comparison with the Fermi LAT 4FGL Catalog, which contains the spectral classification of those BCUs in the Fermi-LAT 3FGL Catalog, reveals that the proposed classification method in the study exhibits higher accuracy than currently established methods; specifically, 151 out of 171 BL Lacs and 19 out of 24 FSRQs are correctly classified.

preprint2020arXiv

Adversarial jamming attacks and defense strategies via adaptive deep reinforcement learning

As the applications of deep reinforcement learning (DRL) in wireless communications grow, sensitivity of DRL based wireless communication strategies against adversarial attacks has started to draw increasing attention. In order to address such sensitivity and alleviate the resulting security concerns, we in this paper consider a victim user that performs DRL-based dynamic channel access, and an attacker that executes DRLbased jamming attacks to disrupt the victim. Hence, both the victim and attacker are DRL agents and can interact with each other, retrain their models, and adapt to opponents&#39; policies. In this setting, we initially develop an adversarial jamming attack policy that aims at minimizing the accuracy of victim&#39;s decision making on dynamic channel access. Subsequently, we devise defense strategies against such an attacker, and propose three defense strategies, namely diversified defense with proportional-integral-derivative (PID) control, diversified defense with an imitation attacker, and defense via orthogonal policies. We design these strategies to maximize the attacked victim&#39;s accuracy and evaluate their performances.

preprint2020arXiv

An Augmented Regression Model for Tensors with Missing Values

Heterogeneous but complementary sources of data provide an unprecedented opportunity for developing accurate statistical models of systems. Although the existing methods have shown promising results, they are mostly applicable to situations where the system output is measured in its complete form. In reality, however, it may not be feasible to obtain the complete output measurement of a system, which results in observations that contain missing values. This paper introduces a general framework that integrates tensor regression with tensor completion and proposes an efficient optimization framework that alternates between two steps for parameter estimation. Through multiple simulations and a case study, we evaluate the performance of the proposed method. The results indicate the superiority of the proposed method in comparison to a benchmark.

preprint2020arXiv

An end-to-end CNN framework for polarimetric vision tasks based on polarization-parameter-constructing network

Pixel-wise operations between polarimetric images are important for processing polarization information. For the lack of such operations, the polarization information cannot be fully utilized in convolutional neural network(CNN). In this paper, a novel end-to-end CNN framework for polarization vision tasks is proposed, which enables the networks to take full advantage of polarimetric images. The framework consists of two sub-networks: a polarization-parameter-constructing network (PPCN) and a task network. PPCN implements pixel-wise operations between images in the CNN form with 1x1 convolution kernels. It takes raw polarimetric images as input, and outputs polarization-parametric images to task network so as to complete a vison task. By training together, the PPCN can learn to provide the most suitable polarization-parametric images for the task network and the dataset. Taking faster R-CNN as task network, the experimental results show that compared with existing methods, the proposed framework achieves much higher mean-average-precision (mAP) in object detection task

preprint2020arXiv

Convolutional neural network based deep-learning architecture for intraprostatic tumour contouring on PSMA PET images in patients with primary prostate cancer

Accurate delineation of the intraprostatic gross tumour volume (GTV) is a prerequisite for treatment approaches in patients with primary prostate cancer (PCa). Prostate-specific membrane antigen positron emission tomography (PSMA-PET) may outperform MRI in GTV detection. However, visual GTV delineation underlies interobserver heterogeneity and is time consuming. The aim of this study was to develop a convolutional neural network (CNN) for automated segmentation of intraprostatic tumour (GTV-CNN) in PSMA-PET. Methods: The CNN (3D U-Net) was trained on [68Ga]PSMA-PET images of 152 patients from two different institutions and the training labels were generated manually using a validated technique. The CNN was tested on two independent internal (cohort 1: [68Ga]PSMA-PET, n=18 and cohort 2: [18F]PSMA-PET, n=19) and one external (cohort 3: [68Ga]PSMA-PET, n=20) test-datasets. Accordance between manual contours and GTV-CNN was assessed with Dice-Sørensen coefficient (DSC). Sensitivity and specificity were calculated for the two internal test-datasets by using whole-mount histology. Results: Median DSCs for cohorts 1-3 were 0.84 (range: 0.32-0.95), 0.81 (range: 0.28-0.93) and 0.83 (range: 0.32-0.93), respectively. Sensitivities and specificities for GTV-CNN were comparable with manual expert contours: 0.98 and 0.76 (cohort 1) and 1 and 0.57 (cohort 2), respectively. Computation time was around 6 seconds for a standard dataset. Conclusion: The application of a CNN for automated contouring of intraprostatic GTV in [68Ga]PSMA- and [18F]PSMA-PET images resulted in a high concordance with expert contours and in high sensitivities and specificities in comparison with histology reference. This robust, accurate and fast technique may be implemented for treatment concepts in primary PCa. The trained model and the study&#39;s source code are available in an open source repository.

preprint2020arXiv

Cooling-Aware Resource Allocation and Load Management for Mobile Edge Computing Systems

Driven by explosive computation demands of Internet of Things (IoT), mobile edge computing (MEC) provides a promising technique to enhance the computation capability for mobile users. In this paper, we propose a joint resource allocation and load management mechanism in an MEC system with wireless power transfer (WPT), by jointly optimizing the transmit power for WPT, the local/edge computing load, the offloading time, and the frequencies of the central processing units (CPUs) at the access point (AP) and the users. To achieve an energy-efficient and sustainable WPT-MEC system, we minimize the total energy consumption of the AP, while meeting computation latency requirements. Cooling energy which is non-negligible, is taken into account in minimizing the energy consumption of the MEC system. By rigorously orchestrating the state-of-the-art optimization techniques, we design an iterative algorithm and obtain the optimal solution in a semi-closed form. Based on the solution, interesting properties and insights are summarized. Extensive numerical tests show that the proposed algorithm can save up to 90.4% the energy of existing benchmarks.

preprint2020arXiv

COVID-19 Docking Server: A meta server for docking small molecules, peptides and antibodies against potential targets of COVID-19

Motivation: The coronavirus disease 2019 (COVID-19) caused by a new type of coronavirus has been emerging from China and led to thousands of death globally since December 2019. Despite many groups have engaged in studying the newly emerged virus and searching for the treatment of COVID-19, the understanding of the COVID-19 target-ligand interactions represents a key chal-lenge. Herein, we introduce COVID-19 Docking Server, a web server that predicts the binding modes between COVID-19 targets and the ligands including small molecules, peptides and anti-bodies. Results: Structures of proteins involved in the virus life cycle were collected or constructed based on the homologs of coronavirus, and prepared ready for docking. The meta platform provides a free and interactive tool for the prediction of COVID-19 target-ligand interactions and following drug discovery for COVID-19.

preprint2020arXiv

Frozen Patterns of Impacted Droplets: From Conical Tips to Toroidal Shapes

We report frozen patterns for the water droplets impacting on a cold substrate through fast-speed images. These patterns can be manipulated by several physical parameters (the droplet size, falling height, and substrate temperature), and the scaling analysis has a remarkable agreement with the phase diagram. The observed double-concentric toroidal shape is attributed to the correlation between the impacting dynamics and freezing process, as confirmed by the spatiotemporal evolution of the droplet temperature, the identified timescale associated with the morphology and solidification ($t_{inn}\simeq τ_{sol}$), and the ice front-advection model. These results for frozen patterns provide insight into the complex interplay of the rapid impacting hydrodynamics, the transient heat transfer, and the intricate solidification process.

preprint2020arXiv

High Yield Growth and Doping of Black Phosphorus with Tunable Electronic Properties

Black phosphorus (BP) has recently attracted significant interest due to its unique electronic and optical properties. Doping is an effective strategy to tune a material&#39;s electronic structures, however, the direct and controllable growth of BP with a high yield and its doping remain a great challenge. Here we report an efficient short-distance transport (SDT) growth approach and achieve the controlled growth of high quality BP with the highest yield so far, where 98% of the red phosphorus is converted to BP. The doping of BP by As, Sb, Bi, Se and Te are also achieved by this SDT growth approach. Spectroscopic results show that doping systematically changes its electronic structures including band gap, work function, and energy band position. As a result, we have found that the air-stability of doped BP samples (Sb and Te-doped BP) improves compared with pristine BP, due to the downshift of the conduction band minimum with doping. This work develops a new method to grow BP and doped BP with tunable electronic structures and improved stability, and should extend the uses of these class of materials in various areas.

preprint2020arXiv

Joint Optimization of User Association, Subchannel Allocation, and Power Allocation in Multi-cell Multi-association OFDMA Heterogeneous Networks

Heterogeneous network is a novel network architecture proposed in Long-Term-Evolution~(LTE), which highly increases the capacity and coverage compared with the conventional networks. However, in order to provide the best services, appropriate resource management must be applied. In this paper, we consider the joint optimization problem of user association, subchannel allocation, and power allocation for downlink transmission in Multi-cell Multi-association Orthogonal Frequency Division Multiple Access (OFDMA) heterogeneous networks. To solve the optimization problem, we first divide it into two subproblems: 1) user association and subchannel allocation for fixed power allocation; 2) power allocation for fixed user association and subchannel allocation. Subsequently, we obtain a locally optimal solution for the joint optimization problem by solving these two subproblems alternately. For the first subproblem, we derive the globally optimal solution based on graph theory. For the second subproblem, we obtain a Karush-Kuhn-Tucker (KKT) optimal solution by a low complexity algorithm based on the difference of two convex functions approximation (DCA) method. In addition, the multi-antenna receiver case and the proportional fairness case are also discussed. Simulation results demonstrate that the proposed algorithms can significantly enhance the overall network throughput.

preprint2020arXiv

Optimal Energy Allocation and Task Offloading Policy for Wireless Powered Mobile Edge Computing Systems

This paper studies a wireless powered mobile edge computing (MEC) system with fluctuating channels and dynamic task arrivals over time. We jointly optimize the transmission energy allocation at the energy transmitter (ET) for WPT and the task allocation at the user for local computing and offloading over a particular finite horizon, with the objective of minimizing the total transmission energy consumption at the ET while ensuring the user&#39;s successful task execution. First, in order to characterize the fundamental performance limit, we consider the offline optimization by assuming that the perfect knowledge of channel state information and task state information (i.e., task arrival timing and amounts) is known a-priori. In this case, we obtain the well-structured optimal solution in a closed form to the energy minimization problem via convex optimization techniques. Next, inspired by the structured offline solutions obtained above, we develop heuristic online designs for the joint energy and task allocation when the knowledge of CSI/TSI is only causally known. Finally, numerical results are provided to show that the proposed joint designs achieve significantly smaller energy consumption than benchmark schemes with only local computing or full offloading at the user, and the proposed heuristic online designs perform close to the optimal offline solutions.

preprint2020arXiv

Real-Time Resource Allocation for Wireless Powered Multiuser Mobile Edge Computing With Energy and Task Causality

This paper considers a wireless powered multiuser mobile edge computing (MEC) system, in which a multi-antenna hybrid access point (AP) wirelessly charges multiple users, and each user relies on the harvested energy to execute computation tasks. We jointly optimize the energy beamforming and remote task execution at the AP, as well as the local computing and task offloading, aiming to minimize the total system energy consumption over a finite time horizon, subject to causality constraints for both energy harvesting and task arrival at the users. In particular, we consider a practical scenario with casual task state information (TSI) and channel state information (CSI), i.e., only the current and previous TSI and CSI are available, but the future TSI and CSI can only be predicted subject to certain errors. To solve this real-time resource allocation problem, we propose an offline-optimization inspired online design approach. First, we consider the offline optimization case by assuming that the TSI and CSI are perfectly known a-priori. In this case, the energy minimization problem corresponds to a convex problem, for which the semi-closed-form optimal solution is obtained via the Lagrange duality method. Next, inspired by the optimal offline solution, we propose a sliding-window based online resource allocation design in practical cases by integrating with the sequential optimization. Finally, numerical results show that the proposed joint wireless powered MEC designs significantly improve the system&#39;s energy efficiency, as compared with the benchmark schemes that consider a sliding window of size one or without such joint optimization.

preprint2020arXiv

Robust Automated Photometry Pipeline for Blurred Images

The primary task of the 1.26-m telescope jointly operated by the National Astronomical Observatory and Guangzhou University is photometric observations of the g, r, and i bands. A data processing pipeline system was set up with mature software packages, such as IRAF, SExtractor, and SCAMP, to process approximately 5 GB of observational data automatically every day. However, the success ratio was significantly reduced when processing blurred images owing to telescope tracking error; this, in turn, significantly constrained the output of the telescope. We propose a robust automated photometric pipeline (RAPP) software that can correctly process blurred images. Two key techniques are presented in detail: blurred star enhancement and robust image matching. A series of tests proved that RAPP not only achieves a photometric success ratio and precision comparable to those of IRAF but also significantly reduces the data processing load and improves the efficiency.

preprint2020arXiv

SpellGCN: Incorporating Phonological and Visual Similarities into Language Models for Chinese Spelling Check

Chinese Spelling Check (CSC) is a task to detect and correct spelling errors in Chinese natural language. Existing methods have made attempts to incorporate the similarity knowledge between Chinese characters. However, they take the similarity knowledge as either an external input resource or just heuristic rules. This paper proposes to incorporate phonological and visual similarity knowledge into language models for CSC via a specialized graph convolutional network (SpellGCN). The model builds a graph over the characters, and SpellGCN is learned to map this graph into a set of inter-dependent character classifiers. These classifiers are applied to the representations extracted by another network, such as BERT, enabling the whole network to be end-to-end trainable. Experiments (The dataset and all code for this paper are available at https://github.com/ACL2020SpellGCN/SpellGCN) are conducted on three human-annotated datasets. Our method achieves superior performance against previous models by a large margin.

preprint2020arXiv

Study of Thermal Expansion Coefficients of 2D Materials via Raman Micro-spectroscopy: Revisited

We report a joint study, using Raman micro-spectroscopy and molecular dynamics simulations, on the substrate effect on thermal properties of 2D materials and revisit measurement of thermal expansion coefficient (TEC) of supported 2D film. Graphene is employed as a representative. We find that the out-of-plane coupling between graphene and substrate strongly affects the temperature-dependent vibrational modes and TEC of graphene. Density of states for long-wavelength out-of-plane oscillations is significantly reduced when graphene is supported on an alkane substrate. To account for the contribution of the out-of-plane coupling to TEC, a Raman micro-spectroscopic scheme is developed. The TEC of graphene on octadecyltrichlorosilane substrate is found to be (-0.6+-0.5)*10-6/K at room temperature, which is fundamentally smaller than that of free-standing graphene. Our results shed light on the understanding of the interaction between 2D material and substrate, and offer a general recipe for optical measurement of TEC of a supported 2D film.

preprint2020arXiv

Tunable ferromagnetism at non-integer filling of a moiré superlattice

The flat bands resulting from moiré superlattices in magic-angle twisted bilayer graphene (MATBG) and ABC-trilayer graphene aligned with hexagonal boron nitride (ABC-TLG/hBN) have been shown to give rise to fascinating correlated electron phenomena such as correlated insulators and superconductivity. More recently, orbital magnetism associated with correlated Chern insulators was found in this class of layered structures centered at integer multiples of n0, the density corresponding to one electron per moiré superlattice unit cell. Here we report the experimental observation of ferromagnetism at fractional filling of a flat Chern band in an ABC-TLG/hBN moirésuperlattice. The ferromagnetic state exhibits prominent ferromagnetic hysteresis behavior with large anomalous Hall resistivity in a broad region of densities, centered in the valence miniband at n = -2.3 n0. This ferromagnetism depends very sensitively on the control parameters in the moiré system: not only the magnitude of the anomalous Hall signal, but also the sign of the hysteretic ferromagnetic response can be modulated by tuning the carrier density and displacement field. Our discovery of electrically tunable ferromagnetism in a moiré Chern band at non-integer filling highlights the opportunities for exploring new correlated ferromagnetic states in moiré heterostructures.

preprint2020arXiv

Ultrafast photocurrent and absorption microscopy of few-layer TMD devices isolate rate-limiting dynamics driving fast and efficient photoresponse

Despite inherently poor interlayer conductivity, photodetectors made from few-layer devices of 2D transition metal dichalcogenides (TMDs) such as WSe$_2$ and MoS$_2$ can still yield a desirably fast ($\leq$90 ps) and efficient ($ε$$>$40\%) photoresponse. By combining ultrafast photocurrent (U-PC) and transient absorption (TA) microscopy, the competing electronic escape and recombination rates are unambiguously identified in otherwise complex kinetics. Both the U-PC and TA response of WSe$_2$ yield matching interlayer electronic escape times that accelerate from 1.6 ns to 86 ns with applied $E$-field to predict the maximum device PC-efficiency realized of $\sim$44\%. The slope of the escape rates versus $E$-field suggests out-of-plane electron and hole mobilities of 0.129 and 0.031 cm$^2$/V$s$ respectively. Above $\sim$10$^{11}$ photons/cm$^{2}$ incident flux, defect-assisted Auger scattering greatly decreases efficiency by trapping carriers at vacancy defects. Both TA and PC spectra identify a metal-vacancy sub-gap peak with $\sim$5.6 ns lifetime as a primary trap capturing carriers as they hop between layers. Synchronous TA and U-PC microscopy show the\ net PC collected is modelled by a kinetic rate-law of electronic escape competing against the linear and nonlinear Auger recombination rates. This simple rate-model further predicts the PC-based dynamics, nonlinear amplitude and efficiency, $ε$ over a 10$^5$ range of incident photon flux in few-layer WSe$_2$ and MoS$_2$ devices.

preprint2020arXiv

Uniformly strong convergence of Kähler-Ricci flows on a Fano manifold

In this paper, we study the uniformly strong convergence of Kähler-Ricci flow on a Fano manifold with varied initial metrics and smooth deformation complex structures. As an application, we prove the uniqueness of Kähler-Ricci solitons in sense of diffeomorphism orbits. The result generalizes Tian-Zhu&#39;s theorem for the uniqueness of of Kähler-Ricci solitons on a compact complex manifold, and it is also a generalization of Chen-Sun&#39;s result of for the uniqueness of of Kähler-Einstein metric orbits.

preprint2019arXiv

Experimental observation of the gate-controlled reversal of the anomalous Hall effect in the intrinsic magnetic topological insulator MnBi2Te4 device

Here we report the reserved anomalous Hall effect (AHE) in the 5-septuple-layer van der Waals device of the intrinsic magnetic topological insulator MnBi2Te4. By employing the top/bottom gate, a negative AHE loop gradually decreases to zero and changes to a reversed sign. The reversed AHE exhibits distinct coercive fields and temperature dependence from the previous AHE. It reaches the maximum inside the gap of the Dirac cone. The newly-seen reversed AHE is attributed to the competition of the intrinsic Berry curvature and the Dirac-gap enhanced extrinsic skew scattering. Its gate-controlled switching contributes a scheme for the topological spin field-effect transistors.

preprint2019arXiv

Optical detection of Mott and generalized Wigner crystal states in WSe2/WS2 moiré superlattices

Moiré superlattices are emerging as a new route for engineering strongly correlated electronic states in two-dimensional van der Waals heterostructures, as recently demonstrated in the correlated insulating and superconducting states in magic-angle twisted bilayer graphene and ABC trilayer graphene/boron nitride moiré superlattices. Transition metal dichalcogenide (TMDC) moiré heterostructures provide another exciting model system to explore correlated quantum phenomena, with the addition of strong light-matter interactions and large spin-orbital coupling. Here we report the optical detection of strongly correlated phases in semiconducting WSe2/WS2 moiré superlattices. Our sensitive optical detection technique reveals a Mott insulator state at one hole per superlattice site (ν = 1), and surprising insulating phases at fractional filling factors ν = 1/3 and 2/3, which we assign to generalized Wigner crystallization on an underlying lattice. Furthermore, the unique spin-valley optical selection rules of TMDC heterostructures allow us to optically create and investigate low-energy spin excited states in the Mott insulator. We reveal an especially slow spin relaxation lifetime of many microseconds in the Mott insulating state, orders-of-magnitude longer than that of charge excitations. Our studies highlight novel correlated physics that can emerge in moiré superlattices beyond graphene.

preprint2019arXiv

Perfect absorption by an atomically thin crystal

Optical absorption is one of fundamental light-matter interactions. In most materials, optical absorption is a weak perturbation to the light. In this regime, absorption and emission are irreversible, incoherent processes due to strong damping. Excitons in monolayer transition metal dichalcogenides, however, interact strongly with light, leading to optical absorption in the non-perturbative regime where coherent re-emission of the light has to be considered. Between the incoherent and coherent limits, we show that a robust critical coupling condition exists, leading to perfect optical absorption. Up to 99.6% absorption is measured in a sub-nanometer thick MoSe2 monolayer placed in front of a mirror. The perfect absorption is controlled by tuning the exciton-phonon, exciton-exciton, and exciton-photon interactions by temperature, pulsed laser excitation, and a movable mirror, respectively. Our work suggests unprecedented opportunities for engineering exciton-light interactions using two-dimensional atomically thin crystals, enabling novel photonic applications including ultrafast light modulators and sensitive optical sensing.

preprint2019arXiv

Periodic variation and phase analysis of grouped solar flare with sunspot activity

Studies on the periodic variation and the phase relationship between different solar activity indicators are useful for understanding the long-term evolution of solar activity cycle. Here we report the statistical analysis of grouped solar flare (GSF) and sunspot number (SN) during the time interval from January 1965 to March 2009. We find that, 1) the significant periodicities of both GSF and SN are related to the differential rotation periodicity, the quasi-biennial oscillation (QBO), and the eleven-year Schwabe cycle (ESC), but the specific values are not absolutely identical; 2) the ESC signal of GSF lags behind that of SN with an average of 7.8 months during the considered time interval, implying that the systematic phase delays between GSF and SN originate from the inter-solar-cycle signal. Our results may provide evidence about the storage of magnetic energy in the corona.

preprint2019arXiv

Resolving spin, valley, and moiré quasi-angular momentum of interlayer excitons in WSe2/WS2 heterostructures

Moiré superlattices provide a powerful way to engineer properties of electrons and excitons in two-dimensional van der Waals heterostructures. The moiré effect can be especially strong for interlayer excitons, where electrons and holes reside in different layers and can be addressed separately. In particular, it was recently proposed that the moiré superlattice potential not only localizes interlayer exciton states at different superlattice positions, but also hosts an emerging moiré quasi-angular momentum (QAM) that periodically switches the optical selection rules for interlayer excitons at different moiré sites. Here we report the observation of multiple interlayer exciton states coexisting in a WSe2/WS2 moiré superlattice and unambiguously determine their spin, valley, and moiré QAM through novel resonant optical pump-probe spectroscopy and photoluminescence excitation spectroscopy. We demonstrate that interlayer excitons localized at different moiré sites can exhibit opposite optical selection rules due to the spatially-varying moiré QAM. Our observation reveals new opportunities to engineer interlayer exciton states and valley physics with moiré superlattices for optoelectronic and valleytronic applications.

preprint2019arXiv

The $ΔI$=2 bands in $^{109}$In: possible antimagnetic rotation

The high-spin structure of $^{109}$In was investigated with the $^{100}$Mo($^{14}$N, 5$n$)$^{109}$In fusion-evaporation reaction at CIAE, Beijing. Eleven new $γ$-rays of $^{109}$In were identified, by which the bandheads of the $ΔI$=2 rotational bands were confirmed. The configurations were assigned with the help of the systematic discussion. Furthermore, the rotational bands are compared with the tilted-axis cranking calculations based on a relativistic mean-field approach. The rotational bands involving the $1p1h$ excitation to the $π$$d_{5/2}$ and $π$$g_{7/2}$ orbitals are suggested as candidates for antimagnetic rotation based on the theoretical results.

preprint2019arXiv

Tunable Correlated Chern Insulator and Ferromagnetism in Trilayer Graphene/Boron Nitride Moiré Superlattice

Studies on two-dimensional electron systems in a strong magnetic field first revealed the quantum Hall (QH) effect, a topological state of matter featuring a finite Chern number (C) and chiral edge states. Haldane later theorized that Chern insulators with integer QH effects could appear in lattice models with complex hopping parameters even at zero magnetic field. The ABC-trilayer graphene/hexagonal boron nitride (TLG/hBN) moiré superlattice provides an attractive platform to explore Chern insulators because it features nearly flat moiré minibands with a valley-dependent electrically tunable Chern number. Here we report the experimental observation of a correlated Chern insulator in a TLG/hBN moiré superlattice. We show that reversing the direction of the applied vertical electric field switches TLG/hBN&#39;s moiré minibands between zero and finite Chern numbers, as revealed by dramatic changes in magneto-transport behavior. For topological hole minibands tuned to have a finite Chern number, we focus on 1/4 filling, corresponding to one hole per moiré unit cell. The Hall resistance is well quantized at h/2e2, i.e. C = 2, for |B| > 0.4 T. The correlated Chern insulator is ferromagnetic, exhibiting significant magnetic hysteresis and a large anomalous Hall signal at zero magnetic field. Our discovery of a C = 2 Chern insulator at zero magnetic field should open up exciting opportunities for discovering novel correlated topological states, possibly with novel topological excitations, in nearly flat and topologically nontrivial moiré minibands.

preprint2019arXiv

Visualization of the flat electronic band in twisted bilayer graphene near the magic angle twist

Bilayer graphene was theorized to host a moire miniband with flat dispersion if the layers are stacked at specific twist angles known as the magic angles. Recently, such twisted bilayer graphene (tBLG) with the first magic angle twist was reported to exhibit correlated insulating state and superconductivity, where the presence of the flat miniband in the system is thought to be essential for the emergence of these ordered phases in the transport measurements. Tunneling spectroscopy and electronic compressibility measurements in tBLG have revealed a van Hove singularity that is consistent with the presence of the flat miniband. However, a direct observation of the flat dispersion in the momentum-space of such moire miniband in tBLG is still elusive. Here, we report the visualization of the flat moire miniband by using angle-resolved photoemission spectroscopy with nanoscale resolution (nanoARPES). The high spatial resolution in nanoARPES enabled the measurement of the local electronic structure of the tBLG. We clearly demonstrate the existence of the flat moire band near the charge neutrality for tBLG close to the magic angle at room temperature.

preprint2016arXiv

Observation of charge density wave order in 1D mirror twin boundaries of single-layer MoSe2

Properties of two-dimensional transition metal dichalcogenides are highly sensitive to the presence of defects in the crystal structure. A detailed understanding of defect structure may lead to control of material properties through defect engineering. Here we provide direct evidence for the existence of isolated, one-dimensional charge density waves at mirror twin boundaries in single-layer MoSe2. Our low-temperature scanning tunneling microscopy/spectroscopy measurements reveal a substantial bandgap of 60 - 140 meV opening at the Fermi level in the otherwise one dimensional metallic structure. We find an energy-dependent periodic modulation in the density of states along the mirror twin boundary, with a wavelength of approximately three lattice constants. The modulations in the density of states above and below the Fermi level are spatially out of phase, consistent with charge density wave order. In addition to the electronic characterization, we determine the atomic structure and bonding configuration of the one-dimensional mirror twin boundary by means of high-resolution non-contact atomic force microscopy. Density functional theory calculations reproduce both the gap opening and the modulations of the density of states.