Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
54works
0followers
24topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

54 published item(s)

preprint2026arXiv

GR-Dexter Technical Report

Vision-language-action (VLA) models have enabled language-conditioned, long-horizon robot manipulation, but most existing systems are limited to grippers. Scaling VLA policies to bimanual robots with high degree-of-freedom (DoF) dexterous hands remains challenging due to the expanded action space, frequent hand-object occlusions, and the cost of collecting real-robot data. We present GR-Dexter, a holistic hardware-model-data framework for VLA-based generalist manipulation on a bimanual dexterous-hand robot. Our approach combines the design of a compact 21-DoF robotic hand, an intuitive bimanual teleoperation system for real-robot data collection, and a training recipe that leverages teleoperated robot trajectories together with large-scale vision-language and carefully curated cross-embodiment datasets. Across real-world evaluations spanning long-horizon everyday manipulation and generalizable pick-and-place, GR-Dexter achieves strong in-domain performance and improved robustness to unseen objects and unseen instructions. We hope GR-Dexter serves as a practical step toward generalist dexterous-hand robotic manipulation.

preprint2026arXiv

Towards Universal Gene Regulatory Network Inference: Unlocking Generalizable Regulatory Knowledge in Single-cell Foundation Models

Gene Regulatory Network (GRN) inference is essential for understanding complex cellular mechanisms, rendered tractable through single-cell transcriptomic data. With the emergence of single-cell Foundation Models (scFMs), enhanced transcriptomic encoding is widely expected to revolutionize GRN inference. However, we observe that their performance remains far from satisfactory. The primary reason is that the standard reconstruction-based pre-training objectives often fail to explicitly capture latent regulatory signals. To bridge this gap, we first introduce a GRN generalization benchmark designed to evaluate regulatory predictions on unseen genes and datasets, which relies on the zero-shot capabilities of scFMs and is inherently challenging for traditional methods. Furthermore, to unlock the regulatory knowledge within the foundation models, we propose two novel methods, Virtual Value Perturbation and Gradient Trajectory, to distill implicit regulatory information from scFMs into highly generalizable inter-gene features. Extensive experiments demonstrate that our approach significantly outperforms existing methods, establishing a new paradigm for leveraging the potential of scFMs in universal GRN inference.

preprint2024arXiv

Engineering topological chiral transport in a flat-band lattice of ultracold atoms

The manipulation of particle transport in synthetic quantum matter is an active research frontier for its theoretical importance and potential applications. Here we experimentally demonstrate an engineered topological transport in a synthetic flat-band lattice of ultracold $^{87}$Rb atoms. We implement a quasi-one-dimensional rhombic chain with staggered flux in the momentum space of the atomic condensate and observe biased local oscillations that originate from the interplay of the staggered flux and flat-band localization under the mechanism of Aharonov-Bohm caging. Based on these features, we design and experimentally confirm a state-dependent chiral transport under the periodic modulation of the synthetic flux. We show that the phenomenon is topologically protected by the winding of the Floquet Bloch bands of a coarse-grained effective Hamiltonian. The observed chiral transport offers a strategy for efficient quantum device design where topological robustness is ensured by fast Floquet driving and flat-band localization.

preprint2024arXiv

Roles of non-axisymmetric perturbations in free drift vertical displacement events on EAST

The safe operation of most tokamaks, especially the largen sized ones, rely on the feedback control of the vertical displacement events (VDEs). However, most these feedback control systems are based on the axisymmetric VDE models. In this work, we use NIMROD simulations to study the roles of non-axisymmetric perturbations in free drift vertical displacement events on EAST. The high-$n$ modes in non-axisymmetric VDE grow first, which drive the formation of high-$n$ magnetic island chains. Subsequently, the magnetic island chains grow and overlap with each other, leading to the destruction of the magnetic flux surface, which induces a minor disruption and accelerates the start of the following major disruption. The magnetic island and the stochastic magnetic field allow the toroidally asymmetric poloidal plasma current to jet towards the hoop force direction, forming the finger and filamentary structures. Such a plasma current asymmetry strongly depends on the anisotropy in thermal transport coefficients.

preprint2023arXiv

A semi-discrete first-order low regularity exponential integrator for the "good" Boussinesq equation without loss of regularity

In this paper, we propose a semi-discrete first-order low regularity exponential-type integrator (LREI) for the ``good" Boussinesq equation. It is shown that the method is convergent linearly in the space $H^r$ for solutions belonging to $H^{r+p(r)}$ where $0\le p(r)\le 1$ is non-increasing with respect to $r$, which means less additional derivatives might be needed when the numerical solution is measured in a more regular space. Particularly, the LREI presents the first-order accuracy in $H^{r}$ with no assumptions of additional derivatives when $r>5/2$. This is the first time to propose a low regularity method which achieves the optimal first-order accuracy without loss of regularity for the GB equation. The convergence is confirmed by extensive numerical experiments.

preprint2023arXiv

Disentangled Representation for Diversified Recommendations

Accuracy and diversity have long been considered to be two conflicting goals for recommendations. We point out, however, that as the diversity is typically measured by certain pre-selected item attributes, e.g., category as the most popularly employed one, improved diversity can be achieved without sacrificing recommendation accuracy, as long as the diversification respects the user's preference about the pre-selected attributes. This calls for a fine-grained understanding of a user's preferences over items, where one needs to recognize the user's choice is driven by the quality of the item itself, or the pre-selected attributes of the item. In this work, we focus on diversity defined on item categories. We propose a general diversification framework agnostic to the choice of recommendation algorithms. Our solution disentangles the learnt user representation in the recommendation module into category-independent and category-dependent components to differentiate a user's preference over items from two orthogonal perspectives. Experimental results on three benchmark datasets and online A/B test demonstrate the effectiveness of our solution in improving both recommendation accuracy and diversity. In-depth analysis suggests that the improvement is due to our improved modeling of users' categorical preferences and refined ranking within item categories.

preprint2022arXiv

A special cross-tie domain wall in helimagnet

A special cross-tie (SCT) domain wall was discovered in the helimagnet MnCoSi alloy via the magnetic vector field tomography in Lorentz transmission electron microscopy (LTEM). Different to the traditional cross-tie (TCT) domain wall where the convergent/divergent magnetic moment configuration line up one by one, the relative large Bloch type sub-walls emerge in this brand-new SCT domain wall and two mutually perpendicular rotation axes coexist in this special feature. The straight magnetic stripes accompanied with the unraveled domain walls hint the complex mechanism to form this SCT structure. Interestingly, different orientation of this domain wall in LTEM can easily exhibit various magnetic features, including meron/antimeron chains or bimeron strings.

preprint2022arXiv

Biologically Inspired Neural Path Finding

The human brain can be considered to be a graphical structure comprising of tens of billions of biological neurons connected by synapses. It has the remarkable ability to automatically re-route information flow through alternate paths in case some neurons are damaged. Moreover, the brain is capable of retaining information and applying it to similar but completely unseen scenarios. In this paper, we take inspiration from these attributes of the brain, to develop a computational framework to find the optimal low cost path between a source node and a destination node in a generalized graph. We show that our framework is capable of handling unseen graphs at test time. Moreover, it can find alternate optimal paths, when nodes are arbitrarily added or removed during inference, while maintaining a fixed prediction time. Code is available here: https://github.com/hangligit/pathfinding

preprint2022arXiv

Bridge the Gap between Supervised and Unsupervised Learning for Fine-Grained Classification

Unsupervised learning technology has caught up with or even surpassed supervised learning technology in general object classification (GOC) and person re-identification (re-ID). However, it is found that the unsupervised learning of fine-grained visual classification (FGVC) is more challenging than GOC and person re-ID. In order to bridge the gap between unsupervised and supervised learning for FGVC, we investigate the essential factors (including feature extraction, clustering, and contrastive learning) for the performance gap between supervised and unsupervised FGVC. Furthermore, we propose a simple, effective, and practical method, termed as UFCL, to alleviate the gap. Three key issues are concerned and improved: First, we introduce a robust and powerful backbone, ResNet50-IBN, which has an ability of domain adaptation when we transfer ImageNet pre-trained models to FGVC tasks. Next, we propose to introduce HDBSCAN instead of DBSCAN to do clustering, which can generate better clusters for adjacent categories with fewer hyper-parameters. Finally, we propose a weighted feature agent and its updating mechanism to do contrastive learning by using the pseudo labels with inevitable noise, which can improve the optimization process of learning the parameters of the network. The effectiveness of our UFCL is verified on CUB-200-2011, Oxford-Flowers, Oxford-Pets, Stanford-Dogs, Stanford-Cars and FGVC-Aircraft datasets. Under the unsupervised FGVC setting, we achieve state-of-the-art results, and analyze the key factors and the important parameters to provide a practical guidance.

preprint2022arXiv

Comparison of skyrmion phases between poly and single-crystal MnSi by composite magnetoelectric method

We have explored the skyrmion phases and phase diagram of poly and single-crystal MnSi by the measurements of the magnetoelectric coefficient alfaE and ac magnetic susceptibility of the MnSi/PMN-PT composite. We found that the regular skyrmion lattice phase in single crystal sample has been averaged in the MnSi polycrystal due to random grain orientations which results in an extended skyrmion lattice-conical mixture phase down to 25 K. The magnitude of the out-of-phase component in alfaE of the polycrystal, not single crystal, decreases gradually with decreasing frequency. With the changing of the driven ac field, it reveals a depinning threshold behavior in both samples. The depinning field is stronger in the polycrystal than that in single crystal and maybe responsible for the diminishing of dissipative behavior at lower frequency due to grain boundaries and defects. The composite magnetoelectric method provides a unique approach to probe topological phase dynamics.

preprint2022arXiv

Covert Beamforming Design for Integrated Radar Sensing and Communication Systems

We propose covert beamforming design frameworks for integrated radar sensing and communication (IRSC) systems, where the radar can covertly communicate with legitimate users under the cover of the probing waveforms without being detected by the eavesdropper. Specifically, by jointly designing the target detection beamformer and communication beamformer, we aim to maximize the radar detection mutual information (MI) (or the communication rate) subject to the covert constraint, the communication rate constraint (or the radar detection MI constraint), and the total power constraint. For the perfect eavesdropper's channel state information (CSI) scenario, we transform the covert beamforming design problems into a series of convex subproblems, by exploiting semidefinite relaxation, which can be solved via the bisection search method. Considering the high complexity of iterative optimization, we further propose a single-iterative covert beamformer design scheme based on the zero-forcing criterion. For the imperfect eavesdropper's CSI scenario, we develop a relaxation and restriction method to tackle the robust covert beamforming design problems. Simulation results demonstrate the effectiveness of the proposed covert beamforming schemes for perfect and imperfect CSI scenarios.

preprint2022arXiv

Directed Acyclic Transformer for Non-Autoregressive Machine Translation

Non-autoregressive Transformers (NATs) significantly reduce the decoding latency by generating all tokens in parallel. However, such independent predictions prevent NATs from capturing the dependencies between the tokens for generating multiple possible translations. In this paper, we propose Directed Acyclic Transfomer (DA-Transformer), which represents the hidden states in a Directed Acyclic Graph (DAG), where each path of the DAG corresponds to a specific translation. The whole DAG simultaneously captures multiple translations and facilitates fast predictions in a non-autoregressive fashion. Experiments on the raw training data of WMT benchmark show that DA-Transformer substantially outperforms previous NATs by about 3 BLEU on average, which is the first NAT model that achieves competitive results with autoregressive Transformers without relying on knowledge distillation.

preprint2022arXiv

Forgetting Fast in Recommender Systems

Users of a recommender system may want part of their data being deleted, not only from the data repository but also from the underlying machine learning model, for privacy or utility reasons. Such right-to-be-forgotten requests could be fulfilled by simply retraining the recommendation model from scratch, but that would be too slow and too expensive in practice. In this paper, we investigate fast machine unlearning techniques for recommender systems that can remove the effect of a small amount of training data from the recommendation model without incurring the full cost of retraining. A natural idea to speed this process up is to fine-tune the current recommendation model on the remaining training data instead of starting from a random initialization. This warm-start strategy indeed works for neural recommendation models using standard 1st-order neural network optimizers (like AdamW). However, we have found that even greater acceleration could be achieved by employing 2nd-order (Newton or quasi-Newton) optimization methods instead. To overcome the prohibitively high computational cost of 2nd-order optimizers, we propose a new recommendation unlearning approach AltEraser which divides the optimization problem of unlearning into many small tractable sub-problems. Extensive experiments on three real-world recommendation datasets show promising results of AltEraser in terms of consistency (forgetting thoroughness), accuracy (recommendation effectiveness), and efficiency (unlearning speed). To our knowledge, this work represents the first attempt at fast approximate machine unlearning for state-of-the-art neural recommendation models.

preprint2022arXiv

How does Feedback Signal Quality Impact Effectiveness of Pseudo Relevance Feedback for Passage Retrieval?

Pseudo-Relevance Feedback (PRF) assumes that the top results retrieved by a first-stage ranker are relevant to the original query and uses them to improve the query representation for a second round of retrieval. This assumption however is often not correct: some or even all of the feedback documents may be irrelevant. Indeed, the effectiveness of PRF methods may well depend on the quality of the feedback signal and thus on the effectiveness of the first-stage ranker. This aspect however has received little attention before. In this paper we control the quality of the feedback signal and measure its impact on a range of PRF methods, including traditional bag-of-words methods (Rocchio), and dense vector-based methods (learnt and not learnt). Our results show the important role the quality of the feedback signal plays on the effectiveness of PRF methods. Importantly, and surprisingly, our analysis reveals that not all PRF methods are the same when dealing with feedback signals of varying quality. These findings are critical to gain a better understanding of the PRF methods and of which and when they should be used, depending on the feedback signal quality, and set the basis for future research in this area.

preprint2022arXiv

Implicit Feedback for Dense Passage Retrieval: A Counterfactual Approach

In this paper we study how to effectively exploit implicit feedback in Dense Retrievers (DRs). We consider the specific case in which click data from a historic click log is available as implicit feedback. We then exploit such historic implicit interactions to improve the effectiveness of a DR. A key challenge that we study is the effect that biases in the click signal, such as position bias, have on the DRs. To overcome the problems associated with the presence of such bias, we propose the Counterfactual Rocchio (CoRocchio) algorithm for exploiting implicit feedback in Dense Retrievers. We demonstrate both theoretically and empirically that dense query representations learnt with CoRocchio are unbiased with respect to position bias and lead to higher retrieval effectiveness. We make available the implementations of the proposed methods and the experimental framework, along with all results at https://github.com/ielab/Counterfactual-DR.

preprint2022arXiv

Multi-Grained Vision Language Pre-Training: Aligning Texts with Visual Concepts

Most existing methods in vision language pre-training rely on object-centric features extracted through object detection and make fine-grained alignments between the extracted features and texts. It is challenging for these methods to learn relations among multiple objects. To this end, we propose a new method called X-VLM to perform `multi-grained vision language pre-training.' The key to learning multi-grained alignments is to locate visual concepts in the image given the associated texts, and in the meantime align the texts with the visual concepts, where the alignments are in multi-granularity. Experimental results show that X-VLM effectively leverages the learned multi-grained alignments to many downstream vision language tasks and consistently outperforms state-of-the-art methods.

preprint2022arXiv

Multimodal Entity Tagging with Multimodal Knowledge Base

To enhance research on multimodal knowledge base and multimodal information processing, we propose a new task called multimodal entity tagging (MET) with a multimodal knowledge base (MKB). We also develop a dataset for the problem using an existing MKB. In an MKB, there are entities and their associated texts and images. In MET, given a text-image pair, one uses the information in the MKB to automatically identify the related entity in the text-image pair. We solve the task by using the information retrieval paradigm and implement several baselines using state-of-the-art methods in NLP and CV. We conduct extensive experiments and make analyses on the experimental results. The results show that the task is challenging, but current technologies can achieve relatively high performance. We will release the dataset, code, and models for future research.

preprint2022arXiv

Observation of Non-Hermitian Skin Effect and Topology in Ultracold Atoms

The non-Hermitian skin effect (NHSE), the accumulation of eigen wavefunctions at boundaries of open systems, underlies a variety of exotic properties that defy conventional wisdom. While NHSE and its intriguing impact on band topology and dynamics have been observed in classical or photonic systems, their demonstration in a quantum many-body setting remains elusive. Here we report the experimental realization of a dissipative Aharonov-Bohm chain -- a non-Hermitian topological model with NHSE -- in the momentum space of a two-component Bose-Einstein condensate. We identify unique signatures of NHSE in the condensate dynamics, and perform Bragg spectroscopy to resolve topological edge states against a background of localized bulk states. Our work sets the stage for further investigation on the interplay of many-body statistics and interactions with NHSE, and is a significant step forward in the quantum control and simulation of non-Hermitian physics.

preprint2022arXiv

Observation of short-period helical spin order and magnetic transition in a non-chiral centrosymmetric helimagnet

The search for materials exhibiting nanoscale spiral order continues to be fuelled by the promise of emergent inductors. Although such spin textures have been reported in many materials, most of them exhibit long periods or are limited to operate far below room temperature. Here, we present the real-space observation of an ordered helical spin order with a period of 3.2 nm in a non-chiral centrosymmetric helimagnet MnCoSi at room temperature via multi-angle and multi-azimuth approach of Lorentz transmission electron microscopy (TEM). A magnetic transition from the ordered helical spin order to a cycloidal spin order below 228 K is clearly revealed by in situ neutron powder diffraction and Lorentz TEM, which is closely correlated with temperature-induced variation in magneto-crystalline anisotropy. These results reveal the origin of spiral ordered spin textures in non-chiral centrosymmetric helimagnet, which can serve as a new strategy for searching materials with nanoscale spin order with potential applications in emergent electromagnetism.

preprint2022arXiv

On Calibration of Graph Neural Networks for Node Classification

Graphs can model real-world, complex systems by representing entities and their interactions in terms of nodes and edges. To better exploit the graph structure, graph neural networks have been developed, which learn entity and edge embeddings for tasks such as node classification and link prediction. These models achieve good performance with respect to accuracy, but the confidence scores associated with the predictions might not be calibrated. That means that the scores might not reflect the ground-truth probabilities of the predicted events, which would be especially important for safety-critical applications. Even though graph neural networks are used for a wide range of tasks, the calibration thereof has not been sufficiently explored yet. We investigate the calibration of graph neural networks for node classification, study the effect of existing post-processing calibration methods, and analyze the influence of model capacity, graph density, and a new loss function on calibration. Further, we propose a topology-aware calibration method that takes the neighboring nodes into account and yields improved calibration compared to baseline methods.

preprint2022arXiv

Optimal Probabilistic Constellation Shaping for Covert Communications

In this paper, we investigate the optimal probabilistic constellation shaping design for covert communication systems from a practical view. Different from conventional covert communications with equiprobable constellations modulation, we propose nonequiprobable constellations modulation schemes to further enhance the covert rate. Specifically, we derive covert rate expressions for practical discrete constellation inputs for the first time. Then, we study the covert rate maximization problem by jointly optimizing the constellation distribution and power allocation. In particular, an approximate gradient descent method is proposed for obtaining the optimal probabilistic constellation shaping. To strike a balance between the computational complexity and the transmission performance, we further develop a framework that maximizes a lower bound on the achievable rate where the optimal probabilistic constellation shaping problem can be solved efficiently using the Frank-Wolfe method. Extensive numerical results show that the optimized probabilistic constellation shaping strategies provide significant gains in the achievable covert rate over the state-of-the-art schemes.

preprint2022arXiv

Pseudo Relevance Feedback with Deep Language Models and Dense Retrievers: Successes and Pitfalls

Pseudo Relevance Feedback (PRF) is known to improve the effectiveness of bag-of-words retrievers. At the same time, deep language models have been shown to outperform traditional bag-of-words rerankers. However, it is unclear how to integrate PRF directly with emergent deep language models. In this article, we address this gap by investigating methods for integrating PRF signals into rerankers and dense retrievers based on deep language models. We consider text-based and vector-based PRF approaches, and investigate different ways of combining and scoring relevance signals. An extensive empirical evaluation was conducted across four different datasets and two task settings (retrieval and ranking). Text-based PRF results show that the use of PRF had a mixed effect on deep rerankers across different datasets. We found that the best effectiveness was achieved when (i) directly concatenating each PRF passage with the query, searching with the new set of queries, and then aggregating the scores; (ii) using Borda to aggregate scores from PRF runs. Vector-based PRF results show that the use of PRF enhanced the effectiveness of deep rerankers and dense retrievers over several evaluation metrics. We found that higher effectiveness was achieved when (i) the query retains either the majority or the same weight within the PRF mechanism, and (ii) a shallower PRF signal (i.e., a smaller number of top-ranked passages) was employed, rather than a deeper signal. Our vector-based PRF method is computationally efficient; thus this represents a general PRF method others can use with deep rerankers and dense retrievers.

preprint2022arXiv

Reconstruction of low dimensional electronic states by altering the chemical arrangement at the SrTiO3 surface

Developing reliable methods for modulating the electronic structure of the two-dimensional electron gas (2DEG) in SrTiO3 is crucial for utilizing its full potential and inducing novel properties. Here, we show that relatively simple surface preparation reconstructs the 2DEG of SrTiO3 (STO) surface, leading to a Lifshitz-like transition. Combining experimental methods, such as angle-resolved photoemission spectroscopy (ARPES) and X-ray photoemission spectroscopy (XPS) with ab initio calculations, we find that the modulation of the surface band structures is primarily attributed to the reorganization of the chemical composition. In addition, ARPES experiments demonstrate that vacuum ultraviolet (VUV) light can be efficiently employed to alter the band renormalization of the 2DEG system and control the electron-phonon interaction (EPI). Our study provides a robust and straightforward route to stabilize and tune the low-dimensional electronic structure via the chemical degeneracy of the STO surface.

preprint2022arXiv

Self-Supervised Audio-and-Text Pre-training with Extremely Low-Resource Parallel Data

Multimodal pre-training for audio-and-text has recently been proved to be effective and has significantly improved the performance of many downstream speech understanding tasks. However, these state-of-the-art pre-training audio-text models work well only when provided with large amount of parallel audio-and-text data, which brings challenges on many languages that are rich in unimodal corpora but scarce of parallel cross-modal corpus. In this paper, we investigate whether it is possible to pre-train an audio-text multimodal model with extremely low-resource parallel data and extra non-parallel unimodal data. Our pre-training framework consists of the following components: (1) Intra-modal Denoising Auto-Encoding (IDAE), which is able to reconstruct input text (audio) representations from a noisy version of itself. (2) Cross-modal Denoising Auto-Encoding (CDAE), which is pre-trained to reconstruct the input text (audio), given both a noisy version of the input text (audio) and the corresponding translated noisy audio features (text embeddings). (3) Iterative Denoising Process (IDP), which iteratively translates raw audio (text) and the corresponding text embeddings (audio features) translated from previous iteration into the new less-noisy text embeddings (audio features). We adapt a dual cross-modal Transformer as our backbone model which consists of two unimodal encoders for IDAE and two cross-modal encoders for CDAE and IDP. Our method achieves comparable performance on multiple downstream speech understanding tasks compared with the model pre-trained on fully parallel data, demonstrating the great potential of the proposed method. Our code is available at: \url{https://github.com/KarlYuKang/Low-Resource-Multimodal-Pre-training}.

preprint2022arXiv

Spectral and Energy Efficiency of DCO-OFDM in Visible Light Communication Systems with Finite-Alphabet Inputs

The bound of the information transmission rate of direct current biased optical orthogonal frequency division multiplexing (DCO-OFDM) for visible light communication (VLC) with finite-alphabet inputs is yet unknown, where the corresponding spectral efficiency (SE) and energy efficiency (EE) stems out as the open research problems. In this paper, we derive the exact achievable rate of {the} DCO-OFDM system with finite-alphabet inputs for the first time. Furthermore, we investigate SE maximization problems of {the} DCO-OFDM system subject to both electrical and optical power constraints. By exploiting the relationship between the mutual information and the minimum mean-squared error, we propose a multi-level mercury-water-filling power allocation scheme to achieve the maximum SE. Moreover, the EE maximization problems of {the} DCO-OFDM system are studied, and the Dinkelbach-type power allocation scheme is developed for the maximum EE. Numerical results verify the effectiveness of the proposed theories and power allocation schemes.

preprint2022arXiv

Text-to-Table: A New Way of Information Extraction

We study a new problem setting of information extraction (IE), referred to as text-to-table. In text-to-table, given a text, one creates a table or several tables expressing the main content of the text, while the model is learned from text-table pair data. The problem setting differs from those of the existing methods for IE. First, the extraction can be carried out from long texts to large tables with complex structures. Second, the extraction is entirely data-driven, and there is no need to explicitly define the schemas. As far as we know, there has been no previous work that studies the problem. In this work, we formalize text-to-table as a sequence-to-sequence (seq2seq) problem. We first employ a seq2seq model fine-tuned from a pre-trained language model to perform the task. We also develop a new method within the seq2seq approach, exploiting two additional techniques in table generation: table constraint and table relation embeddings. We consider text-to-table as an inverse problem of the well-studied table-to-text, and make use of four existing table-to-text datasets in our experiments on text-to-table. Experimental results show that the vanilla seq2seq model can outperform the baseline methods of using relation extraction and named entity extraction. The results also show that our method can further boost the performances of the vanilla seq2seq model. We further discuss the main challenges of the proposed task. The code and data are available at https://github.com/shirley-wu/text_to_table.

preprint2021arXiv

Coupled cavity-waveguide system based on topological corner state and edge state

Topological corner state (TCS) and topological edge state (TES) have provided new approaches to control the propagation of light. The construction of topological coupled cavity-waveguide system (TCCWS) based on TCS and TES is worth looking forward to, due to its research prospects in realizing high-performance micro-nano integrated photonic devices. In this Letter, TCCWS is proposed in two-dimensional (2D) photonic crystal (PC), which possesses strong optical localization, high quality factor and excellent robustness compared with the conventional coupled cavity-waveguide system (CCCWS). This work will provide the possibility to design high-performance logic gates, lasers, filters and other micro-nano integrated photonics devices and expand their applications.

preprint2021arXiv

Deep Convolutional Neural Networks to Predict Mutual Coupling Effects in Metasurfaces

Metasurfaces have provided a novel and promising platform for the realization of compact and large-scale optical devices. The conventional metasurface design approach assumes periodic boundary conditions for each element, which is inaccurate in most cases since the near-field coupling effects between elements will change when surrounded by non-identical structures. In this paper, we propose a deep learning approach to predict the actual electromagnetic (EM) responses of each target meta-atom placed in a large array with near-field coupling effects taken into account. The predicting neural network takes the physical specifications of the target meta-atom and its neighbors as input, and calculates its phase and amplitude in milliseconds. This approach can be applied to explain metasurfaces' performance deterioration caused by mutual coupling and further used to optimize their efficiencies once combined with optimization algorithms. To demonstrate the efficacy of this methodology, we obtain large improvements in efficiency for a beam deflector and a metalens over the conventional design approach. Moreover, we show the correlations between a metasurface's performance and its design errors caused by mutual coupling are not bound to certain specifications (materials, shapes, etc.). As such, we envision that this approach can be readily applied to explore the mutual coupling effects and improve the performance of various metasurface designs.

preprint2021arXiv

High-Order Nonreciprocal Add-Drop Filter

Topological photonics have led to the robust optical behavior of the device, which has solved the problem of the influence of manufacturing defects and perturbations on the device performance. Meanwhile, temporal coupled-mode theory (t-CMT) has been developed and applied widely. However, the t-CMT of cascaded coupling cavities (CCC) system and its corresponding high-order filter has yet to be established. Here the t-CMT of CCC system is established based on the existing t-CMT. By combining the CCC with the topological waveguides, a versatile design scheme of the high-order nonreciprocal add-drop filter (HONAF) is proposed. The relationship between coupling effect of cavities and transmission and filtering performance of HONAF is analyzed quantitatively, then a method to improve the transmission efficiency and quality factor of the filter is given. Based on the combination of gyromagnetic photonic crystals and decagonal Penrose-type photonic quasicrystals, a HONAF is proposed. The transmission and filtering performance of the HONAF are numerically analyzed, which verifies the consistency between the theoretical prediction and the numerical simulation. The t-CMT of CCC system established can be widely used in the coupled resonator optical waveguides and their related systems. The designed HONAF can also be applied and compatible to microwave communication system.

preprint2020arXiv

A Freeform Dielectric Metasurface Modeling Approach Based on Deep Neural Networks

Metasurfaces have shown promising potentials in shaping optical wavefronts while remaining compact compared to bulky geometric optics devices. Design of meta-atoms, the fundamental building blocks of metasurfaces, relies on trial-and-error method to achieve target electromagnetic responses. This process includes the characterization of an enormous amount of different meta-atom designs with different physical and geometric parameters, which normally demands huge computational resources. In this paper, a deep learning-based metasurface/meta-atom modeling approach is introduced to significantly reduce the characterization time while maintaining accuracy. Based on a convolutional neural network (CNN) structure, the proposed deep learning network is able to model meta-atoms with free-form 2D patterns and different lattice sizes, material refractive indexes and thicknesses. Moreover, the presented approach features the capability to predict meta-atoms' wide spectrum responses in the timescale of milliseconds, which makes it attractive for applications such as fast meta-atom/metasurface on-demand designs and optimizations.

preprint2020arXiv

A Full Quantum Eigensolver for Quantum Chemistry Simulations

Quantum simulation of quantum chemistry is one of the most compelling applications of quantum computing. It is of particular importance in areas ranging from materials science, biochemistry and condensed matter physics. Here, we propose a full quantum eigensolver (FQE) algorithm to calculate the molecular ground energies and electronic structures using quantum gradient descent. Compared to existing classical-quantum hybrid methods such as variational quantum eigensolver (VQE), our method removes the classical optimizer and performs all the calculations on a quantum computer with faster convergence. The gradient descent iteration depth has a favorable complexity that is logarithmically dependent on the system size and inverse of the precision. Moreover, the FQE can be further simplified by exploiting perturbation theory for the calculations of intermediate matrix elements, and obtain results with a precision that satisfies the requirement of chemistry application. The full quantum eigensolver can be implemented on a near-term quantum computer. With the rapid development of quantum computing hardware, FQE provides an efficient and powerful tool to solve quantum chemistry problems.

preprint2020arXiv

A Hybrid Quantum Memory Enabled Network at Room Temperature

Quantum memory capable of storage and retrieval of flying photons on demand is crucial for developing quantum information technologies. However, the devices needed for long-distance links are quite different from those envisioned for local processing. Here, we present the first hybrid quantum memory enabled network by demonstrating the interconnection and simultaneous operation of two types of quantum memory: an atomic-ensemble-based memory and an all-optical loop memory. The former generates and stores single atomic excitations that can then be converted to single photons; and the latter maps incoming photons in and out on demand, at room-temperature and with a broad acceptance bandwidth. Interfacing these two types of quantum memories, we observe a well-preserved quantum cross-correlation, reaching a value of 22, and a violation of the Cauchy-Schwarz inequality up to 549 standard deviations. Furthermore, we demonstrate the creation and storage of a fully operable heralded photon chain state that can achieve memory-built-in combining, swapping, splitting, tuning and chopping single photons in a chain temporally. Such a quantum network allows atomic excitations to be generated, stored, and converted to broadband photons, which are then transferred to the next node, stored, and faithfully retrieved, all at high speed and in a programmable fashion.

preprint2020arXiv

Angular momentum conservation in counter-propagating vectorially structured light

It is well-known that electric spin angular momentum and electric orbital angular momentum are conserved under paraxial propagation of travelling waves in free-space. Here we study the electric and magnetic angular momentum in counter-propagating waves and show both theoretically and experimentally that neither component alone is conserved except in special cases. We attribute this non-conservation to spin-spin and orbit-orbit coupling between the electric and magnetic fields. This work generalises previous findings based on travelling waves, explains the apparent spin-orbit coupling in counter-propagating paraxial light, and broadens our understanding of angular momentum conservation in arbitrary structured light waves.

preprint2020arXiv

Conversational Contextual Bandit: Algorithm and Application

Contextual bandit algorithms provide principled online learning solutions to balance the exploitation-exploration trade-off in various applications such as recommender systems. However, the learning speed of the traditional contextual bandit algorithms is often slow due to the need for extensive exploration. This poses a critical issue in applications like recommender systems, since users may need to provide feedbacks on a lot of uninterested items. To accelerate the learning speed, we generalize contextual bandit to conversational contextual bandit. Conversational contextual bandit leverages not only behavioral feedbacks on arms (e.g., articles in news recommendation), but also occasional conversational feedbacks on key-terms from the user. Here, a key-term can relate to a subset of arms, for example, a category of articles in news recommendation. We then design the Conversational UCB algorithm (ConUCB) to address two challenges in conversational contextual bandit: (1) which key-terms to select to conduct conversation, (2) how to leverage conversational feedbacks to accelerate the speed of bandit learning. We theoretically prove that ConUCB can achieve a smaller regret upper bound than the traditional contextual bandit algorithm LinUCB, which implies a faster learning speed. Experiments on synthetic data, as well as real datasets from Yelp and Toutiao, demonstrate the efficacy of the ConUCB algorithm.

preprint2020arXiv

Feature Statistics Guided Efficient Filter Pruning

Building compact convolutional neural networks (CNNs) with reliable performance is a critical but challenging task, especially when deploying them in real-world applications. As a common approach to reduce the size of CNNs, pruning methods delete part of the CNN filters according to some metrics such as $l1$-norm. However, previous methods hardly leverage the information variance in a single feature map and the similarity characteristics among feature maps. In this paper, we propose a novel filter pruning method, which incorporates two kinds of feature map selections: diversity-aware selection (DFS) and similarity-aware selection (SFS). DFS aims to discover features with low information diversity while SFS removes features that have high similarities with others. We conduct extensive empirical experiments with various CNN architectures on publicly available datasets. The experimental results demonstrate that our model obtains up to 91.6% parameter decrease and 83.7% FLOPs reduction with almost no accuracy loss.

preprint2020arXiv

Hacking Quantum Key Distribution via Injection Locking

Unconditionally secure communication, being pursued for thousands of years, however, hasn't been reached yet due to continuous competitions between encryption and hacking. Quantum key distribution (QKD), harnessing the quantum mechanical nature of superposition and non-cloning, may promise unconditional security by incorporating the one-time pad algorithm rigorously proved by Claude Shannon. Massive efforts have been made in building practical and commercial QKD systems, in particular, decoy states are employed to detect photon-number splitting attack against single-photon source loophole, and measurement-device-independent (MDI) QKD has further closed all loopholes in detection side, which leads to a seemingly real-life application. Here, we propose and experimentally demonstrate an MDI-QKD hacking strategy on the trusted source assumption by using injection locking technique. Eve injects near off-resonance photons in randomly chosen polarization into sender's laser, where injection locking in a shifted frequency can happen only when Eve's choice matches with sender's state. By setting a shifted window and switching the frequency of photons back afterwards, Eve in principle can obtain all the keys without terminating the real-time QKD. We observe the dynamics of a semiconductor laser with injected photons, and obtain a hacking success rate reaching 60.0% of raw keys. Our results suggest that the spear-and-shield competitions on unconditional security may continue until all potential loopholes are discovered and closed ultimately.

preprint2020arXiv

Heralding Quantum Entanglement between Two Room-Temperature Atomic Ensembles

Establishing quantum entanglement between individual nodes is crucial for building large-scale quantum networks, enabling secure quantum communication, distributed quantum computing, enhanced quantum metrology and fundamental tests of quantum mechanics. However, the shared entanglements have been merely observed in either extremely low-temperature or well-isolated systems, which limits the quantum networks for the real-life applications. Here, we report the realization of heralding quantum entanglement between two atomic ensembles at room temperature, where each of them contains billions of motional atoms. By measuring the mapped-out entangled state with quantum interference, concurrence and correlation, we strongly verify the existence of a single excitation delocalized in two atomic ensembles. Remarkably, the heralded quantum entanglement of atomic ensembles can be operated with the feature of delay-choice, which illustrates the essentiality of the built-in quantum memory. The demonstrated building block paves the way for constructing quantum networks and distributing entanglement across multiple remote nodes at ambient conditions.

preprint2020arXiv

Identifying At-Risk K-12 Students in Multimodal Online Environments: A Machine Learning Approach

With the rapid emergence of K-12 online learning platforms, a new era of education has been opened up. It is crucial to have a dropout warning framework to preemptively identify K-12 students who are at risk of dropping out of the online courses. Prior researchers have focused on predicting dropout in Massive Open Online Courses (MOOCs), which often deliver higher education, i.e., graduate level courses at top institutions. However, few studies have focused on developing a machine learning approach for students in K-12 online courses. In this paper, we develop a machine learning framework to conduct accurate at-risk student identification specialized in K-12 multimodal online environments. Our approach considers both online and offline factors around K-12 students and aims at solving the challenges of (1) multiple modalities, i.e., K-12 online environments involve interactions from different modalities such as video, voice, etc; (2) length variability, i.e., students with different lengths of learning history; (3) time sensitivity, i.e., the dropout likelihood is changing with time; and (4) data imbalance, i.e., only less than 20\% of K-12 students will choose to drop out the class. We conduct a wide range of offline and online experiments to demonstrate the effectiveness of our approach. In our offline experiments, we show that our method improves the dropout prediction performance when compared to state-of-the-art baselines on a real-world educational dataset. In our online experiments, we test our approach on a third-party K-12 online tutoring platform for two months and the results show that more than 70\% of dropout students are detected by the system.

preprint2020arXiv

Large anisotropic topological Hall effect in a hexagonal non-collinear magnet Fe5Sn3

We report the observation of a large anisotropic topological Hall effect (THE) in the hexagonal non-collinear magnet Fe5Sn3 single crystals. It is found that the sign of the topological Hall resistivity is negative when a magnetic field H perpendicular to the bc-plane (H\perp bc-plane), however, it changes form negative to positive when H parallel to the c-axis (H\parallel c-axis). The value of topological Hall resistivity increased with the increasing temperature and reached approximately -2.12 μΩcm (H\perp bc-plane) and 0.5 μΩcm (H\parallel c-axis) at 350 K, respectively. Quantitative analyses of the measured data suggest that the observed anisotropic THE may originate from the opposite scalar spin chirality induced by the magnetic fields perpendicular and parallel to the c-axis, respectively.

preprint2020arXiv

Large anomalous Hall angle in a topological semimetal candidate TbPtBi

The magnetotransport properties in antiferromagnetic half-Heusler single crystals of TbPtBi, a magnetic-field-induced topological semimetal with simple band structure, are investigated. We found that a nonmonotonic magnetic field dependence of the anomalous Hall resistivity in a high magnetic field (B>7T), which come from the change of band structure induced by the Zeeman-like splitting when applying the external magnetic field. The experiment results show that credible anomalous Hall resistivity and conductivity reach up to 0.6798mΩcm and 125Ω-1cm-1, respectively. A large AHA up to 33% is obtained in TbPtBi, which is comparable to typical ferromagnetic Weyl semimetal. The analysis of results show it should be attributed to topological band around EF and low carrier density.

preprint2020arXiv

Large anomalous Hall effect in a hexagonal ferromagnetic Fe5Sn3 single crystal

In this paper, we report an experimental observation of the large anomalous Hall effect (AHE) in a hexagonal ferromagnetic Fe5Sn3 single crystal with current along the b axis and a magnetic field normal to the bc plane. The intrinsic contribution of the anomalous Hall conductance sigma_AH^int was approximately 613 Ω-1 cm-1, which was more than 3 times the maximum value in the frustrated kagome magnet Fe3Sn2 and nearly independent of the temperature over a wide range between 5 and 350 K. The analysis results revealed that the large AHE was dominated by a common, intrinsic term, while the extrinsic contribution, i.e., the skew scattering and side jump, turned out to be small. In addition to the large AHE, it was found the types of majority carriers changed at approximately 275 and 30 K, consistent with the critical temperatures of the spin reorientation. These findings suggest that the hexagonal ferromagnetic Fe5Sn3 single crystal is an excellent candidate to use for the study of the topological features in ferromagnets.

preprint2020arXiv

Magnetic topological insulator MnBi6Te10 with zero-field ferromagnetic state and gapped Dirac surface states

Magnetic topological insulators (TIs) with nontrivial topological electronic structure and broken time-reversal symmetry exhibit various exotic topological quantum phenomena. The realization of such exotic phenomena at high temperature is one of central topics in this area. We reveal that MnBi6Te10 is a magnetic TI with an antiferromagnetic ground state below 10.8 K whose nontrivial topology is manifested by Dirac-like surface states. The ferromagnetic axion insulator state with Z4 = 2 emerges once spins polarized at field as low as 0.1 T, accompanied with saturated anomalous Hall resistivity up to 10 K. Such a ferromagnetic state is preserved even external field down to zero at 2 K. Theoretical calculations indicate that the few-layer ferromagnetic MnBi6Te10 is also topologically nontrivial with a non-zero Chern number. Angle-resolved photoemission spectroscopy experiments further reveal three types of Dirac surface states arising from different terminations on the cleavage surfaces, one of which has insulating behavior with an energy gap of ~ 28 meV at the Dirac point. These outstanding features suggest that MnBi6Te10 is a promising system to realize various topological quantum effects at zero field and high temperature.

preprint2020arXiv

Many-body Resonance in a Correlated Topological Kagome Antiferromagnet

We use scanning tunneling microscopy/spectroscopy (STM/S) to elucidate the atomically resolved electronic structure in strongly correlated topological kagome magnet Mn$_3$Sn. In stark contrast to its broad single-particle electronic structure, we observe a pronounced resonance with a Fano line shape at the Fermi level resembling the many-body Kondo resonance. We find that this resonance does not arise from the step edges or atomic impurities, but the intrinsic kagome lattice. Moreover, the resonance is robust against the perturbation of a vector magnetic field, but broadens substantially with increasing temperature, signaling strongly interacting physics. We show that this resonance can be understood as the result of geometrical frustration and strong correlation based on the kagome lattice Hubbard model. Our results point to the emergent many-body resonance behavior in a topological kagome magnet.

preprint2020arXiv

Multifunctional Metasurface Design with a Generative Adversarial Network

Metasurfaces have enabled precise electromagnetic wave manipulation with strong potential to obtain unprecedented functionalities and multifunctional behavior in flat optical devices. These advantages in precision and functionality come at the cost of tremendous difficulty in finding individual meta-atom structures based on specific requirements (commonly formulated in terms of electromagnetic responses), which makes the design of multifunctional metasurfaces a key challenge in this field. In this paper, we present a Generative Adversarial Networks (GAN) that can tackle this problem and generate meta-atom/metasurface designs to meet multifunctional design goals. Unlike conventional trial-and-error or iterative optimization design methods, this new methodology produces on-demand free-form structures involving only a single design iteration. More importantly, the network structure and the robust training process are independent of the complexity of design objectives, making this approach ideal for multifunctional device design. Additionally, the ability of the network to generate distinct classes of structures with similar electromagnetic responses but different physical features could provide added latitude to accommodate other considerations such as fabrication constraints and tolerances. We demonstrate the network's ability to produce a variety of multifunctional metasurface designs by presenting a bifocal metalens, a polarization-multiplexed beam deflector, a polarization-multiplexed metalens and a polarization-independent metalens.

preprint2020arXiv

Multimodal Learning For Classroom Activity Detection

Classroom activity detection (CAD) focuses on accurately classifying whether the teacher or student is speaking and recording both the length of individual utterances during a class. A CAD solution helps teachers get instant feedback on their pedagogical instructions. This greatly improves educators' teaching skills and hence leads to students' achievement. However, CAD is very challenging because (1) the CAD model needs to be generalized well enough for different teachers and students; (2) data from both vocal and language modalities has to be wisely fused so that they can be complementary; and (3) the solution shouldn't heavily rely on additional recording device. In this paper, we address the above challenges by using a novel attention based neural framework. Our framework not only extracts both speech and language information, but utilizes attention mechanism to capture long-term semantic dependence. Our framework is device-free and is able to take any classroom recording as input. The proposed CAD learning framework is evaluated in two real-world education applications. The experimental results demonstrate the benefits of our approach on learning attention based neural network from classroom data with different modalities, and show our approach is able to outperform state-of-the-art baselines in terms of various evaluation metrics.

preprint2020arXiv

Multipartite Entanglement of Billions of Motional Atoms Heralded by Single Photon

Quantum entanglement is of central importance to quantum computing, quantum metrology, quantum information as well as the nature of quantum physics. Quantum theory does not prevent entanglement from being created and observed in macroscopic physical systems, in reality however, the accessible scale of entanglement is still very limited due to decoherence effects. Recently, entanglement has been observed among atoms from thousands to millions level in extremely low-temperature and well-isolated systems. Here, we create multipartite entanglement of billions of motional atoms in a quantum memory at room temperature, and certify the genuine entanglement via $M$-separability witness associated with photon statistics. The information contained in a single photon is found strongly correlated with the excitation shared by the motional atoms, which intrinsically address the large system and therefore stimulate the multipartite entanglement. Remarkably, our heralded and quantum memory built-in entanglement generation allows us to directly observe the dynamic evolution of entanglement depth and further to reveal the effects of decoherence. Our results verify the existence of genuine multipartite entanglement among billions of motional atoms at ambient condition, significantly extending the boundary of the accessible scale of entanglement. Besides probing the quantum-to-classical transition in an entirely new realm, the developed abilities of manipulating such a large-scale entanglement may enhance a wide spectrum of applications for emerging quantum technologies.

preprint2020arXiv

Scene Graph Reasoning for Visual Question Answering

Visual question answering is concerned with answering free-form questions about an image. Since it requires a deep linguistic understanding of the question and the ability to associate it with various objects that are present in the image, it is an ambitious task and requires techniques from both computer vision and natural language processing. We propose a novel method that approaches the task by performing context-driven, sequential reasoning based on the objects and their semantic and spatial relationships present in the scene. As a first step, we derive a scene graph which describes the objects in the image, as well as their attributes and their mutual relationships. A reinforcement agent then learns to autonomously navigate over the extracted scene graph to generate paths, which are then the basis for deriving answers. We conduct a first experimental study on the challenging GQA dataset with manually curated scene graphs, where our method almost reaches the level of human performance.

preprint2020arXiv

Siamese Neural Networks for Class Activity Detection

Classroom activity detection (CAD) aims at accurately recognizing speaker roles (either teacher or student) in classrooms. A CAD solution helps teachers get instant feedback on their pedagogical instructions. However, CAD is very challenging because (1) classroom conversations contain many conversational turn-taking overlaps between teachers and students; (2) the CAD model needs to be generalized well enough for different teachers and students; and (3) classroom recordings may be very noisy and low-quality. In this work, we address the above challenges by building a Siamese neural framework to automatically identify teacher and student utterances from classroom recordings. The proposed model is evaluated on real-world educational datasets. The results demonstrate that (1) our approach is superior on the prediction tasks for both online and offline classroom environments; and (2) our framework exhibits robustness and generalization ability on new teachers (i.e., teachers never appear in training data).

preprint2020arXiv

Spelling Error Correction with Soft-Masked BERT

Spelling error correction is an important yet challenging task because a satisfactory solution of it essentially needs human-level language understanding ability. Without loss of generality we consider Chinese spelling error correction (CSC) in this paper. A state-of-the-art method for the task selects a character from a list of candidates for correction (including non-correction) at each position of the sentence on the basis of BERT, the language representation model. The accuracy of the method can be sub-optimal, however, because BERT does not have sufficient capability to detect whether there is an error at each position, apparently due to the way of pre-training it using mask language modeling. In this work, we propose a novel neural architecture to address the aforementioned issue, which consists of a network for error detection and a network for error correction based on BERT, with the former being connected to the latter with what we call soft-masking technique. Our method of using `Soft-Masked BERT' is general, and it may be employed in other language detection-correction problems. Experimental results on two datasets demonstrate that the performance of our proposed method is significantly better than the baselines including the one solely based on BERT.

preprint2020arXiv

Superpixel-Guided Label Softening for Medical Image Segmentation

Segmentation of objects of interest is one of the central tasks in medical image analysis, which is indispensable for quantitative analysis. When developing machine-learning based methods for automated segmentation, manual annotations are usually used as the ground truth toward which the models learn to mimic. While the bulky parts of the segmentation targets are relatively easy to label, the peripheral areas are often difficult to handle due to ambiguous boundaries and the partial volume effect, etc., and are likely to be labeled with uncertainty. This uncertainty in labeling may, in turn, result in unsatisfactory performance of the trained models. In this paper, we propose superpixel-based label softening to tackle the above issue. Generated by unsupervised over-segmentation, each superpixel is expected to represent a locally homogeneous area. If a superpixel intersects with the annotation boundary, we consider a high probability of uncertain labeling within this area. Driven by this intuition, we soften labels in this area based on signed distances to the annotation boundary and assign probability values within [0, 1] to them, in comparison with the original "hard", binary labels of either 0 or 1. The softened labels are then used to train the segmentation models together with the hard labels. Experimental results on a brain MRI dataset and an optical coherence tomography dataset demonstrate that this conceptually simple and implementation-wise easy method achieves overall superior segmentation performances to baseline and comparison methods for both 3D and 2D medical images.

preprint2020arXiv

Thermally induced generation and annihilation of magnetic chiral skyrmion bubbles and achiral bubbles in Mn-Ni-Ga Magnets

Magnetic chiral skyrmion bubbles and achiral bubbles are two independent magnetic domain structures, in which the former with equivalent winding number to skyrmions offers great promise as information carriers for further spintronic devices. Here, in this work, we experimentally investigate the generation and annihilation of magnetic chiral skyrmion bubbles and achiral bubbles in the Mn-Ni-Ga thin plate by using the Lorentz transmission electron microscopy (L-TEM). The two independent magnetic domain structures can be directly controlled after the field cooling manipulation by varying the titled angles of external magnetic fields. By imaging the magnetization reversal with increasing temperature, we found an extraordinary annihilation mode of magnetic chiral skyrmion bubbles and a non-linear frequency for the winding number reversal. Quantitative analysis of such dynamics was performed by using L-TEM to directly determine the barrier energy for the magnetization reversal of magnetic chiral skyrmion bubbles.

preprint2019arXiv

Current-Induced Helicity Reversal of a Single Skyrmionic Bubble Chain in a Nanostructured Frustrated Magnet

Helicity indicates the in-plane magnetic-moment swirling direction of a skyrmionic configuration. The ability to reverse the helicity of a skyrmionic bubble via purely electrical means has been predicted in frustrated magnetic systems, however its experimental observation has remained challenging. Here, we experimentally demonstrate the current-driven helicity reversal of the skyrmionic bubble in a nanostructured frustrated Fe3Sn2 magnet. The critical current density required to trigger the helicity reversal is 109 - 1010 A/m2, with a corresponding pulse-width varying from 1 μs to 100 ns. Computational simulations reveal that both the pinning effect and dipole-dipole interaction play a crucial role in the helicity-reversal process.

preprint2019arXiv

Frequency tunable topological edge states of two-dimensional honeycomb lattice photonic crystals

In this paper, the photonic quantum spin Hall effect (PQSHE) is realized in dielectric two-dimensional (2D) honeycomb lattice photonic crystal (PC) by stretching and shrinking the honeycomb unit cell. Combining two honeycomb lattice PCs with a common photonic band gap (PBG) but different band topologies can generate a topologically protected edge state at the combined junction. The topological edge states and their unidirectional transmission as the scatterers with triangular, pentagonal, and heptagonal shapes are researched. Meanwhile, the unidirectional transmission in an inverted Ω-shaped waveguide with large bending angle is realized, and verifies the characteristics of the topological protection by adding different kind of defects. Moreover, the frequency varies significantly when changing the scatterers shape, which shows that the PC with various scatterers shape can tune the frequency range of the topological edge state significantly. In other words, it can adjust the frequency of unidirectional transmission and increase the adjustability of the topological edge state.

preprint2019arXiv

Observation of Magnetic Skyrmion Bubbles in a van der Waals ferromagnet Fe3GeTe2

Two-dimensional (2D) van der Waals (vdW) magnetic materials have recently been introduced as a new horizon in materials science and enable the potential applications for next-generation spintronic devices. Here, in this communication, the observations of stable Bloch-type magnetic skyrmions in single crystals of 2D vdW Fe3GeTe2 (FGT) are reported by using in-situ Lorentz transmission electron microscopy (TEM). We find the ground-state magnetic stripe domains in FGT transform into skyrmion bubbles when an external magnetic field is applied perpendicularly to the (001) thin plate with temperatures below the Curie-temperature TC. Most interestingly, a hexagonal lattice of skyrmion bubbles is obtained via field cooling manipulation with magnetic field applied along the [001] direction. Owing to their topological stability, the skyrmion bubble lattices are stable to large field-cooling tilted angles and further reproduced by utilizing the micromagnetic simulations. These observations directly demonstrate that the 2D vdW FGT possesses a rich variety of topological spin textures, being of a great promise candidate for future applications in the field of spintronics.