Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
35works
0followers
26topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

35 published item(s)

preprint2026arXiv

Causal Invariance Learning via Efficient Nonconvex Optimization

Identifying the causal relationship among variables from observational data is an important yet challenging task. This work focuses on identifying the direct causes of an outcome and estimating their magnitude, i.e., learning the causal outcome model. Data from multiple environments provide valuable opportunities to uncover causality by exploiting the invariance principle that the causal outcome model holds across heterogeneous environments. Based on the invariance principle, we propose the Negative Weighted Distributionally Robust Optimization (NegDRO) framework to learn an invariant prediction model. NegDRO minimizes the worst-case combination of risks across multiple environments and enforces invariance by allowing potential negative weights. Under the additive interventions regime, we establish three major contributions: (i) On the statistical side, we provide sufficient and nearly necessary identification conditions under which the invariant prediction model coincides with the causal outcome model; (ii) On the optimization side, despite the nonconvexity of NegDRO, we establish its benign optimization landscape, where all stationary points lie close to the true causal outcome model; (iii) On the computational side, we develop a gradient-based algorithm that provably converges to the causal outcome model, with non-asymptotic convergence rates in both sample size and gradient-descent iterations. In particular, our method avoids exhaustive combinatorial searches over exponentially many subsets of covariates found in the literature, ensuring scalability even when the dimension of the covariates is large. To our knowledge, this is the first causal invariance learning method that finds the approximate global optimality for a nonconvex optimization problem efficiently.

preprint2026arXiv

SJTU:Spatial judgments in multimodal models towards unified segmentation through coordinate detection

Despite significant advances in vision-language understanding, implementing image segmentation within multimodal architectures remains a fundamental challenge in modern artificial intelligence systems. Existing vision-language models, which primarily rely on backbone architectures or CLIP-based embedding learning, demonstrate inherent limitations in fine-grained spatial localization and operational capabilities. This paper introduces SJTU: Spatial Judgments in Multimodal Models - Towards Unified Segmentation through Coordinate Detection, a framework that leverages spatial coordinate understanding to bridge vision-language interaction and precise segmentation, enabling accurate target identification through natural language instructions. The framework presents an approach for integrating segmentation techniques with vision-language models through spatial inference in multimodal space. By utilizing normalized coordinate detection for bounding boxes and transforming them into actionable segmentation outputs, we establish a connection between spatial and language representations in multimodal architectures. Experimental results demonstrate superior performance across benchmark datasets, achieving IoU scores of 0.5958 on COCO 2017 and 0.6758 on Pascal VOC. Testing on a single NVIDIA RTX 3090 GPU with 512x512 resolution images yields an average inference time of 7 seconds per image, demonstrating the framework's effectiveness in both accuracy and practical deployability. The project code is available at https://github.com/jw-chae/SJTU

preprint2025arXiv

Atomic-scale spin sensing of a 2D $d$-wave altermagnet via helical tunneling

Altermagnetism simultaneously possesses nonrelativistic spin responses and zero net magnetization, thus combining advantages of ferromagnetism and antiferromagnetism. This superiority originates from its unique dual feature, i.e., opposite-magnetic sublattices in real space and alternating spin polarization in momentum space enforced by the same crystal symmetry. Therefore, the determination of an altermagnetic order and its unique spin response inherently necessitates atomic-scale spin-resolved measurements in real and momentum spaces, an experimental milestone yet to be achieved. Here, via utilizing the helical edge (hinge) modes of a higher order topological insulator as the spin sensor, we realize spin-resolved scanning tunneling microscopy which enables us to pin down the dual-space feature of a layered $d$-wave altermagnet, KV$_2$Se$_2$O. In real space, atomic-registered mapping demonstrates the checkerboard antiferromagnetic order together with density-wave lattice modulation, and in momentum space, spin-resolved spectroscopic imaging provides a direct visualization of d-wave spin splitting of the band structure. Critically, using this new topology-guaranteed spin filter we directly reveal the unidirectional, spin-polarized quasiparticle excitations originating from the crystal symmetry-paired X and Y valleys around opposite magnetic sublattices simultaneously --the unique spin response for $d$-wave altermagnetism. Our experiments establish a solid basis for the exploration and utilization of altermagnetism in layered materials and further facilitate access to atomic-scale spin sensing and manipulating of 2D quantum materials.

preprint2025arXiv

Domain wall skyrmion-based magnonic crystal

Magnonic waveguide based on domain wall (DW) is considered as a crucial breakthrough toward the realization of magnonic nanocircuits. However, the effective control of spin waves propagating in DWs remains to be explored. Here, we construct a magnonic crystal (MC) by using a chain of the domain wall skyrmions (DWSKs) to manipulate the spin-wave propagation in DWs. We show that the DWSK chain can be created by leveraging voltage-controlled Dzyaloshinskii-Moriya interaction. The DWSK-based MC opens magnonic bandgaps, which can be dynamically adjusted through magnetic fields modulating the DWSK size. Furthermore, the manipulation of spin waves by the DWSK-based MC maintains robust in curved DW, demonstrating its adaptability to complex device architectures. Our work provides an effective method to control the spin-wave propagation in DWs and paves the way for designing energy-efficient magnonic nanocircuits.

preprint2025arXiv

Observation of robust one-dimensional edge channels in a three-dimensional quantum spin Hall insulator

Topologically protected edge channels show prospects for quantum devices. They have been found experimentally in two-dimensional (2D) quantum spin Hall insulators (QSHIs), weak topological insulators and higher-order topological insulators (HOTIs), but the number of materials realizing these topologies is still quite limited. Here, we provide evidence for topological edge states within a novel topology named three-dimensional (3D) QSHIs. Its topology originates solely from a nonzero $S_z$ spin Chern number for each $k_z$ plane of the crystal and is realized in bulk $α$-Bi$_4$I$_4$ with trivial symmetry indicators, as we show by density functional theory calculations. We experimentally observe the related edge states at each type of monolayer and bilayer step of this material by scanning tunneling microscopy. Consistently, the edge states are neither interrupted, nor backscattered by defects at the step edges corroborating their helical character as expected from the nontrivial topology. Furthermore, two individual edge channels are directly observed at bilayer steps without visible interaction gap opening, demonstrating the robustness of these edge modes against vertical stacking. Our results establish $α$-Bi$_4$I$_4$ as the first material realization of a 3D QSHI whose definition goes beyond the scope of topological symmetry indicators, and provide a pathway for realizing nearly-quantized spin Hall conductivity per unit cell in a bulk crystal.

preprint2023arXiv

Nonlinear Topological Magnon Spin Hall Effect

When a magnon passes through two-dimensional magnetic textures, it will experience a fictitious magnetic field originating from the $3\times 3$ skew-symmetric gauge fields. To date, only one of the three independent components of the gauge fields has been found to play a role in generating the fictitious magnetic field while the rest two are perfectly hidden. In this work, we show that they are concealed in the nonlinear magnon transport in magnetic textures. Without loss of generality, we theoretically study the nonlinear magnon-skyrmion interaction in antiferromagnets. By analyzing the scattering features of three-magnon processes between the circularly-polarized incident magnon and breathing skyrmion, we predict a giant Hall angle of both the confluence and splitting modes. Furthermore, we find that the Hall angle reverses its sign when one switches the handedness of the incident magnons. We dub it nonlinear topological magnon spin Hall effect. Our findings are deeply rooted in the bosonic nature of magnons that the particle number is not conserved, which has no counterpart in low-energy fermionic systems, and may open the door for probing gauge fields by nonlinear means.

preprint2022arXiv

A collaborative decomposition-based evolutionary algorithm integrating normal and penalty-based boundary intersection for many-objective optimization

Decomposition-based evolutionary algorithms have become fairly popular for many-objective optimization in recent years. However, the existing decomposition methods still are quite sensitive to the various shapes of frontiers of many-objective optimization problems (MaOPs). On the one hand, the cone decomposition methods such as the penalty-based boundary intersection (PBI) are incapable of acquiring uniform frontiers for MaOPs with very convex frontiers. On the other hand, the parallel reference lines of the parallel decomposition methods including the normal boundary intersection (NBI) might result in poor diversity because of under-sampling near the boundaries for MaOPs with concave frontiers. In this paper, a collaborative decomposition method is first proposed to integrate the advantages of parallel decomposition and cone decomposition to overcome their respective disadvantages. This method inherits the NBI-style Tchebycheff function as a convergence measure to heighten the convergence and uniformity of distribution of the PBI method. Moreover, this method also adaptively tunes the extent of rotating an NBI reference line towards a PBI reference line for every subproblem to enhance the diversity of distribution of the NBI method. Furthermore, a collaborative decomposition-based evolutionary algorithm (CoDEA) is presented for many-objective optimization. A collaborative decomposition-based environmental selection mechanism is primarily designed in CoDEA to rank all the individuals associated with the same PBI reference line in the boundary layer and pick out the best ranks. CoDEA is compared with several popular algorithms on 85 benchmark test instances. The experimental results show that CoDEA achieves high competitiveness benefiting from the collaborative decomposition maintaining a good balance among the convergence, uniformity, and diversity of distribution.

preprint2022arXiv

A new class of bilayer kagome lattice compounds with Dirac nodal lines and pressure-induced superconductivity

Kagome lattice composed of transition-metal ions provides a great opportunity to explore the intertwining between geometry, electronic orders and band topology. The discovery of multiple competing orders that connect intimately with the underlying topological band structure in nonmagnetic kagome metals $A$V$_3$Sb$_5$ ($A$ = K, Rb, Cs) further pushes this topic to the quantum frontier. Here we report the discovery and characterization of a new class of vanadium-based compounds with kagome bilayers, namely $A$V$_6$Sb$_6$ ($A$ = K, Rb, Cs) and V$_6$Sb$_4$, which, together with $A$V$_3$Sb$_5$, compose a series of kagome compounds with a generic chemical formula ($A_{m-1}$Sb$_{2m}$)(V$_3$Sb)$_n$ (m = 1, 2; n = 1, 2). Theoretical calculations combined with angle-resolved photoemission measurements reveal that these compounds feature Dirac nodal lines in close vicinity to the Fermi level. Pressure-induced superconductivity in $A$V$_6$Sb$_6$ further suggests promising emergent phenomena in these materials. The establishment of a new family of layered kagome materials paves the way for designer of fascinating kagome systems with diverse topological nontrivialities and collective ground states.

preprint2022arXiv

A Review on Serious Games for Exercise Rehabilitation

Disability is an important factor affecting todays society. At the same time, more and more sub-healthy people are sick due to reduced body functions and cognitive functions. Exercise rehabilitation is a kind of physical therapy, which can recover the motor ability, cognitive ability, and mental state of them through exercise. But the traditional exercise rehabilitation has some drawbacks so that people who need exercise rehabilitation cannot stick to it. Therefore, many researchers improved the drawbacks of traditional exercise rehabilitation by serious games for exercise rehabilitation. Although there were abundant achievements in the games, its relevant technologies and representative games are not be summarized systematically. To fill this gap, we introduced the significance of the convergence of exercise rehabilitation and serious games. Then, our paper sorted out the development of the games based on interaction mode between games and players. Besides, we analyzed the characteristics of different user groups and the specific functions of the games corresponding to them, and gave our classification based on this. Based on the classification, we reviewed related studies of the games in the past decade years and gave some suggestions on game design and development. Finally, we proposed serval research directions worth studying about the games technology development, functional design and social popularization.

preprint2022arXiv

All-magnonic Stern-Gerlach effect in antiferromagnets

The Stern-Gerlach (SG) effect is well known as the spin-dependent splitting of a beam of atoms carrying magnetic moments by a magnetic-field gradient, leading to the concept of electron spin. Antiferromagnets can accommodate two magnon modes with opposite spin polarizations, which is equivalent to the spin property of electrons. Here, we propose the existence of an all-magnonic SG effect in antiferromagnetic magnonic system, where a linearly polarized spin-wave beam is deflected by a straight Dzyaloshinskii-Moriya interaction (DMI) interface into two opposite polarized spin-wave beams propagating in two discrete directions. Moreover, we observe bi-focusing of antiferromagnetic spin waves induced by a curved DMI interface, which can also spatially separate thermal magnons with opposite polarizations. Our findings provide a unique perspective to understand the rich phenomena associated with antiferromagnetic magnon spin and would be helpful for polarization-dependent application of antiferromagnetic spintronic devices.

preprint2022arXiv

Generation of twisted magnons via spin-to-orbital angular momentum conversion

Twisted magnons (TMs) carrying orbital angular momentum (OAM) have attracted much attention from the magnonic community. The fabrication of such novel magnon state however is still challenging. Here we present a simple method to generate TMs with arbitrary radial and azimuthal quantum numbers through the spin-to-orbital angular momentum conversion. The conversion rate from plane-wave magnons to twisted ones is shown to be insensitive to the quantum index. The spectrum of TMs in thin nanodisks is solved analytically, showing a good agreement with micromagnetic simulations. Moreover, we numerically study the propagation of TMs in magnetic nanodisk arrays and obtain the quantitative dependence of the decay length on quantum indexes. Our results are helpful for realizing TMs with large OAMs that are indispensable for future high-capacity magnonic communications and computings.

preprint2022arXiv

Hybrid Physical Metric For 6-DoF Grasp Pose Detection

6-DoF grasp pose detection of multi-grasp and multi-object is a challenge task in the field of intelligent robot. To imitate human reasoning ability for grasping objects, data driven methods are widely studied. With the introduction of large-scale datasets, we discover that a single physical metric usually generates several discrete levels of grasp confidence scores, which cannot finely distinguish millions of grasp poses and leads to inaccurate prediction results. In this paper, we propose a hybrid physical metric to solve this evaluation insufficiency. First, we define a novel metric is based on the force-closure metric, supplemented by the measurement of the object flatness, gravity and collision. Second, we leverage this hybrid physical metric to generate elaborate confidence scores. Third, to learn the new confidence scores effectively, we design a multi-resolution network called Flatness Gravity Collision GraspNet (FGC-GraspNet). FGC-GraspNet proposes a multi-resolution features learning architecture for multiple tasks and introduces a new joint loss function that enhances the average precision of the grasp detection. The network evaluation and adequate real robot experiments demonstrate the effectiveness of our hybrid physical metric and FGC-GraspNet. Our method achieves 90.5\% success rate in real-world cluttered scenes. Our code is available at https://github.com/luyh20/FGC-GraspNet.

preprint2022arXiv

Impact of Naturalistic Field Acoustic Environments on Forensic Text-independent Speaker Verification System

Audio analysis for forensic speaker verification offers unique challenges in system performance due in part to data collected in naturalistic field acoustic environments where location/scenario uncertainty is common in the forensic data collection process. Forensic speech data as potential evidence can be obtained in random naturalistic environments resulting in variable data quality. Speech samples may include variability due to vocal efforts such as yelling over 911 emergency calls, whereas others might be whisper or situational stressed voice in a field location or interview room. Such speech variability consists of intrinsic and extrinsic characteristics and makes forensic speaker verification a complicated and daunting task. Extrinsic properties include recording equipment such as microphone type and placement, ambient noise, room configuration including reverberation, and other environmental scenario-based issues. Some factors, such as noise and non-target speech, will impact the verification system performance by their mere presence. To investigate the impact of field acoustic environments, we performed a speaker verification study based on the CRSS-Forensic corpus with audio collected from 8 field locations including police interviews. This investigation includes an analysis of the impact of seven unseen acoustic environments on speaker verification system performance using an x-Vector system.

preprint2022arXiv

Noisy Boundaries: Lemon or Lemonade for Semi-supervised Instance Segmentation?

Current instance segmentation methods rely heavily on pixel-level annotated images. The huge cost to obtain such fully-annotated images restricts the dataset scale and limits the performance. In this paper, we formally address semi-supervised instance segmentation, where unlabeled images are employed to boost the performance. We construct a framework for semi-supervised instance segmentation by assigning pixel-level pseudo labels. Under this framework, we point out that noisy boundaries associated with pseudo labels are double-edged. We propose to exploit and resist them in a unified manner simultaneously: 1) To combat the negative effects of noisy boundaries, we propose a noise-tolerant mask head by leveraging low-resolution features. 2) To enhance the positive impacts, we introduce a boundary-preserving map for learning detailed information within boundary-relevant regions. We evaluate our approach by extensive experiments. It behaves extraordinarily, outperforming the supervised baseline by a large margin, more than 6% on Cityscapes, 7% on COCO and 4.5% on BDD100k. On Cityscapes, our method achieves comparable performance by utilizing only 30% labeled images.

preprint2022arXiv

Nonreciprocal transport in a bilayer of MnBi2Te4 and Pt

MnBi2Te4 (MBT) is the first intrinsic magnetic topological insulator with the interaction of spin-momentum locked surface electrons and intrinsic magnetism, and it exhibits novel magnetic and topological phenomena. Recent studies suggested that the interaction of electrons and magnetism can be affected by the Mn-doped Bi2Te3 phase at the surface due to inevitable structural defects. Here we report an observation of nonreciprocal transport, i.e. current-direction-dependent resistance, in a bilayer composed of antiferromagnetic MBT and nonmagnetic Pt. The emergence of the nonreciprocal response below the Néel temperature confirms a correlation between nonreciprocity and intrinsic magnetism in the surface state of MBT. The angular dependence of the nonreciprocal transport indicates that nonreciprocal response originates from the asymmetry scattering of electrons at the surface of MBT mediated by magnon. Our work provides an insight into nonreciprocity arising from the correlation between magnetism and Dirac surface electrons in intrinsic magnetic topological insulators.

preprint2022arXiv

Pressure-induced dimensional crossover in a kagome superconductor

The recently discovered kagome superconductors AV3Sb5 exhibit tantalizing high-pressure phase diagrams, in which a new dome-like superconducting phase emerges under moderate pressure. However, its origin is as yet unknown. Here, we carried out the high-pressure electrical measurements up to 150 GPa, together with the high-pressure X-ray diffraction measurements and first-principles calculations on CsV3Sb5. We find the new superconducting phase to be rather robust and inherently linked to the interlayer Sb2-Sb2 interactions. The formation of Sb2-Sb2 bonds at high pressure tunes the system from two-dimensional to three-dimensional and pushes the Pz orbital of Sb2 upward across the Fermi level, resulting in enhanced density of states and increase of TC. Our work demonstrates that the dimensional crossover at high pressure can induce a topological phase transition and is related to the abnormal high-pressure TC evolution. Our findings should apply for other layered materials.

preprint2022arXiv

Rethinking Depth Estimation for Multi-View Stereo: A Unified Representation

Depth estimation is solved as a regression or classification problem in existing learning-based multi-view stereo methods. Although these two representations have recently demonstrated their excellent performance, they still have apparent shortcomings, e.g., regression methods tend to overfit due to the indirect learning cost volume, and classification methods cannot directly infer the exact depth due to its discrete prediction. In this paper, we propose a novel representation, termed Unification, to unify the advantages of regression and classification. It can directly constrain the cost volume like classification methods, but also realize the sub-pixel depth prediction like regression methods. To excavate the potential of unification, we design a new loss function named Unified Focal Loss, which is more uniform and reasonable to combat the challenge of sample imbalance. Combining these two unburdened modules, we present a coarse-to-fine framework, that we call UniMVSNet. The results of ranking first on both DTU and Tanks and Temples benchmarks verify that our model not only performs the best but also has the best generalization ability.

preprint2022arXiv

Twisted Magnon Frequency Comb and Penrose Superradiance

Quantization effects of the nonlinear magnon-vortex interaction in ferromagnetic nanodisks are studied. We show that the circular geometry twists the spin-wave fields with spiral phase dislocations carrying quantized orbital angular momentum (OAM). Meanwhile, the confluence and splitting scattering of twisted magnons off the gyrating vortex core (VC) generates a frequency comb consisting of discrete and equally spaced spectral lines, dubbed as twisted magnon frequency comb (tMFC). It is found that the mode spacing of the tMFC is equal to the gyration frequency of the VC and the OAM quantum numbers between adjacent spectral lines differ by one. By applying a magnetic field perpendicular to the plane of a thick nanodisk, we observe a magnonic Penrose superradiance inside the cone vortex state, which mimics the amplification of waves scattered from a rotating black hole. It is demonstrated that the higher-order modes of tMFC are significantly amplified while the lower-order ones are trapped within the VC gyrating orbit which manifests as the ergoregion. These results suggest a promising way to generate twisted magnons with large OAM and to drastically improve the flatness of the magnon comb.

preprint2022arXiv

Zero Bias Power Detector Circuits based on MoS$_2$ Field Effect Transistors on Wafer-Scale Flexible Substrates

We demonstrate the design, fabrication, and characterization of wafer-scale, zero-bias power detectors based on two-dimensional MoS$_2$ field effect transistors (FETs). The MoS$_2$ FETs are fabricated using a wafer-scale process on 8 $μ$m thick polyimide film, which in principle serves as flexible substrate. The performances of two CVD-MoS$_2$ sheets, grown with different processes and showing different thicknesses, are analyzed and compared from the single device fabrication and characterization steps to the circuit level. The power detector prototypes exploit the nonlinearity of the transistors above the cut-off frequency of the devices. The proposed detectors are designed employing a transistor model based on measurement results. The fabricated circuits operate in Ku-band between 12 and 18 GHz, with a demonstrated voltage responsivity of 45 V/W at 18 GHz in the case of monolayer MoS2 and 104 V/W at 16 GHz in the case of multilayer MoS$_2$, both achieved without applied DC bias. They are the best performing power detectors fabricated on flexible substrate reported to date. The measured dynamic range exceeds 30 dB outperforming other semiconductor technologies like silicon complementary metal oxide semiconductor (CMOS) circuits and GaAs Schottky diodes.

preprint2021arXiv

Consensus-Based Decentralized Energy Trading for Distributed Energy Resources

In smart grids, distributed energy resources (DERs) have penetrated residential zones to provide a new form of electricity supply, mainly from renewable energy. Residential households and commercial buildings with DERs have become prosumers in the local grids, since they can sell surplus power to others. Researches have been initiated to integrate and utilize DERs through better control and communication strategies. With the advances in the Internet of Things (IoT) technology, unprecedented coordination among DERs can be achieved to facilitate energy trading and transactive energy management. However, preventing leakage of users' information during the optimization process keeps challenging researchers, which drives them to develop privacy-preserving energy management systems. In this paper, we develop a fully decentralized transactive energy management using the consensus-based algorithm. To be specific, we design a virtual pool for prosumers to trade energy and exchange information with IoT technologies' support. The consensus-based algorithm enables prosumers to obtain the optimal energy schedule independently in a coordinated manner without revealing any personal data. We use real-world data to perform simulations and validate our developed algorithm. The results show that our consensus-based decentralized transactive energy management strategy is feasible and can significantly reduce the overall system cost.

preprint2021arXiv

Magnetic skyrmion generation by reflective spin-wave focusing

We propose a method to generate magnetic skyrmions by focusing spin waves totally reflected by a curved film edge. Based on the principle of identical magnonic path length, we derive the edge contour that is parabolic and frequency-independent. Micromagnetic simulations are performed to verify our theoretical design. It is found that under proper conditions, magnetic droplet first emerges near the focal point where the spin-wave intensity has been significantly enhanced, and then converts to magnetic skyrmion accompanied by a change of the topological charge. The phase diagram about the amplitude and frequency of the driving field for skyrmion generation is obtained. Our finding would be helpful for the designment of spintronic devices combing the advantage of skyrmionics and magnonics.

preprint2021arXiv

Parallel selective nuclear spin addressing for fast high-fidelity quantum gates

Due to their long coherence times, nuclear spins have gained considerable attention as physical qubits. Two-qubit gates between nuclear spins of distinct resonance frequencies can be mediated by electron spins, usually employing a sequence of electron-nuclear gates. Here we present a different approach inspired by, but not limited to, NV centers in diamond and discuss possible applications. To this end we generalize external electron spin control sequences for nuclear spin initialization and hyperpolarization to achieve the simultaneous control of distinct nuclear spins via an electron spin. This approach results in efficient entangling gates that, compared to standard techniques, reduce the gate time by more than 50% when the gate time is limited by off-resonant coupling to other spins, and by up to 22% when the gate time is limited by small electron-nuclear coupling.

preprint2020arXiv

A multi-view approach for Mandarin non-native mispronunciation verification

Traditionally, the performance of non-native mispronunciation verification systems relied on effective phone-level labelling of non-native corpora. In this study, a multi-view approach is proposed to incorporate discriminative feature representations which requires less annotation for non-native mispronunciation verification of Mandarin. Here, models are jointly learned to embed acoustic sequence and multi-source information for speech attributes and bottleneck features. Bidirectional LSTM embedding models with contrastive losses are used to map acoustic sequences and multi-source information into fixed-dimensional embeddings. The distance between acoustic embeddings is taken as the similarity between phones. Accordingly, examples of mispronounced phones are expected to have a small similarity score with their canonical pronunciations. The approach shows improvement over GOP-based approach by +11.23% and single-view approach by +1.47% in diagnostic accuracy for a mispronunciation verification task.

preprint2020arXiv

AIBench: An Agile Domain-specific Benchmarking Methodology and an AI Benchmark Suite

Domain-specific software and hardware co-design is encouraging as it is much easier to achieve efficiency for fewer tasks. Agile domain-specific benchmarking speeds up the process as it provides not only relevant design inputs but also relevant metrics, and tools. Unfortunately, modern workloads like Big data, AI, and Internet services dwarf the traditional one in terms of code size, deployment scale, and execution path, and hence raise serious benchmarking challenges. This paper proposes an agile domain-specific benchmarking methodology. Together with seventeen industry partners, we identify ten important end-to-end application scenarios, among which sixteen representative AI tasks are distilled as the AI component benchmarks. We propose the permutations of essential AI and non-AI component benchmarks as end-to-end benchmarks. An end-to-end benchmark is a distillation of the essential attributes of an industry-scale application. We design and implement a highly extensible, configurable, and flexible benchmark framework, on the basis of which, we propose the guideline for building end-to-end benchmarks, and present the first end-to-end Internet service AI benchmark. The preliminary evaluation shows the value of our benchmark suite---AIBench against MLPerf and TailBench for hardware and software designers, micro-architectural researchers, and code developers. The specifications, source code, testbed, and results are publicly available from the web site \url{http://www.benchcouncil.org/AIBench/index.html}.

preprint2020arXiv

Cross-domain Adaptation with Discrepancy Minimization for Text-independent Forensic Speaker Verification

Forensic audio analysis for speaker verification offers unique challenges due to location/scenario uncertainty and diversity mismatch between reference and naturalistic field recordings. The lack of real naturalistic forensic audio corpora with ground-truth speaker identity represents a major challenge in this field. It is also difficult to directly employ small-scale domain-specific data to train complex neural network architectures due to domain mismatch and loss in performance. Alternatively, cross-domain speaker verification for multiple acoustic environments is a challenging task which could advance research in audio forensics. In this study, we introduce a CRSS-Forensics audio dataset collected in multiple acoustic environments. We pre-train a CNN-based network using the VoxCeleb data, followed by an approach which fine-tunes part of the high-level network layers with clean speech from CRSS-Forensics. Based on this fine-tuned model, we align domain-specific distributions in the embedding space with the discrepancy loss and maximum mean discrepancy (MMD). This maintains effective performance on the clean set, while simultaneously generalizes the model to other acoustic domains. From the results, we demonstrate that diverse acoustic environments affect the speaker verification performance, and that our proposed approach of cross-domain adaptation can significantly improve the results in this scenario.

preprint2020arXiv

Effect of Dzyaloshinskii-Moriya interaction on magnetic vortex switching driven by radial spin waves

We theoretically investigate the radial-spin-wave induced magnetic vortex switching in the presence of Dzyaloshinskii-Moriya interaction (DMI). From micromagnetic simulations, we observe a circular-to-radial vortex phase transition by increasing the DMI strength. The radial spin-wave excitation spectrum for each magnetization configuration is analyzed, showing that the frequency of spin-wave mode with a given radial node number monotonically increases (decreases) with the DMI parameter of the radial (circular) vortex. Interestingly, we find that the DMI can significantly facilitate the polarity switching of the circular vortex driven by radial spin waves. Our work provides a new insight into the DMI effect on the vortex dynamics and is helpful for designing fast all-magnonic memory devices.

preprint2020arXiv

Momentum Resolved Superconducting Energy Gaps of Sr$_2$RuO$_4$ from Quasiparticle Interference Imaging

Sr$_2$RuO$_4$ has long been the focus of intense research interest because of conjectures that it is a correlated topological superconductor. It is the momentum space (k-space) structure of the superconducting energy gap $Δ_i(\mathbf{k})$ on each band $i$ that encodes its unknown superconducting order-parameter. But, because the energy scales are so low, it has never been possible to directly measure the $Δ_i(\mathbf{k})$ of Sr$_2$RuO$_4$. Here we implement Bogoliubov quasiparticle interference (BQPI) imaging, a technique capable of high-precision measurement of multiband $Δ_i(\mathbf{k})$. At T=90 mK we visualize a set of Bogoliubov scattering interference wavevectors $q_j:j=1-5$ consistent with eight gap nodes/minima, that are all closely aligned to the $(\pm1,\pm1)$ crystal-lattice directions on both the $α$-and $β$-bands. Taking these observations in combination with other very recent advances in directional thermal conductivity (E. Hassinger et al. Phys. Rev. X 7, 011032 (2017)), temperature dependent Knight shift (A. Pustogow et al. Nature 574, 72 (2019)), time-reversal symmetry conservation (S. Kashiwaya et al. arXiv:1907.030939) and theory (A.T. Romer et al. Phys. Rev. Lett. 123, 247001 (2019); H. S. Roising et al. Phys. Rev. Research 1, 033108 (2019),O. Gingras et al. Phys. Rev. Lett. 123, 217005 (2019)), the BQPI signature of Sr$_2$RuO$_4$ appears most consistent with $Δ_i(\mathbf{k})$ having $d_{x^2-y^2}$ $(B_{1g})$ symmetry.

preprint2020arXiv

Off-axial focusing of spin-wave lens in the presence of Dzyaloshinskii-Moriya interaction

We theoretically study the effect of Dzyaloshinskii-Moriya interaction (DMI) on the focusing of a spin-wave lens that is constructed by a circular interface between two magnetic films. We analytically derive the generalized Snell's law in the curved geometry and the position of the focal point which exhibits a peculiar off-axial focusing behavior. We uncover a strong dependence of the focal point on both the material parameters and the frequency of incident spin waves. Full micromagnetic simulations compare well with theoretical predictions. Our findings would be helpful to manipulate spin waves in chiral magnets and to design functional magnonic devices.

preprint2020arXiv

Robust edge states in magnetic domain-wall racetrack

Controllable artificial pinning is indispensable in numerous domain-wall (DW) devices, such as memory, sensor, logic gate, and neuromorphic computing hardware. The high-accuracy determination of the effective spring constant of the pinning potential, however, remains challenging, because the extrinsic pinning is often mixed up with intrinsic ones caused by materials defects and randomness. Here, we study the collective dynamics of interacting DWs in a racetrack with pinning sites of alternate distances. By mapping the governing equations of DW motion to the Su-Schrieffer-Heeger model and evaluating the quantized Zak phase, we predict two topologically distinct phases in the racetrack. Robust edge state emerges at either one or both ends depending on the parity of the DW number and the ratio of alternating intersite lengths. We show that the in-gap DW oscillation frequency has a fixed value which depends only on the geometrical shape of the pinning notch, and is insensitive to device imperfections and inhomogeneities. We propose to accurately quantify the spring coefficient that equals the square of the robust DW frequency multiplied by its constant mass. Our findings suggest as well that the DW racetrack is an ideal platform to study the topological phase transition.

preprint2020arXiv

Super-resolved optical mapping of reactive sulfur-vacancy in 2D transition metal dichalcogenides

Transition metal dichalcogenides (TMDs) represent an entire new class of semiconducting 2D materials with exciting properties. Defects in 2D TMDs can crucially affect their physical and chemical properties. However, characterization of the presence and spatial distribution of defects is limited either in throughput or in resolution. Here, we demonstrate large area mapping of reactive sulfur-deficient defects in 2D-TMDs coupling single-molecule localization microscopy with fluorescence labeling using thiol chemistry. Our method, reminiscent of PAINT strategies, relies on the specific binding by reversible physisorption of fluorescent probes to sulfur-vacancies via a thiol group and their intermittent emission to apply localization of the labeled defects with a precision down to 15 nm. Tuning the distance between the fluorophore and the docking thiol site allows us to control Föster Resonance Energy Transfer (FRET) process and reveal large structural defects such as grain boundaries and line defects, due to the local irregular lattice structure. Our methodology provides a simple and fast alternative for large-scale mapping of non-radiative defects in 2D materials and paves the way for in-situ and spatially resolved monitoring of the interaction between chemical agent and the defects in 2D materials that has general implications for defect engineering in aqueous condition.

preprint2020arXiv

Twisted magnon as a magnetic tweezer

Wave fields with spiral phase dislocations carrying orbital angular momentum (OAM) have been realized in many branches of physics, such as for photons, sound waves, electron beams, and neutrons. However, the OAM states of magnons (spin waves)$-$the building block of modern magnetism$-$and particularly their implications have yet to be addressed. Here, we theoretically investigate the twisted spin-wave generation and propagation in magnetic nanocylinders. The OAM nature of magnons is uncovered by showing that the spin-wave eigenmode is also the eigenstate of the OAM operator in the confined geometry. Inspired by optical tweezers, we predict an exotic "magnetic tweezer" effect by showing skyrmion gyrations under twisted magnons in exchange coupled nanocylinder$|$nanodisk heterostructure, as a practical demonstration of magnonic OAM to manipulate topological spin defects. Our study paves the way for the emerging magnetic manipulations by harnessing the OAM degree of freedom of magnons.

preprint2019arXiv

Microscopic evidence for a chiral superconducting order parameter in the heavy fermion superconductor UTe2

Spin-triplet superconductivity is a condensate of electron pairs with spin-1 and an odd-parity wavefunction. A particularly interesting manifestation of triplet pairing is a chiral p-wave state which is topologically non-trivial and a natural platform for realizing Majorana edge modes. Triplet pairing is however rare in solid state systems and so far, no unambiguous identification has been made in any bulk compound. Since pairing is most naturally mediated by ferromagnetic spin fluctuations, uranium based heavy fermion systems containing f electron elements that can harbor both strong correlations and magnetism are considered ideal candidate spin-triplet superconductors. In this work we present scanning tunneling microscopy (STM) studies of the newly discovered heavy fermion superconductor, UTe2 with a T$_{SC}$ of 1.6 K. We find signatures of coexisting Kondo effect and superconductivity which show competing spatial modulations within one unit-cell. STM spectroscopy at step edges show signatures of chiral in-gap states, predicted to exist at the boundaries of a topological superconductor. Combined with existing data indicating triplet pairing, the presence of chiral edge states suggests that UTe2 is a strong candidate material for chiral-triplet topological superconductivity.

preprint2019arXiv

Signature of Dispersing 1D Majorana Channels in an Iron-based Superconductor

The possible realization of Majorana fermions as quasiparticle excitations in condensed matter physics has created much excitement. Most recent studies have focused on Majorana bound states which can serve as topological qubits. More generally, akin to elementary particles, Majorana fermions can propagate and display linear dispersion. These excitations have not yet been directly observed, and can also be used for quantum information processing. One route to realizing this is in a line junction between two phase-shifted superconductors coupled to topological surface states. Recent theory indicates that in iron-based superconductors, a particular type of crystalline defect, i.e., a domain wall (DW) between two regions with a half-unit cell shift between them, should create a $π$-phase shift in the superconducting order parameter. Combined with recent data showing topological surface states in FeSe$_x$Te$_{1-x}$ we find that this is the ideal system to realize helical 1D-dispersing Majorana modes. Here we report scanning tunneling spectroscopic (STS) measurements of crystalline DWs in FeSe$_{0.45}$Te$_{0.55}$. By analyzing large-area superconducting gap maps, we identify the gap in the topological surface state, demonstrating that our sample is an effective Fu-Kane proximitized topological system. We further locate DWs across which the atoms shift by half a unit cell. STS data on these DWs reveal a flat density of states inside the superconducting gap, a hallmark of linearly dispersing modes in 1D. This unique signature is absent in DWs in the related superconductor, FeSe which is not in the topological phase. Our combined data are consistent with the observation of dispersing Majorana states at a $π$-phase shift DW in a proximitized topological material.

preprint2017arXiv

Large deformation and instability of soft hollow cylinder with surface effects

Surface stress, which is always neglected in classical elastic theories, has recently emerged as a key role in the mechanics of highly deformable soft solids. In this paper, the effect of surface stress on the deformation and instability of soft hollow cylinder are analyzed. By incorporating surface energy density function into the constitutive model of a hyper-elastic theory, explicit solutions are obtained for the deformation of soft hollow cylinder under the conditions of uniform pressure loading and geometric everting. It is found that surface tension evidently alters the deformation of the soft cylinder. Specifically, the surface stiffness resists the deformation, but the residual surface stress is inclined to larger deformation. Effects of surface stress on the instability of the soft hollow cylinder is also explored. For both the pressure loading and geometric everting conditions, significant changes in critical condition of the creases are found by varying the surface parameter. The results in this work reveal that surface energy obviously influences both the deformation and the instability of soft hollow cylinder at finite deformation. The obtained results will be helpful for understanding and predicting the mechanical behavior of soft structures accurately.