Researcher profile

Ran Cheng

Ran Cheng contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
29works
0followers
11topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

29 published item(s)

preprint2026arXiv

A Systematic Survey on Large Language Models for Evolutionary Optimization: From Modeling to Solving

Large Language Models (LLMs) possess substantial reasoning capabilities and are increasingly applied to optimization tasks, particularly in synergy with evolutionary computation. However, while recent surveys have explored specific aspects of this domain, they lack an integrative perspective that connects problem modeling with solving workflows. To address this gap, we present a systematic review of recent developments and organize them within a structured framework. First, we classify existing research into two primary stages: LLMs for optimization modeling and LLMs for optimization solving. Second, we divide the latter into three paradigms based on the role of the LLM: stand-alone optimizers, low-level components embedded within algorithms, and high-level managers for algorithm selection and generation. Third, for each category, we analyze representative methods, distill technical challenges, and examine their interplay with traditional approaches. Finally, we review interdisciplinary applications across the natural sciences, engineering, and machine learning. Based on this analysis, we highlight key limitations and point toward future directions for developing self-evolving agentic ecosystems. An up-to-date collection of related literature is maintained at https://github.com/ishmael233/LLM4OPT.

preprint2026arXiv

The Great March 100: 100 Detail-oriented Tasks for Evaluating Embodied AI Agents

Recently, with the rapid development of robot learning and imitation learning, numerous datasets and methods have emerged. However, these datasets and their task designs often lack systematic consideration and principles. This raises important questions: Do the current datasets and task designs truly advance the capabilities of robotic agents? Do evaluations on a few common tasks accurately reflect the differentiated performance of various methods proposed by different teams and evaluated on different tasks? To address these issues, we introduce the Great March 100 (\textbf{GM-100}) as the first step towards a robot learning Olympics. GM-100 consists of 100 carefully designed tasks that cover a wide range of interactions and long-tail behaviors, aiming to provide a diverse and challenging set of tasks to comprehensively evaluate the capabilities of robotic agents and promote diversity and complexity in robot dataset task designs. These tasks are developed through systematic analysis and expansion of existing task designs, combined with insights from human-object interaction primitives and object affordances. We collect a large amount of trajectory data on different robotic platforms and evaluate several baseline models. Experimental results demonstrate that the GM-100 tasks are 1) feasible to execute and 2) sufficiently challenging to effectively differentiate the performance of current VLA models. Our data and code are available at https://rhos.ai/research/gm-100.

preprint2026arXiv

VAR-MATH: Probing True Mathematical Reasoning in LLMS via Symbolic Multi-Instance Benchmarks

Recent advances in reinforcement learning (RL) have led to substantial improvements in the mathematical reasoning abilities of LLMs, as measured by standard benchmarks. Yet these gains often persist even when models are trained with flawed signals, such as random or inverted rewards. This raises a fundamental question: do such improvements reflect genuine reasoning, or are they merely artifacts of overfitting to benchmark-specific patterns? To answer this question, we adopt an evaluation-centric perspective and highlight two critical shortcomings in existing protocols. First, benchmark contamination arises because test problems are publicly available, thereby increasing the risk of data leakage. Second, evaluation fragility results from reliance on single-instance assessments, which are sensitive to stochastic outputs and fail to capture reasoning consistency. These limitations suggest the need for a new evaluation paradigm that can probe reasoning ability beyond memorization and one-off success. As response, we propose VAR-MATH, a symbolic evaluation framework that converts fixed numerical problems into parameterized templates and requires models to solve multiple instantiations of each. This design enforces consistency across structurally equivalent variants, mitigates contamination, and enhances robustness through bootstrapped metrics. We apply VAR-MATH to transform three popular benchmarks, AMC23, AIME24, and AIME25, into their symbolic counterparts, VAR-AMC23, VAR-AIME24, and VAR-AIME25. Experimental results show substantial performance drops for RL-trained models on these variabilized benchmarks, especially for smaller models, with average declines of 47.9\% on AMC23, 58.8\% on AIME24, and 72.9\% on AIME25. These findings indicate that some existing RL methods rely on superficial heuristics and fail to generalize beyond specific numerical forms.

preprint2025arXiv

Characterizing Spin-Orbit Torques by Tensorial Spin Hall Magnetoresistance

Magnetoresistance (MR) provides a crucial tool for experimentally studying spin torques. While MR is well established in the device geometry of the spin Hall effect (SHE), as exemplified by the magnet/heavy-metal heterostructures, its role and manifestation beyond the SHE paradigm remain elusive. We propose a hitherto unknown form of MR where the underlying charge-to-spin conversion and its inverse process violate the simple geometry of the SHE, calling for tensorial descriptions. This MR can generate a series of unique harmonic responses essential for the experimental characterization of unconventional spin-orbit torques in non-SHE materials. We demonstrate these harmonic signals with semimetal WTe$_2$ in mind but the results are not restricted to specific materials.

preprint2022arXiv

A Perspective on Magnon Spin Nernst Effect in Antiferromagnets

Magnon excitations in antiferromagnetic materials and their physical implications enable novel device concepts not available in ferromagnets, emerging as a new area of active research. A unique characteristic of antiferromagnetic magnons is the coexistence of opposite spin polarization, which mimics the electron spin in a variety of transport phenomena. Among them, the most prominent spin-contrasting phenomenon is the magnon spin Nernst effect (SNE), which refers to the generation of transverse pure magnon spin current through a longitudinal temperature gradient. We introduce selected recent progress in the study of magnon SNE in collinear antiferromagnets with a focus on its underlying physical mechanism entailing profound topological features of the magnon band structures. By reviewing how the magnon SNE has inspired and enriched the exploration of topological magnons, we offer our perspectives on this emerging frontier that holds potential in future spintronic nano-technology.

preprint2022arXiv

Bi-fidelity Evolutionary Multiobjective Search for Adversarially Robust Deep Neural Architectures

Deep neural networks have been found vulnerable to adversarial attacks, thus raising potentially concerns in security-sensitive contexts. To address this problem, recent research has investigated the adversarial robustness of deep neural networks from the architectural point of view. However, searching for architectures of deep neural networks is computationally expensive, particularly when coupled with adversarial training process. To meet the above challenge, this paper proposes a bi-fidelity multiobjective neural architecture search approach. First, we formulate the NAS problem for enhancing adversarial robustness of deep neural networks into a multiobjective optimization problem. Specifically, in addition to a low-fidelity performance predictor as the first objective, we leverage an auxiliary-objective -- the value of which is the output of a surrogate model trained with high-fidelity evaluations. Secondly, we reduce the computational cost by combining three performance estimation methods, i.e., parameter sharing, low-fidelity evaluation, and surrogate-based predictor. The effectiveness of the proposed approach is confirmed by extensive experiments conducted on CIFAR-10, CIFAR-100 and SVHN datasets.

preprint2022arXiv

Exploiting Context Information for Generic Event Boundary Captioning

Generic Event Boundary Captioning (GEBC) aims to generate three sentences describing the status change for a given time boundary. Previous methods only process the information of a single boundary at a time, which lacks utilization of video context information. To tackle this issue, we design a model that directly takes the whole video as input and generates captions for all boundaries parallelly. The model could learn the context information for each time boundary by modeling the boundary-boundary interactions. Experiments demonstrate the effectiveness of context information. The proposed method achieved a 72.84 score on the test set, and we reached the $2^{nd}$ place in this challenge. Our code is available at: \url{https://github.com/zjr2000/Context-GEBC}

preprint2022arXiv

Field-Assisted Sub-Terahertz Spin Pumping and Auto-Oscillation in NiO

Spin pumping converting sub-terahertz electromagnetic waves to DC spin currents has recently been demonstrated in antiferromagnets (AFMs) with easy-axis magnetic anisotropy. However, easy-plane AFMs such as NiO, which are easier to prepare experimentally, are considered to be bad candidates for spin pumping because the Néel vector oscillation is linearly polarized, placing a major restriction on the material choice for practical applications. Through a case study of NiO, we show that an applied magnetic field below the spin-flop transition can substantially modify the polarization of the resonance eigenmodes, which enables coherent sub-terahertz spin pumping as strong as that in easy-axis AFMs. In addition, we find that an applied magnetic field can significantly reduce the threshold of Néel vector auto-oscillation triggered by spin-transfer torques. These prominent field-assisted effects can greatly facilitate spintronic device engineering in the sub-terahertz frequency regime.

preprint2022arXiv

Manipulating Ferrimagnets by Fields and Currents

Ferrimagnets (FIMs) can function as high-frequency antiferromagnets while being easy to detect as ferromagnets, offering unique opportunities for ultrafast device applications. While the physical behavior of FIMs near the compensation point has been widely studied, there lacks a generic understanding of FIMs where the ratio of sublattice spins can vary freely between the ferromagnetic and antiferromagnetic limits. Here we investigate the physical properties of a model two-sublattice FIM manipulated by static magnetic fields and current-induced torques. By continuously varying the ratio of sublattice spins, we clarify how the dynamical chiral modes in an FIM are intrinsically connected to their ferro- and antiferromagnetic counterparts, which reveals unique features not visible near the compensation point. In particular, we find that current-induced torques can trigger spontaneous oscillation of the terahertz exchange mode. Compared with its realization in antiferromagnets, a spin-torque oscillator using FIMs not only has a reduced threshold current density but also can be self-stabilized, obviating the need for dynamic feedback.

preprint2022arXiv

Multiobjective Test Problems with Degenerate Pareto Fronts

In multiobjective optimisation, a set of scalable test problems with a variety of features allow researchers to investigate and evaluate the abilities of different optimisation algorithms, and thus can help them to design and develop more effective and efficient approaches. Existing test problem suites mainly focus on situations where all the objectives are fully conflicting with each other. In such cases, an m-objective optimisation problem has an (m-1)-dimensional Pareto front in the objective space. However, in some optimisation problems, there may be unexpected characteristics among objectives, e.g., redundancy. The redundancy of some objectives can lead to the multiobjective problem having a degenerate Pareto front, i.e., the dimension of the Pareto front of the $m$-objective problem be less than (m-1). In this paper, we systematically study degenerate multiobjective problems. We abstract three general characteristics of degenerate problems, which are not formulated and systematically investigated in the literature. Based on these characteristics, we present a set of test problems to support the investigation of multiobjective optimisation algorithms under situations with redundant objectives. To the best of our knowledge, this work is the first one that explicitly formulates these three characteristics of degenerate problems, thus allowing the resulting test problems to be featured by their generality, in contrast to existing test problems designed for specific purposes (e.g., visualisation).

preprint2022arXiv

S3E-GNN: Sparse Spatial Scene Embedding with Graph Neural Networks for Camera Relocalization

Camera relocalization is the key component of simultaneous localization and mapping (SLAM) systems. This paper proposes a learning-based approach, named Sparse Spatial Scene Embedding with Graph Neural Networks (S3E-GNN), as an end-to-end framework for efficient and robust camera relocalization. S3E-GNN consists of two modules. In the encoding module, a trained S3E network encodes RGB images into embedding codes to implicitly represent spatial and semantic embedding code. With embedding codes and the associated poses obtained from a SLAM system, each image is represented as a graph node in a pose graph. In the GNN query module, the pose graph is transformed to form a embedding-aggregated reference graph for camera relocalization. We collect various scene datasets in the challenging environments to perform experiments. Our results demonstrate that S3E-GNN method outperforms the traditional Bag-of-words (BoW) for camera relocalization due to learning-based embedding and GNN powered scene matching mechanism.

preprint2022arXiv

Semantic-Aware Pretraining for Dense Video Captioning

This report describes the details of our approach for the event dense-captioning task in ActivityNet Challenge 2021. We present a semantic-aware pretraining method for dense video captioning, which empowers the learned features to recognize high-level semantic concepts. Diverse video features of different modalities are fed into an event captioning module to generate accurate and meaningful sentences. Our final ensemble model achieves a 10.00 METEOR score on the test set.

preprint2022arXiv

SoloGAN: Multi-domain Multimodal Unpaired Image-to-Image Translation via a Single Generative Adversarial Network

Despite significant advances in image-to-image (I2I) translation with generative adversarial networks (GANs), it remains challenging to effectively translate an image to a set of diverse images in multiple target domains using a single pair of generator and discriminator. Existing I2I translation methods adopt multiple domain-specific content encoders for different domains, where each domain-specific content encoder is trained with images from the same domain only. Nevertheless, we argue that the content (domain-invariance) features should be learned from images among all of the domains. Consequently, each domain-specific content encoder of existing schemes fails to extract the domain-invariant features efficiently. To address this issue, we present a flexible and general SoloGAN model for efficient multimodal I2I translation among multiple domains with unpaired data. In contrast to existing methods, the SoloGAN algorithm uses a single projection discriminator with an additional auxiliary classifier and shares the encoder and generator for all domains. Consequently, the SoloGAN can be trained effectively with images from all domains such that the domain-invariance content representation can be efficiently extracted. Qualitative and quantitative results over a wide range of datasets against several counterparts and variants of the SoloGAN demonstrate the merits of the method, especially for challenging I2I translation datasets, i.e., datasets involving extreme shape variations or need to keep the complex backgrounds unchanged after translations. Furthermore, we demonstrate the contribution of each component in SoloGAN by ablation studies.

preprint2022arXiv

Surrogate-assisted Multi-objective Neural Architecture Search for Real-time Semantic Segmentation

The architectural advancements in deep neural networks have led to remarkable leap-forwards across a broad array of computer vision tasks. Instead of relying on human expertise, neural architecture search (NAS) has emerged as a promising avenue toward automating the design of architectures. While recent achievements in image classification have suggested opportunities, the promises of NAS have yet to be thoroughly assessed on more challenging tasks of semantic segmentation. The main challenges of applying NAS to semantic segmentation arise from two aspects: (i) high-resolution images to be processed; (ii) additional requirement of real-time inference speed (i.e., real-time semantic segmentation) for applications such as autonomous driving. To meet such challenges, we propose a surrogate-assisted multi-objective method in this paper. Through a series of customized prediction models, our method effectively transforms the original NAS task into an ordinary multi-objective optimization problem. Followed by a hierarchical pre-screening criterion for in-fill selection, our method progressively achieves a set of efficient architectures trading-off between segmentation accuracy and inference speed. Empirical evaluations on three benchmark datasets together with an application using Huawei Atlas 200 DK suggest that our method can identify architectures significantly outperforming existing state-of-the-art architectures designed both manually by human experts and automatically by other NAS methods.

preprint2022arXiv

Theory of Harmonic Hall Responses of Spin-Torque Driven Antiferromagnets

Harmonic analysis is a powerful tool to characterize and quantify current-induced torques acting on magnetic materials, but so far it remains an open question in studying antiferromagnets. Here we formulate a general theory of harmonic Hall responses of collinear antiferromagnets driven by current-induced torques including both field-like and damping-like components. By scanning a magnetic field of variable strength in three orthogonal planes, we are able to distinguish the contributions from field-like torque, damping-like torque, and concomitant thermal effects by analyzing the second harmonic signals in the Hall voltage. The analytical expressions of the first and second harmonics as functions of the magnetic field direction and strength are confirmed by numerical simulations with good agreement. We demonstrate our predictions in two prototype antiferromagnets, $α-$Fe$_{2}$O$_{3}$ and NiO, providing direct and general guidance to current and future experiments.

preprint2022arXiv

VLMixer: Unpaired Vision-Language Pre-training via Cross-Modal CutMix

Existing vision-language pre-training (VLP) methods primarily rely on paired image-text datasets, which are either annotated by enormous human labors, or crawled from the internet followed by elaborate data cleaning techniques. To reduce the dependency on well-aligned image-text pairs, it is promising to directly leverage the large-scale text-only and image-only corpora. This paper proposes a data augmentation method, namely cross-modal CutMix (CMC), for implicit cross-modal alignment learning in unpaired VLP. Specifically, CMC transforms natural sentences from the textual view into a multi-modal view, where visually-grounded words in a sentence are randomly replaced by diverse image patches with similar semantics. There are several appealing proprieties of the proposed CMC. First, it enhances the data diversity while keeping the semantic meaning intact for tackling problems where the aligned data are scarce; Second, by attaching cross-modal noise on uni-modal data, it guides models to learn token-level interactions across modalities for better denoising. Furthermore, we present a new unpaired VLP method, dubbed as VLMixer, that integrates CMC with contrastive learning to pull together the uni-modal and multi-modal views for better instance-level alignments among different modalities. Extensive experiments on five downstream tasks show that VLMixer could surpass previous state-of-the-art unpaired VLP methods.

preprint2022arXiv

Voltage-driven exchange resonance achieving 100\% mechanical efficiency

Magnetic resonances driven by current-induced torques are crucial tools to study magnetic materials but are very limited in frequency and mechanical efficiency. We propose an alternative mechanism, voltage-induced torque, to realize high efficiency in generating high-frequency magnetization dynamics. When a ferromagnet-topological insulator-ferromagnet trilayer heterostructure is operated as an adiabatic quantum motor, voltage-induced torque arises from the adiabatic motion of gapped topological electrons on the two interfaces and act oppositely on the two ferromagnetic layers, which can excite the exchange mode where the two ferromagnetic layers precess with a $π$-phase difference. The exchange mode resonance, bearing a much higher frequency than the ferromagnetic resonance, is accompanied by topological charge pumping, leading to a sharp peak in electrical admittance at the resonance point. Because the output current is purely adiabatic while dissipative current vanishes identically, the proposed voltage-driven exchange resonance entails a remarkably high mechanical efficiency close to unity, which is impossible in any current-driven systems.

preprint2021arXiv

(AF)2-S3Net: Attentive Feature Fusion with Adaptive Feature Selection for Sparse Semantic Segmentation Network

Autonomous robotic systems and self driving cars rely on accurate perception of their surroundings as the safety of the passengers and pedestrians is the top priority. Semantic segmentation is one the essential components of environmental perception that provides semantic information of the scene. Recently, several methods have been introduced for 3D LiDAR semantic segmentation. While, they can lead to improved performance, they are either afflicted by high computational complexity, therefore are inefficient, or lack fine details of smaller instances. To alleviate this problem, we propose AF2-S3Net, an end-to-end encoder-decoder CNN network for 3D LiDAR semantic segmentation. We present a novel multi-branch attentive feature fusion module in the encoder and a unique adaptive feature selection module with feature map re-weighting in the decoder. Our AF2-S3Net fuses the voxel based learning and point-based learning into a single framework to effectively process the large 3D scene. Our experimental results show that the proposed method outperforms the state-of-the-art approaches on the large-scale SemanticKITTI benchmark, ranking 1st on the competitive public leaderboard competition upon publication.

preprint2021arXiv

Quantifying Spin-Orbit Torques in Antiferromagnet/Heavy Metal Heterostructures

The effect of spin currents on the magnetic order of insulating antiferromagnets (AFMs) is of fundamental interest and can enable new applications. Toward this goal, characterizing the spin-orbit torques (SOT) associated with AFM/heavy metal (HM) interfaces is important. Here we report the full angular dependence of the harmonic Hall voltages in a predominantly easy-plane AFM, epitaxial c-axis oriented $α$-Fe$_2$O$_3$ films, with an interface to Pt. By modeling the harmonic Hall signals together with the $α$-Fe$_2$O$_3$ magnetic parameters, we determine the amplitudes of field-like and damping-like SOT. Out-of-plane field scans are shown to be essential to determining the damping-like component of the torques. In contrast to ferromagnetic/heavy metal heterostructures, our results demonstrate that the field-like torques are significantly larger than the damping-like torques, which we correlate with the presence of a large imaginary component of the interface spin-mixing conductance. Our work demonstrates a direct way of characterizing SOT in AFM/HM heterostructures.

preprint2020arXiv

Current-induced CrI3 surface spin-flop transition probed by proximity magnetoresistance in Pt

By exploiting proximity coupling, we probe the spin state of the surface layers of CrI3, a van der Waals magnetic semiconductor, by measuring the induced magnetoresistance (MR) of Pt in Pt/CrI3 nano-devices. We fabricate the devices with clean and stable interfaces by placing freshly exfoliated CrI3 flake atop pre-patterned thin Pt strip and encapsulating the Pt/CrI3 heterostructure with hexagonal boron nitride (hBN) in a protected environment. In devices consisting of a wide range of CrI3 thicknesses (30 to 150 nm), we observe that an abrupt upward jump in Pt MR emerge at a 2 T magnetic field applied perpendicularly to the layers when the current density exceeds 2.5x10^10 A/m2, followed by a gradual decrease over a range of 5 T. These distinct MR features suggest a spin-flop transition which reveals strong antiferromagnetic interlayer coupling in the surface layers of CrI3. We study the current dependence by holding the Pt/CrI3 sample at approximately the same temperature to exclude the joule heating effect, and find that the MR jump increases with the current density, indicating a spin current origin. This spin current effect provides a new route to control spin configurations in insulating antiferromagnets, which is potentially useful for spintronic applications.

preprint2020arXiv

Evolutionary Multi-Objective Optimization Driven by Generative Adversarial Networks

Recently, more and more works have proposed to drive evolutionary algorithms using machine learning models.Usually, the performance of such model based evolutionary algorithms is highly dependent on the training qualities of the adopted models.Since it usually requires a certain amount of data (i.e. the candidate solutions generated by the algorithms) for model training, the performance deteriorates rapidly with the increase of the problem scales, due to the curse of dimensionality.To address this issue, we propose a multi-objective evolutionary algorithm driven by the generative adversarial networks (GANs).At each generation of the proposed algorithm, the parent solutions are first classified into \emph{real} and \emph{fake} samples to train the GANs; then the offspring solutions are sampled by the trained GANs.Thanks to the powerful generative ability of the GANs, our proposed algorithm is capable of generating promising offspring solutions in high-dimensional decision space with limited training data.The proposed algorithm is tested on 10 benchmark problems with up to 200 decision variables.Experimental results on these test problems demonstrate the effectiveness of the proposed algorithm.

preprint2020arXiv

Evolutionary Multiobjective Optimization Driven by Generative Adversarial Networks (GANs)

Recently, increasing works have proposed to drive evolutionary algorithms using machine learning models. Usually, the performance of such model based evolutionary algorithms is highly dependent on the training qualities of the adopted models. Since it usually requires a certain amount of data (i.e. the candidate solutions generated by the algorithms) for model training, the performance deteriorates rapidly with the increase of the problem scales, due to the curse of dimensionality. To address this issue, we propose a multi-objective evolutionary algorithm driven by the generative adversarial networks (GANs). At each generation of the proposed algorithm, the parent solutions are first classified into real and fake samples to train the GANs; then the offspring solutions are sampled by the trained GANs. Thanks to the powerful generative ability of the GANs, our proposed algorithm is capable of generating promising offspring solutions in high-dimensional decision space with limited training data. The proposed algorithm is tested on 10 benchmark problems with up to 200 decision variables. Experimental results on these test problems demonstrate the effectiveness of the proposed algorithm.

preprint2020arXiv

Hybrid Attention-Based Transformer Block Model for Distant Supervision Relation Extraction

With an exponential explosive growth of various digital text information, it is challenging to efficiently obtain specific knowledge from massive unstructured text information. As one basic task for natural language processing (NLP), relation extraction aims to extract the semantic relation between entity pairs based on the given text. To avoid manual labeling of datasets, distant supervision relation extraction (DSRE) has been widely used, aiming to utilize knowledge base to automatically annotate datasets. Unfortunately, this method heavily suffers from wrong labelling due to the underlying strong assumptions. To address this issue, we propose a new framework using hybrid attention-based Transformer block with multi-instance learning to perform the DSRE task. More specifically, the Transformer block is firstly used as the sentence encoder to capture syntactic information of sentences, which mainly utilizes multi-head self-attention to extract features from word level. Then, a more concise sentence-level attention mechanism is adopted to constitute the bag representation, aiming to incorporate valid information of each sentence to effectively represent the bag. Experimental results on the public dataset New York Times (NYT) demonstrate that the proposed approach can outperform the state-of-the-art algorithms on the evaluation dataset, which verifies the effectiveness of our model for the DSRE task.

preprint2020arXiv

Magnonic Su-Schrieffer-Heeger Model in Honeycomb Ferromagnets

Topological electronics has extended its richness to non-electronic systems where phonons and magnons can play the role of electrons. In particular, topological phases of magnons can be enabled by the Dzyaloshinskii-Moriya interaction (DMI) which acts as an effective spin-orbit coupling. We show that besides DMI, an alternating arrangement of Heisenberg exchange interactions critically determines the magnon band topology, realizing a magnonic analog of the Su-Schrieffer-Heeger model. On a honeycomb ferromagnet with perpendicular anisotropy, we calculate the topological phase diagram, the chiral edge states, and the associated magnon Hall effect by allowing the relative strength of exchange interactions on different links to be tunable. Including weak phonon-magnon hybridization does not change the result. Candidate materials are discussed.

preprint2020arXiv

Moiré magnons in twisted bilayer magnets with collinear order

We explore the moiré magnon bands in twisted bilayer magnets with next-nearest neighboring Dzyaloshinskii-Moriya interactions, assuming that the out-of-plane collinear magnetic order is preserved under weak interlayer coupling. By calculating the magnonic band structures and the topological Chern numbers for four representative cases, we find that (i) the valley moiré bands are extremely flat over a wide range of continuous twist angles; (ii) the topological Chern numbers of the lowest few flat bands vary significantly with the twist angle; and (iii) the lowest few topological flat bands in bilayer antiferromagnets entail nontrivial thermal spin transport in the transverse direction; These properties make twisted bilayer magnets an ideal platform to study the magnonic counterparts of moiré electrons, where the statistical distinction between magnons and electrons leads to fundamentally new physical behavior.

preprint2020arXiv

PDA: Progressive Data Augmentation for General Robustness of Deep Neural Networks

Adversarial images are designed to mislead deep neural networks (DNNs), attracting great attention in recent years. Although several defense strategies achieved encouraging robustness against adversarial samples, most of them fail to improve the robustness on common corruptions such as noise, blur, and weather/digital effects (e.g. frost, pixelate). To address this problem, we propose a simple yet effective method, named Progressive Data Augmentation (PDA), which enables general robustness of DNNs by progressively injecting diverse adversarial noises during training. In other words, DNNs trained with PDA are able to obtain more robustness against both adversarial attacks as well as common corruptions than the recent state-of-the-art methods. We also find that PDA is more efficient than prior arts and able to prevent accuracy drop on clean samples without being attacked. Furthermore, we theoretically show that PDA can control the perturbation bound and guarantee better generalization ability than existing work. Extensive experiments on many benchmarks such as CIFAR-10, SVHN, and ImageNet demonstrate that PDA significantly outperforms its counterparts in various experimental setups.

preprint2020arXiv

Sampled Training and Node Inheritance for Fast Evolutionary Neural Architecture Search

The performance of a deep neural network is heavily dependent on its architecture and various neural architecture search strategies have been developed for automated network architecture design. Recently, evolutionary neural architecture search (ENAS) has received increasing attention due to the attractive global optimization capability of evolutionary algorithms. However, ENAS suffers from extremely high computation costs because a large number of performance evaluations is usually required in evolutionary optimization and training deep neural networks is itself computationally very intensive. To address this issue, this paper proposes a new evolutionary framework for fast ENAS based on directed acyclic graph, in which parents are randomly sampled and trained on each mini-batch of training data. In addition, a node inheritance strategy is adopted to generate offspring individuals and their fitness is directly evaluated without training. To enhance the feature processing capability of the evolved neural networks, we also encode a channel attention mechanism in the search space. We evaluate the proposed algorithm on the widely used datasets, in comparison with 26 state-of-the-art peer algorithms. Our experimental results show the proposed algorithm is not only computationally much more efficiently, but also highly competitive in learning performance.

preprint2020arXiv

Spin fluctuations in quantized transport of magnetic topological insulators

In magnetic topological insulators, quantized electronic transport is interwined with spontaneous magnetic ordering, as magnetization controls band gaps, hence band topology, through the exchange interaction. We show that considering the exchange gaps at the mean-field level is inadequate to predict phase transitions between electronic states of distinct topology. Thermal spin fluctuations disturbing the magnetization can act as frozen disorders that strongly scatter electrons, reducing the onset temperature of quantized transport appreciably even in the absence of structural impurities. This effect, which has hitherto been overlooked, provides an alternative explanation of recent experiments on intrinsic magnetic topological insulators.

preprint2020arXiv

Subterahertz spin pumping from an insulating antiferromagnet

Spin-transfer torque and spin Hall effects combined with their reciprocal phenomena, spin-pumping and inverse spin Hall (ISHE) effects, enable the reading and control of magnetic moments in spintronics. The direct observation of these effects remains elusive in antiferromagnetic-based devices. We report sub-terahertz spin-pumping at the interface of a uniaxial insulating antiferromagnet MnF2 and platinum. The measured ISHE voltage arising from spin-charge conversion in the platinum layer depends on the chirality of the dynamical modes of the antiferromagnet, which is selectively excited and modulated by the handedness of the circularly polarized sub-THz irradiation. Our results open the door to the controlled generation of coherent pure spin currents at THz frequencies.