Source author record

Ran Cheng

Ran Cheng appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

cond-mat.mes-hall cond-mat.mtrl-sci quant-ph Computer Vision Neural and Evolutionary Computing Artificial Intelligence Machine Learning cond-mat.other Robotics hep-th Computation and Language cond-mat.stat-mech cond-mat.str-el math-ph math.MP math.OC physics.optics

Catalog footprint

What is connected

43works

17topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

A Systematic Survey on Large Language Models for Evolutionary Optimization: From Modeling to Solving

Large Language Models (LLMs) possess substantial reasoning capabilities and are increasingly applied to optimization tasks, particularly in synergy with evolutionary computation. However, while recent surveys have explored specific aspects of this domain, they lack an integrative perspective that connects problem modeling with solving workflows. To address this gap, we present a systematic review of recent developments and organize them within a structured framework. First, we classify existing research into two primary stages: LLMs for optimization modeling and LLMs for optimization solving. Second, we divide the latter into three paradigms based on the role of the LLM: stand-alone optimizers, low-level components embedded within algorithms, and high-level managers for algorithm selection and generation. Third, for each category, we analyze representative methods, distill technical challenges, and examine their interplay with traditional approaches. Finally, we review interdisciplinary applications across the natural sciences, engineering, and machine learning. Based on this analysis, we highlight key limitations and point toward future directions for developing self-evolving agentic ecosystems. An up-to-date collection of related literature is maintained at https://github.com/ishmael233/LLM4OPT.

preprint2026arXiv

The Great March 100: 100 Detail-oriented Tasks for Evaluating Embodied AI Agents

Recently, with the rapid development of robot learning and imitation learning, numerous datasets and methods have emerged. However, these datasets and their task designs often lack systematic consideration and principles. This raises important questions: Do the current datasets and task designs truly advance the capabilities of robotic agents? Do evaluations on a few common tasks accurately reflect the differentiated performance of various methods proposed by different teams and evaluated on different tasks? To address these issues, we introduce the Great March 100 (\textbf{GM-100}) as the first step towards a robot learning Olympics. GM-100 consists of 100 carefully designed tasks that cover a wide range of interactions and long-tail behaviors, aiming to provide a diverse and challenging set of tasks to comprehensively evaluate the capabilities of robotic agents and promote diversity and complexity in robot dataset task designs. These tasks are developed through systematic analysis and expansion of existing task designs, combined with insights from human-object interaction primitives and object affordances. We collect a large amount of trajectory data on different robotic platforms and evaluate several baseline models. Experimental results demonstrate that the GM-100 tasks are 1) feasible to execute and 2) sufficiently challenging to effectively differentiate the performance of current VLA models. Our data and code are available at https://rhos.ai/research/gm-100.

preprint2026arXiv

VAR-MATH: Probing True Mathematical Reasoning in LLMS via Symbolic Multi-Instance Benchmarks

Recent advances in reinforcement learning (RL) have led to substantial improvements in the mathematical reasoning abilities of LLMs, as measured by standard benchmarks. Yet these gains often persist even when models are trained with flawed signals, such as random or inverted rewards. This raises a fundamental question: do such improvements reflect genuine reasoning, or are they merely artifacts of overfitting to benchmark-specific patterns? To answer this question, we adopt an evaluation-centric perspective and highlight two critical shortcomings in existing protocols. First, benchmark contamination arises because test problems are publicly available, thereby increasing the risk of data leakage. Second, evaluation fragility results from reliance on single-instance assessments, which are sensitive to stochastic outputs and fail to capture reasoning consistency. These limitations suggest the need for a new evaluation paradigm that can probe reasoning ability beyond memorization and one-off success. As response, we propose VAR-MATH, a symbolic evaluation framework that converts fixed numerical problems into parameterized templates and requires models to solve multiple instantiations of each. This design enforces consistency across structurally equivalent variants, mitigates contamination, and enhances robustness through bootstrapped metrics. We apply VAR-MATH to transform three popular benchmarks, AMC23, AIME24, and AIME25, into their symbolic counterparts, VAR-AMC23, VAR-AIME24, and VAR-AIME25. Experimental results show substantial performance drops for RL-trained models on these variabilized benchmarks, especially for smaller models, with average declines of 47.9\% on AMC23, 58.8\% on AIME24, and 72.9\% on AIME25. These findings indicate that some existing RL methods rely on superficial heuristics and fail to generalize beyond specific numerical forms.

preprint2025arXiv

Characterizing Spin-Orbit Torques by Tensorial Spin Hall Magnetoresistance

Magnetoresistance (MR) provides a crucial tool for experimentally studying spin torques. While MR is well established in the device geometry of the spin Hall effect (SHE), as exemplified by the magnet/heavy-metal heterostructures, its role and manifestation beyond the SHE paradigm remain elusive. We propose a hitherto unknown form of MR where the underlying charge-to-spin conversion and its inverse process violate the simple geometry of the SHE, calling for tensorial descriptions. This MR can generate a series of unique harmonic responses essential for the experimental characterization of unconventional spin-orbit torques in non-SHE materials. We demonstrate these harmonic signals with semimetal WTe$_2$ in mind but the results are not restricted to specific materials.

preprint2022arXiv

A Perspective on Magnon Spin Nernst Effect in Antiferromagnets

Magnon excitations in antiferromagnetic materials and their physical implications enable novel device concepts not available in ferromagnets, emerging as a new area of active research. A unique characteristic of antiferromagnetic magnons is the coexistence of opposite spin polarization, which mimics the electron spin in a variety of transport phenomena. Among them, the most prominent spin-contrasting phenomenon is the magnon spin Nernst effect (SNE), which refers to the generation of transverse pure magnon spin current through a longitudinal temperature gradient. We introduce selected recent progress in the study of magnon SNE in collinear antiferromagnets with a focus on its underlying physical mechanism entailing profound topological features of the magnon band structures. By reviewing how the magnon SNE has inspired and enriched the exploration of topological magnons, we offer our perspectives on this emerging frontier that holds potential in future spintronic nano-technology.

preprint2022arXiv

Bi-fidelity Evolutionary Multiobjective Search for Adversarially Robust Deep Neural Architectures

Deep neural networks have been found vulnerable to adversarial attacks, thus raising potentially concerns in security-sensitive contexts. To address this problem, recent research has investigated the adversarial robustness of deep neural networks from the architectural point of view. However, searching for architectures of deep neural networks is computationally expensive, particularly when coupled with adversarial training process. To meet the above challenge, this paper proposes a bi-fidelity multiobjective neural architecture search approach. First, we formulate the NAS problem for enhancing adversarial robustness of deep neural networks into a multiobjective optimization problem. Specifically, in addition to a low-fidelity performance predictor as the first objective, we leverage an auxiliary-objective -- the value of which is the output of a surrogate model trained with high-fidelity evaluations. Secondly, we reduce the computational cost by combining three performance estimation methods, i.e., parameter sharing, low-fidelity evaluation, and surrogate-based predictor. The effectiveness of the proposed approach is confirmed by extensive experiments conducted on CIFAR-10, CIFAR-100 and SVHN datasets.

preprint2022arXiv

Exploiting Context Information for Generic Event Boundary Captioning

Generic Event Boundary Captioning (GEBC) aims to generate three sentences describing the status change for a given time boundary. Previous methods only process the information of a single boundary at a time, which lacks utilization of video context information. To tackle this issue, we design a model that directly takes the whole video as input and generates captions for all boundaries parallelly. The model could learn the context information for each time boundary by modeling the boundary-boundary interactions. Experiments demonstrate the effectiveness of context information. The proposed method achieved a 72.84 score on the test set, and we reached the $2^{nd}$ place in this challenge. Our code is available at: \url{https://github.com/zjr2000/Context-GEBC}

preprint2022arXiv

Field-Assisted Sub-Terahertz Spin Pumping and Auto-Oscillation in NiO

Spin pumping converting sub-terahertz electromagnetic waves to DC spin currents has recently been demonstrated in antiferromagnets (AFMs) with easy-axis magnetic anisotropy. However, easy-plane AFMs such as NiO, which are easier to prepare experimentally, are considered to be bad candidates for spin pumping because the Néel vector oscillation is linearly polarized, placing a major restriction on the material choice for practical applications. Through a case study of NiO, we show that an applied magnetic field below the spin-flop transition can substantially modify the polarization of the resonance eigenmodes, which enables coherent sub-terahertz spin pumping as strong as that in easy-axis AFMs. In addition, we find that an applied magnetic field can significantly reduce the threshold of Néel vector auto-oscillation triggered by spin-transfer torques. These prominent field-assisted effects can greatly facilitate spintronic device engineering in the sub-terahertz frequency regime.

preprint2022arXiv

Manipulating Ferrimagnets by Fields and Currents

Ferrimagnets (FIMs) can function as high-frequency antiferromagnets while being easy to detect as ferromagnets, offering unique opportunities for ultrafast device applications. While the physical behavior of FIMs near the compensation point has been widely studied, there lacks a generic understanding of FIMs where the ratio of sublattice spins can vary freely between the ferromagnetic and antiferromagnetic limits. Here we investigate the physical properties of a model two-sublattice FIM manipulated by static magnetic fields and current-induced torques. By continuously varying the ratio of sublattice spins, we clarify how the dynamical chiral modes in an FIM are intrinsically connected to their ferro- and antiferromagnetic counterparts, which reveals unique features not visible near the compensation point. In particular, we find that current-induced torques can trigger spontaneous oscillation of the terahertz exchange mode. Compared with its realization in antiferromagnets, a spin-torque oscillator using FIMs not only has a reduced threshold current density but also can be self-stabilized, obviating the need for dynamic feedback.

preprint2022arXiv

Multiobjective Test Problems with Degenerate Pareto Fronts

In multiobjective optimisation, a set of scalable test problems with a variety of features allow researchers to investigate and evaluate the abilities of different optimisation algorithms, and thus can help them to design and develop more effective and efficient approaches. Existing test problem suites mainly focus on situations where all the objectives are fully conflicting with each other. In such cases, an m-objective optimisation problem has an (m-1)-dimensional Pareto front in the objective space. However, in some optimisation problems, there may be unexpected characteristics among objectives, e.g., redundancy. The redundancy of some objectives can lead to the multiobjective problem having a degenerate Pareto front, i.e., the dimension of the Pareto front of the $m$-objective problem be less than (m-1). In this paper, we systematically study degenerate multiobjective problems. We abstract three general characteristics of degenerate problems, which are not formulated and systematically investigated in the literature. Based on these characteristics, we present a set of test problems to support the investigation of multiobjective optimisation algorithms under situations with redundant objectives. To the best of our knowledge, this work is the first one that explicitly formulates these three characteristics of degenerate problems, thus allowing the resulting test problems to be featured by their generality, in contrast to existing test problems designed for specific purposes (e.g., visualisation).

preprint2022arXiv

S3E-GNN: Sparse Spatial Scene Embedding with Graph Neural Networks for Camera Relocalization

Camera relocalization is the key component of simultaneous localization and mapping (SLAM) systems. This paper proposes a learning-based approach, named Sparse Spatial Scene Embedding with Graph Neural Networks (S3E-GNN), as an end-to-end framework for efficient and robust camera relocalization. S3E-GNN consists of two modules. In the encoding module, a trained S3E network encodes RGB images into embedding codes to implicitly represent spatial and semantic embedding code. With embedding codes and the associated poses obtained from a SLAM system, each image is represented as a graph node in a pose graph. In the GNN query module, the pose graph is transformed to form a embedding-aggregated reference graph for camera relocalization. We collect various scene datasets in the challenging environments to perform experiments. Our results demonstrate that S3E-GNN method outperforms the traditional Bag-of-words (BoW) for camera relocalization due to learning-based embedding and GNN powered scene matching mechanism.

preprint2022arXiv

Semantic-Aware Pretraining for Dense Video Captioning

This report describes the details of our approach for the event dense-captioning task in ActivityNet Challenge 2021. We present a semantic-aware pretraining method for dense video captioning, which empowers the learned features to recognize high-level semantic concepts. Diverse video features of different modalities are fed into an event captioning module to generate accurate and meaningful sentences. Our final ensemble model achieves a 10.00 METEOR score on the test set.

preprint2022arXiv

SoloGAN: Multi-domain Multimodal Unpaired Image-to-Image Translation via a Single Generative Adversarial Network

Despite significant advances in image-to-image (I2I) translation with generative adversarial networks (GANs), it remains challenging to effectively translate an image to a set of diverse images in multiple target domains using a single pair of generator and discriminator. Existing I2I translation methods adopt multiple domain-specific content encoders for different domains, where each domain-specific content encoder is trained with images from the same domain only. Nevertheless, we argue that the content (domain-invariance) features should be learned from images among all of the domains. Consequently, each domain-specific content encoder of existing schemes fails to extract the domain-invariant features efficiently. To address this issue, we present a flexible and general SoloGAN model for efficient multimodal I2I translation among multiple domains with unpaired data. In contrast to existing methods, the SoloGAN algorithm uses a single projection discriminator with an additional auxiliary classifier and shares the encoder and generator for all domains. Consequently, the SoloGAN can be trained effectively with images from all domains such that the domain-invariance content representation can be efficiently extracted. Qualitative and quantitative results over a wide range of datasets against several counterparts and variants of the SoloGAN demonstrate the merits of the method, especially for challenging I2I translation datasets, i.e., datasets involving extreme shape variations or need to keep the complex backgrounds unchanged after translations. Furthermore, we demonstrate the contribution of each component in SoloGAN by ablation studies.

preprint2022arXiv

Surrogate-assisted Multi-objective Neural Architecture Search for Real-time Semantic Segmentation

The architectural advancements in deep neural networks have led to remarkable leap-forwards across a broad array of computer vision tasks. Instead of relying on human expertise, neural architecture search (NAS) has emerged as a promising avenue toward automating the design of architectures. While recent achievements in image classification have suggested opportunities, the promises of NAS have yet to be thoroughly assessed on more challenging tasks of semantic segmentation. The main challenges of applying NAS to semantic segmentation arise from two aspects: (i) high-resolution images to be processed; (ii) additional requirement of real-time inference speed (i.e., real-time semantic segmentation) for applications such as autonomous driving. To meet such challenges, we propose a surrogate-assisted multi-objective method in this paper. Through a series of customized prediction models, our method effectively transforms the original NAS task into an ordinary multi-objective optimization problem. Followed by a hierarchical pre-screening criterion for in-fill selection, our method progressively achieves a set of efficient architectures trading-off between segmentation accuracy and inference speed. Empirical evaluations on three benchmark datasets together with an application using Huawei Atlas 200 DK suggest that our method can identify architectures significantly outperforming existing state-of-the-art architectures designed both manually by human experts and automatically by other NAS methods.

preprint2022arXiv

Theory of Harmonic Hall Responses of Spin-Torque Driven Antiferromagnets

Harmonic analysis is a powerful tool to characterize and quantify current-induced torques acting on magnetic materials, but so far it remains an open question in studying antiferromagnets. Here we formulate a general theory of harmonic Hall responses of collinear antiferromagnets driven by current-induced torques including both field-like and damping-like components. By scanning a magnetic field of variable strength in three orthogonal planes, we are able to distinguish the contributions from field-like torque, damping-like torque, and concomitant thermal effects by analyzing the second harmonic signals in the Hall voltage. The analytical expressions of the first and second harmonics as functions of the magnetic field direction and strength are confirmed by numerical simulations with good agreement. We demonstrate our predictions in two prototype antiferromagnets, $α-$Fe$_{2}$O$_{3}$ and NiO, providing direct and general guidance to current and future experiments.

preprint2022arXiv

VLMixer: Unpaired Vision-Language Pre-training via Cross-Modal CutMix

Existing vision-language pre-training (VLP) methods primarily rely on paired image-text datasets, which are either annotated by enormous human labors, or crawled from the internet followed by elaborate data cleaning techniques. To reduce the dependency on well-aligned image-text pairs, it is promising to directly leverage the large-scale text-only and image-only corpora. This paper proposes a data augmentation method, namely cross-modal CutMix (CMC), for implicit cross-modal alignment learning in unpaired VLP. Specifically, CMC transforms natural sentences from the textual view into a multi-modal view, where visually-grounded words in a sentence are randomly replaced by diverse image patches with similar semantics. There are several appealing proprieties of the proposed CMC. First, it enhances the data diversity while keeping the semantic meaning intact for tackling problems where the aligned data are scarce; Second, by attaching cross-modal noise on uni-modal data, it guides models to learn token-level interactions across modalities for better denoising. Furthermore, we present a new unpaired VLP method, dubbed as VLMixer, that integrates CMC with contrastive learning to pull together the uni-modal and multi-modal views for better instance-level alignments among different modalities. Extensive experiments on five downstream tasks show that VLMixer could surpass previous state-of-the-art unpaired VLP methods.

preprint2022arXiv

Voltage-driven exchange resonance achieving 100\% mechanical efficiency

Magnetic resonances driven by current-induced torques are crucial tools to study magnetic materials but are very limited in frequency and mechanical efficiency. We propose an alternative mechanism, voltage-induced torque, to realize high efficiency in generating high-frequency magnetization dynamics. When a ferromagnet-topological insulator-ferromagnet trilayer heterostructure is operated as an adiabatic quantum motor, voltage-induced torque arises from the adiabatic motion of gapped topological electrons on the two interfaces and act oppositely on the two ferromagnetic layers, which can excite the exchange mode where the two ferromagnetic layers precess with a $π$-phase difference. The exchange mode resonance, bearing a much higher frequency than the ferromagnetic resonance, is accompanied by topological charge pumping, leading to a sharp peak in electrical admittance at the resonance point. Because the output current is purely adiabatic while dissipative current vanishes identically, the proposed voltage-driven exchange resonance entails a remarkably high mechanical efficiency close to unity, which is impossible in any current-driven systems.

preprint2021arXiv

(AF)2-S3Net: Attentive Feature Fusion with Adaptive Feature Selection for Sparse Semantic Segmentation Network

Autonomous robotic systems and self driving cars rely on accurate perception of their surroundings as the safety of the passengers and pedestrians is the top priority. Semantic segmentation is one the essential components of environmental perception that provides semantic information of the scene. Recently, several methods have been introduced for 3D LiDAR semantic segmentation. While, they can lead to improved performance, they are either afflicted by high computational complexity, therefore are inefficient, or lack fine details of smaller instances. To alleviate this problem, we propose AF2-S3Net, an end-to-end encoder-decoder CNN network for 3D LiDAR semantic segmentation. We present a novel multi-branch attentive feature fusion module in the encoder and a unique adaptive feature selection module with feature map re-weighting in the decoder. Our AF2-S3Net fuses the voxel based learning and point-based learning into a single framework to effectively process the large 3D scene. Our experimental results show that the proposed method outperforms the state-of-the-art approaches on the large-scale SemanticKITTI benchmark, ranking 1st on the competitive public leaderboard competition upon publication.

preprint2021arXiv

Quantifying Spin-Orbit Torques in Antiferromagnet/Heavy Metal Heterostructures

The effect of spin currents on the magnetic order of insulating antiferromagnets (AFMs) is of fundamental interest and can enable new applications. Toward this goal, characterizing the spin-orbit torques (SOT) associated with AFM/heavy metal (HM) interfaces is important. Here we report the full angular dependence of the harmonic Hall voltages in a predominantly easy-plane AFM, epitaxial c-axis oriented $α$-Fe$_2$O$_3$ films, with an interface to Pt. By modeling the harmonic Hall signals together with the $α$-Fe$_2$O$_3$ magnetic parameters, we determine the amplitudes of field-like and damping-like SOT. Out-of-plane field scans are shown to be essential to determining the damping-like component of the torques. In contrast to ferromagnetic/heavy metal heterostructures, our results demonstrate that the field-like torques are significantly larger than the damping-like torques, which we correlate with the presence of a large imaginary component of the interface spin-mixing conductance. Our work demonstrates a direct way of characterizing SOT in AFM/HM heterostructures.

preprint2020arXiv

Current-induced CrI3 surface spin-flop transition probed by proximity magnetoresistance in Pt

By exploiting proximity coupling, we probe the spin state of the surface layers of CrI3, a van der Waals magnetic semiconductor, by measuring the induced magnetoresistance (MR) of Pt in Pt/CrI3 nano-devices. We fabricate the devices with clean and stable interfaces by placing freshly exfoliated CrI3 flake atop pre-patterned thin Pt strip and encapsulating the Pt/CrI3 heterostructure with hexagonal boron nitride (hBN) in a protected environment. In devices consisting of a wide range of CrI3 thicknesses (30 to 150 nm), we observe that an abrupt upward jump in Pt MR emerge at a 2 T magnetic field applied perpendicularly to the layers when the current density exceeds 2.5x10^10 A/m2, followed by a gradual decrease over a range of 5 T. These distinct MR features suggest a spin-flop transition which reveals strong antiferromagnetic interlayer coupling in the surface layers of CrI3. We study the current dependence by holding the Pt/CrI3 sample at approximately the same temperature to exclude the joule heating effect, and find that the MR jump increases with the current density, indicating a spin current origin. This spin current effect provides a new route to control spin configurations in insulating antiferromagnets, which is potentially useful for spintronic applications.

preprint2020arXiv

Evolutionary Multi-Objective Optimization Driven by Generative Adversarial Networks

Recently, more and more works have proposed to drive evolutionary algorithms using machine learning models.Usually, the performance of such model based evolutionary algorithms is highly dependent on the training qualities of the adopted models.Since it usually requires a certain amount of data (i.e. the candidate solutions generated by the algorithms) for model training, the performance deteriorates rapidly with the increase of the problem scales, due to the curse of dimensionality.To address this issue, we propose a multi-objective evolutionary algorithm driven by the generative adversarial networks (GANs).At each generation of the proposed algorithm, the parent solutions are first classified into \emph{real} and \emph{fake} samples to train the GANs; then the offspring solutions are sampled by the trained GANs.Thanks to the powerful generative ability of the GANs, our proposed algorithm is capable of generating promising offspring solutions in high-dimensional decision space with limited training data.The proposed algorithm is tested on 10 benchmark problems with up to 200 decision variables.Experimental results on these test problems demonstrate the effectiveness of the proposed algorithm.

preprint2020arXiv

Evolutionary Multiobjective Optimization Driven by Generative Adversarial Networks (GANs)

Recently, increasing works have proposed to drive evolutionary algorithms using machine learning models. Usually, the performance of such model based evolutionary algorithms is highly dependent on the training qualities of the adopted models. Since it usually requires a certain amount of data (i.e. the candidate solutions generated by the algorithms) for model training, the performance deteriorates rapidly with the increase of the problem scales, due to the curse of dimensionality. To address this issue, we propose a multi-objective evolutionary algorithm driven by the generative adversarial networks (GANs). At each generation of the proposed algorithm, the parent solutions are first classified into real and fake samples to train the GANs; then the offspring solutions are sampled by the trained GANs. Thanks to the powerful generative ability of the GANs, our proposed algorithm is capable of generating promising offspring solutions in high-dimensional decision space with limited training data. The proposed algorithm is tested on 10 benchmark problems with up to 200 decision variables. Experimental results on these test problems demonstrate the effectiveness of the proposed algorithm.

preprint2020arXiv

Hybrid Attention-Based Transformer Block Model for Distant Supervision Relation Extraction

With an exponential explosive growth of various digital text information, it is challenging to efficiently obtain specific knowledge from massive unstructured text information. As one basic task for natural language processing (NLP), relation extraction aims to extract the semantic relation between entity pairs based on the given text. To avoid manual labeling of datasets, distant supervision relation extraction (DSRE) has been widely used, aiming to utilize knowledge base to automatically annotate datasets. Unfortunately, this method heavily suffers from wrong labelling due to the underlying strong assumptions. To address this issue, we propose a new framework using hybrid attention-based Transformer block with multi-instance learning to perform the DSRE task. More specifically, the Transformer block is firstly used as the sentence encoder to capture syntactic information of sentences, which mainly utilizes multi-head self-attention to extract features from word level. Then, a more concise sentence-level attention mechanism is adopted to constitute the bag representation, aiming to incorporate valid information of each sentence to effectively represent the bag. Experimental results on the public dataset New York Times (NYT) demonstrate that the proposed approach can outperform the state-of-the-art algorithms on the evaluation dataset, which verifies the effectiveness of our model for the DSRE task.

preprint2020arXiv

Magnonic Su-Schrieffer-Heeger Model in Honeycomb Ferromagnets

Topological electronics has extended its richness to non-electronic systems where phonons and magnons can play the role of electrons. In particular, topological phases of magnons can be enabled by the Dzyaloshinskii-Moriya interaction (DMI) which acts as an effective spin-orbit coupling. We show that besides DMI, an alternating arrangement of Heisenberg exchange interactions critically determines the magnon band topology, realizing a magnonic analog of the Su-Schrieffer-Heeger model. On a honeycomb ferromagnet with perpendicular anisotropy, we calculate the topological phase diagram, the chiral edge states, and the associated magnon Hall effect by allowing the relative strength of exchange interactions on different links to be tunable. Including weak phonon-magnon hybridization does not change the result. Candidate materials are discussed.

preprint2020arXiv

Moiré magnons in twisted bilayer magnets with collinear order

We explore the moiré magnon bands in twisted bilayer magnets with next-nearest neighboring Dzyaloshinskii-Moriya interactions, assuming that the out-of-plane collinear magnetic order is preserved under weak interlayer coupling. By calculating the magnonic band structures and the topological Chern numbers for four representative cases, we find that (i) the valley moiré bands are extremely flat over a wide range of continuous twist angles; (ii) the topological Chern numbers of the lowest few flat bands vary significantly with the twist angle; and (iii) the lowest few topological flat bands in bilayer antiferromagnets entail nontrivial thermal spin transport in the transverse direction; These properties make twisted bilayer magnets an ideal platform to study the magnonic counterparts of moiré electrons, where the statistical distinction between magnons and electrons leads to fundamentally new physical behavior.

preprint2020arXiv

PDA: Progressive Data Augmentation for General Robustness of Deep Neural Networks

Adversarial images are designed to mislead deep neural networks (DNNs), attracting great attention in recent years. Although several defense strategies achieved encouraging robustness against adversarial samples, most of them fail to improve the robustness on common corruptions such as noise, blur, and weather/digital effects (e.g. frost, pixelate). To address this problem, we propose a simple yet effective method, named Progressive Data Augmentation (PDA), which enables general robustness of DNNs by progressively injecting diverse adversarial noises during training. In other words, DNNs trained with PDA are able to obtain more robustness against both adversarial attacks as well as common corruptions than the recent state-of-the-art methods. We also find that PDA is more efficient than prior arts and able to prevent accuracy drop on clean samples without being attacked. Furthermore, we theoretically show that PDA can control the perturbation bound and guarantee better generalization ability than existing work. Extensive experiments on many benchmarks such as CIFAR-10, SVHN, and ImageNet demonstrate that PDA significantly outperforms its counterparts in various experimental setups.

preprint2020arXiv

Sampled Training and Node Inheritance for Fast Evolutionary Neural Architecture Search

The performance of a deep neural network is heavily dependent on its architecture and various neural architecture search strategies have been developed for automated network architecture design. Recently, evolutionary neural architecture search (ENAS) has received increasing attention due to the attractive global optimization capability of evolutionary algorithms. However, ENAS suffers from extremely high computation costs because a large number of performance evaluations is usually required in evolutionary optimization and training deep neural networks is itself computationally very intensive. To address this issue, this paper proposes a new evolutionary framework for fast ENAS based on directed acyclic graph, in which parents are randomly sampled and trained on each mini-batch of training data. In addition, a node inheritance strategy is adopted to generate offspring individuals and their fitness is directly evaluated without training. To enhance the feature processing capability of the evolved neural networks, we also encode a channel attention mechanism in the search space. We evaluate the proposed algorithm on the widely used datasets, in comparison with 26 state-of-the-art peer algorithms. Our experimental results show the proposed algorithm is not only computationally much more efficiently, but also highly competitive in learning performance.

preprint2020arXiv

Spin fluctuations in quantized transport of magnetic topological insulators

In magnetic topological insulators, quantized electronic transport is interwined with spontaneous magnetic ordering, as magnetization controls band gaps, hence band topology, through the exchange interaction. We show that considering the exchange gaps at the mean-field level is inadequate to predict phase transitions between electronic states of distinct topology. Thermal spin fluctuations disturbing the magnetization can act as frozen disorders that strongly scatter electrons, reducing the onset temperature of quantized transport appreciably even in the absence of structural impurities. This effect, which has hitherto been overlooked, provides an alternative explanation of recent experiments on intrinsic magnetic topological insulators.

preprint2020arXiv

Subterahertz spin pumping from an insulating antiferromagnet

Spin-transfer torque and spin Hall effects combined with their reciprocal phenomena, spin-pumping and inverse spin Hall (ISHE) effects, enable the reading and control of magnetic moments in spintronics. The direct observation of these effects remains elusive in antiferromagnetic-based devices. We report sub-terahertz spin-pumping at the interface of a uniaxial insulating antiferromagnet MnF2 and platinum. The measured ISHE voltage arising from spin-charge conversion in the platinum layer depends on the chirality of the dynamical modes of the antiferromagnet, which is selectively excited and modulated by the handedness of the circularly polarized sub-THz irradiation. Our results open the door to the controlled generation of coherent pure spin currents at THz frequencies.

preprint2016arXiv

Anomalous Feedback and Negative Domain Wall Resistance

Magnetic induction can be regarded as a negative feedback effect, where the motive-force opposes the change of magnetic flux that generates the motive-force. In artificial electromagnetics emerging from spintronics, however, this is not necessarily the case. By studying the current-induced domain wall dynamics in a cylindrical nanowire, we show that the spin motive-force exerting on electrons can either oppose or support the applied current that drives the domain wall. The switching into the anomalous feedback regime occurs when the strength of the dissipative torque β is about twice the value of the Gilbert damping constant α. The anomalous feedback manifests as a negative domain wall resistance, which has an analogy with the water turbine.

preprint2016arXiv

Antiferromagnetic Spin Wave Field-Effect Transistor

In a collinear antiferromagnet with easy-axis anisotropy, symmetry dictates that the spin wave modes must be doubly degenerate. Theses two modes, distinguished by their opposite polarization and available only in antiferromagnets, give rise to a novel degree of freedom to encode and process information. We show that the spin wave polarization can be manipulated by an electric field induced Dzyaloshinskii-Moriya interaction and magnetic anisotropy. We propose a prototype spin wave field-effect transistor which realizes a gate-tunable magnonic analog of the Faraday effect, and demonstrate its application in THz signal modulation. Our findings open up the exciting possibility of digital data processing utilizing antiferromagnetic spin waves and enable the direct projection of optical computing concepts onto the mesoscopic scale.

preprint2016arXiv

Dynamic Feedback in Ferromagnet/Spin Hall Metal Heterostructures

In ferromagnet/normal metal heterostructures, spin pumping and spin-transfer torques are two reciprocal processes that occur concomitantly. Their interplay introduces a dynamic feedback effect interconnecting energy dissipation channels of both magnetization and current. By solving the spin diffusion process in the presence of the spin Hall effect in the normal metal, we show that the dynamic feedback gives rise to: (i) a nonlinear magnetic damping that is crucial to sustain uniform steady-state oscillations of a spin Hall oscillator at large angles. (ii) a frequency dependent spin Hall magnetoimpedance that reduces to the spin Hall magnetoresistance in the dc limit.

preprint2016arXiv

Spin Nernst Effect of Magnons in Collinear Antiferromagnets

In a collinear antiferromagnet with easy-axis anisotropy, symmetry guarantees that the spin wave modes are doubly degenerate. The two modes carry opposite spin angular momentum and exhibit opposite chirality. Using a honeycomb antiferromagnet in the presence of the Dzyaloshinskii-Moriya interaction, we show that a longitudinal temperature gradient can drive the two modes to opposite transverse directions, realizing a spin Nernst effect of magnons with vanishing thermal Hall current. We find that magnons around the Γ-point and the K-point contribute oppositely to the transverse spin transport, and their competition leads to a sign change of the spin Nernst coefficient at finite temperature. Possible material candidates are discussed.

preprint2016arXiv

Terahertz Antiferromagnetic Spin Hall Nano-Oscillator

We consider the current-induced dynamics of insulating antiferromagnets in a spin Hall geometry. Sufficiently large in-plane currents perpendicular to the Néel order trigger spontaneous oscillations at frequencies between the acoustic and the optical eigenmodes. The direction of the driving current determines the chirality of the excitation. When the current exceeds a threshold, the combined effect of spin pumping and current-induced torques introduces a dynamic feedback that sustains steady-state oscillations with amplitudes controllable via the applied current. The ac voltage output is calculated numerically as a function of the dc current input for different feedback strengths. Our findings open a route towards terahertz antiferromagnetic spin-torque oscillators.

preprint2015arXiv

Ultrafast Switching of Antiferromagnets via Spin-transfer Torque

Picosecond switching of the staggered antiferromagnetic order is shown to be realizable through spin-transfer torques from a short current pulse. The coupled dynamics of sublattice magnetization is mapped onto a classical pendulum subject to gravity and a driving pulse, where switching occurs if the pendulum acquires sufficient kinetic energy during the pulse to overcome the maximum of the effective gravity potential. The optimal switching scheme is explored through the dependence of switch angle and magnetic loss on the duration and strength of the current pulse. The physics discussed here provides a general route towards multi-functional THz applications via the spin-transfer torque in antiferromagnetic materials.

preprint2014arXiv

Dynamics of Antiferromagnets Driven by Spin Current

When a spin-polarized current flows through a ferromagnetic (FM) metal, angular momentum is transferred to the background magnetization via spin-transfer torques. In antiferromagnetic (AFM) materials, however, the corresponding problem is unsolved. We derive microscopically the dynamics of an AFM system driven by spin current generated by an attached FM polarizer, and find that the spin current exerts a driving force on the local staggered order parameter. The mechanism does not rely on the conservation of spin angular momentum, nor does it depend on the induced FM moments on top the AFM background. Two examples are studied: (i) A domain wall is accelerated to a terminal velocity by purely adiabatic effect where the Walker's break-down is avoided; and (ii) Spin injection modifies the AFM resonance frequency, and spin current injection triggers spin wave instability of local moments above a threshold.

preprint2014arXiv

Spin pumping and spin-transfer torques in antiferromagnets

Spin pumping and spin-transfer torques are two reciprocal phenomena widely studied in ferromagnetic materials. However, pumping from antiferromagnets and its relation to current-induced torques have not been explored. By calculating how electrons scatter off a normal metal-antiferromagnetic interface, we derive pumped spin and staggered spin currents in terms of the staggered field, the magnetization, and their rates of change. For both compensated and uncompensated interfaces, spin pumping is of a similar magnitude as in ferromagnets with a direction controlled by the polarization of the driving microwave. The pumped currents are connected to current-induced torques via Onsager reciprocity relations.

preprint2013arXiv

Microscopic derivation of Spin-transfer torques in ferromagnets

Spin-transfer torque (STT) provides key mechanisms for current-induced phenomena in ferromagnets. While it is widely accepted that STT involves both adiabatic and non-adiabatic contributions, their underlying physics and range of validity are quite controversial. By computing microscopically the response of conduction electron spins to a time varying and spatially inhomogeneous magnetic background, we derive the adiabatic and non-adiabatic STT in a unified fashion. Our result confirms the macroscopic theory [Phys. Rev. Lett. \textbf{93},~127204 (2004)] with all coefficients matched exactly. Our derivation also reveals a benchmark on the validity of the result, which is used to explain three recent measurements of the non-adiabatic STT in quite different settings.

preprint2013arXiv

Quantum Geometric Tensor (Fubini-Study Metric) in Simple Quantum System: A pedagogical Introduction

Geometric Quantum Mechanics is a novel and prospecting approach motivated by the belief that our world is ultimately geometrical. At the heart of that is a quantity called Quantum Geometric Tensor (or Fubini-Study metric), which is a complex tensor with the real part serving as the Riemannian metric that measures the `quantum distance', and the imaginary part being the Berry curvature. Following a physical introduction of the basic formalism, we illustrate its physical significance in both the adiabatic and non-adiabatic systems.

preprint2012arXiv

Adiabatic Electron Dynamics in Antiferromagnetic Texture

Adiabatic dynamics of conduction electrons in antiferromagnetic (AFM) materials with slowly varying spin texture is developed. Quite different from the ferromagnetic (FM) case, adiabaticity in AFM texture does not imply perfect alignment of conduction electron spins with background profile, instead, it introduces an internal dynamics between degenerate bands. As a result, the orbital motion of conduction electrons becomes spin-dependent and is affected by two emergent gauge fields: one of them is the non-Abelian version of what has been discovered in FM systems; the other leads to an anomalous velocity that has no FM counterpart. Two examples with experimental predictions are provided.

preprint2012arXiv

Electron Dynamics in Slowly Varying Antiferromagnetic Texture

Effective dynamics of conduction electrons in antiferromagnetic (AFM) materials with slowly varying spin texture is developed via non-Abelian gauge theory. Quite different from the ferromagnetic (FM) case, the spin of a conduction electron does not follow the background texture even in the adiabatic limit due to the accumulation of a SU(2) non-Abelian Berry phase. Correspondingly, it is found that the orbital dynamics becomes spin-dependent and is affected by two emergent gauge fields. While one of them is the non-Abelian generalization of what has been discovered in FM systems, the other leads to an anomalous velocity that has no FM counterpart. Two examples are provided to illustrate the distinctive spin dynamics of a conduction electron.

preprint2011arXiv

Brownian motion in superfluid $^4$He

We propose to study the Brownian motion of a classical microsphere submerged in superfluid $^4$He using the recent laser technology as a direct investigation of the thermal fluctuations of quasiparticles in the quantum fluid. By calculating the temperature dependence of both the friction coefficient and the strength of the random force, we show that the resonant mode of the fluctuational motion can be fully resolved by the present technology. Contrary to the previous work, it is found that the roton contribution is not negligible, and it even becomes dominant when the temperature is above 0.76\,K.

preprint2010arXiv

Equivalence of O(3) nonlinear sigma model and the CP1 model: A path integral approach

A rigorous proof is given on the equivalence of the O(3) nonlinear sigma model and the CP1 model via path integral approach.

Ran Cheng

What is connected

Connect this record

See the researcher in context

Building this map preview

43 published item(s)

A Systematic Survey on Large Language Models for Evolutionary Optimization: From Modeling to Solving

The Great March 100: 100 Detail-oriented Tasks for Evaluating Embodied AI Agents

VAR-MATH: Probing True Mathematical Reasoning in LLMS via Symbolic Multi-Instance Benchmarks

Characterizing Spin-Orbit Torques by Tensorial Spin Hall Magnetoresistance

A Perspective on Magnon Spin Nernst Effect in Antiferromagnets

Bi-fidelity Evolutionary Multiobjective Search for Adversarially Robust Deep Neural Architectures

Exploiting Context Information for Generic Event Boundary Captioning

Field-Assisted Sub-Terahertz Spin Pumping and Auto-Oscillation in NiO

Manipulating Ferrimagnets by Fields and Currents

Multiobjective Test Problems with Degenerate Pareto Fronts

S3E-GNN: Sparse Spatial Scene Embedding with Graph Neural Networks for Camera Relocalization

Semantic-Aware Pretraining for Dense Video Captioning

SoloGAN: Multi-domain Multimodal Unpaired Image-to-Image Translation via a Single Generative Adversarial Network

Surrogate-assisted Multi-objective Neural Architecture Search for Real-time Semantic Segmentation

Theory of Harmonic Hall Responses of Spin-Torque Driven Antiferromagnets

VLMixer: Unpaired Vision-Language Pre-training via Cross-Modal CutMix

Voltage-driven exchange resonance achieving 100\% mechanical efficiency

(AF)2-S3Net: Attentive Feature Fusion with Adaptive Feature Selection for Sparse Semantic Segmentation Network

Quantifying Spin-Orbit Torques in Antiferromagnet/Heavy Metal Heterostructures

Current-induced CrI3 surface spin-flop transition probed by proximity magnetoresistance in Pt

Evolutionary Multi-Objective Optimization Driven by Generative Adversarial Networks

Evolutionary Multiobjective Optimization Driven by Generative Adversarial Networks (GANs)

Hybrid Attention-Based Transformer Block Model for Distant Supervision Relation Extraction

Magnonic Su-Schrieffer-Heeger Model in Honeycomb Ferromagnets

Moiré magnons in twisted bilayer magnets with collinear order

PDA: Progressive Data Augmentation for General Robustness of Deep Neural Networks

Sampled Training and Node Inheritance for Fast Evolutionary Neural Architecture Search

Spin fluctuations in quantized transport of magnetic topological insulators

Subterahertz spin pumping from an insulating antiferromagnet

Anomalous Feedback and Negative Domain Wall Resistance

Antiferromagnetic Spin Wave Field-Effect Transistor

Dynamic Feedback in Ferromagnet/Spin Hall Metal Heterostructures

Spin Nernst Effect of Magnons in Collinear Antiferromagnets

Terahertz Antiferromagnetic Spin Hall Nano-Oscillator

Ultrafast Switching of Antiferromagnets via Spin-transfer Torque

Dynamics of Antiferromagnets Driven by Spin Current

Spin pumping and spin-transfer torques in antiferromagnets

Microscopic derivation of Spin-transfer torques in ferromagnets

Quantum Geometric Tensor (Fubini-Study Metric) in Simple Quantum System: A pedagogical Introduction

Adiabatic Electron Dynamics in Antiferromagnetic Texture

Electron Dynamics in Slowly Varying Antiferromagnetic Texture

Brownian motion in superfluid $^4$He

Equivalence of O(3) nonlinear sigma model and the CP1 model: A path integral approach