Source author record

Lei Han

Lei Han appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision cond-mat.mtrl-sci Artificial Intelligence Computation and Language Machine Learning physics.ao-ph hep-ph physics.gen-ph physics.geo-ph physics.optics

Catalog footprint

What is connected

15works

10topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2024arXiv

Identification of Regulatory Requirements Relevant to Business Processes: A Comparative Study on Generative AI, Embedding-based Ranking, Crowd and Expert-driven Methods

Organizations face the challenge of ensuring compliance with an increasing amount of requirements from various regulatory documents. Which requirements are relevant depends on aspects such as the geographic location of the organization, its domain, size, and business processes. Considering these contextual factors, as a first step, relevant documents (e.g., laws, regulations, directives, policies) are identified, followed by a more detailed analysis of which parts of the identified documents are relevant for which step of a given business process. Nowadays the identification of regulatory requirements relevant to business processes is mostly done manually by domain and legal experts, posing a tremendous effort on them, especially for a large number of regulatory documents which might frequently change. Hence, this work examines how legal and domain experts can be assisted in the assessment of relevant requirements. For this, we compare an embedding-based NLP ranking method, a generative AI method using GPT-4, and a crowdsourced method with the purely manual method of creating relevancy labels by experts. The proposed methods are evaluated based on two case studies: an Australian insurance case created with domain experts and a global banking use case, adapted from SAP Signavio's workflow example of an international guideline. A gold standard is created for both BPMN2.0 processes and matched to real-world textual requirements from multiple regulatory documents. The evaluation and discussion provide insights into strengths and weaknesses of each method regarding applicability, automation, transparency, and reproducibility and provide guidelines on which method combinations will maximize benefits for given characteristics such as process usage, impact, and dynamics of an application scenario.

preprint2022arXiv

Controllable anomalous Nernst effect in an antiperovskite antiferromagnet

Anomalous Nernst effect (ANE), the generation of a transverse electric voltage by a longitudinal temperature gradient, has attracted increasing interests of researchers recently, due to its potential in the thermoelectric power conversion and close relevance to the Berry curvature of the band structure. Avoiding the stray field of ferromagnets, ANE in antiferromagnets (AFM) has the advantage of realizing highly efficient and densely integrated thermopiles. Here, we report the observation of ANE in an antiperovskite noncollinear AFM Mn3SnN experimentally, which is triggered by the enhanced Berry curvature from Weyl points located close to the Fermi level. Considering that antiperovskite Mn3SnN has rich magnetic phase transition, we modulate the noncollinear AFM configurations by the biaxial strain, which enables us to control its ANE. Our findings provide a potential class of materials to explore the Weyl physics of noncollinear AFM as well as realizing antiferromagnetic spin caloritronics that exhibits promising prospects for energy conversion and information processing.

preprint2022arXiv

Observation of spin splitting torque in a collinear antiferromagnet RuO2

Current-induced spin torques provide efficient data writing approaches for magnetic memories. Recently, the spin splitting torque (SST) was theoretically predicted (R. González-Hernández et al. Phys. Rev. Lett. 126, 127701 (2021)), which combines advantages of conventional spin transfer torque (STT) and spin-orbit torque (SOT) as well as enables controllable spin polarization. Here we provide the experimental evidence of SST in collinear antiferromagnet RuO2 films. The spin current direction is found to be correlated to the crystal orientation of RuO2 and the spin polarization direction is dependent on (parallel to) the Néel vector. These features are quite characteristic for the predicted SST. Our finding not only present a new member for the spin torques besides traditional STT and SOT, but also proposes a promising spin source RuO2 for spintronics.

preprint2022arXiv

Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RL

Solving goal-conditioned tasks with sparse rewards using self-supervised learning is promising because of its simplicity and stability over current reinforcement learning (RL) algorithms. A recent work, called Goal-Conditioned Supervised Learning (GCSL), provides a new learning framework by iteratively relabeling and imitating self-generated experiences. In this paper, we revisit the theoretical property of GCSL -- optimizing a lower bound of the goal reaching objective, and extend GCSL as a novel offline goal-conditioned RL algorithm. The proposed method is named Weighted GCSL (WGCSL), in which we introduce an advanced compound weight consisting of three parts (1) discounted weight for goal relabeling, (2) goal-conditioned exponential advantage weight, and (3) best-advantage weight. Theoretically, WGCSL is proved to optimize an equivalent lower bound of the goal-conditioned RL objective and generates monotonically improved policies via an iterated scheme. The monotonic property holds for any behavior policies, and therefore WGCSL can be applied to both online and offline settings. To evaluate algorithms in the offline goal-conditioned RL setting, we provide a benchmark including a range of point and simulated robot domains. Experiments in the introduced benchmark demonstrate that WGCSL can consistently outperform GCSL and existing state-of-the-art offline methods in the fully offline goal-conditioned setting.

preprint2021arXiv

Cluster magnetic octupole induced out-of-plane spin polarization in antiperovskite antiferromagnet

Out-of-plane spin polarization σ_z has attracted increasing interests of researchers recently, due to its potential in high-density and low-power spintronic devices. Noncollinear antiferromagnet (AFM), which has unique 120° triangular spin configuration, has been discovered to possess σ_z. However, the physical origin of σ_z in noncollinear AFM is still not clear, and the external magnetic field-free switching of perpendicular magnetic layer using the corresponding σ_z has not been reported yet. Here, we use the cluster magnetic octupole in antiperovskite AFM Mn3SnN to demonstrate the generation of σ_z. σ_z is induced by the precession of carrier spins when currents flow through the cluster magnetic octupole, which also relies on the direction of the cluster magnetic octupole in conjunction with the applied current. With the aid of σ_z, current induced spin-orbit torque (SOT) switching of adjacent perpendicular ferromagnet is realized without external magnetic field. Our findings present a new perspective to the generation of out-of-plane spin polarizations via noncollinear AFM spin structure, and provide a potential path to realize ultrafast high-density applications.

preprint2021arXiv

Magnon-mediated interlayer coupling in an all-antiferromagnetic junction

The interlayer coupling mediated by fermions in ferromagnets brings about parallel and anti-parallel magnetization orientations of two magnetic layers, resulting in the giant magnetoresistance, which forms the foundation in spintronics and accelerates the development of information technology. However, the interlayer coupling mediated by another kind of quasi-particle, boson, is still lacking. Here we demonstrate such a static interlayer coupling at room temperature in an antiferromagnetic junction Fe2O3/Cr2O3/Fe2O3, where the two antiferromagnetic Fe2O3 layers are functional materials and the antiferromagnetic Cr2O3 layer serves as a spacer. The Néel vectors in the top and bottom Fe2O3 are strongly orthogonally coupled, which is bridged by a typical bosonic excitation (magnon) in the Cr2O3 spacer. Such an orthogonally coupling exceeds the category of traditional collinear interlayer coupling via fermions in ground state, reflecting the fluctuating nature of the magnons, as supported by our magnon quantum well model. Besides the fundamental significance on the quasi-particle-mediated interaction, the strong coupling in an antiferromagnetic magnon junction makes it a realistic candidate for practical antiferromagnetic spintronics and magnonics with ultrahigh-density integration.

preprint2020arXiv

A Random Gossip BMUF Process for Neural Language Modeling

Neural network language model (NNLM) is an essential component of industrial ASR systems. One important challenge of training an NNLM is to leverage between scaling the learning process and handling big data. Conventional approaches such as block momentum provides a blockwise model update filtering (BMUF) process and achieves almost linear speedups with no performance degradation for speech recognition. However, it needs to calculate the model average from all computing nodes (e.g., GPUs) and when the number of computing nodes is large, the learning suffers from the severe communication latency. As a consequence, BMUF is not suitable under restricted network conditions. In this paper, we present a decentralized BMUF process, in which the model is split into different components, each of which is updated by communicating to some randomly chosen neighbor nodes with the same component, followed by a BMUF-like process. We apply this method to several LSTM language modeling tasks. Experimental results show that our approach achieves consistently better performance than conventional BMUF. In particular, we obtain a lower perplexity than the single-GPU baseline on the wiki-text-103 benchmark using 4 GPUs. In addition, no performance degradation is observed when scaling to 8 and 16 GPUs.

preprint2020arXiv

Convolutional Neural Network for Convective Storm Nowcasting Using 3D Doppler Weather Radar Data

Convective storms are one of the severe weather hazards found during the warm season. Doppler weather radar is the only operational instrument that can frequently sample the detailed structure of convective storm which has a small spatial scale and short lifetime. For the challenging task of short-term convective storm forecasting, 3-D radar images contain information about the processes in convective storm. However, effectively extracting such information from multisource raw data has been problematic due to a lack of methodology and computation limitations. Recent advancements in deep learning techniques and graphics processing units now make it possible. This article investigates the feasibility and performance of an end-to-end deep learning nowcasting method. The nowcasting problem was transformed into a classification problem first, and then, a deep learning method that uses a convolutional neural network was presented to make predictions. On the first layer of CNN, a cross-channel 3D convolution was proposed to fuse 3D raw data. The CNN method eliminates the handcrafted feature engineering, i.e., the process of using domain knowledge of the data to manually design features. Operationally produced historical data of the Beijing-Tianjin-Hebei region in China was used to train the nowcasting system and evaluate its performance; 3737332 samples were collected in the training data set. The experimental results show that the deep learning method improves nowcasting skills compared with traditional machine learning methods.

preprint2020arXiv

GFF: Gated Fully Fusion for Semantic Segmentation

Semantic segmentation generates comprehensive understanding of scenes through densely predicting the category for each pixel. High-level features from Deep Convolutional Neural Networks already demonstrate their effectiveness in semantic segmentation tasks, however the coarse resolution of high-level features often leads to inferior results for small/thin objects where detailed information is important. It is natural to consider importing low level features to compensate for the lost detailed information in high-level features.Unfortunately, simply combining multi-level features suffers from the semantic gap among them. In this paper, we propose a new architecture, named Gated Fully Fusion (GFF), to selectively fuse features from multiple levels using gates in a fully connected way. Specifically, features at each level are enhanced by higher-level features with stronger semantics and lower-level features with more details, and gates are used to control the propagation of useful information which significantly reduces the noises during fusion. We achieve the state of the art results on four challenging scene parsing datasets including Cityscapes, Pascal Context, COCO-stuff and ADE20K.

preprint2020arXiv

Hybrid vector beams with non-uniform orbital angular momentum density induced by designed azimuthal polarization gradient

Based on angular amplitude modulation of orthogonal base vectors in common-path interference method, we propose an interesting type of hybrid vector beams with unprecedented azimuthal polarization gradient and demonstrate in experiment. Distinct to previously reported types, the synthetic hybrid vector beams exhibit geometrically intriguing projection tracks of angular polarization state on Poincare sphere, more than just conventional circles. More noteworthily, the designed azimuthal polarization gradients are found to be able to induce azimuthally non-uniform orbital angular momentum density, while generally uniform for circle-track cases, immersing in homogenous intensity background whatever base states are. Moreover, via tailoring relevant parameters, more special polarization mapping tracks can be handily achieved. These peculiar features may open alternative routes for new optical effects and applications.

preprint2020arXiv

OccuSeg: Occupancy-aware 3D Instance Segmentation

3D instance segmentation, with a variety of applications in robotics and augmented reality, is in large demands these days. Unlike 2D images that are projective observations of the environment, 3D models provide metric reconstruction of the scenes without occlusion or scale ambiguity. In this paper, we define "3D occupancy size", as the number of voxels occupied by each instance. It owns advantages of robustness in prediction, on which basis, OccuSeg, an occupancy-aware 3D instance segmentation scheme is proposed. Our multi-task learning produces both occupancy signal and embedding representations, where the training of spatial and feature embeddings varies with their difference in scale-aware. Our clustering scheme benefits from the reliable comparison between the predicted occupancy size and the clustered occupancy size, which encourages hard samples being correctly clustered and avoids over segmentation. The proposed approach achieves state-of-the-art performance on 3 real-world datasets, i.e. ScanNetV2, S3DIS and SceneNN, while maintaining high efficiency.

preprint2020arXiv

Triaging moderate COVID-19 and other viral pneumonias from routine blood tests

The COVID-19 is sweeping the world with deadly consequences. Its contagious nature and clinical similarity to other pneumonias make separating subjects contracted with COVID-19 and non-COVID-19 viral pneumonia a priority and a challenge. However, COVID-19 testing has been greatly limited by the availability and cost of existing methods, even in developed countries like the US. Intrigued by the wide availability of routine blood tests, we propose to leverage them for COVID-19 testing using the power of machine learning. Two proven-robust machine learning model families, random forests (RFs) and support vector machines (SVMs), are employed to tackle the challenge. Trained on blood data from 208 moderate COVID-19 subjects and 86 subjects with non-COVID-19 moderate viral pneumonia, the best result is obtained in an SVM-based classifier with an accuracy of 84%, a sensitivity of 88%, a specificity of 80%, and a precision of 92%. The results are found explainable from both machine learning and medical perspectives. A privacy-protected web portal is set up to help medical personnel in their practice and the trained models are released for developers to further build other applications. We hope our results can help the world fight this pandemic and welcome clinical verification of our approach on larger populations.

preprint2019arXiv

Application of Multi-channel 3D-cube Successive Convolution Network for Convective Storm Nowcasting

Convective storm nowcasting has attracted substantial attention in various fields. Existing methods under a deep learning framework rely primarily on radar data. Although they perform nowcast storm advection well, it is still challenging to nowcast storm initiation and growth, due to the limitations of the radar observations. This paper describes the first attempt to nowcast storm initiation, growth, and advection simultaneously under a deep learning framework using multi-source meteorological data. To this end, we present a multi-channel 3D-cube successive convolution network (3D-SCN). As real-time re-analysis meteorological data can now provide valuable atmospheric boundary layer thermal dynamic information, which is essential to predict storm initiation and growth, both raw 3D radar and re-analysis data are used directly without any handcraft feature engineering. These data are formulated as multi-channel 3D cubes, to be fed into our network, which are convolved by cross-channel 3D convolutions. By stacking successive convolutional layers without pooling, we build an end-to-end trainable model for nowcasting. Experimental results show that deep learning methods achieve better performance than traditional extrapolation methods. The qualitative analyses of 3D-SCN show encouraging results of nowcasting of storm initiation, growth, and advection.

preprint2016arXiv

Action2Activity: Recognizing Complex Activities from Sensor Data

As compared to simple actions, activities are much more complex, but semantically consistent with a human's real life. Techniques for action recognition from sensor generated data are mature. However, there has been relatively little work on bridging the gap between actions and activities. To this end, this paper presents a novel approach for complex activity recognition comprising of two components. The first component is temporal pattern mining, which provides a mid-level feature representation for activities, encodes temporal relatedness among actions, and captures the intrinsic properties of activities. The second component is adaptive Multi-Task Learning, which captures relatedness among activities and selects discriminant features. Extensive experiments on a real-world dataset demonstrate the effectiveness of our work.

preprint2015arXiv

Structure Group and Fermion-Mass-Term in General Nonlocality

In our previous work [J. Math. Phys. 49, 033513 (2008)] two problems remain to be resolved. One is that we lack a minimal group to replace GL(4,C), the other is that the Equation of Motion (EoM) for fermion has no mass term. After careful investigation we find these two problems are linked by conformal group, a subgroup of GL(4,C) group. The Weyl group, a subgroup of conformal group, can bring about the running of mass, charge etc. while making it responsible for the transformation of interaction vertex. However, once concerning the generation of the mass term in EoM, we have to resort to the whole conformal group, in which the generators $K_μ$ play a crucial role in making vacuum vary from space-like (or light-cone-like)to time-like. Physically the starting points are our previous conclusion, $\vec E^2-\vec B^2\neq 0$ for massive bosons, and the two-photon process yielding $e^+ e^-$ pair. Finally we get to the conclusion that the mass term of strong interaction is linearly relevant to (chromo-)magnetic flux as well as angular momentum.

Lei Han

What is connected

Connect this record

See the researcher in context

Building this map preview

15 published item(s)

Identification of Regulatory Requirements Relevant to Business Processes: A Comparative Study on Generative AI, Embedding-based Ranking, Crowd and Expert-driven Methods

Controllable anomalous Nernst effect in an antiperovskite antiferromagnet

Observation of spin splitting torque in a collinear antiferromagnet RuO2

Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RL

Cluster magnetic octupole induced out-of-plane spin polarization in antiperovskite antiferromagnet

Magnon-mediated interlayer coupling in an all-antiferromagnetic junction

A Random Gossip BMUF Process for Neural Language Modeling

Convolutional Neural Network for Convective Storm Nowcasting Using 3D Doppler Weather Radar Data

GFF: Gated Fully Fusion for Semantic Segmentation

Hybrid vector beams with non-uniform orbital angular momentum density induced by designed azimuthal polarization gradient

OccuSeg: Occupancy-aware 3D Instance Segmentation

Triaging moderate COVID-19 and other viral pneumonias from routine blood tests

Application of Multi-channel 3D-cube Successive Convolution Network for Convective Storm Nowcasting

Action2Activity: Recognizing Complex Activities from Sensor Data

Structure Group and Fermion-Mass-Term in General Nonlocality