Researcher profile

Lu Han

Lu Han contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
9works
0followers
8topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

9 published item(s)

preprint2026arXiv

Integrated Multivariate Segmentation Tree for Heterogeneous Credit Data Analysis in Small- and Medium-Sized Enterprises

Traditional decision tree models, which rely exclusively on numerical variables, often face challenges in handling high-dimensional data and are limited in their ability to incorporate textual information effectively. To address these limitations, we propose the integrated multivariate segmentation tree (IMST), a comprehensive framework designed to improve credit evaluation for small- and medium-sized enterprises (SMEs) by integrating financial data with textual sources. This method comprises three core stages: (1) transforming textual data into numerical matrices through matrix factorization, (2) selecting salient financial features using Lasso regression, and (3) constructing a multivariate segmentation tree based on either the Gini index or entropy, with weakest-link pruning applied to control model complexity. Experimental results based on a dataset of 1,428 Chinese SMEs demonstrated that IMST achieved an accuracy rate of 88.9%, surpassing both baseline decision trees (87.4%) and conventional models such as support vector machines and neural networks. Furthermore, the proposed model demonstrated superior interpretability and computational efficiency, featuring a more streamlined architecture and improved risk detection capabilities.

preprint2023arXiv

On Pseudo-Labeling for Class-Mismatch Semi-Supervised Learning

When there are unlabeled Out-Of-Distribution (OOD) data from other classes, Semi-Supervised Learning (SSL) methods suffer from severe performance degradation and even get worse than merely training on labeled data. In this paper, we empirically analyze Pseudo-Labeling (PL) in class-mismatched SSL. PL is a simple and representative SSL method that transforms SSL problems into supervised learning by creating pseudo-labels for unlabeled data according to the model's prediction. We aim to answer two main questions: (1) How do OOD data influence PL? (2) What is the proper usage of OOD data with PL? First, we show that the major problem of PL is imbalanced pseudo-labels on OOD data. Second, we find that OOD data can help classify In-Distribution (ID) data given their OOD ground truth labels. Based on the findings, we propose to improve PL in class-mismatched SSL with two components -- Re-balanced Pseudo-Labeling (RPL) and Semantic Exploration Clustering (SEC). RPL re-balances pseudo-labels of high-confidence data, which simultaneously filters out OOD data and addresses the imbalance problem. SEC uses balanced clustering on low-confidence data to create pseudo-labels on extra classes, simulating the process of training with ground truth. Experiments show that our method achieves steady improvement over supervised baseline and state-of-the-art performance under all class mismatch ratios on different benchmarks.

preprint2022arXiv

Abnormal Signal Recognition with Time-Frequency Spectrogram: A Deep Learning Approach

With the increasingly complex and changeable electromagnetic environment, wireless communication systems are facing jamming and abnormal signal injection, which significantly affects the normal operation of a communication system. In particular, the abnormal signals may emulate the normal signals, which makes it very challenging for abnormal signal recognition. In this paper, we propose a new abnormal signal recognition scheme, which combines time-frequency analysis with deep learning to effectively identify synthetic abnormal communication signals. Firstly, we emulate synthetic abnormal communication signals including seven jamming patterns. Then, we model an abnormal communication signals recognition system based on the communication protocol between the transmitter and the receiver. To improve the performance, we convert the original signal into the time-frequency spectrogram to develop an image classification algorithm. Simulation results demonstrate that the proposed method can effectively recognize the abnormal signals under various parameter configurations, even under low signal-to-noise ratio (SNR) and low jamming-to-signal ratio (JSR) conditions.

preprint2022arXiv

Approximate the individually fair k-center with outliers

In this paper, we propose and investigate the individually fair $k$-center with outliers (IF$k$CO). In the IF$k$CO, we are given an $n$-sized vertex set in a metric space, as well as integers $k$ and $q$. At most $k$ vertices can be selected as the centers and at most $q$ vertices can be selected as the outliers. The centers are selected to serve all the not-an-outlier (i.e., served) vertices. The so-called individual fairness constraint restricts that every served vertex must have a selected center not too far way. More precisely, it is supposed that there exists at least one center among its $\lceil (n-q) / k \rceil$ closest neighbors for every served vertex. Because every center serves $(n-q) / k$ vertices on the average. The objective is to select centers and outliers, assign every served vertex to some center, so as to minimize the maximum fairness ratio over all served vertices, where the fairness ratio of a vertex is defined as the ratio between its distance with the assigned center and its distance with a $\lceil (n - q )/k \rceil_{\rm th}$ closest neighbor. As our main contribution, a 4-approximation algorithm is presented, based on which we develop an improved algorithm from a practical perspective.

preprint2022arXiv

Electron transfer under the Floquet modulation in donor-bridge-acceptor systems

Electron transfer (ET) processes are of broad interest in modern chemistry. With the advancements of experimental techniques, one may modulate the ET via such as the light-matter interactions. In this work, we study the ET under a Floquet modulation occurring in the donor-bridge-acceptor systems, with the rate kernels projected out from the exact disspaton equation of motion formalism. This together with the Floquet theorem enables us to investigate the interplay between the intrinsic non-Markovianity and the driving periodicity. The observed rate kernel exhibits a Herzberg-Teller-like mechanism induced by the bridge fluctuation subject to effective modulation.

preprint2022arXiv

Revisiting Unsupervised Meta-Learning via the Characteristics of Few-Shot Tasks

Meta-learning has become a practical approach towards few-shot image classification, where "a strategy to learn a classifier" is meta-learned on labeled base classes and can be applied to tasks with novel classes. We remove the requirement of base class labels and learn generalizable embeddings via Unsupervised Meta-Learning (UML). Specifically, episodes of tasks are constructed with data augmentations from unlabeled base classes during meta-training, and we apply embedding-based classifiers to novel tasks with labeled few-shot examples during meta-test. We observe two elements play important roles in UML, i.e., the way to sample tasks and measure similarities between instances. Thus we obtain a strong baseline with two simple modifications -- a sufficient sampling strategy constructing multiple tasks per episode efficiently together with a semi-normalized similarity. We then take advantage of the characteristics of tasks from two directions to get further improvements. First, synthesized confusing instances are incorporated to help extract more discriminative embeddings. Second, we utilize an additional task-specific embedding transformation as an auxiliary component during meta-training to promote the generalization ability of the pre-adapted embeddings. Experiments on few-shot learning benchmarks verify that our approaches outperform previous UML methods and achieve comparable or even better performance than its supervised variants.

preprint2020arXiv

Giant Polarization and Abnormal Flexural Deformation in Bent Freestanding Perovskite Oxides

Recent realizations of ultrathin freestanding perovskite oxides offer a unique platform to probe novel properties in two-dimensional oxides. Here, we observed a giant flexoelectric response in freestanding BiFeO3 and SrTiO3 in their bent state arising from strain gradients up to 4x10e7/m, suggesting a promising approach for realizing extremely large polarizations. Additionally, a substantial reversible change in thickness was discovered in bent freestanding BiFeO3, which implies an unusual bending-expansion/shrinkage and thickness-dependence Poisson's ratios in this ferroelectric membrane that has never been seen before in crystalline materials. Our theoretical modeling reveals that this unprecedented flexural deformation within the membrane is attributable to a flexoelectricity-piezoelectricity interplay. The finding unveils intriguing nanoscale electromechanical properties and provides guidance for their practical applications in flexible nanoelectromechanical systems.

preprint2019arXiv

Stochastic Equation of Motion Approach to Fermionic Dissipative Dynamics. I. Formalism

In this work, we establish formally exact stochastic equations of motion (SEOM) theory to describe the dissipative dynamics of fermionic open systems. The construction of the SEOM is based on a stochastic decoupling of the dissipative interaction between the system and fermionic environment, and the influence of environmental fluctuations on the reduced system dynamics is characterized by stochastic Grassmann fields. Meanwhile, numerical realization of the time-dependent Grassmann fields has remained a long-standing challenge. To solve this problem, we propose a minimal auxiliary space (MAS) mapping scheme, with which the stochastic Grassmann fields are represented by conventional c-number fields along with a set of pseudo-levels. This eventually leads to a numerically feasible MAS-SEOM method. The important properties of the MAS-SEOM are analyzed by making connection to the well-established time-dependent perturbation theory and the hierarchical equations of motion (HEOM) theory. The MAS-SEOM method provides a potentially promising approach for accurate and efficient simulation of fermionic open systems at ultra-low temperatures.

preprint2019arXiv

Stochastic Equation of Motion Approach to Fermionic Dissipative Dynamics. II. Numerical Implementation

This paper provides a detailed account of the numerical implementation of the stochastic equation of motion (SEOM) method for the dissipative dynamics of fermionic open quantum systems. To enable direct stochastic calculations, a minimal auxiliary space (MAS) mapping scheme is adopted, with which the time-dependent Grassmann fields are represented by c-numbers noises and a set of pseudo-operators. We elaborate on the construction of the system operators and pseudo-operators involved in the MAS-SEOM, along with the analytic expression for the particle current. The MASSEOM is applied to study the relaxation and voltage-driven dynamics of quantum impurity systems described by the single-level Anderson impurity model, and the numerical results are benchmarked against those of the highly accurate hierarchical equations of motion (HEOM) method. The advantages and limitations of the present MAS-SEOM approach are discussed extensively.