Source author record

Longfei Li

Longfei Li appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning Computer Vision math.NA Numerical Analysis Artificial Intelligence Biological Physics eess.IV Information Retrieval math.KT math.RA Quantitative Methods

Catalog footprint

What is connected

13works

11topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

$A_{\infty}$-structures on the additive decomposition of the Tate-Hochschild cohomology of a finite group algebra

Firstly, for a finite group algebra, we provide a computational framework $\widehat{m}_n$ for the Tate-Hochschild cochain complex in terms of the additive decomposition, by decomposing each planar n-ary tree into local two children and local three children. Secondly, we give all $\widehat{m}_2$ formulas of the Tate-Hochschild cochain complex in terms of the additive decomposition. Thirdly, we give explicit $A_{\infty}$-multiplication formulas for both the Hochschild cochain complex and the Hochschild chain complex under additive decompositions. Finally, we give $A_{\infty}$-multiplication formulas in the context of abelian groups.

preprint2026arXiv

SpatialTree: How Spatial Abilities Branch Out in MLLMs

Cognitive science suggests that spatial ability develops progressively-from perception to reasoning and interaction. Yet in multimodal LLMs (MLLMs), this hierarchy remains poorly understood, as most studies focus on a narrow set of tasks. We introduce SpatialTree, a cognitive-science-inspired hierarchy that organizes spatial abilities into four levels: low-level perception (L1), mental mapping (L2), simulation (L3), and agentic competence (L4). Based on this taxonomy, we construct the first capability-centric hierarchical benchmark, thoroughly evaluating mainstream MLLMs across 27 sub-abilities. The evaluation results reveal a clear structure: L1 skills are largely orthogonal, whereas higher-level skills are strongly correlated, indicating increasing interdependency. Through targeted supervised fine-tuning, we uncover a surprising transfer dynamic-negative transfer within L1, but strong cross-level transfer from low- to high-level abilities with notable synergy. Finally, we explore how to improve the entire hierarchy. We find that naive RL that encourages extensive "thinking" is unreliable: it helps complex reasoning but hurts intuitive perception. We propose a simple auto-think strategy that suppresses unnecessary deliberation, enabling RL to consistently improve performance across all levels. By building SpatialTree, we provide a proof-of-concept framework for understanding and systematically scaling spatial abilities in MLLMs.

preprint2022arXiv

Expert Knowledge-guided Geometric Representation Learning for Magnetic Resonance Imaging-based Glioma Grading

Radiomics and deep learning have shown high popularity in automatic glioma grading. Radiomics can extract hand-crafted features that quantitatively describe the expert knowledge of glioma grades, and deep learning is powerful in extracting a large number of high-throughput features that facilitate the final classification. However, the performance of existing methods can still be improved as their complementary strengths have not been sufficiently investigated and integrated. Furthermore, lesion maps are usually needed for the final prediction at the testing phase, which is very troublesome. In this paper, we propose an expert knowledge-guided geometric representation learning (ENROL) framework . Geometric manifolds of hand-crafted features and learned features are constructed to mine the implicit relationship between deep learning and radiomics, and therefore to dig mutual consent and essential representation for the glioma grades. With a specially designed manifold discrepancy measurement, the grading model can exploit the input image data and expert knowledge more effectively in the training phase and get rid of the requirement of lesion segmentation maps at the testing phase. The proposed framework is flexible regarding deep learning architectures to be utilized. Three different architectures have been evaluated and five models have been compared, which show that our framework can always generate promising results.

preprint2021arXiv

Cross-Domain Recommendation: Challenges, Progress, and Prospects

To address the long-standing data sparsity problem in recommender systems (RSs), cross-domain recommendation (CDR) has been proposed to leverage the relatively richer information from a richer domain to improve the recommendation performance in a sparser domain. Although CDR has been extensively studied in recent years, there is a lack of a systematic review of the existing CDR approaches. To fill this gap, in this paper, we provide a comprehensive review of existing CDR approaches, including challenges, research progress, and future directions. Specifically, we first summarize existing CDR approaches into four types, including single-target CDR, multi-domain recommendation, dual-target CDR, and multi-target CDR. We then present the definitions and challenges of these CDR approaches. Next, we propose a full-view categorization and new taxonomies on these approaches and report their research progress in detail. In the end, we share several promising research directions in CDR.

preprint2020arXiv

A split-step finite-element method for incompressible Navier-Stokes equations with high-order accuracy up-to the boundary

An efficient and accurate finite-element algorithm is described for the numerical solution of the incompressible Navier-Stokes (INS) equations. The new algorithm that solves the INS equations in a velocity-pressure reformulation is based on a split-step scheme in conjunction with the standard finite-element method. The split-step scheme employed for the temporal discretization of our algorithm completely separates the pressure updates from the solution of velocity variables. When the pressure equation is formed explicitly, the algorithm avoids solving a saddle-point problem; therefore, our algorithm has more flexibility in choosing finite-element spaces. For efficiency and robustness, Lagrange finite elements of equal order for both velocity and pressure are used. Motivated by a post-processing technique that calculates derivatives of a finite element solution with super-convergent error estimates, an alternative numerical boundary condition is proposed for the pressure equation at the discrete level. The new numerical pressure boundary condition that can be regarded as a better implementation of the compatibility boundary condition improves the boundary-layer errors of the pressure solution. Normal-mode analysis is performed using a simplified model problem on a uniform mesh to demonstrate the numerical properties of our methods. Convergence study using $\mathbb{P}_1$ elements confirms the analytical results and demonstrates that our algorithm with the new numerical boundary condition achieves the optimal second-order accuracy for both velocity and pressure up-to the boundary. Benchmark problems are also computed and carefully compared with existing studies. Finally, as an example to illustrate that our approach can be easily adapted for higher-order finite elements, we solve the classical flow-past-a-cylinder problem using $\mathbb{P}_n$ finite elements with $n\geq 1$.

preprint2020arXiv

A Time Attention based Fraud Transaction Detection Framework

With online payment platforms being ubiquitous and important, fraud transaction detection has become the key for such platforms, to ensure user account safety and platform security. In this work, we present a novel method for detecting fraud transactions by leveraging patterns from both users' static profiles and users' dynamic behaviors in a unified framework. To address and explore the information of users' behaviors in continuous time spaces, we propose to use \emph{time attention based recurrent layers} to embed the detailed information of the time interval, such as the durations of specific actions, time differences between different actions and sequential behavior patterns,etc., in the same latent space. We further combine the learned embeddings and users' static profiles altogether in a unified framework. Extensive experiments validate the effectiveness of our proposed methods over state-of-the-art methods on various evaluation metrics, especially on \emph{recall at top percent} which is an important metric for measuring the balance between service experiences and risk of potential losses.

preprint2020arXiv

Distributed Deep Forest and its Application to Automatic Detection of Cash-out Fraud

Internet companies are facing the need for handling large-scale machine learning applications on a daily basis and distributed implementation of machine learning algorithms which can handle extra-large scale tasks with great performance is widely needed. Deep forest is a recently proposed deep learning framework which uses tree ensembles as its building blocks and it has achieved highly competitive results on various domains of tasks. However, it has not been tested on extremely large scale tasks. In this work, based on our parameter server system, we developed the distributed version of deep forest. To meet the need for real-world tasks, many improvements are introduced to the original deep forest model, including MART (Multiple Additive Regression Tree) as base learners for efficiency and effectiveness consideration, the cost-based method for handling prevalent class-imbalanced data, MART based feature selection for high dimension data and different evaluation metrics for automatically determining of the cascade level. We tested the deep forest model on an extra-large scale task, i.e., automatic detection of cash-out fraud, with more than 100 millions of training samples. Experimental results showed that the deep forest model has the best performance according to the evaluation metrics from different perspectives even with very little effort for parameter tuning. This model can block fraud transactions in a large amount of money each day. Even compared with the best-deployed model, the deep forest model can additionally bring into a significant decrease in economic loss each day.

preprint2020arXiv

Interpretable MTL from Heterogeneous Domains using Boosted Tree

Multi-task learning (MTL) aims at improving the generalization performance of several related tasks by leveraging useful information contained in them. However, in industrial scenarios, interpretability is always demanded, and the data of different tasks may be in heterogeneous domains, making the existing methods unsuitable or unsatisfactory. In this paper, following the philosophy of boosted tree, we proposed a two-stage method. In stage one, a common model is built to learn the commonalities using the common features of all instances. Different from the training of conventional boosted tree model, we proposed a regularization strategy and an early-stopping mechanism to optimize the multi-task learning process. In stage two, started by fitting the residual error of the common model, a specific model is constructed with the task-specific instances to further boost the performance. Experiments on both benchmark and real-world datasets validate the effectiveness of the proposed method. What's more, interpretability can be naturally obtained from the tree based method, satisfying the industrial needs.

preprint2020arXiv

Knowledge Consistency between Neural Networks and Beyond

This paper aims to analyze knowledge consistency between pre-trained deep neural networks. We propose a generic definition for knowledge consistency between neural networks at different fuzziness levels. A task-agnostic method is designed to disentangle feature components, which represent the consistent knowledge, from raw intermediate-layer features of each neural network. As a generic tool, our method can be broadly used for different applications. In preliminary experiments, we have used knowledge consistency as a tool to diagnose representations of neural networks. Knowledge consistency provides new insights to explain the success of existing deep-learning techniques, such as knowledge distillation and network compression. More crucially, knowledge consistency can also be used to refine pre-trained networks and boost performance.

preprint2020arXiv

SAFE: Scalable Automatic Feature Engineering Framework for Industrial Tasks

Machine learning techniques have been widely applied in Internet companies for various tasks, acting as an essential driving force, and feature engineering has been generally recognized as a crucial tache when constructing machine learning systems. Recently, a growing effort has been made to the development of automatic feature engineering methods, so that the substantial and tedious manual effort can be liberated. However, for industrial tasks, the efficiency and scalability of these methods are still far from satisfactory. In this paper, we proposed a staged method named SAFE (Scalable Automatic Feature Engineering), which can provide excellent efficiency and scalability, along with requisite interpretability and promising performance. Extensive experiments are conducted and the results show that the proposed method can provide prominent efficiency and competitive effectiveness when comparing with other methods. What's more, the adequate scalability of the proposed method ensures it to be deployed in large scale industrial tasks.

preprint2020arXiv

Stable and accurate numerical methods for generalized Kirchhoff-Love plates

Efficient and accurate numerical algorithms are developed to solve a generalized Kirchhoff-Love plate model subject to three common physical boundary conditions: (i) clamped; (ii) simply supported; and (iii) free. We solve the model equation by discretizing the spatial derivatives using second-order finite-difference schemes, and then advancing the semi-discrete problem in time with either an explicit predictor-corrector or an implicit Newmark-Beta time-stepping algorithm. Stability analysis is conducted for the schemes and the results are used to determine stable time steps in practice. A series of carefully chosen test problems are solved to demonstrate the properties and applications of our numerical approaches. The numerical results confirm the stability and 2nd-order accuracy of the algorithms, and are also comparable with experiments for similar thin plates. As an application, we illustrate a strategy to identify the natural frequencies of a plate using our numerical methods in conjunction with a fast Fourier transformation (FFT) power spectrum analysis of the computed data. Then we take advantage of one of the computed natural frequencies to simulate the interesting physical phenomena known as resonance and beat for a generalized Kirchhoff-Love plate.

preprint2017arXiv

Numerical methods for thermally stressed shallow shell equations

We develop efficient and accurate numerical methods to solve a class of shallow shell problems of the von Karman type. The governing equations form a fourth-order coupled system of nonlinear biharnomic equations for the transverse deflection and Airy's stress function. A second-order finite difference discretization with three iterative methods (Picard, Newton and Trust-Region Dogleg) are proposed for the numerical solution of the nonlinear PDE system. Three simple boundary conditions and two application-motivated mixed boundary conditions are considered. Along with the nonlinearity of the system, boundary singularities that appear when mixed boundary conditions are specified are the main numerical challenges. Two approaches that use either a transition function or local corrections are developed to deal with these boundary singularities. All the proposed numerical methods are validated using carefully designed numerical tests, where expected orders of accuracy and rates of convergence are observed. A rough run-time performance comparison is also conducted to illustrate the efficiency of our methods. As an application of the methods, a snap-through thermal buckling problem is considered. The critical thermal loads of shell buckling with various boundary conditions are numerically calculated, and snap-through bifurcation curves are also obtained using our numerical methods together with a pseudo-arclength continuation method. Our results are consistent with previous studies.

preprint2016arXiv

Computed Flow and Fluorescence Over the Ocular Surface

Fluorescein is perhaps the most commonly used substance to visualize tear film thickness and dynamics; better understanding of this process aids understanding of dry eye syndrome which afflicts millions of people. We study a mathematical model for tear film flow, evaporation, solutal transport and fluorescence over the exposed ocular surface during the interblink. Transport of the fluorescein ion by fluid flow in the tear film affects the intensity of fluorescence via changes in concentration and tear film thickness. Evaporation causes increased osmolarity and potential irritation over the ocular surface; it also alters fluorescein concentration and thus fluorescence. Using thinning rates from in vivo measurements together with thin film equations for flow and transport of multiple solutes, we compute dynamic results for tear film quantities of interest. We compare our computed intensity distributions with in vivo observations. A number of experimental features are recovered by the model.

Longfei Li

What is connected

Connect this record

See the researcher in context

Building this map preview

13 published item(s)

$A_{\infty}$-structures on the additive decomposition of the Tate-Hochschild cohomology of a finite group algebra

SpatialTree: How Spatial Abilities Branch Out in MLLMs

Expert Knowledge-guided Geometric Representation Learning for Magnetic Resonance Imaging-based Glioma Grading

Cross-Domain Recommendation: Challenges, Progress, and Prospects

A split-step finite-element method for incompressible Navier-Stokes equations with high-order accuracy up-to the boundary

A Time Attention based Fraud Transaction Detection Framework

Distributed Deep Forest and its Application to Automatic Detection of Cash-out Fraud

Interpretable MTL from Heterogeneous Domains using Boosted Tree

Knowledge Consistency between Neural Networks and Beyond

SAFE: Scalable Automatic Feature Engineering Framework for Industrial Tasks

Stable and accurate numerical methods for generalized Kirchhoff-Love plates

Numerical methods for thermally stressed shallow shell equations

Computed Flow and Fluorescence Over the Ocular Surface