Researcher profile

Wei Yuan

Wei Yuan contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
16works
0followers
16topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

16 published item(s)

preprint2026arXiv

Reasoning Pattern Alignment Merging for Adaptive Reasoning

Recent large reasoning models (LRMs) have made substantial progress in complex reasoning tasks, yet they often generate lengthy reasoning paths for every query, incurring unnecessary computation and latency. Existing speed-up approaches typically rely on retraining the model or designing sophisticated prompting, which are either prohibitively expensive or highly sensitive to the input and prompt formulation. In this work, we study model merging as a lightweight alternative for efficient reasoning: by combining a long chain-of-thought (Long-CoT) reasoning model with a Short-CoT instruction model, we obtain an adaptive reasoner without training from scratch or requiring large-scale additional data. Building on this idea, we propose Reasoning Pattern Alignment Merging (RPAM), a layer-wise model merging framework based on feature alignment to facilitate query-adaptive reasoning. RPAM first constructs a small pattern-labeled calibration set that assigns each query an appropriate reasoning pattern. It then optimizes layer-wise merging coefficients by aligning the merged model's intermediate representations with those of the selected model, while a contrastive objective explicitly pushes them away from the non-selected model. Experiments on seven widely used reasoning benchmarks show that RPAM substantially reduces inference cost while maintaining strong performance. Upon article acceptance, we will provide open-source code to reproduce experiments for RPAM.

preprint2024arXiv

Co-universal $C^{\ast}$-algebras for product systems over finite aligned subcategories of groupoids

The product systems over left cancellative small categories are introduced and studied in this paper. We also introduce the notion of compactly aligned product systems over finite aligned left cancellative small categories and its Nica covariant representations. The existence of co-universal algebras for injective, gauge-compatible, Nica covariant representations of compactly aligned product systems over finite aligned subcategories of groupoids is proved in this paper.

preprint2024arXiv

ROIC-DM: Robust Text Inference and Classification via Diffusion Model

While language models have made many milestones in text inference and classification tasks, they remain susceptible to adversarial attacks that can lead to unforeseen outcomes. Existing works alleviate this problem by equipping language models with defense patches. However, these defense strategies often rely on impractical assumptions or entail substantial sacrifices in model performance. Consequently, enhancing the resilience of the target model using such defense mechanisms is a formidable challenge. This paper introduces an innovative model for robust text inference and classification, built upon diffusion models (ROIC-DM). Benefiting from its training involving denoising stages, ROIC-DM inherently exhibits greater robustness compared to conventional language models. Moreover, ROIC-DM can attain comparable, and in some cases, superior performance to language models, by effectively incorporating them as advisory components. Extensive experiments conducted with several strong textual adversarial attacks on three datasets demonstrate that (1) ROIC-DM outperforms traditional language models in robustness, even when the latter are fortified with advanced defense mechanisms; (2) ROIC-DM can achieve comparable and even better performance than traditional language models by using them as advisors.

preprint2022arXiv

CSCNet: Contextual Semantic Consistency Network for Trajectory Prediction in Crowded Spaces

Trajectory prediction aims to predict the movement trend of the agents like pedestrians, bikers, vehicles. It is helpful to analyze and understand human activities in crowded spaces and widely applied in many areas such as surveillance video analysis and autonomous driving systems. Thanks to the success of deep learning, trajectory prediction has made significant progress. The current methods are dedicated to studying the agents' future trajectories under the social interaction and the sceneries' physical constraints. Moreover, how to deal with these factors still catches researchers' attention. However, they ignore the \textbf{Semantic Shift Phenomenon} when modeling these interactions in various prediction sceneries. There exist several kinds of semantic deviations inner or between social and physical interactions, which we call the "\textbf{Gap}". In this paper, we propose a \textbf{C}ontextual \textbf{S}emantic \textbf{C}onsistency \textbf{Net}work (\textbf{CSCNet}) to predict agents' future activities with powerful and efficient context constraints. We utilize a well-designed context-aware transfer to obtain the intermediate representations from the scene images and trajectories. Then we eliminate the differences between social and physical interactions by aligning activity semantics and scene semantics to cross the Gap. Experiments demonstrate that CSCNet performs better than most of the current methods quantitatively and qualitatively.

preprint2022arXiv

Curved versions of the Ovsienko-Redou operators

We give a complete classification of tangential bidifferential operators of total order at most $n$ which are expressed purely in terms of the Laplacian on the ambient space of an $n$-dimensional manifold. This gives a curved analogue of the classification, due to Ovsienko--Redou and Clerc, of conformally invariant bidifferential operators on the sphere. As an application, we construct a large class of formally self-adjoint conformally invariant differential operators.

preprint2022arXiv

Decorrelated Local Linear Estimator: Inference for Non-linear Effects in High-dimensional Additive Models

Additive models play an essential role in studying non-linear relationships. Despite many recent advances in estimation, there is a lack of methods and theories for inference in high-dimensional additive models, including confidence interval construction and hypothesis testing. Motivated by inference for non-linear treatment effects, we consider the high-dimensional additive model and make inference for the derivative of the function of interest. We propose a novel decorrelated local linear estimator and establish its asymptotic normality. The main novelty is the construction of the decorrelation weights, which is instrumental in reducing the error inherited from estimating the nuisance functions in the high-dimensional additive model. We construct the confidence interval for the function derivative and conduct the related hypothesis testing. We demonstrate our proposed method over large-scale simulation studies and apply it to identify non-linear effects in the motif regression problem. Our proposed method is implemented in the R package \texttt{DLL} available from CRAN.

preprint2022arXiv

Hardware Acceleration of Sampling Algorithms in Sample and Aggregate Graph Neural Networks

Sampling is an important process in many GNN structures in order to train larger datasets with a smaller computational complexity. However, compared to other processes in GNN (such as aggregate, backward propagation), the sampling process still costs tremendous time, which limits the speed of training. To reduce the time of sampling, hardware acceleration is an ideal choice. However, state of the art GNN acceleration proposal did not specify how to accelerate the sampling process. What's more, directly accelerating traditional sampling algorithms will make the structure of the accelerator very complicated. In this work, we made two contributions: (1) Proposed a new neighbor sampler: CONCAT Sampler, which can be easily accelerated on hardware level while guaranteeing the test accuracy. (2) Designed a CONCAT-sampler-accelerator based on FPGA, with which the neighbor sampling process boosted to about 300-1000 times faster compared to the sampling process without it.

preprint2022arXiv

Kernelized Similarity Learning and Embedding for Dynamic Texture Synthesis

Dynamic texture (DT) exhibits statistical stationarity in the spatial domain and stochastic repetitiveness in the temporal dimension, indicating that different frames of DT possess a high similarity correlation that is critical prior knowledge. However, existing methods cannot effectively learn a promising synthesis model for high-dimensional DT from a small number of training data. In this paper, we propose a novel DT synthesis method, which makes full use of similarity prior knowledge to address this issue. Our method bases on the proposed kernel similarity embedding, which not only can mitigate the high-dimensionality and small sample issues, but also has the advantage of modeling nonlinear feature relationship. Specifically, we first raise two hypotheses that are essential for DT model to generate new frames using similarity correlation. Then, we integrate kernel learning and extreme learning machine into a unified synthesis model to learn kernel similarity embedding for representing DT. Extensive experiments on DT videos collected from the internet and two benchmark datasets, i.e., Gatech Graphcut Textures and Dyntex, demonstrate that the learned kernel similarity embedding can effectively exhibit the discriminative representation for DT. Accordingly, our method is capable of preserving the long-term temporal continuity of the synthesized DT sequences with excellent sustainability and generalization. Meanwhile, it effectively generates realistic DT videos with fast speed and low computation, compared with the state-of-the-art methods. The code and more synthesis videos are available at our project page https://shiming-chen.github.io/Similarity-page/Similarit.html.

preprint2022arXiv

On cleanness of von Neumann algebras

A unital ring is called clean (resp. strongly clean) if every element can be written as the sum of an invertible element and an idempotent (resp. an invertible element and an idempotent that commutes). T.Y. Lam proposed a question: which von Neumann algebras are clean as rings? In this paper, we characterize strongly clean von Neumann algebras and prove that all finite von Neumann algebras and all separable infinite factors are clean.

preprint2022arXiv

Online Trajectory Prediction for Metropolitan Scale Mobility Digital Twin

Knowing "what is happening" and "what will happen" of the mobility in a city is the building block of a data-driven smart city system. In recent years, mobility digital twin that makes a virtual replication of human mobility and predicting or simulating the fine-grained movements of the subjects in a virtual space at a metropolitan scale in near real-time has shown its great potential in modern urban intelligent systems. However, few studies have provided practical solutions. The main difficulties are four-folds. 1) The daily variation of human mobility is hard to model and predict; 2) the transportation network enforces a complex constraints on human mobility; 3) generating a rational fine-grained human trajectory is challenging for existing machine learning models; and 4) making a fine-grained prediction incurs high computational costs, which is challenging for an online system. Bearing these difficulties in mind, in this paper we propose a two-stage human mobility predictor that stratifies the coarse and fine-grained level predictions. In the first stage, to encode the daily variation of human mobility at a metropolitan level, we automatically extract citywide mobility trends as crowd contexts and predict long-term and long-distance movements at a coarse level. In the second stage, the coarse predictions are resolved to a fine-grained level via a probabilistic trajectory retrieval method, which offloads most of the heavy computations to the offline phase. We tested our method using a real-world mobile phone GPS dataset in the Kanto area in Japan, and achieved good prediction accuracy and a time efficiency of about 2 min in predicting future 1h movements of about 220K mobile phone users on a single machine to support more higher-level analysis of mobility prediction.

preprint2022arXiv

Some constructions of formally self-adjoint conformally covariant polydifferential operators

We introduce the notion of formally self-adjoint conformally covariant polydifferential operators and give some constructions of families of such operators. In one direction, we show that any homogeneous conformally variational scalar Riemannian invariant (CVI) induces one of these operators. In another direction, we use the ambient metric to give alternative constructions of certain operators produced this way, one of which is a formally self-adjoint, fourth-order, conformally covariant tridifferential operator which should be regarded as the simplest fully nonlinear analogue of the Paneitz operator.

preprint2022arXiv

View Vertically: A Hierarchical Network for Trajectory Prediction via Fourier Spectrums

Understanding and forecasting future trajectories of agents are critical for behavior analysis, robot navigation, autonomous cars, and other related applications. Previous methods mostly treat trajectory prediction as time sequence generation. Different from them, this work studies agents' trajectories in a "vertical" view, i.e., modeling and forecasting trajectories from the spectral domain. Different frequency bands in the trajectory spectrums could hierarchically reflect agents' motion preferences at different scales. The low-frequency and high-frequency portions could represent their coarse motion trends and fine motion variations, respectively. Accordingly, we propose a hierarchical network V$^2$-Net, which contains two sub-networks, to hierarchically model and predict agents' trajectories with trajectory spectrums. The coarse-level keypoints estimation sub-network first predicts the "minimal" spectrums of agents' trajectories on several "key" frequency portions. Then the fine-level spectrum interpolation sub-network interpolates the spectrums to reconstruct the final predictions. Experimental results display the competitiveness and superiority of V$^2$-Net on both ETH-UCY benchmark and the Stanford Drone Dataset.

preprint2021arXiv

Deformations of Q-curvature II

This is the second article of a sequence of research on deformations of Q-curvature. In the previous one, we studied local stability and rigidity phenomena of Q-curvature. In this article, we mainly investigate the volume comparison with respect to Q-curvature. In particular, we show that volume comparison theorem holds for metrics close to strictly stable positive Einstein metrics. This result shows that Q-curvature can still control the volume of manifolds under certain conditions, which provides a fundamental geometric characterization of Q-curvature. Applying the same technique, we derive the local rigidity of strictly stable Ricci-flat manifolds with respect to Q-curvature, which shows the non-existence of metrics with positive Q-curvature near the reference metric.

preprint2021arXiv

Volume comparison with respect to scalar curvature

In this article, we investigate the volume comparison with respect to scalar curvature. In particular, we show volume comparison holds for small geodesic balls of metrics near a V-static metric. For closed manifold, we prove the volume comparison for metrics near a strictly stable Einstein metric. As applications, we give a partial answer to a conjecture of Bray and recover a result of Besson, Courtois and Gallot, which partially confirms a conjecture of Schoen about closed hyperbolic manifold. Applying analogous techniques, we obtain a different proof of a local rigidity result due to Dai, Wang and Wei, which shows it admits no metric with positive scalar curvature near strictly stable Ricci-flat metrics.

preprint2020arXiv

Spin Seebeck Effect near the Antiferromagnetic Spin-Flop Transition

We develop a low-temperature, long-wavelength theory for the interfacial spin Seebeck effect (SSE) in easy-axis antiferromagnets. The field-induced spin-flop (SF) transition of Néel order is associated with a qualitative change in SSE behavior: Below SF, there are two spin carriers with opposite magnetic moments, with the carriers polarized along the field forming a majority magnon band. Above SF, the low-energy, ferromagnetic-like mode has magnetic moment opposite the field. This results in a sign change of the SSE across SF, which agrees with recent measurements on Cr$_2$O$_3$/Pt and Cr$_2$O$_3$/Ta devices [Li $\textit{et al.,}$ $\textit{Nature}$ $\textbf{578,}$ 70 (2020)]. In our theory, SSE is due to a Néel spin current below SF and a magnetic spin current above SF. Using the ratio of the associated Néel to magnetic spin-mixing conductances as a single constant fitting parameter, we reproduce the field dependence of the experimental data and partially the temperature dependence of the relative SSE jump across SF.

preprint2019arXiv

Pointed Drinfeld center functor

In this work, using the functoriality of Drinfeld center of fusion categories, we generalize an earlier result on the functoriality of full center of simple separable algebras in a fixed fusion category to all fusion categories. This generalization produces a new center functor, which involves both Drinfeld center and full center and will be called the pointed Drinfeld center functor. We prove that this pointed Drinfeld center functor is a symmetric monoidal equivalence. It turns out that this functor provides a precise and rather complete mathematical formulation of the boundary-bulk relation of 1+1D rational conformal field theories (RCFT). In this process, we solve an old problem of computing the fusion of two 0D (or 1D) wall CFT's along a non-trivial 1+1D bulk RCFT.