Source author record

Yifei Sun

Yifei Sun appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Artificial Intelligence cond-mat.soft Methodology Computation and Language Computer Vision cond-mat.mtrl-sci eess.SP Applications cond-mat.mes-hall cond-mat.str-el econ.TH eess.SY physics.app-ph physics.geo-ph physics.optics physics.soc-ph Populations and Evolution Social and Information Networks Systems and Control

Catalog footprint

What is connected

19works

19topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

A self-evolving multi-role collaborative framework with fine-grained difficulty guidance for innovative mathematical problem generation

Mathematical problem generation (MPG) is a significant research direction in the field of intelligent education. In recent years, the rapid development of large language models (LLMs) has enabled new technological approaches to problem-generation tasks. Although existing LLMs can achieve high correctness rates, they generally lack innovation and exhibit poor discrimination. In this paper, we propose the task of innovative math problem generation (IMPG). To solve the IMPG task, this paper proposes a self-evolving, multi-role collaborative framework with fine-grained difficulty guidance. First, a multi-role collaborative mechanism comprising a sampler, generator, evaluator, state machine, and memory is constructed, ensuring the correctness of generated problems through iterative optimization informed by self-assessment and external feedback. Second, we introduce an improved difficulty model to quantify difficulty and provide fine-grained guidance. We adopt the data-driven association-guided path sampling (DAPS) algorithm to enhance the semantic rationality of sampled encodings. Third, we construct the HSM3K-CN dataset, which comprises high-quality high school math problems. A multi-stage training pipeline is adopted, incorporating continual pre-training (CPT), supervised fine-tuning (SFT), and group relative policy optimization (GRPO), to enhance the generation and evaluation capabilities of the base model. Finally, system self-evolution is achieved by transferring evaluation capabilities from the expert model to the apprentice model via distillation. Experiments show that, compared to baseline models, our proposed method significantly improves the innovation of the generated problems while maintaining a high correctness rate.

preprint2026arXiv

ReBrain: Brain MRI Reconstruction from Sparse CT Slice via Retrieval-Augmented Diffusion

Magnetic Resonance Imaging (MRI) plays a crucial role in brain disease diagnosis, but it is not always feasible for certain patients due to physical or clinical constraints. Recent studies attempt to synthesize MRI from Computed Tomography (CT) scans; however, low-dose protocols often result in highly sparse CT volumes with poor through-plane resolution, making accurate reconstruction of the full brain MRI volume particularly challenging. To address this, we propose ReBrain, a retrieval-augmented diffusion framework for brain MRI reconstruction. Given any 3D CT scan with limited slices, we first employ a Brownian Bridge Diffusion Model (BBDM) to synthesize MRI slices along the 2D dimension. Simultaneously, we retrieve structurally and pathologically similar CT slices from a comprehensive prior database via a fine-tuned retrieval model. These retrieved slices are used as references, incorporated through a ControlNet branch to guide the generation of intermediate MRI slices and ensure structural continuity. We further account for rare retrieval failures when the database lacks suitable references and apply spherical linear interpolation to provide supplementary guidance. Extensive experiments on SynthRAD2023 and BraTS demonstrate that ReBrain achieves state-of-the-art performance in cross-modal reconstruction under sparse conditions.

preprint2026arXiv

Self-Evolving Spatial Reasoning in Vision Language Models via Geometric Logic Consistency

Vision-Language Models (VLMs) have made striking progress, yet their spatial reasoning remains fragile: models that answer an original input correctly can still fail under paired transformations with predictable answer mappings, revealing a gap between instance-level correctness and robust spatial reasoning. To address this, we propose Spatial Alignment via Geometric Evolution (SAGE), a self-evolving framework that enforces logical consistency in VLMs through geometric and linguistic duality operations. SAGE incorporates duality consistency as an auxiliary reward within GRPO training, encouraging models to produce logically coherent answers across original and transformed inputs. A dynamic operation pool continuously probes for inconsistencies, promoting challenging operations and retiring mastered ones, so that training focuses on the most informative signals. SAGE is model-agnostic, data-efficient compared to prior GRPO methods, and can be applied as a lightweight post-training stage to any existing VLM. Experiments on video and spatial reasoning benchmarks demonstrate consistent improvements over strong baselines and enhanced generalization to unseen data.

preprint2026arXiv

Table as a Modality for Large Language Models

To migrate the remarkable successes of Large Language Models (LLMs), the community has made numerous efforts to generalize them to the table reasoning tasks for the widely deployed tabular data. Despite that, in this work, by showing a probing experiment on our proposed StructQA benchmark, we postulate that even the most advanced LLMs (such as GPTs) may still fall short of coping with tabular data. More specifically, the current scheme often simply relies on serializing the tabular data, together with the meta information, then inputting them through the LLMs. We argue that the loss of structural information is the root of this shortcoming. In this work, we further propose TAMO, which bears an ideology to treat the tables as an independent modality integrated with the text tokens. The resulting model in TAMO is a multimodal framework consisting of a hypergraph neural network as the global table encoder seamlessly integrated with the mainstream LLM. Empirical results on various benchmarking datasets, including HiTab, WikiTQ, WikiSQL, FeTaQA, and StructQA, have demonstrated significant improvements on generalization with an average relative gain of 42.65%.

preprint2023arXiv

An Indoor Environment Sensing and Localization System via mmWave Phased Array

An indoor layout sensing and localization system in 60GHz millimeter wave (mmWave) band, named mmReality, is elaborated in this paper. The mmReality system consists of one transmitter and one mobile receiver, each with a phased array and a single radio frequency (RF) chain. To reconstruct the room layout, the pilot signal is delivered from the transmitter to the receiver via different pairs of transmission and receiving beams, so that the signals at all antenna elements can be resolved. Then, the spatial smoothing and two-dimensional multiple signal classification (MUSIC) algorithm is applied to detect the angle-of-arrival (AoAs) and angle-of-departure (AoDs) of the rays from the transmitter to the receiver. Moreover, the technique of multi-carrier ranging is adopted to measure the distance of each propagation path. Synthesizing the above geometrical parameters, the location of receiver relative to the transmitter can be pinpointed, both line-of-sight (LoS) and non-line-of-sight (NLoS) paths can also be determined. Therefore, the room layout can be reconstructed by moving the receiver and repeating the above measurement in different locations of the room. At the end, we show that the reconstructed room layout can be utilized to locate a mobile device according to its AoA spectrum, even with single access point.

preprint2022arXiv

Dynamic Risk Prediction Triggered by Intermediate Events Using Survival Tree Ensembles

With the availability of massive amounts of data from electronic health records and registry databases, incorporating time-varying patient information to improve risk prediction has attracted great attention. To exploit the growing amount of predictor information over time, we develop a unified framework for landmark prediction using survival tree ensembles, where an updated prediction can be performed when new information becomes available. Compared to conventional landmark prediction with fixed landmark times, our methods allow the landmark times to be subject-specific and triggered by an intermediate clinical event. Moreover, the nonparametric approach circumvents the thorny issue of model incompatibility at different landmark times. In our framework, both the longitudinal predictors and the event time outcome are subject to right censoring, and thus existing tree-based approaches cannot be directly applied. To tackle the analytical challenges, we propose a risk-set-based ensemble procedure by averaging martingale estimating equations from individual trees. Extensive simulation studies are conducted to evaluate the performance of our methods. The methods are applied to the Cystic Fibrosis Patient Registry (CFFPR) data to perform dynamic prediction of lung disease in cystic fibrosis patients and to identify important prognosis factors.

preprint2022arXiv

Energy-efficient transmission policies for the linear quadratic control of scalar systems

This paper considers controlled scalar systems relying on a lossy wireless feedback channel. In contrast with the existing literature, the focus is not on the system controller but on the wireless transmit power controller that is implemented at the system side for reporting the state to the controller. Such a problem may be of interest, \emph{e.g.}, for the remote control of drones, where communication costs may have to be considered. Determining the power control policy that minimizes the combination of the dynamical system cost and the wireless transmission energy is shown to be a non-trivial optimization problem. It turns out that the recursive structure of the problem can be exploited to determine the optimal power control policy. As illustrated in the numerical performance analysis, in the scenario of a dynamics without perturbations, the optimal power control policy consists in decreasing the transmit power at the right pace. This allows a significant performance gain compared to conventional policies such as the full transmit power policy or the open-loop policy.

preprint2022arXiv

Maskin Meets Abreu and Matsushima

The theory of full implementation has been criticized for using integer/modulo games which admit no equilibrium (Jackson (1992)). To address the critique, we revisit the classical Nash implementation problem due to Maskin (1999) but allow for the use of lotteries and monetary transfers as in Abreu and Matsushima (1992, 1994). We unify the two well-established but somewhat orthogonal approaches in full implementation theory. We show that Maskin monotonicity is a necessary and sufficient condition for (exact) mixed-strategy Nash implementation by a finite mechanism. In contrast to previous papers, our approach possesses the following features: finite mechanisms (with no integer or modulo game) are used; mixed strategies are handled explicitly; neither undesirable outcomes nor transfers occur in equilibrium; the size of transfers can be made arbitrarily small; and our mechanism is robust to information perturbations. Finally, our result can be extended to infinite/continuous settings and ordinal settings.

preprint2022arXiv

Passive Motion Detection via mmWave Communication System

In this paper, an integrated passive sensing and communication system working in 60 GHz band is elaborated, and the sensing performance is investigated in an application of hand gesture recognition. Specifically, in this integrated system, there are two radio frequency (RF) chains at the receiver and one at the transmitter. Each RF chain is connected with one phased array for analog beamforming. To facilitate simultaneous sensing and communication, the transmitter delivers one stream of information-bearing signals via two beam lobes, one is aligned with the main signal propagation path and the other is directed to the sensing target. Signals from the two lobes are received by the two RF chains at the receiver, respectively. By cross ambiguity coherent processing, the time-Doppler spectrograms of hand gestures can be obtained. Relying on the passive sensing system, a dataset of received signals, where three types of hand gestures are sensed, is collected by using Line-of-Sight (LoS) and Non-Line-of-Sight (NLoS) paths as the reference channel respectively. Then a neural network is trained by the dataset for motion detection. It is shown that the classification accuracy rate is high as long as sufficient sensing time is assured. Finally, an empirical model characterizing the relation between the classification accuracy and sensing duration is derived analytically.

preprint2022arXiv

Wavelength-by-wavelength temperature-independent thermal radiation utilizing an insulator-metal transition

Both the magnitude and spectrum of the blackbody-radiation distribution change with temperature. Here, we designed the temperature-dependent spectral emissivity of a coating to counteract all the changes in the blackbody-radiation distribution over a certain temperature range, enabled by the nonhysteretic insulator-to-metal phase transition of SmNiO3. At each wavelength within the long-wave infrared atmospheric-transparency window, the thermal radiance of our coating remains nearly constant over a temperature range of at least 20 °C. Our approach can conceal thermal gradients and transient temperature changes from infrared imaging systems, including those that discriminate by wavelength, such as multispectral and hyperspectral cameras.

preprint2022arXiv

Who is next: rising star prediction via diffusion of user interest in social networks

Finding items with potential to increase sales is of great importance in online market. In this paper, we propose to study this novel and practical problem: rising star prediction. We call these potential items Rising Star, which implies their ability to rise from low-turnover items to best-sellers in the future. Rising stars can be used to help with unfair recommendation in e-commerce platform, balance supply and demand to benefit the retailers and allocate marketing resources rationally. Although the study of rising star can bring great benefits, it also poses challenges to us. The sales trend of rising star fluctuates sharply in the short-term and exhibits more contingency caused by some external events (e.g., COVID-19 caused increasing purchase of the face mask) than other items, which cannot be solved by existing sales prediction methods. To address above challenges, in this paper, we observe that the presence of rising stars is closely correlated with the early diffusion of user interest in social networks, which is validated in the case of Taocode (an intermediary that diffuses user interest in Taobao). Thus, we propose a novel framework, RiseNet, to incorporate the user interest diffusion process with the item dynamic features to effectively predict rising stars. Specifically, we adopt a coupled mechanism to capture the dynamic interplay between items and user interest, and a special designed GNN based framework to quantify user interest. Our experimental results on large-scale real-world datasets provided by Taobao demonstrate the effectiveness of our proposed framework.

preprint2021arXiv

X-ray nanoimaging of crystal defects in single grains of solid-state electrolyte AlxLi7-3xLa3Zr2O12

All-solid-state lithium batteries promise significant improvements in energy density and safety over traditional liquid electrolyte batteries. The Al-doped AlxLi7-3xLa3Zr2O12 (LLZO) solid-state electrolyte shows excellent potential given its high ionic conductivity and good thermal, chemical, and electrochemical stability. Nevertheless, further improvements on LLZOs electrochemical and mechanical properties call for an incisive understanding of its local microstructure. Here, we employ Bragg Coherent Diffractive Imaging to investigate the atomic displacements inside single grains of LLZO with various Al-doping concentrations, resulting in cubic, tetragonal, and cubic-tetragonal mixed structures. We observe coexisting domains of different crystallographic orientations in the tetragonal structure. We further show that Al doping leads to crystal defects such as dislocations and phase boundary in the mixed- and cubic-phase grain. This study addresses the effect of Al-doping on the nanoscale structure within individual grains of LLZO, which is informative for the future development of solid-state batteries.

preprint2020arXiv

A simple stress-dilatancy equation for sand

The stress-dilatancy relation is of critical importance for constitutive modelling of sand. A new fractional-order stress-dilatancy equation is analytically developed in this study, based on stress-fractional operators. An apparent linear response of the stress-dilatancy behaviour of soil after sufficient shearing is obtained. As the fractional order varies, the derived stress-dilatancy curve and the associated phase transformation state stress ratio shift. But, unlike existing researches, no other specific parameters, except the fractional order, concerning such shift and the state-dependence are required. The developed stress-dilatancy equation is then incorporated into an existing constitutive model for validation. Test results of different sands are simulated and compared, where a good model performance is observed.

preprint2020arXiv

Effects of free-ranging livestock on sympatric herbivores at fine spatiotemporal scales

Understanding wildlife-livestock interactions is crucial for the design and management of protected areas that aim to conserve large mammal communities undergoing conflicts with humans worldwide. An example of the need to quantify the strength and direction of species interactions is the conservation of big cats in newly established protected areas in China. Currently, free-ranging livestock degrade the food and habitat of the endangered Amur tiger and Amur leopard in the forest landscapes of Northeast China, but quantitative assessments of how livestock affect the use of habitat by the major ungulate prey of these predators are very limited. Here, we examined livestock-ungulate interactions using large-scale camera-trap data in the newly established Tiger and Leopard National Park in Northeast China, which borders Russia. We used N-mixture models, two-species occupancy models and activity pattern overlap to understand the effects of cattle grazing on three ungulate species (wild boar, roe deer and sika deer) at a fine spatiotemporal scale. Our results showed that incorporating the biotic interactions with cattle had significant negative effects on encounters with three ungulates; sika deer were particularly displaced as more cattle encroached on forest habitat, as they exhibited low levels of co-occurrence with cattle in terms of habitat use. These results, combined with spatiotemporal overlap, suggested fine-scale avoidance behaviours, and they can help to refine strategies for the conservation of tigers, leopards and their prey in human-dominated transboundary landscapes. Progressively controlling cattle and the impact of cattle on biodiversity while simultaneously addressing the economic needs of local communities should be key priority actions for the Chinese government.

preprint2020arXiv

Recurrent Events Analysis With Data Collected at Informative Clinical Visits in Electronic Health Records

Although increasingly used as a data resource for assembling cohorts, electronic health records (EHRs) pose many analytic challenges. In particular, a patient's health status influences when and what data are recorded, generating sampling bias in the collected data. In this paper, we consider recurrent event analysis using EHR data. Conventional regression methods for event risk analysis usually require the values of covariates to be observed throughout the follow-up period. In EHR databases, time-dependent covariates are intermittently measured during clinical visits, and the timing of these visits is informative in the sense that it depends on the disease course. Simple methods, such as the last-observation-carried-forward approach, can lead to biased estimation. On the other hand, complex joint models require additional assumptions on the covariate process and cannot be easily extended to handle multiple longitudinal predictors. By incorporating sampling weights derived from estimating the observation time process, we develop a novel estimation procedure based on inverse-rate-weighting and kernel-smoothing for the semiparametric proportional rate model of recurrent events. The proposed methods do not require model specifications for the covariate processes and can easily handle multiple time-dependent covariates. Our methods are applied to a kidney transplant study for illustration.

preprint2020arXiv

ROC-Guided Survival Trees and Ensembles

Tree-based methods are popular nonparametric tools in studying time-to-event outcomes. In this article, we introduce a novel framework for survival trees and ensembles, where the trees partition the dynamic survivor population and can handle time-dependent covariates. Using the idea of randomized tests, we develop generalized time-dependent Receiver Operating Characteristic (ROC) curves for evaluating the performance of survival trees. The tree-building algorithm is guided by decision-theoretic criteria based on ROC, targeting specifically for prediction accuracy. To address the instability issue of a single tree, we propose a novel ensemble procedure based on averaging martingale estimating equations, which is different from existing methods that average the predicted survival or cumulative hazard functions from individual trees. Extensive simulation studies are conducted to examine the performance of the proposed methods. We apply the methods to a study on AIDS for illustration.

preprint2016arXiv

Estimating matching affinity matrix under low-rank constraints

In this paper, we address the problem of estimating transport surplus (a.k.a. matching affinity) in high dimensional optimal transport problems. Classical optimal transport theory specifies the matching affinity and determines the optimal joint distribution. In contrast, we study the inverse problem of estimating matching affinity based on the observation of the joint distribution, using an entropic regularization of the problem. To accommodate high dimensionality of the data, we propose a novel method that incorporates a nuclear norm regularization which effectively enforces a rank constraint on the affinity matrix. The low-rank matrix estimated in this way reveals the main factors which are relevant for matching.

preprint2015arXiv

Dynamic scaling for the growth of non-equilibrium fluctuations during thermophoretic diffusion in microgravity

Diffusion processes are widespread in biological and chemical systems, where they play a fundamental role in the exchange of substances at the cellular level and in determining the rate of chemical reactions. Recently, the classical picture that portrays diffusion as random uncorrelated motion of molecules has been revised, when it was shown that giant non-equilibrium fluctuations develop during diffusion processes. Under microgravity conditions and at steady-state, non-equilibrium fluctuations exhibit scale invariance and their size is only limited by the boundaries of the system. In this work, we investigate the onset of non-equilibrium concentration fluctuations induced by thermophoretic diffusion in microgravity, a regime not accessible to analytical calculations but of great relevance for the understanding of several natural and technological processes. A combination of state of the art simulations and experiments allows us to attain a fully quantitative description of the development of fluctuations during transient diffusion in microgravity. Both experiments and simulations show that during the onset the fluctuations exhibit scale invariance at large wave vectors. In a broader range of wave vectors simulations predict a spinodal-like growth of fluctuations, where the amplitude and length-scale of the dominant mode are determined by the thickness of the diffuse layer.

preprint2015arXiv

Slowing-down of non-equilibrium concentration fluctuations in confinement

Fluctuations in a fluid are strongly affected by the presence of a macroscopic gradient making them long-ranged and enhancing their amplitude. While small-scale fluctuations exhibit diffusive lifetimes, larger-scale fluctuations live shorter because of gravity, as theoretically and experimentally well-known. We explore here fluctuations of even larger size, comparable to the extent of the system in the direction of the gradient, and find experimental evidence of a dramatic slowing-down in their dynamics. We recover diffusive behaviour for these strongly-confined fluctuations, but with a diffusion coefficient that depends on the solutal Rayleigh number. Results from dynamic shadowgraph experiments are complemented by theoretical calculations and numerical simulations based on fluctuating hydrodynamics, and excellent agreement is found. The study of the dynamics of non-equilibrium fluctuations allows to probe and measure the competition of physical processes such as diffusion, buoyancy and confinement.

Yifei Sun

What is connected

Connect this record

See the researcher in context

Building this map preview

19 published item(s)

A self-evolving multi-role collaborative framework with fine-grained difficulty guidance for innovative mathematical problem generation

ReBrain: Brain MRI Reconstruction from Sparse CT Slice via Retrieval-Augmented Diffusion

Self-Evolving Spatial Reasoning in Vision Language Models via Geometric Logic Consistency

Table as a Modality for Large Language Models

An Indoor Environment Sensing and Localization System via mmWave Phased Array

Dynamic Risk Prediction Triggered by Intermediate Events Using Survival Tree Ensembles

Energy-efficient transmission policies for the linear quadratic control of scalar systems

Maskin Meets Abreu and Matsushima

Passive Motion Detection via mmWave Communication System

Wavelength-by-wavelength temperature-independent thermal radiation utilizing an insulator-metal transition

Who is next: rising star prediction via diffusion of user interest in social networks

X-ray nanoimaging of crystal defects in single grains of solid-state electrolyte AlxLi7-3xLa3Zr2O12

A simple stress-dilatancy equation for sand

Effects of free-ranging livestock on sympatric herbivores at fine spatiotemporal scales

Recurrent Events Analysis With Data Collected at Informative Clinical Visits in Electronic Health Records

ROC-Guided Survival Trees and Ensembles

Estimating matching affinity matrix under low-rank constraints

Dynamic scaling for the growth of non-equilibrium fluctuations during thermophoretic diffusion in microgravity

Slowing-down of non-equilibrium concentration fluctuations in confinement