Source author record

Xin Lin

Xin Lin appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

cond-mat.mtrl-sci Computer Vision Computation and Language physics.soc-ph Artificial Intelligence cond-mat.soft Neural and Evolutionary Computing Populations and Evolution cond-mat.mes-hall cs.CY eess.IV Machine Learning math.DS math.NT math.OC Multimedia nlin.CG physics.plasm-ph Social and Information Networks

Catalog footprint

What is connected

21works

19topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

SOD: Step-wise On-policy Distillation for Small Language Model Agents

Tool-integrated reasoning (TIR) is difficult to scale to small language models due to instability in long-horizon tool interactions and limited model capacity. While reinforcement learning methods like group relative policy optimization provide only sparse outcome-level rewards. Recently, on-policy distillation (OPD) has gained popularity by supplying dense token-level supervision from a teacher on student-generated trajectories. However, our experiments indicate that applying OPD to TIR leads to a critical failure mode: erroneous tool calls tend to cascade across subsequent reasoning steps, progressively amplifying student-teacher divergence and rendering the teacher's token-level supervision increasingly unreliable. To address this, we propose SOD, a step-wise on-policy distillation framework for small language model agents, which adaptively reweights distillation strength at each step based on step-level divergence. Therefore, SOD can attenuate potentially misleading teacher signals in high-divergence regions while preserving dense guidance in well-aligned states. Experiments on challenging math, science, and code benchmarks show that SOD achieves up to 20.86% improvement over the second-best baseline. Notably, our 0.6B student achieves 26.13% on AIME 2025, demonstrating effective transfer of agentic reasoning to lightweight models. Our code is available at https://github.com/YoungZ365/SOD.

preprint2023arXiv

On inverted Kloosterman sums over finite fields

The classical $n$-variable Kloosterman sums over finite fields are well understood by Deligne's theorem from complex point of view and by Sperber's theorem from $p$-adic point of view. In this paper, we study the complex and $p$-adic estimates of inverted $n$-variable Kloosterman sums, addressing a question of N. Katz (1995). We shall give two complex estimates. The first one is elementary based on Gauss sums. The second estimate is deeper, depending on the cohomological results of Adolphson-Sperber, Denef-Loeser and Fu for twisted toric exponential sums. This deeper result assumes that the characteristic $p$ does not divide $n+1$. Combining with Dwork's $p$-adic theory, we also determine the exact $p$-adic valuations for zeros and poles of the L-function associated to inverted $n$-variable Kloosterman sums in the case $p \equiv 1 \mod (n+1)$. As we shall see, the inverted $n$-variable Kloosterman sum is more complicated than the classical $n$-variable Kloosterman sum in all aspects in the sense that our understanding is less complete, partly because the Hodge numbers are now mostly $2$ instead of $1$.

preprint2023arXiv

Towards a Holistic Understanding of Mathematical Questions with Contrastive Pre-training

Understanding mathematical questions effectively is a crucial task, which can benefit many applications, such as difficulty estimation. Researchers have drawn much attention to designing pre-training models for question representations due to the scarcity of human annotations (e.g., labeling difficulty). However, unlike general free-format texts (e.g., user comments), mathematical questions are generally designed with explicit purposes and mathematical logic, and usually consist of more complex content, such as formulas, and related mathematical knowledge (e.g., Function). Therefore, the problem of holistically representing mathematical questions remains underexplored. To this end, in this paper, we propose a novel contrastive pre-training approach for mathematical question representations, namely QuesCo, which attempts to bring questions with more similar purposes closer. Specifically, we first design two-level question augmentations, including content-level and structure-level, which generate literally diverse question pairs with similar purposes. Then, to fully exploit hierarchical information of knowledge concepts, we propose a knowledge hierarchy-aware rank strategy (KHAR), which ranks the similarities between questions in a fine-grained manner. Next, we adopt a ranking contrastive learning task to optimize our model based on the augmented and ranked questions. We conduct extensive experiments on two real-world mathematical datasets. The experimental results demonstrate the effectiveness of our model.

preprint2022arXiv

Benchmark Functions for CEC 2022 Competition on Seeking Multiple Optima in Dynamic Environments

Dynamic and multimodal features are two important properties and widely existed in many real-world optimization problems. The former illustrates that the objectives and/or constraints of the problems change over time, while the latter means there is more than one optimal solution (sometimes including the accepted local solutions) in each environment. The dynamic multimodal optimization problems (DMMOPs) have both of these characteristics, which have been studied in the field of evolutionary computation and swarm intelligence for years, and attract more and more attention. Solving such problems requires optimization algorithms to simultaneously track multiple optima in the changing environments. So that the decision makers can pick out one optimal solution in each environment according to their experiences and preferences, or quickly turn to other solutions when the current one cannot work well. This is very helpful for the decision makers, especially when facing changing environments. In this competition, a test suit about DMMOPs is given, which models the real-world applications. Specifically, this test suit adopts 8 multimodal functions and 8 change modes to construct 24 typical dynamic multimodal optimization problems. Meanwhile, the metric is also given to measure the algorithm performance, which considers the average number of optimal solutions found in all environments. This competition will be very helpful to promote the development of dynamic multimodal optimization algorithms.

preprint2022arXiv

Evolutionary Multiparty Distance Minimization

In the field of evolutionary multiobjective optimization, the decision maker (DM) concerns conflicting objectives. In the real-world applications, there usually exist more than one DM and each DM concerns parts of these objectives. Multiparty multiobjective optimization problems (MPMOPs) are proposed to depict the MOP with multiple decision makers involved, where each party concerns about certain some objectives of all. However, in the evolutionary computation field, there is not much attention paid on MPMOPs. This paper constructs a series of MPMOPs based on distance minimization problems (DMPs), whose Pareto optimal solutions can be vividly visualized. To address MPMOPs, the new proposed algorithm OptMPNDS3 uses the multiparty initializing method to initialize the population and takes JADE2 operator to generate the offsprings. OptMPNDS3 is compared with OptAll, OptMPNDS and OptMPNDS2 on the problem suite. The result shows that OptMPNDS3 is strongly comparable to other algorithms

preprint2022arXiv

Harmonizing Pathological and Normal Pixels for Pseudo-healthy Synthesis

Synthesizing a subject-specific pathology-free image from a pathological image is valuable for algorithm development and clinical practice. In recent years, several approaches based on the Generative Adversarial Network (GAN) have achieved promising results in pseudo-healthy synthesis. However, the discriminator (i.e., a classifier) in the GAN cannot accurately identify lesions and further hampers from generating admirable pseudo-healthy images. To address this problem, we present a new type of discriminator, the segmentor, to accurately locate the lesions and improve the visual quality of pseudo-healthy images. Then, we apply the generated images into medical image enhancement and utilize the enhanced results to cope with the low contrast problem existing in medical image segmentation. Furthermore, a reliable metric is proposed by utilizing two attributes of label noise to measure the health of synthetic images. Comprehensive experiments on the T2 modality of BraTS demonstrate that the proposed method substantially outperforms the state-of-the-art methods. The method achieves better performance than the existing methods with only 30\% of the training data. The effectiveness of the proposed method is also demonstrated on the LiTS and the T1 modality of BraTS. The code and the pre-trained model of this study are publicly available at https://github.com/Au3C2/Generator-Versus-Segmentor.

preprint2022arXiv

HL-Net: Heterophily Learning Network for Scene Graph Generation

Scene graph generation (SGG) aims to detect objects and predict their pairwise relationships within an image. Current SGG methods typically utilize graph neural networks (GNNs) to acquire context information between objects/relationships. Despite their effectiveness, however, current SGG methods only assume scene graph homophily while ignoring heterophily. Accordingly, in this paper, we propose a novel Heterophily Learning Network (HL-Net) to comprehensively explore the homophily and heterophily between objects/relationships in scene graphs. More specifically, HL-Net comprises the following 1) an adaptive reweighting transformer module, which adaptively integrates the information from different layers to exploit both the heterophily and homophily in objects; 2) a relationship feature propagation module that efficiently explores the connections between relationships by considering heterophily in order to refine the relationship representation; 3) a heterophily-aware message-passing scheme to further distinguish the heterophily and homophily between objects/relationships, thereby facilitating improved message passing in graphs. We conducted extensive experiments on two public datasets: Visual Genome (VG) and Open Images (OI). The experimental results demonstrate the superiority of our proposed HL-Net over existing state-of-the-art approaches. In more detail, HL-Net outperforms the second-best competitors by 2.1$\%$ on the VG dataset for scene graph classification and 1.2$\%$ on the IO dataset for the final score. Code is available at https://github.com/siml3/HL-Net.

preprint2022arXiv

Shifting More Attention to Visual Backbone: Query-modulated Refinement Networks for End-to-End Visual Grounding

Visual grounding focuses on establishing fine-grained alignment between vision and natural language, which has essential applications in multimodal reasoning systems. Existing methods use pre-trained query-agnostic visual backbones to extract visual feature maps independently without considering the query information. We argue that the visual features extracted from the visual backbones and the features really needed for multimodal reasoning are inconsistent. One reason is that there are differences between pre-training tasks and visual grounding. Moreover, since the backbones are query-agnostic, it is difficult to completely avoid the inconsistency issue by training the visual backbone end-to-end in the visual grounding framework. In this paper, we propose a Query-modulated Refinement Network (QRNet) to address the inconsistent issue by adjusting intermediate features in the visual backbone with a novel Query-aware Dynamic Attention (QD-ATT) mechanism and query-aware multiscale fusion. The QD-ATT can dynamically compute query-dependent visual attention at the spatial and channel levels of the feature maps produced by the visual backbone. We apply the QRNet to an end-to-end visual grounding framework. Extensive experiments show that the proposed method outperforms state-of-the-art methods on five widely used datasets.

preprint2021arXiv

A knowledge transfer model for COVID-19 predicting and non-pharmaceutical intervention simulation

Since December 2019, A novel coronavirus (2019-nCoV) has been breaking out in China, which can cause respiratory diseases and severe pneumonia. Mathematical and empirical models relying on the epidemic situation scale for forecasting disease outbreaks have received increasing attention. Given its successful application in the evaluation of infectious diseases scale, we propose a Susceptible-Undiagnosed-Infected-Removed (SUIR) model to offer the effective prediction, prevention, and control of infectious diseases. Our model is a modified susceptible-infected-recovered (SIR) model that injects undiagnosed state and offers pre-training effective reproduction number. Our SUIR model is more precise than the traditional SIR model. Moreover, we combine domain knowledge of the epidemic to estimate effective reproduction number, which addresses the initial susceptible population of the infectious disease model approach to the ground truth. These findings have implications for the forecasting of epidemic trends in COVID-19 as these could help the growth of estimating epidemic situation.

preprint2020arXiv

ECNU-SenseMaker at SemEval-2020 Task 4: Leveraging Heterogeneous Knowledge Resources for Commonsense Validation and Explanation

This paper describes our system for SemEval-2020 Task 4: Commonsense Validation and Explanation (Wang et al., 2020). We propose a novel Knowledge-enhanced Graph Attention Network (KEGAT) architecture for this task, leveraging heterogeneous knowledge from both the structured knowledge base (i.e. ConceptNet) and unstructured text to better improve the ability of a machine in commonsense understanding. This model has a powerful commonsense inference capability via utilizing suitable commonsense incorporation methods and upgraded data augmentation techniques. Besides, an internal sharing mechanism is cooperated to prohibit our model from insufficient and excessive reasoning for commonsense. As a result, this model performs quite well in both validation and explanation. For instance, it achieves state-of-the-art accuracy in the subtask called Commonsense Explanation (Multi-Choice). We officially name the system as ECNU-SenseMaker. Code is publicly available at https://github.com/ECNU-ICA/ECNU-SenseMaker.

preprint2020arXiv

GPS-Net: Graph Property Sensing Network for Scene Graph Generation

Scene graph generation (SGG) aims to detect objects in an image along with their pairwise relationships. There are three key properties of scene graph that have been underexplored in recent works: namely, the edge direction information, the difference in priority between nodes, and the long-tailed distribution of relationships. Accordingly, in this paper, we propose a Graph Property Sensing Network (GPS-Net) that fully explores these three properties for SGG. First, we propose a novel message passing module that augments the node feature with node-specific contextual information and encodes the edge direction information via a tri-linear model. Second, we introduce a node priority sensitive loss to reflect the difference in priority between nodes during training. This is achieved by designing a mapping function that adjusts the focusing parameter in the focal loss. Third, since the frequency of relationships is affected by the long-tailed distribution problem, we mitigate this issue by first softening the distribution and then enabling it to be adjusted for each subject-object pair according to their visual appearance. Systematic experiments demonstrate the effectiveness of the proposed techniques. Moreover, GPS-Net achieves state-of-the-art performance on three popular databases: VG, OI, and VRD by significant gains under various settings and metrics. The code and models are available at \url{https://github.com/taksau/GPS-Net}.

preprint2020arXiv

The short-term seasonal analyses between atmospheric environment and COVID-19 in epidemic areas of Cities in Australia, South Korea, and Italy

The impact of the outbreak of COVID-19 on health has been widely concerned. Disease risk assessment, prediction, and early warning have become a significant research field. Previous research suggests that there is a relationship between air quality and the disease. This paper investigated the impact of the atmospheric environment on the basic reproduction number (R$_0$) in Australia, South Korea, and Italy by using atmospheric environment data, confirmed case data, and the distributed lag non-linear model (DLNM) model based on Quasi-Poisson regression. The results show that the air temperature and humidity have lag and persistence on short-term R$_0$, and seasonal factors have an apparent decorating effect on R$_0$. PM$_{10}$ is the primary pollutant that affects the excess morbidity rate. Moreover, O$_3$, PM$_{2.5}$, and SO$_2$ as perturbation factors have an apparent cumulative effect. These results present beneficial knowledge for correlation between environment and COVID-19, which guiding prospective analyses of disease data.

preprint2019arXiv

Stabilizing effect of enhanced resistivity on peeling-ballooning instabilities on EAST

Previous stability analysis of NSTX equilibrium with lithium-conditioning demonstrates that the enhanced resistivity due to the increased effective charge number Zeff (i.e. increased impurity level) can provide a stabilizing effect on low-n edge localized modes (Banerjee et al 2017 Nucl. Fusion 24 054501). This paper extends the resistivity stabilizing effect to the intermediate-n peeling-ballooning (PB) instabilities with the linear stability analysis of EAST high-confinement mode equilibria in NIMROD two-fluid calculations. However, the resistivity stabilizing effect on PB instabilities in the EAST tokamak appears weaker than that found in NSTX. This work may give better insight into the physical mechanism behind the beneficial effects of impurity on the pedestal stability.

preprint2016arXiv

Can secondary nucleation exist in ice banding of freezing colloidal suspensions?

The formation mechanism of ice banding in the system of freezing colloidal suspensions, which is of significance in frost heaving, ice-templating porous materials and biological materials, still remains a mystery. Recently, the theory of secondary nucleation and growth of ice has been proposed to explain the emergence of a new ice lens. However, this theory has not been quantitatively examined. Here, we quantitatively measured the initial interfacial undercooling of a new ice lens and the nucleation undercoolings of suspensions. We found that the interfacial undercooling can not satisfy the nucleation undercooling of ice and hence disprove the secondary nucleation mechanism for ice banding.

preprint2016arXiv

Interface instability modes in freezing colloidal suspensions - revealed from onset of planar instability

Freezing colloidal suspensions widely exists in nature and industry. Interface instability has attracted much attention for the understandings of the pattern formation in freezing colloidal suspensions. However, the interface instability modes, the origin of the ice banding or ice lamellae, are still unclear. In-situ experimental observation of the onset of interface instability is still absent up to now. Here, by directly imaging the initial transient stage of planar interface instability in directional freezing colloidal suspensions, we proposed three interface instability modes, Mullins-Sekerka instability, global split instability and local split instability. All the three instability modes come from the competition of the solute boundary layer and the particle boundary layer, which only can be revealed from the initial transient stage of planar instability in directional freezing.

preprint2016arXiv

Interfacial undercooling in the solidification of colloidal suspensions-analyses with quantitative measurements

Interfacial undercooling is of significant importance on microscopic pattern formation in the solidification of colloidal suspensions. Two kinds of interfacial undercooling are supposed to be involved in freezing colloidal suspensions, i.e. solute constitutional supercooling (SCS) caused by additives in the solvent and particulate constitutional supercooling (PCS) caused by particles. However, quantitatively identification of the interfacial undercooling of freezing colloidal suspensions is still absent and it is still unknown which undercooling is dominant. The revealing of interfacial undercooling is closely related to the design of ice-templating porous materials. Based on quantitative experimental measurements, we show that the interfacial undercooling mainly comes from SCS caused by the additives in the solvent, while the PCS can be ignored. This finding implies that the PCS theory is not the fundamental physical mechanism for patterning in the solidification of colloidal suspensions. Instead, the patterns in ice-templating method can be controlled effectively by adjusting the additives.

preprint2015arXiv

Failure dynamics of the global risk network

Risks threatening modern societies form an intricately interconnected network that often underlies crisis situations. Yet, little is known about how risk materializations in distinct domains influence each other. Here we present an approach in which expert assessments of risks likelihoods and influence underlie a quantitative model of the global risk network dynamics. The modeled risks range from environmental to economic and technological and include difficult to quantify risks, such as geo-political or social. Using the maximum likelihood estimation, we find the optimal model parameters and demonstrate that the model including network effects significantly outperforms the others, uncovering full value of the expert collected data. We analyze the model dynamics and study its resilience and stability. Our findings include such risk properties as contagion potential, persistence, roles in cascades of failures and the identity of risks most detrimental to system stability. The model provides quantitative means for measuring the adverse effects of risk interdependence and the materialization of risks in the network.

preprint2015arXiv

In situ observation the interface undercooling of freezing colloidal suspensions with differential visualization method

Interface undercooling is one of the most significant parameters in the solidification of colloidal suspensions. However, quantitative measurement of interface undercooling of colloidal suspensions is still a challenge. Here, a new experimental facility and gauging method are designed to directly reveal the interface undercooling on both static and dynamic cases. The interface undercooling is visualized through the discrepancy of solid/liquid interface positions between the suspensions and its solvent in a thermal gradient apparatus. The resolutions of the experimental facility and gauging method are proved to be 0.01 K. The high precision of the method comes from the principle of converting temperature measurement into distance measurement in the thermal gradient platform. Moreover, both static and dynamic interface undercoolings can be quantitatively measured.

preprint2014arXiv

Low artificial anisotropy cellular automaton model and its applications to the cell-to-dendrite transition in directional solidification

A low artificial anisotropy cellular automaton (CA) model is developed for the simulation of microstructure evolution in directional solidification. The CA model's capture rule was modified by a limited neighbor solid fraction (LNSF) method. Various interface curvature calculation methods have been compared. The simulated equilibrium shapes agree with the theoretical shapes, when the interface energy anisotropy coefficient is ε=0.01, ε=0.03 and ε=0.05, respectively.The low artificial anisotropy CA model is used in the numerical simulation of the cell-to-dendrite transition (CDT) in directional solidification. The influence of physical parameters (Γ, Dl, k0, ml) on CDT has been investigated. The main finding in this paper is the discovery of the changing behavior of the Vcd when the solute partition coefficient k0 is larger than a critical value. When k0 is less than 0.125, the Vcd follows the Kurz and Fisher criterion Vc/k0; while when k0>0.125, the Vcd equals to 8Vc. The experimental data of succinonitrile-acetone (SCN-ace, k0=0.1) and SCN-camphor (k0=0.33) support the conclusion from CA simulations.

preprint2011arXiv

Gap opening of single-layer graphene under the continuum model

Gap opening at the Dirac point of the single-layer graphene with periodic scalar and vector potentials has been theoretically investigated under the continuum model. The symmetry analysis indicates that the two-fold degeneracy at the Dirac point can be lifted when the potentials break both the chiral symmetry and the time-reversal symmetry. A gap equation at the Dirac point is obtained analytically with perturbation theory. It is shown that a mass term at the Dirac point would be generated by coupling of vector and scalar potentials. This gap equation could be considered as a criterion for gap opening at the Dirac point, which is confirmed by the numerical calculation. Furthermore, the bandgap from the gap equation agrees well with the exact result, when the applied potentials are weak.

preprint2011arXiv

Orientation selection of equiaxed dendritic growth by three-dimensional cellular automaton model

A three-dimensional (3-D) adaptive mesh refinement (AMR) cellular automata (CA) model is developed to simulate the equiaxed dendritic growth of pure substance. In order to reduce the mesh induced anisotropy by CA capture rules, a limited neighbor solid fraction (LNSF) method is presented. An expansion description using two interface free energy anisotropy parameters (\epsilon1, \epsilon2) is used in present 3-D CA model. The dendrite growths with the orientation selection between <100> and <110> are discussed using the different \epsilon1 with \epsilon2=-0.02. It is found that the simulated morphologies by present CA model are as expected from the minimum stiffness criterion.

Xin Lin

What is connected

Connect this record

See the researcher in context

Building this map preview

21 published item(s)

SOD: Step-wise On-policy Distillation for Small Language Model Agents

On inverted Kloosterman sums over finite fields

Towards a Holistic Understanding of Mathematical Questions with Contrastive Pre-training

Benchmark Functions for CEC 2022 Competition on Seeking Multiple Optima in Dynamic Environments

Evolutionary Multiparty Distance Minimization

Harmonizing Pathological and Normal Pixels for Pseudo-healthy Synthesis

HL-Net: Heterophily Learning Network for Scene Graph Generation

Shifting More Attention to Visual Backbone: Query-modulated Refinement Networks for End-to-End Visual Grounding

A knowledge transfer model for COVID-19 predicting and non-pharmaceutical intervention simulation

ECNU-SenseMaker at SemEval-2020 Task 4: Leveraging Heterogeneous Knowledge Resources for Commonsense Validation and Explanation

GPS-Net: Graph Property Sensing Network for Scene Graph Generation

The short-term seasonal analyses between atmospheric environment and COVID-19 in epidemic areas of Cities in Australia, South Korea, and Italy

Stabilizing effect of enhanced resistivity on peeling-ballooning instabilities on EAST

Can secondary nucleation exist in ice banding of freezing colloidal suspensions?

Interface instability modes in freezing colloidal suspensions - revealed from onset of planar instability

Interfacial undercooling in the solidification of colloidal suspensions-analyses with quantitative measurements

Failure dynamics of the global risk network

In situ observation the interface undercooling of freezing colloidal suspensions with differential visualization method

Low artificial anisotropy cellular automaton model and its applications to the cell-to-dendrite transition in directional solidification

Gap opening of single-layer graphene under the continuum model

Orientation selection of equiaxed dendritic growth by three-dimensional cellular automaton model