Researcher profile

Zhennan Zhou

Zhennan Zhou contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
15works
0followers
10topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

15 published item(s)

preprint2026arXiv

STOP: Structured On-Policy Pruning of Long-Form Reasoning in Low-Data Regimes

Long chain-of-thought (Long CoT) reasoning improves performance on multi-step problems, but it also induces overthinking: models often generate low-yield reasoning that increases inference cost and latency. This inefficiency is especially problematic in low-data fine-tuning regimes, where real applications adapt reasoning models with limited supervision and cannot rely on large-scale teacher distillation or heavy test-time control. To address this, we propose STOP (Structured On-policy Pruning), an on-policy algorithm for analyzing and pruning long-form reasoning traces. STOP constructs self-distilled traces from the model. Then it maps each trace into a structured reasoning interface through node segmentation, taxonomy annotation, and reasoning-tree construction. On top of this interface, we introduce ECN (Earliest Correct Node), which retains the shortest prefix ending at the earliest node that both functions as an answering conclusion and yields the correct final answer, removing redundant post-solution reasoning while preserving semantic continuity. Experiments on DeepSeek-R1-Distill-Qwen-7B and DeepSeek-R1-Distill-LLaMA-3-8B across GSM8K, Math 500, and AIME 2024 show that STOP reduces generated tokens by 19.4-42.4% while largely preserving accuracy in low-data fine-tuning. Beyond efficiency, our analyses show that STOP induces much smaller distributional shift than teacher-guided pruning, improves the structural efficiency of generated reasoning, and reallocates reasoning effort away from redundant verification and backtracking toward more productive exploration.

preprint2022arXiv

A simplified voltage-conductance kinetic model for interacting neurons and its asymptotic limit

The voltage-conductance kinetic model for the collective behavior of neurons has been studied by scientists and mathematicians for two decades, but the rigorous analysis of its solution structure has been only partially obtained in spite of plenty of numerical evidence in various scenarios. In this work, we consider a simplified voltage-conductance model in which the velocity field in the voltage variable is in a separable form. The long time behavior of the simplified model is fully investigated leading to the following dichotomy: either the density function converges to the global equilibrium, or the firing rate diverges as time goes to infinity. Besides, the fast conductance asymptotic limit is justified and analyzed, where the solution to the limit model either blows up in finite time, or globally exists leading to time periodic solutions. An important implication of these results is that the non-separable velocity field, or physically the leaky mechanism, is a key element for the emergence of periodic solutions in the original model based on the available numerical evidence.

preprint2022arXiv

A structure preserving numerical scheme for Fokker-Planck equations of structured neural networks with learning rules

In this work, we are concerned with a Fokker-Planck equation related to the nonlinear noisy leaky integrate-and-fire model for biological neural networks which are structured by the synaptic weights and equipped with the Hebbian learning rule. The equation contains a small parameter $\varepsilon$ separating the time scales of learning and reacting behavior of the neural system, and an asymptotic limit model can be derived by letting $\varepsilon\to 0$, where the microscopic quasi-static states and the macroscopic evolution equation are coupled through the total firing rate. To handle the endowed flux-shift structure and the multi-scale dynamics in a unified framework, we propose a numerical scheme for this equation that is mass conservative, unconditionally positivity preserving, and asymptotic preserving. We provide extensive numerical tests to verify the schemes' properties and carry out a set of numerical experiments to investigate the model's learning ability, and explore the solution's behavior when the neural network is excitatory.

preprint2022arXiv

Bounds and long term convergence for the voltage-conductance kinetic system arising in neuroscience

The voltage-conductance equation determines the probability distribution of a stochastic process describing a fluctuation-driven neuronal network arising in the visual cortex. Its structure and degeneracy share many common features with the kinetic Fokker-Planck equation, which has attracted much attention recently. We prove an L $\infty$ bound on the steady state solution and the long term convergence of the evolution problem towards this stationary state. Despite the hypoellipticity property, the difficulty is to treat the boundary conditions that change type along the boundary. This leads us to use specific weights in the Alikakos iterations and adapt the relative entropy method.

preprint2022arXiv

Dilating blow-up time: A generalized solution of the NNLIF neuron model and its global well-posedness

The nonlinear noisy leaky integrate-and-fire (NNLIF) model is a popular mean-field description of a large number of interacting neurons, which has attracted mathematicians to study from various aspects. A core property of this model is the finite time blow-up of the firing rate, which scientifically corresponds to the synchronization of a neuron network, and mathematically prevents the existence of a global classical solution. In this work, we propose a new generalized solution based on reformulating the PDE model with a specific change of variable in time. A firing rate dependent timescale is introduced, in which the transformed equation can be shown to be globally well-posed for any connectivity parameter even in the event of the blow-up. The generalized solution is then defined via the backward change of timescale, and it may have a jump when the firing rate blows up. We establish properties of the generalized solution including the characterization of blow-ups and the global well-posedness in the original timescale. The generalized solution provides a new perspective to understand the dynamics when the firing rate blows up as well as the continuation of the solution after a blow-up.

preprint2022arXiv

Ergodicity and long-time behavior of the Random Batch Method for interacting particle systems

We study the geometric ergodicity and the long time behavior of the Random Batch Method for interacting particle systems, which exhibits superior numerical performance in recent large-scale scientific computing experiments. We show that for both the interacting particle system (IPS) and the random batch interacting particle system (RB-IPS), the distribution laws converge to their respective invariant distributions exponentially, and the convergence rate does not depend on the number of particles $N$, the time step $τ$ for batch divisions or the batch size $p$. Moreover, the Wasserstein distance between the invariant distributions of the IPS and the RB-IPS is bounded by $O(\sqrtτ)$, showing that the RB-IPS can be used to sample the invariant distribution of the IPS accurately with greatly reduced computational cost.

preprint2022arXiv

Error Analysis of Time-Discrete Random Batch Method for Interacting Particle Systems and Associated Mean-Field Limits

The random batch method provides an efficient algorithm for computing statistical properties of a canonical ensemble of interacting particles. In this work, we study the error estimates of the fully discrete random batch method, especially in terms of approximating the invariant distribution. Using a triangle inequality framework, we show that the long-time error of the method is $O(\sqrtτ + e^{-λt})$, where $τ$ is the time step and $λ$ is the convergence rate which does not depend on the time step $τ$ or the number of particles $N$. Our results also apply to the McKean-Vlasov process, which is the mean-field limit of the interacting particle system as the number of particles $N\rightarrow\infty$.

preprint2022arXiv

Exponential convergence to equilibrium for a two-speed model with variant drift fields via the resolvent estimate

We study a two-speed model with variant drift fields, which generalizes the Goldstein-Taylor model but the two advection fields are not necessarily in proportion. Due to the lack of a local equilibrium structure of the steady state, prevailing hypocoercivity methods could not be directly applied to such a system. To prove the exponential convergence to equilibrium for this model, we use a recent generalization of the Gearhart-Prüss theorem [Sci. China Math. 64 (2021), no. 3, 507-518] to gain semigroup bounds, where the relative entropy estimate plays a central role in gaining desired resolvent estimates via a compactness argument.

preprint2021arXiv

A Mean Field Game Analysis of Consensus Protocol Design

A decentralized blockchain is a distributed ledger that is often used as a platform for exchanging goods and services. This ledger is maintained by a network of nodes that obeys a set of rules, called a consensus protocol, which helps to resolve inconsistencies among local copies of a blockchain. In this paper, we build a mathematical framework for the consensus protocol designer, specifying (a) the measurement of a resource which nodes strategically invest in and compete for to win the right to build new blocks in the blockchain; and (b) a payoff function for such efforts. Thus, the equilibrium of an associated stochastic differential game can be implemented by selecting nodes in proportion to this specified resource and penalizing dishonest nodes by its loss. This associated, induced game can be further analyzed using mean field games. The problem can be broken down into two coupled PDEs, where an individual node's optimal control path is solved using a Hamilton-Jacobi-Bellman equation, and where the evolution of states distribution is characterized by a Fokker-Planck equation. We develop numerical methods to compute the mean field equilibrium for both steady states at the infinite time horizon and evolutionary dynamics. As an example, we show how the mean field equilibrium can be applied to the Bitcoin blockchain mechanism design. We demonstrate that a blockchain can be viewed as a mechanism that operates in a decentralized setup and propagates properties of the mean field equilibrium over time, such as the underlying security of the blockchain.

preprint2021arXiv

Investigating the integrate and fire model as the limit of a random discharge model: a stochastic analysis perspective

In the mean field integrate-and-fire model, the dynamics of a typical neuron within a large network is modeled as a diffusion-jump stochastic process whose jump takes place once the voltage reaches a threshold. In this work, the main goal is to establish the convergence relationship between the regularized process and the original one where in the regularized process, the jump mechanism is replaced by a Poisson dynamic, and jump intensity within the classically forbidden domain goes to infinity as the regularization parameter vanishes. On the macroscopic level, the Fokker-Planck equation for the process with random discharges (i.e. Poisson jumps) are defined on the whole space, while the equation for the limit process is on the half space. However, with the iteration scheme, the difficulty due to the domain differences has been greatly mitigated and the convergence for the stochastic process and the firing rates can be established. Moreover, we find a polynomial-order convergence for the distribution by a re-normalization argument in probability theory. Finally, by numerical experiments, we quantitatively explore the rate and the asymptotic behavior of the convergence for both linear and nonlinear models.

preprint2020arXiv

A unified structure preserving scheme for a multi-species model with a gradient flow structure and nonlocal interactions via singular kernels

In this paper, we consider a nonlinear and nonlocal parabolic model for multi-species ionic fluids and introduce a semi-implicit finite volume scheme, which is second order accurate in space, first order in time and satisfies the following properties: positivity preserving, mass conservation and energy dissipation. Besides, our scheme involves a fast algorithm on the convolution terms with singular but integrable kernels, which otherwise impedes the accuracy and efficiency of the whole scheme. Error estimates on the fast convolution algorithm are shown next. Numerous numerical tests are provided to demonstrate the properties, such as unconditional stability, order of convergence, energy dissipation and the complexity of the fast convolution algorithm. Furthermore, extensive numerical experiments are carried out to explore the modeling effects in specific examples, such as, the steric repulsion, the concentration of ions at the boundary and the blowup phenomenon of the Keller-Segel equations.

preprint2020arXiv

Bloch dynamics with second order Berry phase correction

We derive the semiclassical Bloch dynamics with the second-order Berry phase correction in the presence of the slow-varying scalar potential as perturbation. Our mathematical derivation is based on a two-scale WKB asymptotic analysis. For a uniform external electric field, the bi-characteristics system after a positional shift introduced by Berry connections agrees with the recent result in previous works. Moreover, for the case with a linear external electric field, we show that the extra terms arising in the bi-characteristics system after the positional shift are also gauge independent.

preprint2020arXiv

Continuum limit and preconditioned Langevin sampling of the path integral molecular dynamics

We investigate the continuum limit that the number of beads goes to infinity in the ring polymer representation of thermal averages. Studying the continuum limit of the trajectory sampling equation sheds light on possible preconditioning techniques for sampling ring polymer configurations with large number of beads. We propose two preconditioned Langevin sampling dynamics, which are shown to have improved stability and sampling accuracy. We present a careful mode analysis of the preconditioned dynamics and show their connections to the normal mode, the staging coordinate and the Matsubara mode representation for ring polymers. In the case where the potential is quadratic, we show that the continuum limit of the preconditioned mass modified Langevin dynamics converges to its equilibrium exponentially fast, which suggests that the finite-dimensional counterpart has a dimension-independent convergence rate. In addition, the preconditioning techniques can be naturally applied to the multi-level quantum systems in the nonadiabatic regime, which are compatible with various numerical approaches.

preprint2020arXiv

The Bayesian Inversion Problem for Thermal Average Sampling of Quantum Systems

In this article, we propose a novel method for sampling potential functions based on noisy observation data of a finite number of observables in quantum canonical ensembles, which leads to the accurate sampling of a wide class of test observables. The method is based on the Bayesian inversion framework, which provides a platform for analyzing the posterior distribution and naturally leads to an efficient numerical sampling algorithm. We highlight that, the stability estimate is obtained by treating the potential functions as intermediate variables in the following way: the discrepancy between two sets of observation data of training observables can bound the distance between corresponding posterior distributions of potential functions, while the latter naturally leads to a bound of the discrepancies between corresponding thermal averages of test observables. Besides, the training observables can be more flexible than finite samples of the local density function, which are mostly used in previous researches. The method also applies to the multi-level quantum systems in the non-adiabatic regime. In addition, we provide extensive numerical tests to verify the accuracy and efficiency of the proposed algorithm.

preprint2019arXiv

Second-order semi-implicit projection methods for micromagnetics simulations

Micromagnetics simulations require accurate approximation of the magnetization dynamics described by the Landau-Lifshitz-Gilbert equation, which is nonlinear, nonlocal, and has a non-convex constraint, posing interesting challenges in developing numerical methods. In this paper, we propose two second-order semi-implicit projection methods based on the second-order backward differentiation formula and the second-order interpolation formula using the information at previous two temporal steps. Unconditional unique solvability of both methods is proved, with their second-order accuracy verified through numerical examples in both 1D and 3D. The efficiency of both methods is compared to that of another two popular methods. In addition, we test the robustness of both methods for the first benchmark problem with a ferromagnetic thin film material from National Institute of Standards and Technology.