Source author record

Fei Yu

Fei Yu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.AG astro-ph.CO gr-qc physics.optics Applications Computer Vision math.DS Computation and Language cond-mat.mtrl-sci hep-ph hep-th Information Theory Machine Learning math.IT math.NT math.RT

Catalog footprint

What is connected

20works

16topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

SEIF: Self-Evolving Reinforcement Learning for Instruction Following

Instruction following is a fundamental capability of large language models (LLMs), yet continuously improving this capability remains challenging. Existing methods typically rely either on costly external supervision from humans or strong teacher models, or on self-play training with static-difficulty instructions that cannot evolve as the model's capabilities improve. To address these limitations, we propose SEIF (Self-Evolving Reinforcement Learning for Instruction Following), a self-evolving framework for enhancing the instruction-following ability of LLMs. SEIF forms a closed self-evolution loop that improves the model's instruction-following ability, where instruction difficulty evolution and model capability evolution reinforce each other. SEIF consists of four roles: an Instructor that generates increasingly challenging instructions, a Filter that removes conflicting or invalid instructions to ensure data quality, a Follower that learns to follow evolved instructions, and a Judger that provides reward signals for reinforcement learning. The Instructor and Follower are alternately trained and co-evolve throughout the process. Experiments across multiple model scales and architectures show that SEIF consistently improves instruction-following performance, suggesting strong generality. Further analyses reveal the sources of improvement and identify an effective training strategy for self-evolution on open-ended tasks: sufficient early-stage training to build a solid foundation, followed by moderate late-stage training to mitigate overfitting and achieve better final performance. The code and data are publicly available at https://github.com/Rainier-rq1/SEIF.

preprint2025arXiv

Towards Comprehensive Interactive Change Understanding in Remote Sensing: A Large-scale Dataset and Dual-granularity Enhanced VLM

Remote sensing change understanding (RSCU) is essential for analyzing remote sensing images and understanding how human activities affect the environment. However, existing datasets lack deep understanding and interactions in the diverse change captioning, counting, and localization tasks. To tackle these gaps, we construct ChangeIMTI, a new large-scale interactive multi-task instruction dataset that encompasses four complementary tasks including change captioning, binary change classification, change counting, and change localization. Building upon this new dataset, we further design a novel vision-guided vision-language model (ChangeVG) with dual-granularity awareness for bi-temporal remote sensing images (i.e., two remote sensing images of the same area at different times). The introduced vision-guided module is a dual-branch architecture that synergistically combines fine-grained spatial feature extraction with high-level semantic summarization. These enriched representations further serve as the auxiliary prompts to guide large vision-language models (VLMs) (e.g., Qwen2.5-VL-7B) during instruction tuning, thereby facilitating the hierarchical cross-modal learning. We extensively conduct experiments across four tasks to demonstrate the superiority of our approach. Remarkably, on the change captioning task, our method outperforms the strongest method Semantic-CC by 1.39 points on the comprehensive S*m metric, which integrates the semantic similarity and descriptive accuracy to provide an overall evaluation of change caption. Moreover, we also perform a series of ablation studies to examine the critical components of our method. The source code and associated data for this work are publicly available at Github.

preprint2023arXiv

Chiral Topological superconductivity in the OAI/SC/FMI heterostructure avoiding the subband problem

Implementing topological superconductivity (TSC) and Majorana states (MSs) is one of the most significant and challenging tasks in both fundamental physics and topological quantum computations. In this work, taking the obstructed atomic insulator (OAI) Nb3Br8, s-wave superconductor (SC) NbSe2 and ferromagnetic insulator (FMI) as example, we propose a new setup to realize the 2D chiral TSC and MSs in the OAI/SC/FMI heterostructure, which could avoid the subband problem effectively and has the advantage of huge Rashba spin-orbit coupling. As a result, the TSC phase can be stabilized in a wide region of chemical potential and Zeeman field, and four distinct TSC phases with superconducting Chern number N= -1, -2, -3, 3 can be achieved. Moreover, a 2D BdG Hamiltonian based on the triangular lattice of obstructed Wannier charge centers, combined with the s-wave superconductivity paring and Zeeman field, is constructed to understand the whole topological phase diagram analytically. These results expand the application of OAIs and pave a new way to realize the TSC and MSs with unique advantages.

preprint2022arXiv

Region-Aware Metric Learning for Open World Semantic Segmentation via Meta-Channel Aggregation

As one of the most challenging and practical segmentation tasks, open-world semantic segmentation requires the model to segment the anomaly regions in the images and incrementally learn to segment out-of-distribution (OOD) objects, especially under a few-shot condition. The current state-of-the-art (SOTA) method, Deep Metric Learning Network (DMLNet), relies on pixel-level metric learning, with which the identification of similar regions having different semantics is difficult. Therefore, we propose a method called region-aware metric learning (RAML), which first separates the regions of the images and generates region-aware features for further metric learning. RAML improves the integrity of the segmented anomaly regions. Moreover, we propose a novel meta-channel aggregation (MCA) module to further separate anomaly regions, forming high-quality sub-region candidates and thereby improving the model performance for OOD objects. To evaluate the proposed RAML, we have conducted extensive experiments and ablation studies on Lost And Found and Road Anomaly datasets for anomaly segmentation and the CityScapes dataset for incremental few-shot learning. The results show that the proposed RAML achieves SOTA performance in both stages of open world segmentation. Our code and appendix are available at https://github.com/czifan/RAML.

preprint2021arXiv

Photoionization-induced broadband dispersive wave generated in an Ar-filled hollow-core photonic crystal fiber

The resonance band in hollow-core photonic crystal fiber (HC-PCF), while leading to high-loss region in the fiber transmission spectrum, has been successfully used for generating phase-matched dispersive wave (DW). Here, we report that the spectral width of the resonance-induced DW can be largely broadened due to plasma-driven blueshifting soliton. In the experiment, we observed that in a short length of Ar-filled single-ring HC-PCF the soliton self-compression and photoionization effects caused a strong spectral blueshift of the pump pulse, changing the phase-matching condition of the DW emission process. Therefore, broadening of DW spectrum to the longer-wavelength side was obtained with several spectral peaks, which correspond to the generation of DW at different positions along the fiber. In the simulation, we used super-Gauss windows with different central wavelengths to filter out these DW spectral peaks, and studied the time-domain characteristics of these peaks respectively using Fourier transform method. The simulation results verified that these multiple-peaks on the DW spectrum have different delays in the time domain, agreeing well with our theoretical prediction. Remarkably, we found that the whole time-domain DW trace can be compressed to ~29 fs using proper chirp compensation. The experimental and numerical results reported here provide some insight into the resonance-induced DW generation process in gas-filled HC-PCFs, they could also pave the way to ultrafast pulse generation using DW-emission mechanism.

preprint2021arXiv

Temperature-Dependent Group Delay of Photonic-Bandgap Hollow-Core Fiber Tuned by Surface-Mode Coupling

Surface modes (SM) are highly spatially localized modes existing at the core-cladding interface of photonic-bandgap hollow-core fiber (PBG-HCF). When coupling with SM, the air modes (AM) in the core would suffer a higher loss despite being spectrally within the cladding photonic bandgap, and would be highly dispersive around the avoided crossing (anti-crossing) wavelength. In this paper, we numerically demonstrate that such avoided crossings can play an important role in the tuning of the temperature dependence of group delay of AM of PBG-HCF. At higher temperatures, both the thermal-optic effect and thermal expansion contribute to the redshift of avoided crossing wavelength, giving rise to a temperature dependence of the AM dispersion. Numerical simulations show that the redshift of avoided crossing can significantly tune the thermal coefficient of delay (TCD) of PBG-HCF from -400 ps/km/K to 400 ps/km/K, approximately -120 ppm/K to 120 ppm/K. In comparison with the known tuning mechanism by the thermal-induced redshift of photonic bandgap [Fokoua et al., Optica 4, 659, 2017], the tuning of TCD by SM coupling presents a much broader tuning range and higher efficiency. Our finding would provide a new route to design PBG-HCF for propagation time sensitive applications.

preprint2020arXiv

Photoionization-assisted, high-efficiency emission of dispersive wave in gas-filled hollow-core photonic crystal fibers

We demonstrate that the phase-matched dispersive wave (DW) emission within the resonance band of a 25-cm-long gas-filled hollow-core photonic crystal fiber (HC-PCF) can be strongly enhanced by the photoionization effect of the pump pulse. In the experiments we observe that as the pulse energy increases, the pump pulse gradually shifts to shorter wavelengths due to soliton-plasma interactions. When the central wavelength of the blueshifting soliton is close to the resonance band of the HC-PCF, high-efficiency energy transfer from the pump light to the DW in the visible region can be obtained. During this DW emission process, we also observe that the spectral center of the DW gradually shifts to longer wavelengths leading to a slightly-increased DW bandwidth, which can be well explained as the consequence of phase-matched coupling between the pump pulse and the DW. In particular, at an input pulse energy of 6 uJ, the spectral ratio of the DW at the fiber output is measured to be as high as ~53% together with a conversion efficiency of ~19%. These experimental results, explained by numerical simulations, pave the way to high-brightness light sources based on high-efficiency frequency-upconversion processes in gas-filled HC-PCFs.

preprint2016arXiv

Weierstrass filtration on Teichmüller curves and Lyapunov exponents: Upper bounds

We get an upper bound of the slope of each graded quotient for the Harder-Narasimhan filtration of the Hodge bundle of a Teichmüller curve. As an application, we show that the sum of Lyapunov exponents of a Teichmüller curve does not exceed ${(g+1)}/{2}$, with equality reached if and only if the curve lies in the hyperelliptic locus induced from $\mathcal{Q}(2k_1,...,2k_n,-1^{2g+2})$ or it is a special Teichmüller curve in $Ω\mathcal{M}_g(1^{2g-2})$. It also gives an unified interpretation for many known results about the special partial sums of Lyapunov exponents on Teichmüller curves.

preprint2015arXiv

On Okounkov's conjecture connecting Hilbert schemes of points and multiple q-zeta values

We compute the generating series for the intersection pairings between the total Chern classes of the tangent bundles of the Hilbert schemes of points on a smooth projective surface and the Chern characters of tautological bundles over these Hilbert schemes. Modulo the lower weight term, we verify Okounkov's conjecture [Oko] connecting these Hilbert schemes and multiple $q$-zeta values. In addition, this conjecture is completely proved when the surface is abelian. We also determine some universal constants in the sense of Boissi\' ere and Nieper-Wisskirchen [Boi, BN] regarding the total Chern classes of the tangent bundles of these Hilbert schemes. The main approach of this paper is to use the set-up of Carlsson and Okounkov outlined in [Car, CO] and the structure of the Chern character operators proved in [LQW2].

preprint2015arXiv

Statefinder hierarchy exploration of the extended Ricci dark energy

We apply the statefinder hierarchy plus the fractional growth parameter to explore the extended Ricci dark energy (ERDE) model, in which there are two independent coefficients $α$ and $β$. By adjusting them, we plot evolution trajectories of some typical parameters, including Hubble expansion rate $E$, deceleration parameter $q$, the third and fourth order hierarchy $S_3^{(1)}$ and $S_4^{(1)}$ and fractional growth parameter $ε$, respectively, as well as several combinations of them. For the case of variable $α$ and constant $β$, in the low-redshift region the evolution trajectories of $E$ are in high degeneracy and that of $q$ separate somewhat. However, the $Λ$CDM model is confounded with ERDE in both of these two cases. $S_3^{(1)}$ and $S_4^{(1)}$, especially the former, perform much better. They can differentiate well only varieties of cases within ERDE except $Λ$CDM in the low-redshift region. For high-redshift region, combinations $\{S_n^{(1)},ε\}$ can break the degeneracy. Both of $\{S_3^{(1)},ε\}$ and $\{S_4^{(1)},ε\}$ have the ability to discriminate ERDE with $α=1$ from $Λ$CDM, of which the degeneracy cannot be broken by all the before-mentioned parameters. For the case of variable $β$ and constant $α$, $S_3^{(1)}(z)$ and $S_4^{(1)}(z)$ can only discriminate ERDE from $Λ$CDM. Nothing but pairs $\{S_3^{(1)},ε\}$ and $\{S_4^{(1)},ε\}$ can discriminate not only within ERDE but also ERDE from $Λ$CDM. Finally we find that $S_3^{(1)}$ is surprisingly a better choice to discriminate within ERDE itself, and ERDE from $Λ$CDM as well, rather than $S_4^{(1)}$.

preprint2014arXiv

Differentially-Private Logistic Regression for Detecting Multiple-SNP Association in GWAS Databases

Following the publication of an attack on genome-wide association studies (GWAS) data proposed by Homer et al., considerable attention has been given to developing methods for releasing GWAS data in a privacy-preserving way. Here, we develop an end-to-end differentially private method for solving regression problems with convex penalty functions and selecting the penalty parameters by cross-validation. In particular, we focus on penalized logistic regression with elastic-net regularization, a method widely used to in GWAS analyses to identify disease-causing genes. We show how a differentially private procedure for penalized logistic regression with elastic-net regularization can be applied to the analysis of GWAS data and evaluate our method's performance.

preprint2014arXiv

Scalable Privacy-Preserving Data Sharing Methodology for Genome-Wide Association Studies

The protection of privacy of individual-level information in genome-wide association study (GWAS) databases has been a major concern of researchers following the publication of "an attack" on GWAS data by Homer et al. (2008) Traditional statistical methods for confidentiality and privacy protection of statistical databases do not scale well to deal with GWAS data, especially in terms of guarantees regarding protection from linkage to external information. The more recent concept of differential privacy, introduced by the cryptographic community, is an approach that provides a rigorous definition of privacy with meaningful privacy guarantees in the presence of arbitrary external information, although the guarantees may come at a serious price in terms of data utility. Building on such notions, Uhler et al. (2013) proposed new methods to release aggregate GWAS data without compromising an individual's privacy. We extend the methods developed in Uhler et al. (2013) for releasing differentially-private $χ^2$-statistics by allowing for arbitrary number of cases and controls, and for releasing differentially-private allelic test statistics. We also provide a new interpretation by assuming the controls' data are known, which is a realistic assumption because some GWAS use publicly available data as controls. We assess the performance of the proposed methods through a risk-utility analysis on a real data set consisting of DNA samples collected by the Wellcome Trust Case Control Consortium and compare the methods with the differentially-private release mechanism proposed by Johnson and Shmatikov (2013).

preprint2014arXiv

Weierstrass filtration on Teichmuller curves and Lyapunov exponents

We define the Weierstrass filtration for Teichmuller curves and construct the Harder-Narasimhan filtration of the Hodge bundle of a Teichmuller curve in hyperelliptic loci and low-genus nonvarying strata. As a result we obtain the sum of Lyapunov exponents of Teichmuller curves in these strata.

preprint2013arXiv

A new inequality on the Hodge number $h^{1,1}$ of algebraic surfaces

We get a new inequality on the Hodge number $h^{1,1}(S)$ of fibred algebraic complex surfaces $S$, which is a generalization of an inequality of Beauville. Our inequality implies the Arakelov type inequalities due to Arakelov, Faltings, Viehweg and Zuo, respectively.

preprint2013arXiv

Max-Min Energy Efficient Beamforming for Multicell Multiuser Joint Transmission Systems

Energy efficient communication technology has attracted much attention due to the explosive growth of energy consumption in current wireless communication systems. In this letter we focus on fairness-based energy efficiency and aim to maximize the minimum user energy efficiency in the multicell multiuser joint beamforming system, taking both dynamic and static power consumptions into account. This optimization problem is a non-convex fractional programming problem and hard to tackle. In order to find its solution, the original problem is transformed into a parameterized polynomial subtractive form by exploiting the relationship between the user rate and the minimum mean square error, and using the fractional programming theorem. Furthermore, an iterative algorithm with proved convergence is developed to achieve a near-optimal performance. Numerical results validate the effectiveness of the proposed solution and show that our algorithm significantly outperforms the max-min rate optimization algorithm in terms of maximizing the minimum energy efficiency.

preprint2013arXiv

Statefinder diagnosis for the extended holographic Ricci dark energy model without and with interaction

We apply the statefinder diagnostic to the extended holographic Ricci dark energy (ERDE) model without and with interaction to study their behaviors. We plot the trajectories of various parameters for different cases. It is shown that the non-interacting model does not reach the LCDM point $\{1,0\}$ and the interacting one is favored, because the interaction makes the evolution of the statefinder pair $\{r,s\}$ quite different.

preprint2012arXiv

Instability of Truncated Symmetric Powers of sheaves

Let $X$ be a smooth projective variety of dimension $n$ over an algebraically closed field $k$ of characteristic $p>0$. Let $F_X:X\rightarrow X$ be the absolute Frobenius morphism, and $\E$ a torsion free sheaf on $X$. We give a upper bound of instability of truncated symmetric powers $\mathrm{T}^l(\E)(0\leq l\leq\rk(\E)(p-1))$ in terms of $L_{\max}(\Omg^1_X)$, $\mathrm{I}(\Omg^1_X)$ and $\mathrm{I}(\E)$ (Theorem \ref{InstabTl}). As an application, We obtain a upper bound of Frobenius direct image ${F_X}_*(\E)$ and some sufficient conditions of slope semi-stability of ${F_X}_*(\E)$. In addition, we study the slope (semi)-stability of sheaves of locally exact (closed) forms $B^i_X$ ($Z^i_X$).

preprint2010arXiv

A more general interacting model of holographic dark energy

So far, there have been no theories or observational data that deny the presence of interaction between dark energy and dark matter. We extend naturally the holographic dark energy (HDE) model, proposed by Granda and Oliveros, in which the dark energy density includes not only the square of the Hubble scale, but also the time derivative of the Hubble scale to the case with interaction and the analytic forms for the cosmic parameters are obtained under the specific boundary conditions. The various behaviors concerning the cosmic expansion depend on the introduced numerical parameters which are also constrained. The more general interacting model inherits the features of the previous ones of HDE, keeping the consistency of the theory.

preprint2010arXiv

The Fourth Gravity Test and Quintessence Matter Field

After the previous work on gravitational frequency shift, light deflection (arXiv:1003.5296) and perihelion advance (arXiv:0812.2332), we calculate carefully the fourth gravity test, i.e. radar echo delay in a central gravity field surrounded by static free quintessence matter, in this paper. Through the Lagrangian method, we find the influence of the quintessence matter on the time delay of null particle is presence by means of an additional integral term. When the quintessence field vanishes, it reduces to the usual Schwarzschild case naturally. Meanwhile, we also use the data of the Viking lander from the Mars and Cassini spacecraft to Saturn to constrain the quintessence field. For the Viking case, the field parameter $α$ is under the order of $10^{-9}$. However, $α$ is under $10^{-18}$ for the Cassini case.

preprint2010arXiv

The influence of quintessence on the motion of a binary system in cosmology

We employ the metric of Schwarzschild space surrounded by quintessential matter to study the trajectories of test masses on the motion of a binary system. The results, which are obtained through the gradually approximate approach, can be used to search for dark energy via the difference of the azimuth angle of the pericenter. The classification of the motion is discussed.

Fei Yu

What is connected

Connect this record

See the researcher in context

Building this map preview

20 published item(s)

SEIF: Self-Evolving Reinforcement Learning for Instruction Following

Towards Comprehensive Interactive Change Understanding in Remote Sensing: A Large-scale Dataset and Dual-granularity Enhanced VLM

Chiral Topological superconductivity in the OAI/SC/FMI heterostructure avoiding the subband problem

Region-Aware Metric Learning for Open World Semantic Segmentation via Meta-Channel Aggregation

Photoionization-induced broadband dispersive wave generated in an Ar-filled hollow-core photonic crystal fiber

Temperature-Dependent Group Delay of Photonic-Bandgap Hollow-Core Fiber Tuned by Surface-Mode Coupling

Photoionization-assisted, high-efficiency emission of dispersive wave in gas-filled hollow-core photonic crystal fibers

Weierstrass filtration on Teichmüller curves and Lyapunov exponents: Upper bounds

On Okounkov's conjecture connecting Hilbert schemes of points and multiple q-zeta values

Statefinder hierarchy exploration of the extended Ricci dark energy

Differentially-Private Logistic Regression for Detecting Multiple-SNP Association in GWAS Databases

Scalable Privacy-Preserving Data Sharing Methodology for Genome-Wide Association Studies

Weierstrass filtration on Teichmuller curves and Lyapunov exponents

A new inequality on the Hodge number $h^{1,1}$ of algebraic surfaces

Max-Min Energy Efficient Beamforming for Multicell Multiuser Joint Transmission Systems

Statefinder diagnosis for the extended holographic Ricci dark energy model without and with interaction

Instability of Truncated Symmetric Powers of sheaves

A more general interacting model of holographic dark energy

The Fourth Gravity Test and Quintessence Matter Field

The influence of quintessence on the motion of a binary system in cosmology