Researcher profile

Fei Ye

Fei Ye contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
14works
0followers
11topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

14 published item(s)

preprint2025arXiv

SeedFold: Scaling Biomolecular Structure Prediction

Highly accurate biomolecular structure prediction is a key component of developing biomolecular foundation models, and one of the most critical aspects of building foundation models is identifying the recipes for scaling the model. In this work, we present SeedFold, a folding model that successfully scales up the model capacity. Our contributions are threefold: first, we identify an effective width-scaling strategy for the Pairformer to increase representation capacity; second, we introduce a novel linear triangular attention that reduces computational complexity to enable efficient scaling; finally, we construct a large-scale distillation dataset to substantially enlarge the training set. Experiments on FoldBench show that SeedFold outperforms AlphaFold3 on most protein-related tasks.

preprint2022arXiv

Continual Variational Autoencoder Learning via Online Cooperative Memorization

Due to their inference, data representation and reconstruction properties, Variational Autoencoders (VAE) have been successfully used in continual learning classification tasks. However, their ability to generate images with specifications corresponding to the classes and databases learned during Continual Learning (CL) is not well understood and catastrophic forgetting remains a significant challenge. In this paper, we firstly analyze the forgetting behaviour of VAEs by developing a new theoretical framework that formulates CL as a dynamic optimal transport problem. This framework proves approximate bounds to the data likelihood without requiring the task information and explains how the prior knowledge is lost during the training process. We then propose a novel memory buffering approach, namely the Online Cooperative Memorization (OCM) framework, which consists of a Short-Term Memory (STM) that continually stores recent samples to provide future information for the model, and a Long-Term Memory (LTM) aiming to preserve a wide diversity of samples. The proposed OCM transfers certain samples from STM to LTM according to the information diversity selection criterion without requiring any supervised signals. The OCM framework is then combined with a dynamic VAE expansion mixture network for further enhancing its performance.

preprint2022arXiv

Lattice dynamics in the charge-density-wave metal at a van-Hove-singularity filling

The charge-density-wave (CDW) order with macroscopically occupied electrons distorts the underlying lattice and usually causes the softening of the associated phonon mode. However, previous studies demonstrated that the spin-Peierls transition does not always induce an associated phonon softening, but the central-peak scenario applied in the quasi-one-dimensional compound CuGeO$_3$. We generalize the lattice-dynamics studies on the two-dimensional CDW state at van-Hove-singularity (VHS) filling and find that the CDW ordering could develop a central peak at zero frequency while the associated phonon undergoes hardening. The particle-hole scatterings between VHS points give rise to a low-energy increased charge-density susceptibility, and their coupling to the lattice dynamics induces two poles in the Green function for the CDW-associated phonon mode. The zero-frequency pole corresponds to the collective charge-density and phonon coupling mode. The high-frequency one is related to the high-temperature phonon mode that hardens as reaching the CDW transition. Our result may have the potential implication for the recently discovered Kagome metal $A$V$_3$Sb$_5$ ($A$ = K, Rb, Cs) in which no soft phonon is observed during the CDW transition.

preprint2022arXiv

Learning an evolved mixture model for task-free continual learning

Recently, continual learning (CL) has gained significant interest because it enables deep learning models to acquire new knowledge without forgetting previously learnt information. However, most existing works require knowing the task identities and boundaries, which is not realistic in a real context. In this paper, we address a more challenging and realistic setting in CL, namely the Task-Free Continual Learning (TFCL) in which a model is trained on non-stationary data streams with no explicit task information. To address TFCL, we introduce an evolved mixture model whose network architecture is dynamically expanded to adapt to the data distribution shift. We implement this expansion mechanism by evaluating the probability distance between the knowledge stored in each mixture model component and the current memory buffer using the Hilbert Schmidt Independence Criterion (HSIC). We further introduce two simple dropout mechanisms to selectively remove stored examples in order to avoid memory overload while preserving memory diversity. Empirical results demonstrate that the proposed approach achieves excellent performance.

preprint2022arXiv

Supplemental Material: Lifelong Generative Modelling Using Dynamic Expansion Graph Model

In this article, we provide the appendix for Lifelong Generative Modelling Using Dynamic Expansion Graph Model. This appendix includes additional visual results as well as the numerical results on the challenging datasets. In addition, we also provide detailed proofs for the proposed theoretical analysis framework. The source code can be found in https://github.com/dtuzi123/Expansion-Graph-Model.

preprint2021arXiv

Facet Dependent Topological Phase Transition in Bi4Br4

The realization of the coexistence of various topologically nontrivial surface states in one material is expected to lay a foundation for new electric applications with selective robust spin current. Here we apply the magnetoconductivity characteristic and angle-resolved photoemission spectroscopy (ARPES) to visualize the surface-selected electronic features evolution of quasi-one-dimensional material Bi4Br4. The transport measurements indicate the quantum interference correction to conductivity possesses symbolic spin rotational characteristic correlated to the value of Berry phase with the effects of weak localization and weak antilocalization for (001) and (100) surfaces, respectively. The ARPES spectra provide the experimental evidence for quasi-one-dimensional massless Dirac surface state at the side (100) surface and anisotropic massive Dirac surface state at the top (001) surface, respectively, which is highly coincide with the angle-dependent scaling behavior of magnetoconductivity. Our results reveal the facet dependent topological phases in quasi-one-dimensional Bi4Br4, stimulating the further investigations of this dual topology classes and the applications of the feasible technologies of topological spintronics.

preprint2020arXiv

Automated Lane Change Strategy using Proximal Policy Optimization-based Deep Reinforcement Learning

Lane-change maneuvers are commonly executed by drivers to follow a certain routing plan, overtake a slower vehicle, adapt to a merging lane ahead, etc. However, improper lane change behaviors can be a major cause of traffic flow disruptions and even crashes. While many rule-based methods have been proposed to solve lane change problems for autonomous driving, they tend to exhibit limited performance due to the uncertainty and complexity of the driving environment. Machine learning-based methods offer an alternative approach, as Deep reinforcement learning (DRL) has shown promising success in many application domains including robotic manipulation, navigation, and playing video games. However, applying DRL to autonomous driving still faces many practical challenges in terms of slow learning rates, sample inefficiency, and safety concerns. In this study, we propose an automated lane change strategy using proximal policy optimization-based deep reinforcement learning, which shows great advantages in learning efficiency while still maintaining stable performance. The trained agent is able to learn a smooth, safe, and efficient driving policy to make lane-change decisions (i.e. when and how) in a challenging situation such as dense traffic scenarios. The effectiveness of the proposed policy is validated by using metrics of task success rate and collision rate. The simulation results demonstrate the lane change maneuvers can be efficiently learned and executed in a safe, smooth, and efficient manner.

preprint2020arXiv

Core-level x-ray photoemission and Raman spectroscopy studies on electronic structures in Mott-Hubbard type nickelate oxide NdNiO$_2$

We perform core-level X-ray photoemission spectroscopy (XPS) and electronic Raman scattering studies of electronic structures and spin fluctuations in the bulk samples of the nickelate oxide NdNiO$_2$. According to Nd $3d$ and O $1s$ XPS spectra, we conclude that NdNiO$_2$ has a large transfer energy. From the analysis of the main line of the Ni $2p_{3/2}$ XPS, we confirm the NiO$_2$ planes in NdNiO$_2$ are of Mott-Hubbard type in the Zaanen-Sawatzky-Allen scheme. The two-magnon peak in the Raman scattering provides direct evidence for the strong spin-fluctuation in NdNiO$_2$. The peak position determines the antiferromagnetic exchange $J=25$~meV. Our experimental results agree well with our previous theoretical results.

preprint2020arXiv

Learning latent representations across multiple data domains using Lifelong VAEGAN

The problem of catastrophic forgetting occurs in deep learning models trained on multiple databases in a sequential manner. Recently, generative replay mechanisms (GRM), have been proposed to reproduce previously learned knowledge aiming to reduce the forgetting. However, such approaches lack an appropriate inference model and therefore can not provide latent representations of data. In this paper, we propose a novel lifelong learning approach, namely the Lifelong VAEGAN (L-VAEGAN), which not only induces a powerful generative replay network but also learns meaningful latent representations, benefiting representation learning. L-VAEGAN can allow to automatically embed the information associated with different domains into several clusters in the latent space, while also capturing semantically meaningful shared latent variables, across different data domains. The proposed model supports many downstream tasks that traditional generative replay methods can not, including interpolation and inference across different data domains.

preprint2020arXiv

Meta Reinforcement Learning-Based Lane Change Strategy for Autonomous Vehicles

Recent advances in supervised learning and reinforcement learning have provided new opportunities to apply related methodologies to automated driving. However, there are still challenges to achieve automated driving maneuvers in dynamically changing environments. Supervised learning algorithms such as imitation learning can generalize to new environments by training on a large amount of labeled data, however, it can be often impractical or cost-prohibitive to obtain sufficient data for each new environment. Although reinforcement learning methods can mitigate this data-dependency issue by training the agent in a trial-and-error way, they still need to re-train policies from scratch when adapting to new environments. In this paper, we thus propose a meta reinforcement learning (MRL) method to improve the agent's generalization capabilities to make automated lane-changing maneuvers at different traffic environments, which are formulated as different traffic congestion levels. Specifically, we train the model at light to moderate traffic densities and test it at a new heavy traffic density condition. We use both collision rate and success rate to quantify the safety and effectiveness of the proposed model. A benchmark model is developed based on a pretraining method, which uses the same network structure and training tasks as our proposed model for fair comparison. The simulation results shows that the proposed method achieves an overall success rate up to 20% higher than the benchmark model when it is generalized to the new environment of heavy traffic density. The collision rate is also reduced by up to 18% than the benchmark model. Finally, the proposed model shows more stable and efficient generalization capabilities adapting to the new environment, and it can achieve 100% successful rate and 0% collision rate with only a few steps of gradient updates.

preprint2020arXiv

Superconductivity in a hole-doped Mott-insulating triangular adatom layer on a silicon surface

Adsorption of one-third monolayer of Sn on an atomically-clean Si(111) substrate produces a two-dimensional triangular adatom lattice with one unpaired electron per site. This dilute adatom reconstruction is an antiferromagnetic Mott insulator; however, the system can be modulation-doped and metallized using heavily-doped p-type Si(111) substrates. Here, we show that the hole-doped dilute adatom layer on a degenerately doped p-type Si(111) wafer is superconducting with a critical temperature of 4.7 +- 0.3 K. While a phonon-mediated coupling scenario would be consistent with the observed TC, Mott correlations in the Sn-derived dangling-bond surface state could suppress the s-wave pairing channel. The latter suggests that the superconductivity in this triangular adatom lattice may be unconventional.

preprint2020arXiv

Using observed bacteria concentration and modeled transit time under an analytical framework to estimate overall removal rate of fecal coliform in an estuary

Abundance of fecal coliform (FC) is widely used to indicate the potential presence of pathogens, the No.1 cause of water impairments in the U.S. Despite extensive monitoring efforts, assessing and modeling FC pollution still faces challenges, largely owing to the uncertainties in estimation of overall removal rate (K). This study proposes an alternative method to estimate in situ K by combining observational data, hydrodynamic simulation, and analytical solution. The method requires the observed spatial distribution of FC concentration along an estuarine channel and the numerically-simulated transit time, and converts the K estimation from a temporal problem into a spatial problem, potentially reducing survey duration, effort, and cost. Application of the method gave an estimation of K = 0.5 d-1 on average for the Nassawadox Creek in Chesapeake Bay. The numerical and analytical model results with the estimated K agreed well with the observation, demonstrating the credibility of the method.

preprint2019arXiv

Effective Hamiltonian for superconducting Ni oxides Nd$_{1-x}$Sr$_x$NiO$_2$

We derive the effective single-band Hamiltonian in the flat NiO$_2$ planes for nickelate compounds Nd$_{1-x}$Sr$_x$NiO$_2$. We first implement the first-principles calculation to study electronic structures of nickelates using the Heyd-Scuseria-Ernzerhof hybrid density functional and derive a three-band Hubbard model for Ni-O $pdσ$ bands of Ni$^+$ $3d_{x^2-y^2}$ and O$^{2-}$ $2p_{x/y}$ orbitals in the NiO$_2$ planes. To obtain the effective one-band $t$-$t'$-$J$ model Hamiltonian, we perform the exact diagonalization of the three-band Hubbard model for the Ni$_5$O$_{16}$ cluster and map the low-energy spectra onto the effective one-band models. We find that the undoped NiO$_2$ plane is a Hubbard Mott insulator, and the doped holes primarily locate on Ni sites. The physics of the NiO$_2$ plane is a doped Mott insulator, described by the one-band $t$-$t'$-$J$ model with $t=265$~meV, $t'=-21$~meV and $J=28.6$~meV. We also discuss the electronic structure for the "self-doping" effect and heavy fermion behavior of electron pockets of Nd$^{3+}$ $5d$ character in Nd$_{1-x}$Sr$_x$NiO$_2$.

preprint2019arXiv

Magnetic Raman continuum in single crystalline H$_3$LiIr$_2$O$_6$

Recently H$_3$LiIr$_2$O$_6$ has been reported as a spin-orbital entangled quantum spin liquid (QSL) [K. Kitagawa et al., Nature {\bf 554}, 341 (2018)], albeit its connection to Kitaev QSL has not been yet identified. To unveil the related Kitaev physics, we perform the first Raman spectroscopy studies on single crystalline H$_3$LiIr$_2$O$_6$ samples. We implement a soft chemical replacement of Li$^+$ with H$^+$ from $α$-Li$_2$IrO$_3$ single crystals to synthesize the single crystal samples of the iridate second generation H$_3$LiIr$_2$O$_6$. The Raman spectroscopy can be used to diagnose the QSL state since the magnetic Raman continuum arises from a process involving pairs of fractionalized Majorana fermionic excitation in a pure Kitaev model. We observe a broad dome-shaped magnetic continuum in H$_3$LiIr$_2$O$_6$, in line with theoretical expectations for the two-spin process in the Kitaev QSL. Our results establish the close connection to the Kitaev QSL physics in H$_3$LiIr$_2$O$_6$.