Researcher profile

Yufei Huang

Yufei Huang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
9works
0followers
11topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

9 published item(s)

preprint2026arXiv

UltraLogic: Enhancing LLM Reasoning through Large-Scale Data Synthesis and Bipolar Float Reward

While Large Language Models (LLMs) have demonstrated significant potential in natural language processing , complex general-purpose reasoning requiring multi-step logic, planning, and verification remains a critical bottleneck. Although Reinforcement Learning with Verifiable Rewards (RLVR) has succeeded in specific domains , the field lacks large-scale, high-quality, and difficulty-calibrated data for general reasoning. To address this, we propose UltraLogic, a framework that decouples the logical core of a problem from its natural language expression through a Code-based Solving methodology to automate high-quality data production. The framework comprises hundreds of unique task types and an automated calibration pipeline across ten difficulty levels. Furthermore, to mitigate binary reward sparsity and the Non-negative Reward Trap, we introduce the Bipolar Float Reward (BFR) mechanism, utilizing graded penalties to effectively distinguish perfect responses from those with logical flaws. Our experiments demonstrate that task diversity is the primary driver for reasoning enhancement , and that BFR, combined with a difficulty matching strategy, significantly improves training efficiency, guiding models toward global logical optima.

preprint2023arXiv

Risk-aware Vehicle Motion Planning Using Bayesian LSTM-Based Model Predictive Control

Understanding the probabilistic traffic environment is a vital challenge for the motion planning of autonomous vehicles. To make feasible control decisions, forecasting future trajectories of adjacent cars is essential for intelligent vehicles to assess potential conflicts and react to reduce the risk. This paper first introduces a Bayesian Long Short-term Memory (BLSTM) model to learn human drivers' behaviors and habits from their historical trajectory data. The model predicts the probability distribution of surrounding vehicles' positions, which are used to estimate dynamic conflict risks. Next, a hybrid automaton is built to model the basic motions of a car, and the conflict risks are assessed for real-time state-space transitions based on environmental information. Finally, a BLSTM-based Model Predictive Control (MPC) is built to navigate vehicles through safe paths with the least predicted conflict risk. By merging BLSTM with MPC, the designed neural-based MPC overcomes the defect that traditional MPC is hard to model uncertain conflict risks. The simulation results show that our proposed BLSTM-based MPC performs better than human drivers because it can foresee potential conflicts and take action to avoid them.

preprint2022arXiv

A Survey on Protein Representation Learning: Retrospect and Prospect

Proteins are fundamental biological entities that play a key role in life activities. The amino acid sequences of proteins can be folded into stable 3D structures in the real physicochemical world, forming a special kind of sequence-structure data. With the development of Artificial Intelligence (AI) techniques, Protein Representation Learning (PRL) has recently emerged as a promising research topic for extracting informative knowledge from massive protein sequences or structures. To pave the way for AI researchers with little bioinformatics background, we present a timely and comprehensive review of PRL formulations and existing PRL methods from the perspective of model architectures, pretext tasks, and downstream applications. We first briefly introduce the motivations for protein representation learning and formulate it in a general and unified framework. Next, we divide existing PRL methods into three main categories: sequence-based, structure-based, and sequence-structure co-modeling. Finally, we discuss some technical challenges and potential directions for improving protein representation learning. The latest advances in PRL methods are summarized in a GitHub repository https://github.com/LirongWu/awesome-protein-representation-learning.

preprint2022arXiv

Sharp bounds on the symmetric division deg index of graphs and line graphs

For a graph $G$ with vertex set $V_{G}$ and edge set $E_{G}$, the symmetric division deg index is defined as $SDD(G)=\sum\limits_{uv\in E_{G}}(\frac{d_{u}}{d_{v}}+\frac{d_{v}}{d_{u}})$, where $d_{u}$ denotes the degree of vertex $u$ in $G$. In 2018, Furtula et al. confirmed the quality of SDD index exceeds that of some more popular VDB indices, in particular that of the GA index. They shown a close connection between the SDD index and the earlier well-established GA index. Thus it is meaningful and important to consider the chemical and mathematical properties of the SDD index. In this paper, we determine some sharp bounds on the symmetric division deg index of graphs and line graphs and characterize the corresponding extremal graphs.

preprint2022arXiv

The Sombor index and coindex of two-trees

The Sombor index of a graph $G$, introduced by Ivan Gutman, is defined as the sum of the weights $\sqrt{d_G(u)^2+d_G(v)^2}$ of all edges $uv$ of $G$, where $d_G(u)$ denotes the degree of vertex $u$ in $G$. The Sombor coindex is recently defined as $\bar{SO}(G)=\sum \limits_{uv\notin E(G)}\sqrt{d_G(u)^2+d_G(v)^2}$. In this paper, the maximum and second maximum Sombor index, the minimum and second minimum Sombor coindex in two-trees are determined.

preprint2022arXiv

The spread of generalized reciprocal distance matrix

The generalized reciprocal distance matrix $RD_α(G)$ was defined as $RD_α(G)=αRT(G)+(1-α)RD(G),\quad 0\leq α\leq 1.$ Let $λ_{1}(RD_α(G))\geq λ_{2}(RD_α(G))\geq \cdots \geq λ_{n}(RD_α(G))$ be the eigenvalues of $RD_α$ matrix of graphs $G$. Then the $RD_α$-spread of graph $G$ can be defined as $S_{RD_α}(G)=λ_{1}(RD_α(G))-λ_{n}(RD_α(G))$. In this paper, we first obtain some sharp lower and upper bounds for the $RD_α$-spread of graphs. Then we determine the lower bounds for the $RD_α$-spread of bipartite graphs and graphs with given clique number. At last, we give the $RD_α$-spread of double star graphs. Our results generalize the related results of the reciprocal distance matrix and reciprocal distance signless Laplacian matrix.

preprint2022arXiv

Toward Deep Learning Based Access Control

A common trait of current access control approaches is the challenging need to engineer abstract and intuitive access control models. This entails designing access control information in the form of roles (RBAC), attributes (ABAC), or relationships (ReBAC) as the case may be, and subsequently, designing access control rules. This framework has its benefits but has significant limitations in the context of modern systems that are dynamic, complex, and large-scale, due to which it is difficult to maintain an accurate access control state in the system for a human administrator. This paper proposes Deep Learning Based Access Control (DLBAC) by leveraging significant advances in deep learning technology as a potential solution to this problem. We envision that DLBAC could complement and, in the long-term, has the potential to even replace, classical access control models with a neural network that reduces the burden of access control model engineering and updates. Without loss of generality, we conduct a thorough investigation of a candidate DLBAC model, called DLBAC_alpha, using both real-world and synthetic datasets. We demonstrate the feasibility of the proposed approach by addressing issues related to accuracy, generalization, and explainability. We also discuss challenges and future research directions.

preprint2021arXiv

Engineered Raman Lasing in Photonic Integrated Chalcogenide Microresonators

Chalcogenide glass (ChG) is an attractive material for integrated nonlinear photonics due to its wide transparency and high nonlinearity, and its capability of being directly deposited and patterned on Silicon wafer substrates. It has a singular Raman effect among amorphous materials. Yet, the Raman lasing performance in high quality and chip integrated ChG microresonators remains unexplored. Here, we demonstrate an engineered Raman lasing dynamic based on home developed photonic integrated high-Q ChG microresonators. With a quality factor above 10^6, we achieve the record-low lasing threshold 3.25 mW among integrated planar photonic platforms. Both the single-mode Raman lasers and a broadband Raman-Kerr comb are observed and characterized, which is dependent on the dispersion of our flexible photonic platform and engineered via tuning the waveguide geometric size. The tunability of such a chipscale Raman laser is also demonstrated through tuning the pump wavelength and tuning the operating temperature on the chip. This allows for the access of single-mode lasing at arbitrary wavelengths in the range 1615-1755 nm. Our results may contribute to the understanding of rich Raman and Kerr nonlinear interactions in dissipative and nonlinear microresonators, and on application aspect, may pave a way to chip-scale efficient Raman lasers that is highly desired in spectroscopic applications in the infrared.

preprint2020arXiv

Modeling EEG data distribution with a Wasserstein Generative Adversarial Network to predict RSVP Events

Electroencephalography (EEG) data are difficult to obtain due to complex experimental setups and reduced comfort with prolonged wearing. This poses challenges to train powerful deep learning model with the limited EEG data. Being able to generate EEG data computationally could address this limitation. We propose a novel Wasserstein Generative Adversarial Network with gradient penalty (WGAN-GP) to synthesize EEG data. This network addresses several modeling challenges of simulating time-series EEG data including frequency artifacts and training instability. We further extended this network to a class-conditioned variant that also includes a classification branch to perform event-related classification. We trained the proposed networks to generate one and 64-channel data resembling EEG signals routinely seen in a rapid serial visual presentation (RSVP) experiment and demonstrated the validity of the generated samples. We also tested intra-subject cross-session classification performance for classifying the RSVP target events and showed that class-conditioned WGAN-GP can achieve improved event-classification performance over EEGNet.