Source author record

Yangfan Hu

Yangfan Hu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning Artificial Intelligence Computer Vision cond-mat.mes-hall cond-mat.mtrl-sci cond-mat.stat-mech Neural and Evolutionary Computing

Catalog footprint

What is connected

3works

7topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Understanding and Preserving Safety in Fine-Tuned LLMs

Fine-tuning is an essential and pervasive functionality for applying large language models (LLMs) to downstream tasks. However, it has the potential to substantially degrade safety alignment, e.g., by greatly increasing susceptibility to jailbreak attacks, even when the fine-tuning data is entirely harmless. Despite garnering growing attention in defense efforts during the fine-tuning stage, existing methods struggle with a persistent safety-utility dilemma: emphasizing safety compromises task performance, whereas prioritizing utility typically requires deep fine-tuning that inevitably leads to steep safety declination. In this work, we address this dilemma by shedding new light on the geometric interaction between safety- and utility-oriented gradients in safety-aligned LLMs. Through systematic empirical analysis, we uncover three key insights: (I) safety gradients lie in a low-rank subspace, while utility gradients span a broader high-dimensional space; (II) these subspaces are often negatively correlated, causing directional conflicts during fine-tuning; and (III) the dominant safety direction can be efficiently estimated from a single sample. Building upon these novel insights, we propose safety-preserving fine-tuning (SPF), a lightweight approach that explicitly removes gradient components conflicting with the low-rank safety subspace. Theoretically, we show that SPF guarantees utility convergence while bounding safety drift. Empirically, SPF consistently maintains downstream task performance and recovers nearly all pre-trained safety alignment, even under adversarial fine-tuning scenarios. Furthermore, SPF exhibits robust resistance to both deep fine-tuning and dynamic jailbreak attacks. Together, our findings provide new mechanistic understanding and practical guidance toward always-aligned LLM fine-tuning.

preprint2020arXiv

Spiking Deep Residual Network

Spiking neural networks (SNNs) have received significant attention for their biological plausibility. SNNs theoretically have at least the same computational power as traditional artificial neural networks (ANNs). They possess potential of achieving energy-efficiency while keeping comparable performance to deep neural networks (DNNs). However, it is still a big challenge to train a very deep SNN. In this paper, we propose an efficient approach to build a spiking version of deep residual network (ResNet). ResNet is considered as a kind of the state-of-the-art convolutional neural networks (CNNs). We employ the idea of converting a trained ResNet to a network of spiking neurons, named Spiking ResNet (S-ResNet). We propose a shortcut conversion model to appropriately scale continuous-valued activations to match firing rates in SNN, and a compensation mechanism to reduce the error caused by discretisation. Experimental results demonstrate that, compared with the state-of-the-art SNN approaches, the proposed Spiking ResNet achieves the best performance on CIFAR-10, CIFAR-100, and ImageNet 2012. Our work is the first time to build a SNN deeper than 40, with comparable performance to ANNs on a large-scale dataset.

preprint2015arXiv

Nature of Spontaneous Curvature in Suspended Graphene

The nature of its intrinsic ripples is the key factor for understanding the stability of suspended graphene, and for unraveling the long-standing theoretical debate of the existence of low-dimensional crystalline state. The rippling morphology of graphene, discovered also in other 2D materials, has a profound impact on its electronic, mechanical and chemical properties. Actually, before the discovery of graphene, rippling phenomena are widely observed: for example, the roughing transition of crystalline interface, the rippled phase in biomembrane, and crumpling of flexible sheet polymers modeled by tethered surfaces. The fascinating truth that ripples exist in so many different membrane-like materials implies possible existence of a universal physical mechanism which was unclear. We consider the ripples in suspended graphene as two parts, characterizing the first part by the spontaneous curvature k which stabilizes the possible soft ZA modes, and the second part by the thermal curvature kt which is caused directly by height fluctuation. By choosing k as the order parameter of the system, we establish the Landau theory modified by thermal fluctuation for wrinkling transition of large sized graphene. We find that as temperature rises from 0K, a second order phase transition occurs at a size dependent critical temperature Tc, which corresponds to a change of equilibrium configuration from a flat state to a rippling state. Interestingly, the order parameter is stablized as temperature increases, and the phase transition is associated with a jump of equilibrium bond length as well as a vanishing intrinsic bending rigidity. The results obtained suggest that the interplay between the rippling morphology and the elementary excitations is vital for understanding the behavior of 2D materials. The concepts and theory developed here is of general significance at least for tethered membranes.