Researcher profile

Song Li

Song Li contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
11works
0followers
13topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

11 published item(s)

preprint2026arXiv

How Label Imbalance Shapes Geometry: A General Spectral Analysis of Multi-Label Neural Collapse

This work investigates the phenomenon of Neural Collapse (NC) in multi-label classification, extending its conceptual framework from multi-class learning to general correlated and imbalanced multi-label settings. Although recent studies have identified a ''tag-wise averaging'' structure for multi-label features, this view relies on implicit assumptions of label balance and combinatorial symmetry. Consequently, it fails to account for the geometrical distortions caused by intrinsic label correlations and data imbalance, which are common in practice. We resolve the multiplicity-one imbalance conjecture raised by Li et al. (2024), showing that higher-multiplicity prototypes obey a class-frequency-weighted synthesis rule rather than uniform averaging. To address this, we propose a rigorous spectral-control framework to analyze the terminal phase of multi-label learning under general imbalanced conditions. We introduce the label covariance spectrum $κ_m$, a scalar controlling the distribution-dependent lower-bound geometry, derived from the second-order moment matrix of the label distribution. Contrary to the averaging perspective, our analysis reveals that the centered label covariance spectrum controls the stability of terminal geometry by quantifying the weakest centered inter-class contrast directions. We prove that the classical Tag-wise Averaging emerges only as a special case under perfect orthogonality. Numerical experiments on synthetic distributions validate our theoretical bounds. This work resolves the scaled-average aspect of the imbalance conjecture and establishes a unifying theoretical framework that extends Neural Collapse to complex, imbalanced multi-label settings.

preprint2022arXiv

An Algorithm for Solving Solvable Polynomial Equations of Arbitrary Degree by Radicals

This work provides a method(an algorithm) for solving the solvable unary algebraic equation $f(x)=0$ ($f(x)\in\mathbb{Q}[x]$) of arbitrary degree and obtaining the exact radical roots. This method requires that we know the Galois group as the permutation group of the roots of $f(x)$ and the approximate roots with sufficient precision beforehand. Of course, the approximate roots are not necessary but can help reduce the quantity of computation. The algorithm complexity is approximately proportional to the 4th power of the size of the Galois group of $f(x)$. The whole algorithm doesn't need to deal with tremendous polynomials or reduce symmetric polynomials.

preprint2022arXiv

An Interpretable Federated Learning-based Network Intrusion Detection Framework

Learning-based Network Intrusion Detection Systems (NIDSs) are widely deployed for defending various cyberattacks. Existing learning-based NIDS mainly uses Neural Network (NN) as a classifier that relies on the quality and quantity of cyberattack data. Such NN-based approaches are also hard to interpret for improving efficiency and scalability. In this paper, we design a new local-global computation paradigm, FEDFOREST, a novel learning-based NIDS by combining the interpretable Gradient Boosting Decision Tree (GBDT) and Federated Learning (FL) framework. Specifically, FEDFOREST is composed of multiple clients that extract local cyberattack data features for the server to train models and detect intrusions. A privacy-enhanced technology is also proposed in FEDFOREST to further defeat the privacy of the FL systems. Extensive experiments on 4 cyberattack datasets of different tasks demonstrate that FEDFOREST is effective, efficient, interpretable, and extendable. FEDFOREST ranks first in the collaborative learning and cybersecurity competition 2021 for Chinese college students.

preprint2022arXiv

Can electron and muon $g-2$ anomalies be jointly explained in SUSY?

The FNAL+BNL measurements for muon $g-2$ is $4.2σ$ above the SM prediction, and the Berkeley $^{133}$Cs measurement for the fine-structure constant $α_{\rm em}$ leads to the SM prediction for electron $g-2$ which is $2.4σ$ above the experimental value. Hence, a joint explanation of both anomalies requires a positive contribution to muon $g-2$ and a negative contribution to electron $g-2$, which is rather challenging. In this work we explore the possibility of such a joint explanation in the minimal supersymmetric standard model (MSSM). Assuming no universality between smuon and selectron soft masses, we find out a part of parameter space for a joint explanation at $2σ$ level, i.e., $μM_1,μM_2<0$, $m_{L1}, m_{E2}<200$ GeV, $m_{L2}$ being much larger than the soft masses of other sleptons, $|M_1|<125$ GeV and $μ<400$ GeV. This part of parameter space can survive LHC and LEP constraints, but gives an over-abundance for dark matter if the bino-like lightest neutralino is assumed to be the dark matter candidate. With the assumption that the dark matter candidate is a superWIMP (say a pseudo-goldstino in multi-sector SUSY breaking scenarios, whose mass can be as light as GeV and produced from the late-decay of the thermally freeze-out lightest neutralino), the dark matter problem can be avoided. So, we conclude that the MSSM may give a joint explanation for the muon and electron $g-2$ anomalies at $2σ$ level (the muon $g-2$ anomaly can be even ameliorated to $1σ$).

preprint2022arXiv

Dynamic Differential-Privacy Preserving SGD

The vanilla Differentially-Private Stochastic Gradient Descent (DP-SGD), including DP-Adam and other variants, ensures the privacy of training data by uniformly distributing privacy costs across training steps. The equivalent privacy costs controlled by maintaining the same gradient clipping thresholds and noise powers in each step result in unstable updates and a lower model accuracy when compared to the non-DP counterpart. In this paper, we propose the dynamic DP-SGD (along with dynamic DP-Adam, and others) to reduce the performance loss gap while maintaining privacy by dynamically adjusting clipping thresholds and noise powers while adhering to a total privacy budget constraint. Extensive experiments on a variety of deep learning tasks, including image classification, natural language processing, and federated learning, demonstrate that the proposed dynamic DP-SGD algorithm stabilizes updates and, as a result, significantly improves model accuracy in the strong privacy protection region when compared to the vanilla DP-SGD. We also conduct theoretical analysis to better understand the privacy-utility trade-off with dynamic DP-SGD, as well as to learn why Dynamic DP-SGD can outperform vanilla DP-SGD.

preprint2022arXiv

Explanation of electron and muon $g-2$ anomalies in AMSB

We propose to jointly explain the electron/muon $g-2$ anomalies in the framework of anomaly mediated SUSY breaking (AMSB) scenario. Two Yukawa deflected AMSB models are proposed and discussed in depth: one with lepton-specific interactions and the other one with messenger-matter interactions. Both models are found to be able to jointly explain the anomalies at $2 σ$ level by naturally realizing the preferred parameter space with $μM_1,μM_2<0$ and very heavy left-handed smuon.

preprint2022arXiv

GraphTrack: A Graph-based Cross-Device Tracking Framework

Cross-device tracking has drawn growing attention from both commercial companies and the general public because of its privacy implications and applications for user profiling, personalized services, etc. One particular, wide-used type of cross-device tracking is to leverage browsing histories of user devices, e.g., characterized by a list of IP addresses used by the devices and domains visited by the devices. However, existing browsing history based methods have three drawbacks. First, they cannot capture latent correlations among IPs and domains. Second, their performance degrades significantly when labeled device pairs are unavailable. Lastly, they are not robust to uncertainties in linking browsing histories to devices. We propose GraphTrack, a graph-based cross-device tracking framework, to track users across different devices by correlating their browsing histories. Specifically, we propose to model the complex interplays among IPs, domains, and devices as graphs and capture the latent correlations between IPs and between domains. We construct graphs that are robust to uncertainties in linking browsing histories to devices. Moreover, we adapt random walk with restart to compute similarity scores between devices based on the graphs. GraphTrack leverages the similarity scores to perform cross-device tracking. GraphTrack does not require labeled device pairs and can incorporate them if available. We evaluate GraphTrack on two real-world datasets, i.e., a publicly available mobile-desktop tracking dataset (around 100 users) and a multiple-device tracking dataset (154K users) we collected. Our results show that GraphTrack substantially outperforms the state-of-the-art on both datasets.

preprint2021arXiv

The Observation of Ferroelastic and Ferrielectric Domains in AgNbO3 Single Crystal

Compared to AgNbO3 based ceramics, the experimental investigations on the single crystalline AgNbO3, especially the ground state and ferroic domain structures, are not on the same level. Here in this work, based on successfully synthesized AgNbO3 single crystal using flux method, we observed the coexistence of ferroelastic and ferrielectric domain structures by a combination study of polarized light microscopy and piezoresponse force microscope, this finding may provide a new aspect for studying AgNbO3. The result also suggests a weak electromechanical response from the ferrielectric phase of AgNbO3 which is also supported by the transmission electron microscope characterization. Our results reveal that the AgNbO3 single crystal is in a polar ferrielectric phase at room temperature, clarifying its ground state which is controversial from the AgNbO3 ceramic materials.

preprint2020arXiv

Giant shift upon strain on the fluorescence spectrum of V$_{\rm N}$N$_{\rm B}$ color centers in $h$-BN

We study the effect of strain on the physical properties of the nitrogen antisite-vacancy pair in hexagonal boron nitride ($h$-BN), a color center that may be employed as a quantum bit in a two-dimensional material. With group theory and ab-initio analysis we show that strong electron-phonon coupling plays a key role in the optical activation of this color center. We find a giant shift on the zero-phonon-line (ZPL) emission of the nitrogen antisite-vacancy pair defect upon applying strain that is typical of $h$-BN samples. Our results provide a plausible explanation for the experimental observation of quantum emitters with similar optical properties but widely scattered ZPL wavelengths and the experimentally observed dependence of the ZPL on the strain.

preprint2018arXiv

Statefinder diagnostic and constraints on the Palatini f(R) gravity theories

We focus on a series of $f(R)$ gravity theories in Palatini formalism to investigate the probabilities of producing the late-time acceleration for the flat Friedmann-Robertson-Walker (FRW) universe. We apply statefinder diagnostic to these cosmological models for chosen series of parameters to see if they distinguish from one another. The diagnostic involves the statefinder pair $\{r,s\}$, where $r$ is derived from the scale factor $a$ and its higher derivatives with respect to the cosmic time $t$, and $s$ is expressed by $r$ and the deceleration parameter $q$. In conclusion, we find that although two types of $f(R)$ theories: (i) $f(R) = R + αR^m - βR^{-n}$ and (ii) $f(R) = R + α\ln R - β$ can lead to late-time acceleration, their evolutionary trajectories in the $r-s$ and $r-q$ planes reveal different evolutionary properties, which certainly justify the merits of statefinder diagnostic. Additionally, we utilize the observational Hubble parameter data (OHD) to constrain these models of $f(R)$ gravity. As a result, except for $m=n=1/2$ of (i) case, $α=0$ of (i) case and (ii) case allow $Λ$CDM model to exist in 1$σ$ confidence region. After adopting statefinder diagnostic to the best-fit models, we find that all the best-fit models are capable of going through deceleration/acceleration transition stage with late-time acceleration epoch, and all these models turn to de-Sitter point ($\{r,s\}=\{1,0\}$) in the future. Also, the evolutionary differences between these models are distinct, especially in $r-s$ plane, which makes the statefinder diagnostic more reliable in discriminating cosmological models.

preprint2010arXiv

Statefinder diagnosis for the Palatini $f(R)$ gravity theories

The Palatini $f(R)$ gravity, is able to probably explain the late time cosmic acceleration without the need for dark energy, is studied. In this paper, we investigate a number of $f(R)$ gravity theories in Palatini formalism by means of statefinder diagnosis. We consider two types of $f(R)$ theories: (i) $f(R)=R+αR^{m}-βR^{-n}$ and (ii) $f(R)=R+αln R+β$. We find that the evolutionary trajectories in the $s-r$ and $q-r$ planes for various types of the Palatini $f(R)$ theories reveal different evolutionary properties of the universe. Additionally, we use the observational $H(z)$ data to constrain models of $f(R)$ gravity.