Researcher profile

Sihan Feng

Sihan Feng contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 13 - UnverifiedVerification L1Unclaimed author
2works
0followers
4topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

2 published item(s)

preprint2022arXiv

How and what to learn:The modes of machine learning

Despite their great success, neural networks still remain as black-boxes due to the lack of interpretability. Here we propose a new analyzing method, namely the weight pathway analysis (WPA), to make them transparent. We consider weights in pathways that link neurons longitudinally from input neurons to output neurons, or simply weight pathways, as the basic units for understanding a neural network, and decompose a neural network into a series of subnetworks of such weight pathways. A visualization scheme of the subnetworks is presented that gives longitudinal perspectives of the network like radiographs, making the internal structures of the network visible. Impacts of parameter adjustments or structural changes to the network can be visualized via such radiographs. Characteristic maps are established for subnetworks to characterize the enhancement or suppression of the influence of input samples on each output neuron. Using WPA, we discover that neural network store and utilize information in a holographic way, that is, subnetworks encode all training samples in a coherent structure and thus only by investigating the weight pathways can one explore samples stored in the network. Furthermore, with WPA, we reveal fundamental learning modes of a neural network: the linear learning mode and the nonlinear learning mode. The former extracts linearly separable features while the latter extracts linearly inseparable features. The hidden-layer neurons self-organize into different classes for establishing learning modes and for reaching the training goal. The finding of learning modes provides us the theoretical ground for understanding some of the fundamental problems of machine learning, such as the dynamics of learning process, the role of linear and nonlinear neurons, as well as the role of network width and depth.

preprint2022arXiv

The anti-Fermi-Pasta-Ulam-Tsingou problem in one-dimensional diatomic lattices

We study the thermalization dynamics of one-dimensional diatomic lattices (which represents the simplest system possessing multi-branch phonons), exemplified by the famous Fermi-Pasta-Ulam-Tsingou (FPUT)-$β$ and the Toda models. Here we focus on how the system relaxes to the equilibrium state when part of highest-frequency optical modes are initially excited, which is called the anti-FPUT problem comparing with the original FPUT problem (low frequency excitations of the monatomic lattice). It is shown numerically that the final thermalization time $T_{\rm eq}$ of the diatomic FPUT-$β$ chain depends on whether its acoustic modes are thermalized, whereas the $T_{\rm eq}$ of the diatomic Toda chain depends on the optical ones; in addition, the metastable state of both models have different energy distributions and lifetimes. Despite these differences, in the near-integrable region, the $T_{\rm eq}$ of both models still follows the same scaling law, i.e., $T_{\rm eq}$ is inversely proportional to the square of the perturbation strength. Finally, comparisons of the thermalization behavior between different models under various initial conditions are briefly summarized.