Source author record

Sihan Feng

Sihan Feng appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

cond-mat.dis-nn cond-mat.stat-mech Machine Learning physics.data-an

Catalog footprint

What is connected

2works

4topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

How and what to learn:The modes of machine learning

Despite their great success, neural networks still remain as black-boxes due to the lack of interpretability. Here we propose a new analyzing method, namely the weight pathway analysis (WPA), to make them transparent. We consider weights in pathways that link neurons longitudinally from input neurons to output neurons, or simply weight pathways, as the basic units for understanding a neural network, and decompose a neural network into a series of subnetworks of such weight pathways. A visualization scheme of the subnetworks is presented that gives longitudinal perspectives of the network like radiographs, making the internal structures of the network visible. Impacts of parameter adjustments or structural changes to the network can be visualized via such radiographs. Characteristic maps are established for subnetworks to characterize the enhancement or suppression of the influence of input samples on each output neuron. Using WPA, we discover that neural network store and utilize information in a holographic way, that is, subnetworks encode all training samples in a coherent structure and thus only by investigating the weight pathways can one explore samples stored in the network. Furthermore, with WPA, we reveal fundamental learning modes of a neural network: the linear learning mode and the nonlinear learning mode. The former extracts linearly separable features while the latter extracts linearly inseparable features. The hidden-layer neurons self-organize into different classes for establishing learning modes and for reaching the training goal. The finding of learning modes provides us the theoretical ground for understanding some of the fundamental problems of machine learning, such as the dynamics of learning process, the role of linear and nonlinear neurons, as well as the role of network width and depth.

preprint2022arXiv

The anti-Fermi-Pasta-Ulam-Tsingou problem in one-dimensional diatomic lattices

We study the thermalization dynamics of one-dimensional diatomic lattices (which represents the simplest system possessing multi-branch phonons), exemplified by the famous Fermi-Pasta-Ulam-Tsingou (FPUT)-$β$ and the Toda models. Here we focus on how the system relaxes to the equilibrium state when part of highest-frequency optical modes are initially excited, which is called the anti-FPUT problem comparing with the original FPUT problem (low frequency excitations of the monatomic lattice). It is shown numerically that the final thermalization time $T_{\rm eq}$ of the diatomic FPUT-$β$ chain depends on whether its acoustic modes are thermalized, whereas the $T_{\rm eq}$ of the diatomic Toda chain depends on the optical ones; in addition, the metastable state of both models have different energy distributions and lifetimes. Despite these differences, in the near-integrable region, the $T_{\rm eq}$ of both models still follows the same scaling law, i.e., $T_{\rm eq}$ is inversely proportional to the square of the perturbation strength. Finally, comparisons of the thermalization behavior between different models under various initial conditions are briefly summarized.

Sihan Feng

What is connected

Connect this record

See the researcher in context

Building this map preview

2 published item(s)

How and what to learn:The modes of machine learning

The anti-Fermi-Pasta-Ulam-Tsingou problem in one-dimensional diatomic lattices