Graph explorer

Quiver neural networks

We develop a uniform theoretical approach towards the analysis of various neural network connectivity architectures by introducing the notion of a quiver neural network. Inspired by quiver representation theory in mathematics, this approach gives a compact way to capture elaborate data flows in complex network architectures. As an application, we use parameter space symmetries to prove a lossless model compression algorithm for quiver neural networks with certain non-pointwise activations known as rescaling activations. In the case of radial rescaling activations, we prove that training the compressed model with gradient descent is equivalent to training the original model with projected gradient descent.

5 nodes5 linksoverview previewQuiver neural networks
5 nodes5 links
Quiver neural networks5 visible / 5 total nodes / 6 links
Co-authorshipAuthorshipAuthorshipTopic signalTopic signalRelated contextWQuiver neural networkspreprint / 2022AIordan GanevResearcherARobin WaltersResearcherTMachine Learning49008 worksTmath.RT2974 works
PaperSignal 104 links

Quiver neural networks

preprint / 2022

Open