Source author record

Alberto Fachechi

Alberto Fachechi appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

cond-mat.dis-nn Machine Learning hep-th Artificial Intelligence math-ph math.MP Neurons and Cognition

Catalog footprint

What is connected

9works

7topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Pavlov Learning Machines

As well known, Hebb's learning traces its origin in Pavlov's Classical Conditioning, however, while the former has been extensively modelled in the past decades (e.g., by Hopfield model and countless variations on theme), as for the latter modelling has remained largely unaddressed so far; further, a bridge between these two pillars is totally lacking. The main difficulty towards this goal lays in the intrinsically different scales of the information involved: Pavlov's theory is about correlations among \emph{concepts} that are (dynamically) stored in the synaptic matrix as exemplified by the celebrated experiment starring a dog and a ring bell; conversely, Hebb's theory is about correlations among pairs of adjacent neurons as summarized by the famous statement {\em neurons that fire together wire together}. In this paper we rely on stochastic-process theory and model neural and synaptic dynamics via Langevin equations, to prove that -- as long as we keep neurons' and synapses' timescales largely split -- Pavlov mechanism spontaneously takes place and ultimately gives rise to synaptic weights that recover the Hebbian kernel.

preprint2021arXiv

Pattern recognition in Deep Boltzmann machines

We consider a multi-layer Sherrington-Kirkpatrick spin-glass as a model for deep restricted Boltzmann machines and we solve for its quenched free energy, in the thermodynamic limit and allowing for a first step of replica symmetry breaking. This result is accomplished rigorously exploiting interpolating techniques and recovering the expression already known for the replica-symmetry case. Further, we drop the restriction constraint by introducing intra-layer connections among spins and we show that the resulting system can be mapped into a modular Hopfield network, which is also addressed rigorously via interpolating techniques up to the first step of replica symmetry breaking.

preprint2020arXiv

Generalized Guerra's interpolation schemes for dense associative neural networks

In this work we develop analytical techniques to investigate a broad class of associative neural networks set in the high-storage regime. These techniques translate the original statistical-mechanical problem into an analytical-mechanical one which implies solving a set of partial differential equations, rather than tackling the canonical probabilistic route. We test the method on the classical Hopfield model - where the cost function includes only two-body interactions (i.e., quadratic terms) - and on the "relativistic" Hopfield model - where the (expansion of the) cost function includes p-body (i.e., of degree p) contributions. Under the replica symmetric assumption, we paint the phase diagrams of these models by obtaining the explicit expression of their free energy as a function of the model parameters (i.e., noise level and memory storage). Further, since for non-pairwise models ergodicity breaking is non necessarily a critical phenomenon, we develop a fluctuation analysis and find that criticality is preserved in the relativistic model.

preprint2019arXiv

Interpolating between boolean and extremely high noisy patterns through Minimal Dense Associative Memories

Recently, Hopfield and Krotov introduced the concept of {\em dense associative memories} [DAM] (close to spin-glasses with $P$-wise interactions in a disordered statistical mechanical jargon): they proved a number of remarkable features these networks share and suggested their use to (partially) explain the success of the new generation of Artificial Intelligence. Thanks to a remarkable ante-litteram analysis by Baldi \& Venkatesh, among these properties, it is known these networks can handle a maximal amount of stored patterns $K$ scaling as $K \sim N^{P-1}$.\\ In this paper, once introduced a {\em minimal dense associative network} as one of the most elementary cost-functions falling in this class of DAM, we sacrifice this high-load regime -namely we force the storage of {\em solely} a linear amount of patterns, i.e. $K = αN$ (with $α>0$)- to prove that, in this regime, these networks can correctly perform pattern recognition even if pattern signal is $O(1)$ and is embedded in a sea of noise $O(\sqrt{N})$, also in the large $N$ limit. To prove this statement, by extremizing the quenched free-energy of the model over its natural order-parameters (the various magnetizations and overlaps), we derived its phase diagram, at the replica symmetric level of description and in the thermodynamic limit: as a sideline, we stress that, to achieve this task, aiming at cross-fertilization among disciplines, we pave two hegemon routes in the statistical mechanics of spin glasses, namely the replica trick and the interpolation technique.\\ Both the approaches reach the same conclusion: there is a not-empty region, in the noise-$T$ vs load-$α$ phase diagram plane, where these networks can actually work in this challenging regime; in particular we obtained a quite high critical (linear) load in the (fast) noiseless case resulting in $\lim_{β\to \infty}α_c(β)=0.65$.

preprint2019arXiv

Neural networks with redundant representation: detecting the undetectable

We consider a three-layer Sejnowski machine and show that features learnt via contrastive divergence have a dual representation as patterns in a dense associative memory of order P=4. The latter is known to be able to Hebbian-store an amount of patterns scaling as N^{P-1}, where N denotes the number of constituting binary neurons interacting P-wisely. We also prove that, by keeping the dense associative network far from the saturation regime (namely, allowing for a number of patterns scaling only linearly with N, while P>2) such a system is able to perform pattern recognition far below the standard signal-to-noise threshold. In particular, a network with P=4 is able to retrieve information whose intensity is O(1) even in the presence of a noise O(\sqrt{N}) in the large N limit. This striking skill stems from a redundancy representation of patterns -- which is afforded given the (relatively) low-load information storage -- and it contributes to explain the impressive abilities in pattern recognition exhibited by new-generation neural networks. The whole theory is developed rigorously, at the replica symmetric level of approximation, and corroborated by signal-to-noise analysis and Monte Carlo simulations.

preprint2018arXiv

Dreaming neural networks: rigorous results

Recently a daily routine for associative neural networks has been proposed: the network Hebbian-learns during the awake state (thus behaving as a standard Hopfield model), then, during its sleep state, optimizing information storage, it consolidates pure patterns and removes spurious ones: this forces the synaptic matrix to collapse to the projector one (ultimately approaching the Kanter-Sompolinksy model). This procedure keeps the learning Hebbian-based (a biological must) but, by taking advantage of a (properly stylized) sleep phase, still reaches the maximal critical capacity (for symmetric interactions). So far this emerging picture (as well as the bulk of papers on unlearning techniques) was supported solely by mathematically-challenging routes, e.g. mainly replica-trick analysis and numerical simulations: here we rely extensively on Guerra's interpolation techniques developed for neural networks and, in particular, we extend the generalized stochastic stability approach to the case. Confining our description within the replica symmetric approximation (where the previous ones lie), the picture painted regarding this generalization (and the previously existing variations on theme) is here entirely confirmed. Further, still relying on Guerra's schemes, we develop a systematic fluctuation analysis to check where ergodicity is broken (an analysis entirely absent in previous investigations). We find that, as long as the network is awake, ergodicity is bounded by the Amit-Gutfreund-Sompolinsky critical line (as it should), but, as the network sleeps, sleeping destroys spin glass states by extending both the retrieval as well as the ergodic region: after an entire sleeping session the solely surviving regions are retrieval and ergodic ones and this allows the network to achieve the perfect retrieval regime (the number of storable patterns equals the number of neurons in the network).

preprint2016arXiv

Exact partition functions for deformed $\mathcal{N}=2$ theories with $N_{f}=4$ flavours

We consider the $Ω$-deformed $\mathcal{N}=2$ $SU(2)$ gauge theory in four dimensions with $N_{f}=4$ massive fundamental hypermultiplets. The low energy effective action depends on the deformation parameters $\varepsilon_{1}, \varepsilon_{2}$, the scalar field expectation value $a$, and the hypermultiplet masses ${\bf m}=(m_{1}, m_{2}, m_{3}, m_{4})$. Motivated by recent findings in the $\mathcal{N}=2^{*}$ theory, we explore the theories that are characterized by special fixed ratios $\varepsilon_{2}/\varepsilon_{1}$ and ${\bf m}/\varepsilon_{1}$ and propose a simple condition on the structure of the multi-instanton contributions to the prepotential determining the effective action. This condition determines a finite set $Π_{N}$ of special points such that the prepotential has $N$ poles at fixed positions independent on the instanton number. In analogy with what happens in the $\mathcal{N}=2^{*}$ gauge theory, the full prepotential of the $Π_{N}$ theories may be given in closed form as an explicit function of $a$ and the modular parameter $q$ appearing in special combinations of Eisenstein series and Jacobi theta functions with well defined modular properties. The resulting finite pole partition functions are related by AGT correspondence to special 4-point spherical conformal blocks of the Virasoro algebra. We examine in full details special cases where the closed expression of the block is known and confirms our Ansatz. We systematically study the special features of Zamolodchikov's recursion for the $Π_{N}$ conformal blocks. As a result, we provide a novel effective recursion relation that can be exactly solved and allows to prove the conjectured closed expressions analytically in the case of the $Π_{1}$ and $Π_{2}$ conformal blocks.

preprint2016arXiv

On the cusp anomalous dimension in the ladder limit of $\mathcal N=4$ SYM

We analyze the cusp anomalous dimension in the (leading) ladder limit of $\mathcal N=4$ SYM and present new results for its higher-order perturbative expansion. We study two different limits with respect to the cusp angle $ϕ$. The first is the light-like regime where $x = e^{i\,ϕ}\to 0$. This limit is characterised by a non-trivial expansion of the cusp anomaly as a sum of powers of $\log x$, where the maximum exponent increases with the loop order. The coefficients of this expansion have remarkable transcendentality features and can be expressed by products of single zeta values. We show that the whole logarithmic expansion is fully captured by a solvable Woods-Saxon like one-dimensional potential. From the exact solution, we extract generating functions for the cusp anomaly as well as for the various specific transcendental structures appearing therein. The second limit that we discuss is the regime of small cusp angle. In this somewhat simpler case, we show how to organise the quantum mechanical perturbation theory in a novel efficient way by means of a suitable all-order Ansatz for the ground state of the associated Schrödinger problem. Our perturbative setup allows to systematically derive higher-order perturbative corrections in powers of the cusp angle as explicit non-perturbative functions of the effective coupling. This series approximation is compared with the numerical solution of the Schrödinger equation to show that we can achieve very good accuracy over the whole range of coupling and cusp angle. Our results have been obtained by relatively simple techniques. Nevertheless, they provide several non-trivial tests useful to check the application of Quantum Spectral Curve methods to the ladder approximation at non zero $ϕ$, in the two limits we studied.

preprint2015arXiv

Virasoro vacuum block at next-to-leading order in the heavy-light limit

We consider the semiclassical limit of the vacuum Virasoro block describing the diagonal 4-point correlation functions on the sphere. At large central charge c, after exponentiation, it depends on two fixed ratios h_H/c and h_L/c, where h_{H, L} are the conformal dimensions of the 4-point function operators. The semiclassical block may be expanded in powers of the light ratio h_L/c and the leading non-trivial (linear) order is known in closed form as a function of h_H/c. Recently, this contribution has been matched against AdS_3 gravity calculations where heavy operators build up a classical geometry corresponding to a BTZ black hole, while the light operators are described by a geodesic in this background. Here, we compute for the first time the next-to-leading quadratic correction O((h_L/c)^{2}), again in closed form for generic heavy operator ratio h_H/c. The result is a highly non-trivial extension of the leading order and may be relevant for further refined AdS_{3}/CFT_{2} tests. Applications to the two-interval Rényi entropy are also presented.

Alberto Fachechi

What is connected

Connect this record

See the researcher in context

Building this map preview

9 published item(s)

Pavlov Learning Machines

Pattern recognition in Deep Boltzmann machines

Generalized Guerra's interpolation schemes for dense associative neural networks

Interpolating between boolean and extremely high noisy patterns through Minimal Dense Associative Memories

Neural networks with redundant representation: detecting the undetectable

Dreaming neural networks: rigorous results

Exact partition functions for deformed $\mathcal{N}=2$ theories with $N_{f}=4$ flavours

On the cusp anomalous dimension in the ladder limit of $\mathcal N=4$ SYM

Virasoro vacuum block at next-to-leading order in the heavy-light limit