Source author record

Alejandro Lage-Castellanos

Alejandro Lage-Castellanos appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

cond-mat.dis-nn cond-mat.stat-mech Populations and Evolution physics.soc-ph Applications math.DS physics.data-an Quantitative Methods Social and Information Networks

Catalog footprint

What is connected

10works

9topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2021arXiv

Ancestral Sequence Reconstruction for Co-evolutionary models

The ancestral sequence reconstruction problem is the inference, back in time, of the properties of common sequence ancestors from measured properties of contemporary populations. Standard algorithms for this problem assume independent (factorized) evolution of the characters of the sequences, which is generally wrong (e.g. proteins and genome sequences). In this work, we have studied this problem for sequences described by global co-evolutionary models, which reproduce the global pattern of cooperative interactions between the elements that compose it. For this, we first modeled the temporal evolution of correlated real valued characters by a multivariate Ornstein-Uhlenbeck process on a finite tree. This represents sequences as Gaussian vectors evolving in a quadratic potential, who describe selection forces acting on the evolving entities. Under a Bayesian framework, we developed a reconstruction algorithm for these sequences and obtained an analytical expression to quantify the quality of our estimation. We extend this formalism to discrete valued sequences by applying our method to a Potts model. We showed that for both continuous and discrete configurations, there is a wide range of parameters where, to properly reconstruct the ancestral sequences, intra-species correlations must be taken into account. We also demonstrated that, for sequences with discrete elements, our reconstruction algorithm outperforms traditional schemes based on independent site approximations.

preprint2021arXiv

Dynamics of epidemic models from cavity master equations

We apply the cavity master equation (CME) approach to epidemics models. We explore mostly the susceptible-infectious-susceptible (SIS) model, which can be readily treated with the CME as a two-state. We show that this approach is more accurate than individual based and pair based mean field methods, and a previously published dynamic message passing scheme. We explore average case predictions and extend the cavity master equation to SIR and SIRS models.

preprint2020arXiv

Estimating undocumented Covid-19 infections in Cuba by means of a hybrid mechanistic-statistical approach

We adapt the hybrid mechanistic-statistical approach of Ref. [1] to estimate the total number of undocumented Covid-19 infections in Cuba. This scheme is based on the maximum likelihood estimation of a SIR-like model parameters for the infected population, assuming that the detection process matches a Bernoulli trial. Our estimations show that (a) 60% of the infections were undocumented, (b) the real epidemics behind the data peaked ten days before the reports suggested, and (c) the reproduction number swiftly vanishes after 80 epidemic days.

preprint2015arXiv

Random Field Ising Model in two dimensions: Bethe approximation, Cluster Variational Method and message passing algorithms

We study two free energy approximations (Bethe and plaquette-CVM) for the Random Field Ising Model in two dimensions. We compare results obtained by these two methods in single instances of the model on the square grid, showing the difficulties arising in defining a robust critical line. We also attempt average case calculations using a replica-symmetric ansatz, and compare the results with single instances. Both, Bethe and plaquette-CVM approximations present a similar panorama in the phase space, predicting long range order at low temperatures and fields. We show that plaquette-CVM is more precise, in the sense that predicts a lower critical line (the truth being no line at all). Furthermore, we give some insight on the non-trivial structure of the fixed points of different message passing algorithms.

preprint2014arXiv

A cavity approach to optimization and inverse dynamical problems

In these two lectures we shall discuss how the cavity approach can be used efficiently to study optimization problems with global (topological) constraints and how the same techniques can be generalized to study inverse problems in irreversible dynamical processes. These two classes of problems are formally very similar: they both require an efficient procedure to trace over all trajectories of either auxiliary variables which enforce global constraints, or directly dynamical variables defining the inverse dynamical problems. We will mention three basic examples, namely the Minimum Steiner Tree problem, the inverse threshold linear dynamical problem, and the patient-zero problem in epidemic cascades. All these examples are root problems in optimization and inference over networks. They appear in many modern applications and in a variety of different contexts. Credit for these results should be shared with A. Braunstein, A. Ramezanpour, F. Altarelli, L. Dall'Asta, I. Biazzo and A. Lage-Castellanos.

preprint2014arXiv

Bayesian inference of epidemics on networks via Belief Propagation

We study several bayesian inference problems for irreversible stochastic epidemic models on networks from a statistical physics viewpoint. We derive equations which allow to accurately compute the posterior distribution of the time evolution of the state of each node given some observations. At difference with most existing methods, we allow very general observation models, including unobserved nodes, state observations made at different or unknown times, and observations of infection times, possibly mixed together. Our method, which is based on the Belief Propagation algorithm, is efficient, naturally distributed, and exact on trees. As a particular case, we consider the problem of finding the "zero patient" of a SIR or SI epidemic given a snapshot of the state of the network at a later unknown time. Numerical simulations show that our method outperforms previous ones on both synthetic and real networks, often by a very large margin.

preprint2012arXiv

Replica Cluster Variational Method: the Replica Symmetric solution for the 2D random bond Ising model

We present and solve the Replica Symmetric equations in the context of the Replica Cluster Variational Method for the 2D random bond Ising model (including the 2D Edwards-Anderson spin glass model). First we solve a linearized version of these equations to obtain the phase diagrams of the model on the square and triangular lattices. In both cases the spin-glass transition temperatures and the tricritical point estimations improve largely over the Bethe predictions. Moreover, we show that this phase diagram is consistent with the behavior of inference algorithms on single instances of the problem. Finally, we present a method to consistently find approximate solutions to the equations in the glassy phase. The method is applied to the triangular lattice down to T=0, also in the presence of an external field.

preprint2012arXiv

Stability of the replica symmetric solution in diluted perceptron learning

We study the role played by the dilution in the average behavior of a perceptron model with continuous coupling with the replica method. We analyze the stability of the replica symmetric solution as a function of the dilution field for the generalization and memorization problems. Thanks to a Gardner like stability analysis we show that at any fixed ratio $α$ between the number of patterns M and the dimension N of the perceptron ($α=M/N$), there exists a critical dilution field $h_c$ above which the replica symmetric ansatz becomes unstable.

preprint2011arXiv

A very fast inference algorithm for finite-dimensional spin glasses: Belief Propagation on the dual lattice

Starting from a Cluster Variational Method, and inspired by the correctness of the paramagnetic Ansatz (at high temperatures in general, and at any temperature in the 2D Edwards-Anderson model) we propose a novel message passing algorithm --- the Dual algorithm --- to estimate the marginal probabilities of spin glasses on finite dimensional lattices. We show that in a wide range of temperatures our algorithm compares very well with Monte Carlo simulations, with the Double Loop algorithm and with exact calculation of the ground state of 2D systems with bimodal and Gaussian interactions. Moreover it is usually 100 times faster than other provably convergent methods, as the Double Loop algorithm.

preprint2009arXiv

Statistical mechanics of sparse generalization and model selection

One of the crucial tasks in many inference problems is the extraction of sparse information out of a given number of high-dimensional measurements. In machine learning, this is frequently achieved using, as a penality term, the $L_p$ norm of the model parameters, with $p\leq 1$ for efficient dilution. Here we propose a statistical-mechanics analysis of the problem in the setting of perceptron memorization and generalization. Using a replica approach, we are able to evaluate the relative performance of naive dilution (obtained by learning without dilution, following by applying a threshold to the model parameters), $L_1$ dilution (which is frequently used in convex optimization) and $L_0$ dilution (which is optimal but computationally hard to implement). Whereas both $L_p$ diluted approaches clearly outperform the naive approach, we find a small region where $L_0$ works almost perfectly and strongly outperforms the simpler to implement $L_1$ dilution.