Researcher profile

François Bavaud

François Bavaud contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - Emerging
7works
0followers
9topics
1close collaborators

Actions

Decide how to stay connected

Follow researcher0

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

7 published item(s)

preprint2016arXiv

Non-parametric latent modeling and network clustering

The paper exposes a non-parametric approach to latent and co-latent modeling of bivariate data, based upon alternating minimization of the Kullback-Leibler divergence (EM algorithm) for complete log-linear models. For categorical data, the iterative algorithm generates a soft clustering of both rows and columns of the contingency table. Well-known results are systematically revisited, and some variants are presumably original. In particular, the consideration of square contingency tables induces a clustering algorithm for weighted networks, differing from spectral clustering or modularity maximization techniques. Also, we present a co-clustering algorithm applicable to HMM models of general kind, distinct from the Baum-Welch algorithm. Three case studies illustrate the theory.

preprint2012arXiv

Interpolating between Random Walks and Shortest Paths: a Path Functional Approach

General models of network navigation must contain a deterministic or drift component, encouraging the agent to follow routes of least cost, as well as a random or diffusive component, enabling free wandering. This paper proposes a thermodynamic formalism involving two path functionals, namely an energy functional governing the drift and an entropy functional governing the diffusion. A freely adjustable parameter, the temperature, arbitrates between the conflicting objectives of minimising travel costs and maximising spatial exploration. The theory is illustrated on various graphs and various temperatures. The resulting optimal paths, together with presumably new associated edges and nodes centrality indices, are analytically and numerically investigated.

preprint2011arXiv

Robust Estimation through Schoenberg transformations

Schoenberg transformations, mapping Euclidean configurations into Euclidean configurations, define in turn a transformed inertia, whose minimization produces robust location estimates. The procedure only depends upon Euclidean distances between observations, and applies equivalently to univariate and multivariate data. The choice of the family of transformations and their parameters defines a flexible location strategy, generalizing M-estimators. Two regimes of solutions are identified. Theoretical results on their existence and stability are provided, and illustrated on two data sets.

preprint2010arXiv

Euclidean Distances, soft and spectral Clustering on Weighted Graphs

We define a class of Euclidean distances on weighted graphs, enabling to perform thermodynamic soft graph clustering. The class can be constructed form the "raw coordinates" encountered in spectral clustering, and can be extended by means of higher-dimensional embeddings (Schoenberg transformations). Geographical flow data, properly conditioned, illustrate the procedure as well as visualization aspects.

preprint2010arXiv

On the Schoenberg Transformations in Data Analysis: Theory and Illustrations

The class of Schoenberg transformations, embedding Euclidean distances into higher dimensional Euclidean spaces, is presented, and derived from theorems on positive definite and conditionally negative definite matrices. Original results on the arc lengths, angles and curvature of the transformations are proposed, and visualized on artificial data sets by classical multidimensional scaling. A simple distance-based discriminant algorithm illustrates the theory, intimately connected to the Gaussian kernels of Machine Learning.

preprint2010arXiv

Relative Entropy and Statistics

Formalising the confrontation of opinions (models) to observations (data) is the task of Inferential Statistics. Information Theory provides us with a basic functional, the relative entropy (or Kullback-Leibler divergence), an asymmetrical measure of dissimilarity between the empirical and the theoretical distributions. The formal properties of the relative entropy turn out to be able to capture every aspect of Inferential Statistics, as illustrated here, for simplicity, on dices (= i.i.d. process with finitely many outcomes): refutability (strict or probabilistic): the asymmetry data / models; small deviations: rejecting a single hypothesis; competition between hypotheses and model selection; maximum likelihood: model inference and its limits; maximum entropy: reconstructing partially observed data; EM-algorithm; flow data and gravity modelling; determining the order of a Markov chain.

preprint2010arXiv

Stereotype bias: a simple formal model

Minimizing the relative inertia of a statistical group with respect to the inertia of the overall sample defines an unique point, the in-focus, which constitutes a context-dependent measure of typical group tendency, biased in comparison to the group centroid. Maximizing the relative inertia yields an unique out-focal point, polarized in the reverse direction. This mechanism evokes the relative variability reduction of the outgroup reported in Social Psychology, and the stereotypic-like behavior of the in-focus, whose bias vanishes if the outgroup is constituted of a single individual. In this picture, the out-focus plays the role of an anti-stereotypical position, identical to the in-focus of the complementary group.