Source author record

João Xavier

João Xavier appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.OC Information Theory math.IT physics.soc-ph Social and Information Networks Distributed, Parallel, and Cluster Computing eess.SP Machine Learning math.ST Populations and Evolution Statistics Theory

Catalog footprint

What is connected

12works

11topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Decentralized EM to Learn Gaussian Mixtures from Datasets Distributed by Features

Expectation Maximization (EM) is the standard method to learn Gaussian mixtures. Yet its classic, centralized form is often infeasible, due to privacy concerns and computational and communication bottlenecks. Prior work dealt with data distributed by examples, horizontal partitioning, but we lack a counterpart for data scattered by features, an increasingly common scheme (e.g. user profiling with data from multiple entities). To fill this gap, we provide an EM-based algorithm to fit Gaussian mixtures to Vertically Partitioned data (VP-EM). In federated learning setups, our algorithm matches the centralized EM fitting of Gaussian mixtures constrained to a subspace. In arbitrary communication graphs, consensus averaging allows VP-EM to run on large peer-to-peer networks as an EM approximation. This mismatch comes from consensus error only, which vanishes exponentially fast with the number of consensus rounds. We demonstrate VP-EM on various topologies for both synthetic and real data, evaluating its approximation of centralized EM and seeing that it outperforms the available benchmark.

preprint2022arXiv

Distributed Banach-Picard Iteration: Application to Distributed EM and Distributed PCA

In recent work, we proposed a distributed Banach-Picard iteration (DBPI) that allows a set of agents, linked by a communication network, to find a fixed point of a locally contractive (LC) map that is the average of individual maps held by said agents. In this work, we build upon the DBPI and its local linear convergence (LLC) guarantees to make several contributions. We show that Sanger's algorithm for principal component analysis (PCA) corresponds to the iteration of an LC map that can be written as the average of local maps, each map known to each agent holding a subset of the data. Similarly, we show that a variant of the expectation-maximization (EM) algorithm for parameter estimation from noisy and faulty measurements in a sensor network can be written as the iteration of an LC map that is the average of local maps, each available at just one node. Consequently, via the DBPI, we derive two distributed algorithms - distributed EM and distributed PCA - whose LLC guarantees follow from those that we proved for the DBPI. The verification of the LC condition for EM is challenging, as the underlying operator depends on random samples, thus the LC condition is of probabilistic nature.

preprint2022arXiv

Robust Localization with Bounded Noise: Creating a Superset of the Possible Target Positions via Linear-Fractional Representations

Locating a target is key in many applications, namely in high-stakes real-world scenarios, like detecting humans or obstacles in vehicular networks. In scenarios where precise statistics of the measurement noise are unavailable, applications require localization methods that assume minimal knowledge on the noise distribution. We present a scalable algorithm delimiting a tight superset of all possible target locations, assuming range measurements to known landmarks, contaminated with bounded noise and unknown distributions. This superset is of primary interest in robust statistics since it is a tight majorizer of the set of Maximum-Likelihood (ML) estimates parametrized by noise densities respecting two main assumptions: (1) the noise distribution is supported on a ellipsoidal uncertainty region and (2) the measurements are non-negative with probability one. We create the superset through convex relaxations that use Linear Fractional Representations (LFRs), a well-known technique in robust control. For low noise regimes the supersets created by our method double the accuracy of a standard semidefinite relaxation. For moderate to high noise regimes our method still improves the benchmark but the benefit tends to be less significant, as both supersets tend to have the same size (area).

preprint2016arXiv

Robustness Properties in Fictitious-Play-Type Algorithms

Fictitious play (FP) is a canonical game-theoretic learning algorithm which has been deployed extensively in decentralized control scenarios. However standard treatments of FP, and of many other game-theoretic models, assume rather idealistic conditions which rarely hold in realistic control scenarios. This paper considers a broad class of best response learning algorithms, that we refer to as FP-type algorithms. In such an algorithm, given some (possibly limited) information about the history of actions, each individual forecasts the future play and chooses a (myopic) best action given their forecast. We provide a unifed analysis of the behavior of FP-type algorithms under an important class of perturbations, thus demonstrating robustness to deviations from the idealistic operating conditions that have been previously assumed. This robustness result is then used to derive convergence results for two control-relevant relaxations of standard game-theoretic applications: distributed (network-based) implementation without full observability and asynchronous deployment (including in continuous time). In each case the results follow as a direct consequence of the main robustness result.

preprint2015arXiv

Distributed inference over directed networks: Performance limits and optimal design

We find large deviations rates for consensus-based distributed inference for directed networks. When the topology is deterministic, we establish the large deviations principle and find exactly the corresponding rate function, equal at all nodes. We show that the dependence of the rate function on the stochastic weight matrix associated with the network is fully captured by its left eigenvector corresponding to the unit eigenvalue. Further, when the sensors' observations are Gaussian, the rate function admits a closed form expression. Motivated by these observations, we formulate the optimal network design problem of finding the left eigenvector which achieves the highest value of the rate function, for a given target accuracy. This eigenvector therefore minimizes the time that the inference algorithm needs to reach the desired accuracy. For Gaussian observations, we show that the network design problem can be formulated as a semidefinite (convex) program, and hence can be solved efficiently. When observations are identically distributed across agents, the system exhibits an interesting property: the graph of the rate function always lies between the graphs of the rate function of an isolated node and the rate function of a fusion center that has access to all observations. We prove that this fundamental property holds even when the topology and the associated system matrices change randomly over time, with arbitrary distribution. Due to generality of its assumptions, the latter result requires more subtle techniques than the standard large deviations tools, contributing to the general theory of large deviations.

preprint2015arXiv

Massive MIMO Full-Duplex Relaying with Optimal Power Allocation for Independent Multipairs

With the help of an in-band full-duplex relay station, it is possible to simultaneously transmit and receive signals from multiple users. The performance of such system can be greatly increased when the relay station is equipped with a large number of antennas on both transmitter and receiver sides. In this paper, we exploit the use of massive arrays to effectively suppress the loopback interference (LI) of a decode-and-forward relay (DF) and evaluate the performance of the end-to-end (e2e) transmission. This paper assumes imperfect channel state information is available at the relay and designs a minimum mean-square error (MMSE) filter to mitigate the interference. Subsequently, we adopt zero-forcing (ZF) filters for both detection and beamforming. The performance of such system is evaluated in terms of bit error rate (BER) at both relay and destinations, and an optimal choice for the transmission power at the relay is shown. We then propose a complexity efficient optimal power allocation (OPA) algorithm that, using the channel statistics, computes the minimum power that satisfies the rate constraints of each pair. The results obtained via simulation show that when both MMSE filtering and OPA method are used, better values for the energy efficiency are attained.

preprint2015arXiv

Simple and fast convex relaxation method for cooperative localization in sensor networks using range measurements

We address the sensor network localization problem given noisy range measurements between pairs of nodes. We approach the non-convex maximum-likelihood formulation via a known simple convex relaxation. We exploit its favorable optimization properties to the full to obtain an approach that: is completely distributed, has a simple implementation at each node, and capitalizes on an optimal gradient method to attain fast convergence. We offer a parallel but also an asynchronous flavor, both with theoretical convergence guarantees and iteration complexity analysis. Experimental results establish leading performance. Our algorithms top the accuracy of a comparable state of the art method by one order of magnitude, using one order of magnitude fewer communications.

preprint2013arXiv

Emergent Behavior in Multipartite Large Networks: Multi-virus Epidemics

Epidemics in large complete networks is well established. In contrast, we consider epidemics in non-complete networks. We establish the fluid limit macroscopic dynamics of a multi-virus spread over a multipartite network as the number of nodes at each partite or island grows large. The virus spread follows a peer-to-peer random rule of infection in line with the Harris contact process. The model conforms to an SIS (susceptible-infected-susceptible) type, where a node is either infected or it is healthy and prone to be infected. The local (at node level) random infection model induces the emergence of structured dynamics at the macroscale. Namely, we prove that, as the multipartite network grows large, the normalized Markov jump vector process $\left(\bar{\mathbf{Y}}^\mathbf{N}(t)\right) = \left(\bar{Y}_1^\mathbf{N}(t),\ldots, \bar{Y}_M^\mathbf{N}(t)\right)$ collecting the fraction of infected nodes at each island $i=1,\ldots,M$, converges weakly (with respect to the Skorokhod topology on the space of \emph{càdlàg} sample paths) to the solution of an $M$-dimensional vector nonlinear coupled ordinary differential equation. In the case of multi-virus diffusion with $K\in\mathbb{N}$ distinct strains of virus, the Markov jurmp matrix process $\left(\bar{\mathbf{Y}}^\mathbf{N}(t)\right)$, stacking the fraction of nodes infected with virus type $j$, $j=1,\ldots,K$, at each island $i=1,\ldots,M$, converges weakly as well to the solution of a $\left(K\times M\right)$-dimensional vector differential equation that is also characterized.

preprint2013arXiv

Epidemics in Multipartite Networks: Emergent Dynamics

Single virus epidemics over complete networks are widely explored in the literature as the fraction of infected nodes is, under appropriate microscopic modeling of the virus infection, a Markov process. With non-complete networks, this macroscopic variable is no longer Markov. In this paper, we study virus diffusion, in particular, multi-virus epidemics, over non-complete stochastic networks. We focus on multipartite networks. In companying work http://arxiv.org/abs/1306.6198, we show that the peer-to-peer local random rules of virus infection lead, in the limit of large multipartite networks, to the emergence of structured dynamics at the macroscale. The exact fluid limit evolution of the fraction of nodes infected by each virus strain across islands obeys a set of nonlinear coupled differential equations, see http://arxiv.org/abs/1306.6198. In this paper, we develop methods to analyze the qualitative behavior of these limiting dynamics, establishing conditions on the virus micro characteristics and network structure under which a virus persists or a natural selection phenomenon is observed.

preprint2013arXiv

Filter Design with Secrecy Constraints: The MIMO Gaussian Wiretap Channel

This paper considers the problem of filter design with secrecy constraints, where two legitimate parties (Alice and Bob) communicate in the presence of an eavesdropper (Eve), over a Gaussian multiple-input-multiple-output (MIMO) wiretap channel. This problem involves designing, subject to a power constraint, the transmit and the receive filters which minimize the mean-squared error (MSE) between the legitimate parties whilst assuring that the eavesdropper MSE remains above a certain threshold. We consider a general MIMO Gaussian wiretap scenario, where the legitimate receiver uses a linear Zero-Forcing (ZF) filter and the eavesdropper receiver uses either a ZF or an optimal linear Wiener filter. We provide a characterization of the optimal filter designs by demonstrating the convexity of the optimization problems. We also provide generalizations of the filter designs from the scenario where the channel state is known to all the parties to the scenario where there is uncertainty in the channel state. A set of numerical results illustrates the performance of the novel filter designs, including the robustness to channel modeling errors. In particular, we assess the efficacy of the designs in guaranteeing not only a certain MSE level at the eavesdropper, but also in limiting the error probability at the eavesdropper. We also assess the impact of the filter designs on the achievable secrecy rates. The penalty induced by the fact that the eavesdropper may use the optimal non-linear receive filter rather than the optimal linear one is also explored in the paper.

preprint2011arXiv

Approximate Maximum Likelihood Source Localization from Range Measurements Through Convex Relaxation

This work considers the problem of locating a single source from noisy range measurements to a set of nodes in a wireless sensor network. We propose two new techniques that we designate as Source Localization with Nuclear Norm (SLNN) and Source Localization with l1-norm (SL-l1), which extend to arbitrary real dimensions, including 3D, our prior work on 2D source localization formulated in the complex plane. Broadly, our approach is based on formulating a Maximum-Likelihood (ML) estimation problem for the source position, and then using convex relaxation techniques to obtain a semidefinite program (SDP) that can be globally and efficiently solved. SLNN directly approximates the Gaussian ML solution, and the relaxation is shown to be tighter than in other methods in the same class. We present an analysis of the convexity properties of the constraint set for the 2D complex version of SLNN (SLCP) to justify the observed tightness of the relaxation. In terms of global accuracy of localization, SLNN outperforms state-of-the-art optimization-based methods with either iterative or closed-form formulations. We propose the SL-l1 algorithm to address the Laplacian noise case, which models the presence of outliers in range measurements. We overcome the nondifferentiability of the Laplacian likelihood function by rewriting the ML problem as an exact weighted version of the Gaussian case, and compare two solution strategies. One of them is iterative, based on block coordinate descent, and uses SLNN as a subprocessing block. The other, attaining only slightly worse performance, is noniterative and based on an SDP relaxation of the weighted ML problem.

preprint2010arXiv

Robust Simultaneous Localization of Nodes and Targets in Sensor Networks Using Range-Only Measurements

Simultaneous localization and tracking (SLAT) in sensor networks aims to determine the positions of sensor nodes and a moving target in a network, given incomplete and inaccurate range measurements between the target and each of the sensors. One of the established methods for achieving this is to iteratively maximize a likelihood function (ML), which requires initialization with an approximate solution to avoid convergence towards local extrema. This paper develops methods for handling both Gaussian and Laplacian noise, the latter modeling the presence of outliers in some practical ranging systems that adversely affect the performance of localization algorithms designed for Gaussian noise. A modified Euclidean Distance Matrix (EDM) completion problem is solved for a block of target range measurements to approximately set up initial sensor/target positions, and the likelihood function is then iteratively refined through Majorization-Minimization (MM). To avoid the computational burden of repeatedly solving increasingly large EDM problems in time-recursive operation an incremental scheme is exploited whereby a new target/node position is estimated from previously available node/target locations to set up the iterative ML initial point for the full spatial configuration. The above methods are first derived under Gaussian noise assumptions, and modifications for Laplacian noise are then considered. Analytically, the main challenges to be overcome in the Laplacian case stem from the non-differentiability of $\ell_1$ norms that arise in the various cost functions. Simulation results confirm that the proposed algorithms significantly outperform existing methods for SLAT in the presence of outliers, while offering comparable performance for Gaussian noise.

João Xavier

What is connected

Connect this record

See the researcher in context

Building this map preview

12 published item(s)

Decentralized EM to Learn Gaussian Mixtures from Datasets Distributed by Features

Distributed Banach-Picard Iteration: Application to Distributed EM and Distributed PCA

Robust Localization with Bounded Noise: Creating a Superset of the Possible Target Positions via Linear-Fractional Representations

Robustness Properties in Fictitious-Play-Type Algorithms

Distributed inference over directed networks: Performance limits and optimal design

Massive MIMO Full-Duplex Relaying with Optimal Power Allocation for Independent Multipairs

Simple and fast convex relaxation method for cooperative localization in sensor networks using range measurements

Emergent Behavior in Multipartite Large Networks: Multi-virus Epidemics

Epidemics in Multipartite Networks: Emergent Dynamics

Filter Design with Secrecy Constraints: The MIMO Gaussian Wiretap Channel

Approximate Maximum Likelihood Source Localization from Range Measurements Through Convex Relaxation

Robust Simultaneous Localization of Nodes and Targets in Sensor Networks Using Range-Only Measurements