Source author record

Cláudia Soares

Cláudia Soares appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.OC Machine Learning Computer Vision eess.SP math.ST Statistics Theory

Catalog footprint

What is connected

7works

6topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

A Cluster-Based Trip Prediction Graph Neural Network Model for Bike Sharing Systems

Bike Sharing Systems (BSSs) are emerging as an innovative transportation service. Ensuring the proper functioning of a BSS is crucial given that these systems are committed to eradicating many of the current global concerns, by promoting environmental and economic sustainability and contributing to improving the life quality of the population. Good knowledge of users' transition patterns is a decisive contribution to the quality and operability of the service. The analogous and unbalanced users' transition patterns cause these systems to suffer from bicycle imbalance, leading to a drastic customer loss in the long term. Strategies for bicycle rebalancing become important to tackle this problem and for this, bicycle traffic prediction is essential, as it allows to operate more efficiently and to react in advance. In this work, we propose a bicycle trips predictor based on Graph Neural Network embeddings, taking into consideration station groupings, meteorology conditions, geographical distances, and trip patterns. We evaluated our approach in the New York City BSS (CitiBike) data and compared it with four baselines, including the non-clustered approach. To address our problem's specificities, we developed the Adaptive Transition Constraint Clustering Plus (AdaTC+) algorithm, eliminating shortcomings of previous work. Our experiments evidence the clustering pertinence (88% accuracy compared with 83% without clustering) and which clustering technique best suits this problem. Accuracy on the Link Prediction task is always higher for AdaTC+ than benchmark clustering methods when the stations are the same, while not degrading performance when the network is upgraded, in a mismatch with the trained model.

preprint2022arXiv

Decentralized EM to Learn Gaussian Mixtures from Datasets Distributed by Features

Expectation Maximization (EM) is the standard method to learn Gaussian mixtures. Yet its classic, centralized form is often infeasible, due to privacy concerns and computational and communication bottlenecks. Prior work dealt with data distributed by examples, horizontal partitioning, but we lack a counterpart for data scattered by features, an increasingly common scheme (e.g. user profiling with data from multiple entities). To fill this gap, we provide an EM-based algorithm to fit Gaussian mixtures to Vertically Partitioned data (VP-EM). In federated learning setups, our algorithm matches the centralized EM fitting of Gaussian mixtures constrained to a subspace. In arbitrary communication graphs, consensus averaging allows VP-EM to run on large peer-to-peer networks as an EM approximation. This mismatch comes from consensus error only, which vanishes exponentially fast with the number of consensus rounds. We demonstrate VP-EM on various topologies for both synthetic and real data, evaluating its approximation of centralized EM and seeing that it outperforms the available benchmark.

preprint2022arXiv

From noisy point clouds to complete ear shapes: unsupervised pipeline

Ears are a particularly difficult region of the human face to model, not only due to the non-rigid deformations existing between shapes but also to the challenges in processing the retrieved data. The first step towards obtaining a good model is to have complete scans in correspondence, but these usually present a higher amount of occlusions, noise and outliers when compared to most face regions, thus requiring a specific procedure. Therefore, we propose a complete pipeline taking as input unordered 3D point clouds with the aforementioned problems, and producing as output a dataset in correspondence, with completion of the missing data. We provide a comparison of several state-of-the-art registration methods and propose a new approach for one of the steps of the pipeline, with better performance for our data.

preprint2022arXiv

Robust Localization with Bounded Noise: Creating a Superset of the Possible Target Positions via Linear-Fractional Representations

Locating a target is key in many applications, namely in high-stakes real-world scenarios, like detecting humans or obstacles in vehicular networks. In scenarios where precise statistics of the measurement noise are unavailable, applications require localization methods that assume minimal knowledge on the noise distribution. We present a scalable algorithm delimiting a tight superset of all possible target locations, assuming range measurements to known landmarks, contaminated with bounded noise and unknown distributions. This superset is of primary interest in robust statistics since it is a tight majorizer of the set of Maximum-Likelihood (ML) estimates parametrized by noise densities respecting two main assumptions: (1) the noise distribution is supported on a ellipsoidal uncertainty region and (2) the measurements are non-negative with probability one. We create the superset through convex relaxations that use Linear Fractional Representations (LFRs), a well-known technique in robust control. For low noise regimes the supersets created by our method double the accuracy of a standard semidefinite relaxation. For moderate to high noise regimes our method still improves the benchmark but the benefit tends to be less significant, as both supersets tend to have the same size (area).

preprint2016arXiv

Dealing with bad apples: Robust range-based network localization via distributed relaxation methods

Real-world network applications must cope with failing nodes, malicious attacks, or, somehow, nodes facing corrupted data --- classified as outliers. One enabling application is the geographic localization of the network nodes. However, despite excellent work on the network localization problem, prior research seldom considered outlier data --- even now, when already deployed networks cry out for robust procedures. We propose robust, fast, and distributed network localization algorithms, resilient to high-power noise, but also precise under regular Gaussian noise. We use the Huber M-estimator as a difference measure between the distance of estimated nodes and noisy range measurements, thus obtaining a robust (but nonconvex) optimization problem. We then devise a convex underestimator solvable in polynomial time, and tight in the inter-node terms. We also provide an optimality bound for the convex underestimator. We put forward a new representation of the Huber function composed with a norm, enabling distributed robust localization algorithms to minimize the proposed underestimator. The synchronous distributed method has optimal convergence rate and the asynchronous one converges in finite time, for a given precision. The main highlight of our contribution lies on the fact that we pay no price for distributed computation nor in accuracy, nor in communication cost or convergence speed. Simulations show the advantage of using our proposed algorithms, both in the presence of outliers and under regular Gaussian noise: our method exceeds the accuracy of an alternative robust approach based on L1 norms by at least 100m in an area of 1Km sides.

preprint2015arXiv

Simple and fast convex relaxation method for cooperative localization in sensor networks using range measurements

We address the sensor network localization problem given noisy range measurements between pairs of nodes. We approach the non-convex maximum-likelihood formulation via a known simple convex relaxation. We exploit its favorable optimization properties to the full to obtain an approach that: is completely distributed, has a simple implementation at each node, and capitalizes on an optimal gradient method to attain fast convergence. We offer a parallel but also an asynchronous flavor, both with theoretical convergence guarantees and iteration complexity analysis. Experimental results establish leading performance. Our algorithms top the accuracy of a comparable state of the art method by one order of magnitude, using one order of magnitude fewer communications.

preprint2014arXiv

Robust dissimilarity measure for Network Localization

In practice, network applications have to deal with failing nodes, malicious attacks, or, somehow, nodes facing highly corrupted data --- generally classified as outliers. This calls for robust, uncomplicated, and efficient methods. We propose a dissimilarity model for network localization which is robust to high-power noise, but also discriminative in the presence of regular gaussian noise. We capitalize on the known properties of the M-estimator Huber penalty function to obtain a robust, but nonconvex, problem, and devise a convex underestimator, tight in the function terms, that can be minimized in polynomial time. Simulations show the performance advantage of using this dissimilarity model in the presence of outliers and under regular gaussian noise.

Cláudia Soares

What is connected

Connect this record

See the researcher in context

Building this map preview

7 published item(s)

A Cluster-Based Trip Prediction Graph Neural Network Model for Bike Sharing Systems

Decentralized EM to Learn Gaussian Mixtures from Datasets Distributed by Features

From noisy point clouds to complete ear shapes: unsupervised pipeline

Robust Localization with Bounded Noise: Creating a Superset of the Possible Target Positions via Linear-Fractional Representations

Dealing with bad apples: Robust range-based network localization via distributed relaxation methods

Simple and fast convex relaxation method for cooperative localization in sensor networks using range measurements

Robust dissimilarity measure for Network Localization