Researcher profile

Bruno Ribeiro

Bruno Ribeiro contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
13works
0followers
7topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

13 published item(s)

preprint2026arXiv

Bridging Input Feature Spaces Towards Graph Foundation Models

Unlike vision and language domains, graph learning lacks a shared input space, as input features differ across graph datasets not only in semantics, but also in value ranges and dimensionality. This misalignment prevents graph models from generalizing across datasets, limiting their use as foundation models. In this work, we propose ALL-IN, a simple and theoretically grounded method that enables transferability across datasets with different input features. Our approach projects node features into a shared random space and constructs representations via covariance-based statistics, thus eliminating dependence on the original feature space. We show that the computed node-covariance operators and the resulting node representations are invariant in distribution to permutations of the input features. We further demonstrate that the expected operator exhibits invariance to general orthogonal transformations of the input features. Empirically, ALL-IN achieves strong performance across diverse node- and graph-level tasks on unseen datasets with new input features, without requiring architecture changes or retraining. These results point to a promising direction for input-agnostic, transferable graph models.

preprint2026arXiv

Imitative Membership Inference Attack

A Membership Inference Attack (MIA) assesses how much a target machine learning model reveals about its training data by determining whether specific query instances were part of the training set. State-of-the-art MIAs rely on training hundreds of shadow models that are independent of the target model, leading to significant computational overhead. In this paper, we introduce Imitative Membership Inference Attack (IMIA), which employs a novel imitative training technique to strategically construct a small number of target-informed imitative models that closely replicate the target model's behavior for inference. Extensive experimental results demonstrate that IMIA substantially outperforms existing MIAs in various attack settings while only requiring less than 5% of the computational cost of state-of-the-art approaches.

preprint2022arXiv

Bias Challenges in Counterfactual Data Augmentation

Deep learning models tend not to be out-of-distribution robust primarily due to their reliance on spurious features to solve the task. Counterfactual data augmentations provide a general way of (approximately) achieving representations that are counterfactual-invariant to spurious features, a requirement for out-of-distribution (OOD) robustness. In this work, we show that counterfactual data augmentations may not achieve the desired counterfactual-invariance if the augmentation is performed by a context-guessing machine, an abstract machine that guesses the most-likely context of a given input. We theoretically analyze the invariance imposed by such counterfactual data augmentations and describe an exemplar NLP task where counterfactual data augmentation by a context-guessing machine does not lead to robust OOD classifiers.

preprint2022arXiv

Veritas: Answering Causal Queries from Video Streaming Traces

In this paper, we seek to answer what-if questions - i.e., given recorded data of an existing deployed networked system, what would be the performance impact if we changed the design of the system (a task also known as causal inference). We make three contributions. First, we expose the complexity of causal inference in the context of adaptive bit rate video streaming, a challenging domain where the network conditions during the session act as a sequence of latent and confounding variables, and a change at any point in the session has a cascading impact on the rest of the session. Second, we present Veritas, a novel framework that tackles causal reasoning for video streaming without resorting to randomised trials. Integral to Veritas is an easy to interpret domain-specific ML model (an embedded Hidden Markov Model) that relates the latent stochastic process (intrinsic bandwidth that the video session can achieve) to actual observations (download times) while exploiting control variables such as the TCP state (e.g., congestion window) observed at the start of the download of video chunks. We show through experiments on an emulation testbed that Veritas can answer both counterfactual queries (e.g., the performance of a completed video session had it used a different buffer size) and interventional queries (e.g., estimating the download time for every possible video quality choice for the next chunk in a session in progress). In doing so, Veritas achieves accuracy close to an ideal oracle, while significantly outperforming both a commonly used baseline approach, and Fugu (an off-the-shelf neural network) neither of which account for causal effects.

preprint2021arXiv

Membership Inference Attacks and Defenses in Classification Models

We study the membership inference (MI) attack against classifiers, where the attacker's goal is to determine whether a data instance was used for training the classifier. Through systematic cataloging of existing MI attacks and extensive experimental evaluations of them, we find that a model's vulnerability to MI attacks is tightly related to the generalization gap -- the difference between training accuracy and test accuracy. We then propose a defense against MI attacks that aims to close the gap by intentionally reduces the training accuracy. More specifically, the training process attempts to match the training and validation accuracies, by means of a new {\em set regularizer} using the Maximum Mean Discrepancy between the softmax output empirical distributions of the training and validation sets. Our experimental results show that combining this approach with another simple defense (mix-up training) significantly improves state-of-the-art defense against MI attacks, with minimal impact on testing accuracy.

preprint2020arXiv

ALMA reveals the molecular gas properties of 5 star-forming galaxies across the main sequence at 3 < z < 3.5

We present the detection of CO(5-4) with S/N> 7 - 13 and a lower CO transition with S/N > 3 (CO(4-3) for 4 galaxies, and CO(3-2) for one) with ALMA in band 3 and 4 in five main sequence star-forming galaxies with stellar masses 3-6x10^10 M/M_sun at 3 < z < 3.5. We find a good correlation between the total far-infrared luminosity LFIR and the luminosity of the CO(5-4) transition L&#39;CO(5-4), where L&#39;CO(5-4) increases with SFR, indicating that CO(5-4) is a good tracer of the obscured SFR in these galaxies. The two galaxies that lie closer to the star-forming main sequence have CO SLED slopes that are comparable to other star-forming populations, such as local SMGs and BzK star-forming galaxies; the three objects with higher specific star formation rates (sSFR) have far steeper CO SLEDs, which possibly indicates a more concentrated episode of star formation. By exploiting the CO SLED slopes to extrapolate the luminosity of the CO(1-0) transition, and using a classical conversion factor for main sequence galaxies of alpha_CO = 3.8 M_sun(K km s^-1 pc^-2)^-1, we find that these galaxies are very gas rich, with molecular gas fractions between 60 and 80%, and quite long depletion times, between 0.2 and 1 Gyr. Finally, we obtain dynamical masses that are comparable with the sum of stellar and gas mass (at least for four out of five galaxies), allowing us to put a first constraint on the alpha_CO parameter for main sequence galaxies at an unprecedented redshift.

preprint2020arXiv

Deceptive Deletions for Protecting Withdrawn Posts on Social Platforms

Over-sharing poorly-worded thoughts and personal information is prevalent on online social platforms. In many of these cases, users regret posting such content. To retrospectively rectify these errors in users&#39; sharing decisions, most platforms offer (deletion) mechanisms to withdraw the content, and social media users often utilize them. Ironically and perhaps unfortunately, these deletions make users more susceptible to privacy violations by malicious actors who specifically hunt post deletions at large scale. The reason for such hunting is simple: deleting a post acts as a powerful signal that the post might be damaging to its owner. Today, multiple archival services are already scanning social media for these deleted posts. Moreover, as we demonstrate in this work, powerful machine learning models can detect damaging deletions at scale. Towards restraining such a global adversary against users&#39; right to be forgotten, we introduce Deceptive Deletion, a decoy mechanism that minimizes the adversarial advantage. Our mechanism injects decoy deletions, hence creating a two-player minmax game between an adversary that seeks to classify damaging content among the deleted posts and a challenger that employs decoy deletions to masquerade real damaging deletions. We formalize the Deceptive Game between the two players, determine conditions under which either the adversary or the challenger provably wins the game, and discuss the scenarios in-between these two extremes. We apply the Deceptive Deletion mechanism to a real-world task on Twitter: hiding damaging tweet deletions. We show that a powerful global adversary can be beaten by a powerful challenger, raising the bar significantly and giving a glimmer of hope in the ability to be really forgotten on social platforms.

preprint2020arXiv

Infinity Learning: Learning Markov Chains from Aggregate Steady-State Observations

We consider the task of learning a parametric Continuous Time Markov Chain (CTMC) sequence model without examples of sequences, where the training data consists entirely of aggregate steady-state statistics. Making the problem harder, we assume that the states we wish to predict are unobserved in the training data. Specifically, given a parametric model over the transition rates of a CTMC and some known transition rates, we wish to extrapolate its steady state distribution to states that are unobserved. A technical roadblock to learn a CTMC from its steady state has been that the chain rule to compute gradients will not work over the arbitrarily long sequences necessary to reach steady state ---from where the aggregate statistics are sampled. To overcome this optimization challenge, we propose $\infty$-SGD, a principled stochastic gradient descent method that uses randomly-stopped estimators to avoid infinite sums required by the steady state computation, while learning even when only a subset of the CTMC states can be observed. We apply $\infty$-SGD to a real-world testbed and synthetic experiments showcasing its accuracy, ability to extrapolate the steady state distribution to unobserved states under unobserved conditions (heavy loads, when training under light loads), and succeeding in difficult scenarios where even a tailor-made extension of existing methods fails.

preprint2020arXiv

On the Equivalence between Positional Node Embeddings and Structural Graph Representations

This work provides the first unifying theoretical framework for node (positional) embeddings and structural graph representations, bridging methods like matrix factorization and graph neural networks. Using invariant theory, we show that the relationship between structural representations and node embeddings is analogous to that of a distribution and its samples. We prove that all tasks that can be performed by node embeddings can also be performed by structural representations and vice-versa. We also show that the concept of transductive and inductive learning is unrelated to node embeddings and graph representations, clearing another source of confusion in the literature. Finally, we introduce new practical guidelines to generating and using node embeddings, which fixes significant shortcomings of standard operating procedures used today.

preprint2020arXiv

Random Spiking and Systematic Evaluation of Defenses Against Adversarial Examples

Image classifiers often suffer from adversarial examples, which are generated by strategically adding a small amount of noise to input images to trick classifiers into misclassification. Over the years, many defense mechanisms have been proposed, and different researchers have made seemingly contradictory claims on their effectiveness. We present an analysis of possible adversarial models, and propose an evaluation framework for comparing different defense mechanisms. As part of the framework, we introduce a more powerful and realistic adversary strategy. Furthermore, we propose a new defense mechanism called Random Spiking (RS), which generalizes dropout and introduces random noises in the training process in a controlled manner. Evaluations under our proposed framework suggest RS delivers better protection against adversarial examples than many existing schemes.

preprint2020arXiv

The evolution of rest-frame UV properties, Lya EWs and the SFR-Stellar mass relation at z~2-6 for SC4K LAEs

We explore deep rest-frame UV to FIR data in the COSMOS field to measure the individual spectral energy distributions (SED) of the ~4000 SC4K (Sobral et al. 2018) Lyman-alpha (Lya) emitters (LAEs) at z~2-6. We find typical stellar masses of 10$^{9.3\pm0.6}$ M$_{\odot}$ and star formation rates (SFR) of SFR$_{SED}=4.4^{+10.5}_{-2.4}$ M$_{\odot}$/yr and SFR$_{Lya}=5.9^{+6.3}_{-2.6}$ M$_{\odot}$/yr, combined with very blue UV slopes of beta=-2.1$^{+0.5}_{-0.4}$, but with significant variations within the population. M$_{UV}$ and beta are correlated in a similar way to UV-selected sources, but LAEs are consistently bluer. This suggests that LAEs are the youngest and/or most dust-poor subset of the UV-selected population. We also study the Lya rest-frame equivalent width (EW$_0$) and find 45 &#34;extreme&#34; LAEs with EW$_0>240$ A (3 $σ$), implying a low number density of $(7\pm1)\times10^{-7}$ Mpc$^{-3}$. Overall, we measure little to no evolution of the Lya EW$_0$ and scale length parameter ($w_0$) which are consistently high (EW$_0=140^{+280}_{-70}$ A, $w_0=129^{+11}_{-11}$ A) from z~6 to z~2 and below. However, $w_0$ is anti-correlated with M$_{UV}$ and stellar mass. Our results imply that sources selected as LAEs have a high Lya escape fraction (f$_{esc, Lya}$) irrespective of cosmic time, but f$_{esc, Lya}$ is still higher for UV-fainter and lower mass LAEs. The least massive LAEs ($<10^{9.5}$ M$_{\odot}$) are typically located above the star formation &#34;Main Sequence&#34; (MS), but the offset from the MS decreases towards z~6 and towards $10^{10}$ M$_{\odot}$. Our results imply a lack of evolution in the properties of LAEs across time and reveals the increasing overlap in properties of LAEs and UV-continuum selected galaxies as typical star-forming galaxies at high redshift effectively become LAEs.

preprint2020arXiv

Towards Studying Hierarchical Assembly in Real Time: A Milky Way Progenitor Galaxy at z = 2.36 under the Microscope

We use Hubble Space Telescope (HST) imaging and near-infrared spectroscopy from Keck/MOSFIRE to study the sub-structure around the progenitor of a Milky Way-mass galaxy in the Hubble Frontier Fields (HFF). Specifically, we study an $r_e = 40^{+70}_{-30}$pc, $M_{\star} \sim 10^{8.2} M_{\odot}$ rest-frame ultra-violet luminous &#34;clump&#34; at a projected distance of $\sim$100~pc from a $M_{\star} \sim 10^{9.8}$M$_{\odot}$ galaxy at $z = 2.36$ with a magnification $μ= 5.21$. We measure the star formation history of the clump and galaxy by jointly modeling the broadband spectral energy distribution from HST photometry and H$α$ from MOSFIRE spectroscopy. Given our inferred properties (e.g., mass, metallicity, dust) of the clump and galaxy, we explore scenarios in which the clump formed \emph{in-situ} (e.g., a star forming complex) or \emph{ex-situ} (e.g., a dwarf galaxy being accreted). If it formed \emph{in-situ}, we conclude that the clump is likely a single entity as opposed to a aggregation of smaller star clusters, making it one of the most dense star clusters cataloged. If it formed \emph{ex-situ}, then we are witnessing an accretion event with a 1:40 stellar mass ratio. However, our data alone are not informative enough to distinguish between \emph{in-situ} and \emph{ex-situ} scenarios to a high level of significance. We posit that the addition of high-fidelity metallicity information, such as [OIII]4363Å, which can be detected at modest S/N with only a few hours of JWST/NIRSpec time, may be a powerful discriminant. We suggest that studying larger samples of moderately lensed sub-structures across cosmic time can provide unique insight into the hierarchical formation of galaxies like the Milky Way.

preprint2019arXiv

VIS3COS: III. environmental effects on the star formation histories of galaxies at z~0.8 seen in [OII], H$δ$, and Dn4000

[ABRIDGED] We present spectroscopic observations of 466 galaxies in and around a superstructure at $z\sim0.84$ targeted by the VIMOS Spectroscopic Survey of a Supercluster in the COSMOS field (VIS$^{3}$COS). We use [OII]$λ$3727, H$δ$, and $D_n4000$ to trace the recent, mid-, and long-term star formation histories and investigate how stellar mass and the local environment impacts those. By studying trends both in individual and composite galaxy spectra, we find that both stellar mass and environment play a role in the observed galactic properties. We find that the median [OII] equivalent width (|EW$_\mathrm{[OII]}|$) decreases from $27\pm2$ Å to $2.0_{-0.4}^{+0.5}$ Å and $D_n4000$ increases from $1.09\pm0.01$ to $1.56\pm0.03$ with increasing stellar mass (from $\sim10^{9.25}$ to $\sim10^{11.35}\ \mathrm{M_\odot}$). Concerning the dependence on the environment, we find that at fixed stellar mass |EW$_\mathrm{[OII]}|$ is tentatively lower in higher density environments. Regarding $D_n4000$, we find that the increase with stellar mass is sharper in denser environments, hinting that such environments may accelerate galaxy evolution. Moreover, we find larger $D_n4000$ values in denser environments at fixed stellar mass, suggesting that galaxies are on average older and/or more metal-rich in such dense environments. This set of tracers depicts a scenario where the most massive galaxies have, on average, the lowest sSFRs and the oldest stellar populations (age $\gtrsim1$ Gyr, showing a mass-downsizing effect). We also hypothesize that the observed increase in star formation (higher EW$_\mathrm{[OII]|}$, higher sSFR) at intermediate densities may lead to quenching since we find the quenched fraction to increase sharply from the filament to cluster-like regions at similar stellar masses.