Researcher profile

Roberto Gonzalez

Roberto Gonzalez contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 19 - UnverifiedVerification L1Unclaimed author
5works
0followers
9topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

5 published item(s)

preprint2022arXiv

syslrn: Learning What to Monitor for Efficient Anomaly Detection

While monitoring system behavior to detect anomalies and failures is important, existing methods based on log-analysis can only be as good as the information contained in the logs, and other approaches that look at the OS-level software state introduce high overheads. We tackle the problem with syslrn, a system that first builds an understanding of a target system offline, and then tailors the online monitoring instrumentation based on the learned identifiers of normal behavior. While our syslrn prototype is still preliminary and lacks many features, we show in a case study for the monitoring of OpenStack failures that it can outperform state-of-the-art log-analysis systems with little overhead.

preprint2013arXiv

Google+ or Google-?: Dissecting the Evolution of the New OSN in its First Year

In the era when Facebook and Twitter dominate the market for social media, Google has introduced Google+ (G+) and reported a significant growth in its size while others called it a ghost town. This begs the question that "whether G+ can really attract a significant number of connected and active users despite the dominance of Facebook and Twitter?". This paper tackles the above question by presenting a detailed characterization of G+ based on large scale measurements. We identify the main components of G+ structure, characterize the key features of their users and their evolution over time. We then conduct detailed analysis on the evolution of connectivity and activity among users in the largest connected component (LCC) of G+ structure, and compare their characteristics with other major OSNs. We show that despite the dramatic growth in the size of G+, the relative size of LCC has been decreasing and its connectivity has become less clustered. While the aggregate user activity has gradually increased, only a very small fraction of users exhibit any type of activity. To our knowledge, our study offers the most comprehensive characterization of G+ based on the largest collected data sets.

preprint2012arXiv

TorrentGuard: stopping scam and malware distribution in the BitTorrent ecosystem

In this paper we conduct a large scale measurement study in order to analyse the fake content publishing phenomenon in the BitTorrent Ecosystem. Our results reveal that fake content represents an important portion (35%) of those files shared in BitTorrent and just a few tens of users are responsible for 90% of this content. Furthermore, more than 99% of the analysed fake files are linked to either malware or scam websites. This creates a serious threat for the BitTorrent ecosystem. To address this issue, we present a new detection tool named TorrentGuard for the early detection of fake content. Based on our evaluation this tool may prevent the download of more than 35 millions of fake files per year. This could help to reduce the number of computer infections and scams suffered by BitTorrent users. TorrentGuard is already available and it can be accessed through both a webpage or a Vuze plugin.

preprint2011arXiv

Where are my followers? Understanding the Locality Effect in Twitter

Twitter is one of the most used applications in the current Internet with more than 200M accounts created so far. As other large-scale systems Twitter can obtain enefit by exploiting the Locality effect existing among its users. In this paper we perform the first comprehensive study of the Locality effect of Twitter. For this purpose we have collected the geographical location of around 1M Twitter users and 16M of their followers. Our results demonstrate that language and cultural characteristics determine the level of Locality expected for different countries. Those countries with a different language than English such as Brazil typically show a high intra-country Locality whereas those others where English is official or co-official language suffer from an external Locality effect. This is, their users have a larger number of followers in US than within their same country. This is produced by two reasons: first, US is the dominant country in Twitter counting with around half of the users, and second, these countries share a common language and cultural characteristics with US.

preprint2010arXiv

Local and global environmental effects on galaxies and active galactic nuclei

We study the properties of SDSS galaxies with and without AGN detection as a function of the local and global environment measured via the local density, the mass of the galaxy host group (parameterised by the group luminosity) and distance to massive clusters. Our results can be divided in two main subjects, the environments of galaxies and their relation to the assembly of their host haloes, and the environments of AGN. (i) For the full SDSS sample, we find indications that the local galaxy density is the most efficient parameter to separate galaxy populations, but we also find that galaxies at fixed local density show some remaining variation of their properties as a function of the distance to the nearest cluster of galaxies (in a range of 0 to 10 cluster virial radii). These differences seem to become less significant if the galaxy samples are additionally constrained to be hosted by groups of similar total luminosity. (ii) In AGN host galaxies, the morphology-density relation is much less noticeable when compared to the behaviour of the full SDSS sample. In order to interpret this result we analyse control samples constructed using galaxies with no detected AGN activity with matching distributions of redshifts, stellar masses, r-band luminosities, g-r colours, concentrations, local densities, host group luminosities, and fractions of central and satellite galaxies. The control samples also show a similar small dependence on the local density indicating an influence from the AGN selection, but their colours are slightly bluer compared to the AGN hosts regardless of local density. Furthermore, even when the local density is held fixed at intermediate or high values, and the distance to the closest cluster of galaxies is allowed to vary, AGN control galaxies away from clusters tend to be bluer than the AGN hosts. (ABRIDGED)