Source author record

Valerio Gemmetto

Valerio Gemmetto appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

5works
5topics
4close collaborators

Actions

Connect this record

Log in to claim

Research graph

See the researcher in context

Open full explorer

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

5 published item(s)

preprint2016arXiv

Ground truth? Concept-based communities versus the external classification of physics manuscripts

Community detection techniques are widely used to infer hidden structures within interconnected systems. Despite demonstrating high accuracy on benchmarks, they reproduce the external classification for many real-world systems with a significant level of discrepancy. A widely accepted reason behind such outcome is the unavoidable loss of non-topological information (such as node attributes) encountered when the original complex system is represented as a network. In this article we emphasize that the observed discrepancies may also be caused by a different reason: the external classification itself. For this end we use scientific publication data which i) exhibit a well defined modular structure and ii) hold an expert-made classification of research articles. Having represented the articles and the extracted scientific concepts both as a bipartite network and as its unipartite projection, we applied modularity optimization to uncover the inner thematic structure. The resulting clusters are shown to partly reflect the author-made classification, although some significant discrepancies are observed. A detailed analysis of these discrepancies shows that they carry essential information about the system, mainly related to the use of similar techniques and methods across different (sub)disciplines, that is otherwise omitted when only the external classification is considered.

preprint2016arXiv

Multiplexity and multireciprocity in directed multiplexes

Real-world multi-layer networks feature nontrivial dependencies among links of different layers. Here we argue that, if links are directed, dependencies are twofold. Besides the ordinary tendency of links of different layers to align as the result of `multiplexity', there is also a tendency to anti-align as the result of what we call `multireciprocity', i.e. the fact that links in one layer can be reciprocated by \emph{opposite} links in a different layer. Multireciprocity generalizes the scalar definition of single-layer reciprocity to that of a square matrix involving all pairs of layers. We introduce multiplexity and multireciprocity matrices for both binary and weighted multiplexes and validate their statistical significance against maximum-entropy null models that filter out the effects of node heterogeneity. We then perform a detailed empirical analysis of the World Trade Multiplex (WTM), representing the import-export relationships between world countries in different commodities. We show that the WTM exhibits strong multiplexity and multireciprocity, an effect which is however largely encoded into the degree or strength sequences of individual layers. The residual effects are still significant and allow to classify pairs of commodities according to their tendency to be traded together in the same direction and/or in opposite ones. We also find that the multireciprocity of the WTM is significantly lower than the usual reciprocity measured on the aggregate network. Moreover, layers with low (high) internal reciprocity are embedded within sets of layers with comparably low (high) mutual multireciprocity. This suggests that, in the WTM, reciprocity is inherent to groups of related commodities rather than to individual commodities. We discuss the implications for international trade research focusing on product taxonomies, the product space, and fitness/complexity metrics.

preprint2016arXiv

ScienceWISE: Topic Modeling over Scientific Literature Networks

We provide an up-to-date view on the knowledge management system ScienceWISE (SW) and address issues related to the automatic assignment of articles to research topics. So far, SW has been proven to be an effective platform for managing large volumes of technical articles by means of ontological concept-based browsing. However, as the publication of research articles accelerates, the expressivity and the richness of the SW ontology turns into a double-edged sword: a more fine-grained characterization of articles is possible, but at the cost of introducing more spurious relations among them. In this context, the challenge of continuously recommending relevant articles to users lies in tackling a network partitioning problem, where nodes represent articles and co-occurring concepts create edges between them. In this paper, we discuss the three research directions we have taken for solving this issue: i) the identification of generic concepts to reinforce inter-article similarities; ii) the adoption of a bipartite network representation to improve scalability; iii) the design of a clustering algorithm to identify concepts for cross-disciplinary articles and obtain fine-grained topics for all articles.

preprint2014arXiv

Mitigation of infectious disease at school: targeted class closure vs school closure

School environments are thought to play an important role in the community spread of airborne infections (e.g., influenza) because of the high mixing rates of school children. The closure of schools has therefore been proposed as efficient mitigation strategy, with however high social and economic costs: alternative, less disruptive interventions are highly desirable. The recent availability of high-resolution contact networks in school environments provides an opportunity to design micro-interventions and compare the outcomes of alternative mitigation measures. We consider mitigation measures that involve the targeted closure of school classes or grades based on readily available information such as the number of symptomatic infectious children in a class. We focus on the case of a primary school for which we have high-resolution data on the close-range interactions of children and teachers. We simulate the spread of an influenza-like illness in this population by using an SEIR model with asymptomatics and compare the outcomes of different mitigation strategies. We find that targeted class closure affords strong mitigation effects: closing a class for a fixed period of time -equal to the sum of the average infectious and latent durations- whenever two infectious individuals are detected in that class decreases the attack rate by almost 70% and strongly decreases the probability of a severe outbreak. The closure of all classes of the same grade mitigates the spread almost as much as closing the whole school. Targeted class closure strategies based on readily available information on symptomatic subjects and on limited information on mixing patterns, such as the grade structure of the school, can be almost as effective as whole-school closure, at a much lower cost. This may inform public health policies for the management and mitigation of influenza-like outbreaks in the community.

preprint2014arXiv

Multiplexity versus correlation: the role of local constraints in real multiplexes

Several real-world systems can be represented as multi-layer complex networks, i.e. in terms of a superposition of various graphs, each related to a different mode of connection between nodes. Hence, the definition of proper mathematical quantities aiming at capturing the level of complexity of those systems is required. Various attempts have been made to measure the empirical dependencies between the layers of a multiplex, for both binary and weighted networks. In the simplest case, such dependencies are measured via correlation-based metrics: we show that this is equivalent to the use of completely homogeneous benchmarks specifying only global constraints, such as the total number of links in each layer. However, these approaches do not take into account the heterogeneity in the degree and strength distributions, which are instead a fundamental feature of real-world multiplexes. In this work, we compare the observed dependencies between layers with the expected values obtained from reference models that appropriately control for the observed heterogeneity in the degree and strength distributions. This leads to novel multiplexity measures that we test on different datasets, i.e. the International Trade Network (ITN) and the European Airport Network (EAN). Our findings confirm that the use of homogeneous benchmarks can lead to misleading results, and furthermore highlight the important role played by the distribution of hubs across layers.