Source author record

Fabio Porto

Fabio Porto appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Databases Artificial Intelligence Computational Engineering, Finance, and Science Computer Vision

Catalog footprint

What is connected

6works

4topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Hyperspherical embedding for novel class classification

Deep learning models have become increasingly useful in many different industries. On the domain of image classification, convolutional neural networks proved the ability to learn robust features for the closed set problem, as shown in many different datasets, such as MNIST FASHIONMNIST, CIFAR10, CIFAR100, and IMAGENET. These approaches use deep neural networks with dense layers with softmax activation functions in order to learn features that can separate classes in a latent space. However, this traditional approach is not useful for identifying classes unseen on the training set, known as the open set problem. A similar problem occurs in scenarios involving learning on small data. To tackle both problems, few-shot learning has been proposed. In particular, metric learning learns features that obey constraints of a metric distance in the latent space in order to perform classification. However, while this approach proves to be useful for the open set problem, current implementation requires pair-wise training, where both positive and negative examples of similar images are presented during the training phase, which limits the applicability of these approaches in large data or large class scenarios given the combinatorial nature of the possible inputs.In this paper, we present a constraint-based approach applied to the representations in the latent space under the normalized softmax loss, proposed by[18]. We experimentally validate the proposed approach for the classification of unseen classes on different datasets using both metric learning and the normalized softmax loss, on disjoint and joint scenarios. Our results show that not only our proposed strategy can be efficiently trained on larger set of classes, as it does not require pairwise learning, but also present better classification results than the metric learning strategies surpassing its accuracy by a significant margin.

preprint2016arXiv

A note on the complexity of the causal ordering problem

In this note we provide a concise report on the complexity of the causal ordering problem, originally introduced by Simon to reason about causal dependencies implicit in systems of mathematical equations. We show that Simon's classical algorithm to infer causal ordering is NP-Hard---an intractability previously guessed but never proven. We present then a detailed account based on Nayak's suggested algorithmic solution (the best available), which is dominated by computing transitive closure---bounded in time by $O(|\mathcal V|\cdot |\mathcal S|)$, where $\mathcal S(\mathcal E, \mathcal V)$ is the input system structure composed of a set $\mathcal E$ of equations over a set $\mathcal V$ of variables with number of variable appearances (density) $|\mathcal S|$. We also comment on the potential of causal ordering for emerging applications in large-scale hypothesis management and analytics.

preprint2015arXiv

Managing large-scale scientific hypotheses as uncertain and probabilistic data with support for predictive analytics

The sheer scale of high-resolution raw data generated by simulation has motivated non-conventional approaches for data exploration referred as `immersive' and `in situ' query processing of the raw simulation data. Another step towards supporting scientific progress is to enable data-driven hypothesis management and predictive analytics out of simulation results. We present a synthesis method and tool for encoding and managing competing hypotheses as uncertain data in a probabilistic database that can be conditioned in the presence of observations.

preprint2014arXiv

$Υ$-DB: A system for data-driven hypothesis management and analytics

The vision of $Υ$-DB introduces deterministic scientific hypotheses as a kind of uncertain and probabilistic data, and opens some key technical challenges for enabling data-driven hypothesis management and analytics. The $Υ$-DB system addresses those challenges throughout a design-by-synthesis pipeline that defines its architecture. It processes hypotheses from their XML-based extraction to encoding as uncertain and probabilistic U-relational data, and eventually to their conditioning in the presence of observations. In this demo we present a first prototype of the $Υ$-DB system. We showcase its core innovative features by means of use case scenarios in computational science in which the hypotheses are extracted from a model repository on the web and evaluated (rated/ranked) as probabilistic data.

preprint2014arXiv

$Υ$-DB: Managing scientific hypotheses as uncertain data

In view of the paradigm shift that makes science ever more data-driven, we consider deterministic scientific hypotheses as uncertain data. This vision comprises a probabilistic database (p-DB) design methodology for the systematic construction and management of U-relational hypothesis DBs, viz., $Υ$-DBs. It introduces hypothesis management as a promising new class of applications for p-DBs. We illustrate the potential of $Υ$-DB as a tool for deep predictive analytics.

preprint2014arXiv

Design-theoretic encoding of deterministic hypotheses as constraints and correlations into U-relational databases

In view of the paradigm shift that makes science ever more data-driven, in this paper we consider deterministic scientific hypotheses as uncertain data. In the form of mathematical equations, hypotheses symmetrically relate aspects of the studied phenomena. For computing predictions, however, deterministic hypotheses are used asymmetrically as functions. We refer to Simon's notion of structural equations in order to extract the (so-called) causal ordering embedded in a hypothesis. Then we encode it into a set of functional dependencies (fd's) that is basic input to a design-theoretic method for the synthesis of U-relational databases (DB's). The causal ordering captured from a formally-specified system of mathematical equations into fd's determines not only the constraints (structure), but also the correlations (uncertainty chaining) hidden in the hypothesis predictive data. We show how to process it effectively through original algorithms for encoding and reasoning on the given hypotheses as constraints and correlations into U-relational DB's. The method is applicable to both quantitative and qualitative hypotheses and has underwent initial tests in a realistic use case from computational science.

Fabio Porto

What is connected

Connect this record

See the researcher in context

Building this map preview

6 published item(s)

Hyperspherical embedding for novel class classification

A note on the complexity of the causal ordering problem

Managing large-scale scientific hypotheses as uncertain and probabilistic data with support for predictive analytics

$Υ$-DB: A system for data-driven hypothesis management and analytics

$Υ$-DB: Managing scientific hypotheses as uncertain data

Design-theoretic encoding of deterministic hypotheses as constraints and correlations into U-relational databases