Source author record

Daniel Ramos

Daniel Ramos appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning math.DG Applications eess.AS Software Engineering Sound

Catalog footprint

What is connected

8works

6topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Adaptive Temperature Scaling for Robust Calibration of Deep Neural Networks

In this paper, we study the post-hoc calibration of modern neural networks, a problem that has drawn a lot of attention in recent years. Many calibration methods of varying complexity have been proposed for the task, but there is no consensus about how expressive these should be. We focus on the task of confidence scaling, specifically on post-hoc methods that generalize Temperature Scaling, we call these the Adaptive Temperature Scaling family. We analyse expressive functions that improve calibration and propose interpretable methods. We show that when there is plenty of data complex models like neural networks yield better performance, but are prone to fail when the amount of data is limited, a common situation in certain post-hoc calibration applications like medical diagnosis. We study the functions that expressive methods learn under ideal conditions and design simpler methods but with a strong inductive bias towards these well-performing functions. Concretely, we propose Entropy-based Temperature Scaling, a simple method that scales the confidence of a prediction according to its entropy. Results show that our method obtains state-of-the-art performance when compared to others and, unlike complex models, it is robust against data scarcity. Moreover, our proposed model enables a deeper interpretation of the calibration process.

preprint2021arXiv

SOAR: A Synthesis Approach for Data Science API Refactoring

With the growth of the open-source data science community, both the number of data science libraries and the number of versions for the same library are increasing rapidly. To match the evolving APIs from those libraries, open-source organizations often have to exert manual effort to refactor the APIs used in the code base. Moreover, due to the abundance of similar open-source libraries, data scientists working on a certain application may have an abundance of libraries to choose, maintain and migrate between. The manual refactoring between APIs is a tedious and error-prone task. Although recent research efforts were made on performing automatic API refactoring between different languages, previous work relies on statistical learning with collected pairwise training data for the API matching and migration. Using large statistical data for refactoring is not ideal because such training data will not be available for a new library or a new version of the same library. We introduce Synthesis for Open-Source API Refactoring (SOAR), a novel technique that requires no training data to achieve API migration and refactoring. SOAR relies only on the documentation that is readily available at the release of the library to learn API representations and mapping between libraries. Using program synthesis, SOAR automatically computes the correct configuration of arguments to the APIs and any glue code required to invoke those APIs. SOAR also uses the interpreter's error messages when running refactored code to generate logical constraints that can be used to prune the search space. Our empirical evaluation shows that SOAR can successfully refactor 80% of our benchmarks corresponding to deep learning models with up to 44 layers with an average run time of 97.23 seconds, and 90% of the data wrangling benchmarks with an average run time of 17.31 seconds.

preprint2020arXiv

Calibration of Deep Probabilistic Models with Decoupled Bayesian Neural Networks

Deep Neural Networks (DNNs) have achieved state-of-the-art accuracy performance in many tasks. However, recent works have pointed out that the outputs provided by these models are not well-calibrated, seriously limiting their use in critical decision scenarios. In this work, we propose to use a decoupled Bayesian stage, implemented with a Bayesian Neural Network (BNN), to map the uncalibrated probabilities provided by a DNN to calibrated ones, consistently improving calibration. Our results evidence that incorporating uncertainty provides more reliable probabilistic models, a critical condition for achieving good calibration. We report a generous collection of experimental results using high-accuracy DNNs in standardized image classification benchmarks, showing the good performance, flexibility and robust behavior of our approach with respect to several state-of-the-art calibration methods. Code for reproducibility is provided.

preprint2020arXiv

Statistical Models in Forensic Voice Comparison

This chapter describes a number of signal-processing and statistical-modeling techniques that are commonly used to calculate likelihood ratios in human-supervised automatic approaches to forensic voice comparison. Techniques described include mel-frequency cepstral coefficients (MFCCs) feature extraction, Gaussian mixture model - universal background model (GMM-UBM) systems, i-vector - probabilistic linear discriminant analysis (i-vector PLDA) systems, deep neural network (DNN) based systems (including senone posterior i-vectors, bottleneck features, and embeddings / x-vectors), mismatch compensation, and score-to-likelihood-ratio conversion (aka calibration). Empirical validation of forensic-voice-comparison systems is also covered. The aim of the chapter is to bridge the gap between general introductions to forensic voice comparison and the highly technical automatic-speaker-recognition literature from which the signal-processing and statistical-modeling techniques are mostly drawn. Knowledge of the likelihood-ratio framework for the evaluation of forensic evidence is assumed. It is hoped that the material presented here will be of value to students of forensic voice comparison and to researchers interested in learning about statistical modeling techniques that could potentially also be applied to data from other branches of forensic science.

preprint2019arXiv

Generative Models For Deep Learning with Very Scarce Data

The goal of this paper is to deal with a data scarcity scenario where deep learning techniques use to fail. We compare the use of two well established techniques, Restricted Boltzmann Machines and Variational Auto-encoders, as generative models in order to increase the training set in a classification framework. Essentially, we rely on Markov Chain Monte Carlo (MCMC) algorithms for generating new samples. We show that generalization can be improved comparing this methodology to other state-of-the-art techniques, e.g. semi-supervised learning with ladder networks. Furthermore, we show that RBM is better than VAE generating new samples for training a classifier with good generalization capabilities.

preprint2013arXiv

An asymptotically cusped three dimensional expanding gradient Ricci soliton

We construct an expanding gradient Ricci soliton in dimension three over the topological manifold R x T^2 (the product of a line and a torus) that aproaches asymptotically a constant curvature cusp at one end, and a flat manifold on the other end. We prove that this is the only gradient soliton with this topology, provided the curvature is negatively pinched, -1/4 < sec < 0, at the time-zero manifold (normalizing the soliton to be born at time -1).

preprint2013arXiv

Gradient Ricci solitons on surfaces

We classify and expose all the gradient Ricci solitons on complete surfaces, open or closed, with curvature bounded below, and possibly with a discrete set of cone-like singular points that arise naturally. We give a precise qualitative description of each metric in terms of a phase portrait, that is the most accurate description for all cases that do not admit an explicit expression in terms of elementary functions. Our classification contains examples of smooth and conic solitons that were not described in the classic literature. We add some visual embeddings in R^3 for aesthetics.

preprint2011arXiv

Smoothening cone points with Ricci flow

We consider Ricci flow on a closed surface with cone points. The main result is: given a (nonsmooth) cone metric g_0 over a closed surface there is a smooth Ricci flow g(t) defined for (0,T], with curvature unbounded above, such that g(t) tends to g_0 as t tends to 0. This result means that Ricci flow provides a way for instantaneously smoothening cone points. We follow an argument of P. Topping modifying his reasoning for cusps of negative curvature; in that sense we can consider cusps as a limiting zero-angle cone, and we generalize to any angle between 0 and 2π.

Daniel Ramos

What is connected

Connect this record

See the researcher in context

Building this map preview

8 published item(s)

Adaptive Temperature Scaling for Robust Calibration of Deep Neural Networks

SOAR: A Synthesis Approach for Data Science API Refactoring

Calibration of Deep Probabilistic Models with Decoupled Bayesian Neural Networks

Statistical Models in Forensic Voice Comparison

Generative Models For Deep Learning with Very Scarce Data

An asymptotically cusped three dimensional expanding gradient Ricci soliton

Gradient Ricci solitons on surfaces

Smoothening cone points with Ricci flow