Source author record

Daniel Andrés Díaz-Pachón

Daniel Andrés Díaz-Pachón appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Methodology Populations and Evolution Information Theory math.IT physics.data-an Quantitative Methods astro-ph.CO Biological Physics math.PR physics.hist-ph

Catalog footprint

What is connected

7works

10topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

"Back to the future" projections for COVID-19 surges

We argue that information from countries who had earlier COVID-19 surges can be used to inform another country's current model, then generating what we call back-to-the-future (BTF) projections. We show that these projections can be used to accurately predict future COVID-19 surges prior to an inflection point of the daily infection curve. We show, across 12 different countries from all populated continents around the world, that our method can often predict future surges in scenarios where the traditional approaches would always predict no future surges. However, as expected, BTF projections cannot accurately predict a surge due to the emergence of a new variant. To generate BTF projections, we make use of a matching scheme for asynchronous time series combined with a response coaching SIR model.

preprint2022arXiv

Active information, missing data and prevalence estimation

The topic of this paper is prevalence estimation from the perspective of active information. Prevalence among tested individuals has an upward bias under the assumption that individuals' willingness to be tested for the disease increases with the strength of their symptoms. Active information due to testing bias quantifies the degree at which the willingness to be tested correlates with infection status. Interpreting incomplete testing as a missing data problem, the missingness mechanism impacts the degree at which the bias of the original prevalence estimate can be removed. The reduction in prevalence, when testing bias is adjusted for, translates into an active information due to bias correction, with opposite sign to active information due to testing bias. Prevalence and active information estimates are asymptotically normal, a behavior also illustrated through simulations.

preprint2022arXiv

High Dimensional Mode Hunting Using Pettiest Components Analysis

Principal components analysis has been used to reduce the dimensionality of datasets for a long time. In this paper, we will demonstrate that in mode detection the components of smallest variance, the pettiest components, are more important. We prove that for a multivariate normal or Laplace distribution, we obtain boxes of optimal volume by implementing "pettiest component analysis", in the sense that their volume is minimal over all possible boxes with the same number of dimensions and fixed probability. This reduction in volume produces an information gain that is measured using active information. We illustrate our results with a simulation and a search for modal patterns of digitized images of hand-written numbers using the famous MNIST database; in both cases pettiest components work better than their competitors. In fact, we show that modes obtained with pettiest components generate better written digits for MNIST than principal components.

preprint2022arXiv

Sometimes size does not matter

Cosmological fine-tuning has traditionally been associated with the narrowness of the intervals in which the parameters of the physical models must be located to make life possible. A more thorough approach focuses on the probability of the interval, not on its size. Most attempts to measure the probability of the life-permitting interval for a given parameter rely on a Bayesian statistical approach for which the prior distribution of the parameter is uniform. However, the parameters in these models often take values in spaces of infinite size, so that a uniformity assumption is not possible. This is known as the normalization problem. This paper explains a framework to measure tuning that, among others, deals with normalization, assuming that the prior distribution belongs to a class of maximum entropy (maxent) distributions. By analyzing an upper bound of the tuning probability for this class of distributions the method solves the so-called weak anthropic principle, and offer a solution, at least in this context, to the well-known lack of invariance of maxent distributions. The implication of this approach is that, since all mathematical models need parameters, tuning is not only a question of natural science, but also a problem of mathematical modeling. Cosmological tuning is thus a particular instantiation of a more general scenario. Therefore, whenever a mathematical model is used to describe nature, not only in physics but in all of science, tuning is present. And the question of whether the tuning is fine or coarse for a given parameter -- if the interval in which the parameter is located has low or high probability, respectively -- depends crucially not only on the interval but also on the assumed class of prior distributions. Novel upper bounds for tuning probabilities are presented.

preprint2021arXiv

A simple correction for COVID-19 sampling bias

COVID-19 testing has become a standard approach for estimating prevalence which then assist in public health decision making to contain and mitigate the spread of the disease. The sampling designs used are often biased in that they do not reflect the true underlying populations. For instance, individuals with strong symptoms are more likely to be tested than those with no symptoms. This results in biased estimates of prevalence (too high). Typical post-sampling corrections are not always possible. Here we present a simple bias correction methodology derived and adapted from a correction for publication bias in meta analysis studies. The methodology is general enough to allow a wide variety of customization making it more useful in practice. Implementation is easily done using already collected information. Via a simulation and two real datasets, we show that the bias corrections can provide dramatic reductions in estimation error.

preprint2021arXiv

Active information requirements for fixation on the Wright-Fisher model of population genetics

In the context of population genetics, active information can be extended to measure the change of information of a given event (e.g., fixation of an allele) from a neutral model in which only genetic drift is taken into account to a non-neutral model that includes other sources of frequency variation (e.g., selection and mutation). In this paper we illustrate active information in population genetics through the Wright-Fisher model.

preprint2015arXiv

$F$ tests for the strip-split plot design

In this article we present the structure of the $F$ tests, the variance components and the approximate degrees of freedom for each of the eight possible mixed models of the strip-split plot design. We present an example to illustrate the model and compare it to more traditional settings like a three-way factorial design and a split-split plot model.

Daniel Andrés Díaz-Pachón

What is connected

Connect this record

See the researcher in context

Building this map preview

7 published item(s)

"Back to the future" projections for COVID-19 surges

Active information, missing data and prevalence estimation

High Dimensional Mode Hunting Using Pettiest Components Analysis

Sometimes size does not matter

A simple correction for COVID-19 sampling bias

Active information requirements for fixation on the Wright-Fisher model of population genetics

$F$ tests for the strip-split plot design