Source author record

Georg M. Goerg

Georg M. Goerg appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

9works
9topics
4close collaborators

Actions

Connect this record

Log in to claim

Research graph

See the researcher in context

Open full explorer

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

9 published item(s)

preprint2016arXiv

Rebuttal of the 'Letter to the Editor' of Annals of Applied Statistics on Lambert W x F Distributions and the IGMM Algorithm

I discuss comments and claims made in Stehlik and Hermann (2015) about skewed Lambert W x F random variables and the IGMM algorithm. I clarify misunderstandings about the definition and use of Lambert W x F distributions and show that most of their empirical results cannot be reproduced. I also introduce a variant of location-scale Lambert W x F distributions that are well-defined for random variables X ~ F with non-finite mean and variance.

preprint2015arXiv

Acknowledgment of priority: Usage of the Lambert W function in statistics

In my 2011 Annals of Applied Statistics article [Goerg (2011)] I wrote that "Whereas the Lambert $W$ function plays an important role in mathematics, physics, chemistry, biology and other fields, it has not yet been used in statistics." This was incorrect. At the time of publication I was unaware of Stehl\'ık (2003), who used the Lambert $W$ function to derive the exact distribution of the likelihood ratio test statistic. He has also used it in more recent work such as Stehl\'ık (2006), Stehl\'ık et al. (2010), or Stehl\'ık (2014) amongst others. While Stehlík's use of the Lambert $W$ function was focused on the distribution of the likelihood ratio test statistic, my work dealt with the modeling of skewed random variables and symmetrizing data using the Lambert $W$ function as a variable transformation.

preprint2014arXiv

Escaping the poverty trap: modeling the interplay between economic growth and the ecology of infectious disease

The dynamics of economies and infectious disease are inexorably linked: economic well-being influences health (sanitation, nutrition, treatment capacity, etc.) and health influences economic well-being (labor productivity lost to sickness and disease). Often societies are locked into "poverty traps" of poor health and poor economy. Here, using a simplified coupled disease-economic model with endogenous capital growth we demonstrate the formation of poverty traps, as well as ways to escape them. We suggest two possible mechanisms of escape both motivated by empirical data: one, through an influx of capital (development aid), and another through changing the percentage of GDP spent on healthcare. We find that a large influx of capital is successful in escaping the poverty trap, but increasing health spending alone is not. Our results demonstrate that escape from a poverty trap may be possible, and carry important policy implications in the world-wide distribution of aid and within-country healthcare spending.

preprint2013arXiv

Forecastable Component Analysis (ForeCA)

I introduce Forecastable Component Analysis (ForeCA), a novel dimension reduction technique for temporally dependent signals. Based on a new forecastability measure, ForeCA finds an optimal transformation to separate a multivariate time series into a forecastable and an orthogonal white noise space. I present a converging algorithm with a fast eigenvector solution. Applications to financial and macro-economic time series show that ForeCA can successfully discover informative structure, which can be used for forecasting as well as classification. The R package ForeCA (http://cran.r-project.org/web/packages/ForeCA/index.html) accompanies this work and is publicly available on CRAN.

preprint2013arXiv

Mixed LICORS: A Nonparametric Algorithm for Predictive State Reconstruction

We introduce 'mixed LICORS', an algorithm for learning nonlinear, high-dimensional dynamics from spatio-temporal data, suitable for both prediction and simulation. Mixed LICORS extends the recent LICORS algorithm (Goerg and Shalizi, 2012) from hard clustering of predictive distributions to a non-parametric, EM-like soft clustering. This retains the asymptotic predictive optimality of LICORS, but, as we show in simulations, greatly improves out-of-sample forecasts with limited data. The new method is implemented in the publicly-available R package "LICORS" (http://cran.r-project.org/web/packages/LICORS/).

preprint2012arXiv

LICORS: Light Cone Reconstruction of States for Non-parametric Forecasting of Spatio-Temporal Systems

We present a new, non-parametric forecasting method for data where continuous values are observed discretely in space and time. Our method, "light-cone reconstruction of states" (LICORS), uses physical principles to identify predictive states which are local properties of the system, both in space and time. LICORS discovers the number of predictive states and their predictive distributions automatically, and consistently, under mild assumptions on the data source. We provide an algorithm to implement our method, along with a cross-validation scheme to pick control settings. Simulations show that CV-tuned LICORS outperforms standard methods in forecasting challenging spatio-temporal dynamics. Our work provides applied researchers with a new, highly automatic method to analyze and forecast spatio-temporal data.

preprint2012arXiv

The Lambert Way to Gaussianize heavy tailed data with the inverse of Tukey's h as a special case

I present a parametric, bijective transformation to generate heavy tail versions Y of arbitrary RVs X ~ F. The tail behavior of the so-called 'heavy tail Lambert W x F' RV Y depends on a tail parameter delta >= 0: for delta = 0, Y = X, for delta > 0 Y has heavier tails than X. For X being Gaussian, this meta-family of heavy-tailed distributions reduces to Tukey's h distribution. Lambert's W function provides an explicit inverse transformation, which can be estimated by maximum likelihood. This inverse can remove heavy tails from data, and also provide analytical expressions for the cumulative distribution (cdf) and probability density function (pdf). As a special case, these yield explicit formulas for Tukey's h pdf and cdf - to the author's knowledge for the first time in the literature. Simulations and applications to S&P 500 log-returns and solar flares data demonstrate the usefulness of the introduced methodology. The R package "LambertW" (cran.r-project.org/web/packages/LambertW) implementing the presented methodology is publicly available at CRAN.

preprint2011arXiv

A Nonparametric Frequency Domain EM Algorithm for Time Series Classification with Applications to Spike Sorting and Macro-Economics

I propose a frequency domain adaptation of the Expectation Maximization (EM) algorithm to group a family of time series in classes of similar dynamic structure. It does this by viewing the magnitude of the discrete Fourier transform (DFT) of each signal (or power spectrum) as a probability density/mass function (pdf/pmf) on the unit circle: signals with similar dynamics have similar pdfs; distinct patterns have distinct pdfs. An advantage of this approach is that it does not rely on any parametric form of the dynamic structure, but can be used for non-parametric, robust and model-free classification. This new method works for non-stationary signals of similar shape as well as stationary signals with similar auto-correlation structure. Applications to neural spike sorting (non-stationary) and pattern-recognition in socio-economic time series (stationary) demonstrate the usefulness and wide applicability of the proposed method.

preprint2011arXiv

Lambert W random variables - a new family of generalized skewed distributions with applications to risk estimation

Originating from a system theory and an input/output point of view, I introduce a new class of generalized distributions. A parametric nonlinear transformation converts a random variable $X$ into a so-called Lambert $W$ random variable $Y$, which allows a very flexible approach to model skewed data. Its shape depends on the shape of $X$ and a skewness parameter $γ$. In particular, for symmetric $X$ and nonzero $γ$ the output $Y$ is skewed. Its distribution and density function are particular variants of their input counterparts. Maximum likelihood and method of moments estimators are presented, and simulations show that in the symmetric case additional estimation of $γ$ does not affect the quality of other parameter estimates. Applications in finance and biomedicine show the relevance of this class of distributions, which is particularly useful for slightly skewed data. A practical by-result of the Lambert $W$ framework: data can be "unskewed." The $R$ package http://cran.r-project.org/web/packages/LambertWLambertW developed by the author is publicly available (http://cran.r-project.orgCRAN).