Source author record

Seth Strimas-Mackey

Seth Strimas-Mackey appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

cond-mat.stat-mech hep-lat hep-ph hep-th Machine Learning math.ST Methodology Statistics Theory

Catalog footprint

What is connected

2works

8topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Likelihood estimation of sparse topic distributions in topic models and its applications to Wasserstein document distance calculations

This paper studies the estimation of high-dimensional, discrete, possibly sparse, mixture models in topic models. The data consists of observed multinomial counts of $p$ words across $n$ independent documents. In topic models, the $p\times n$ expected word frequency matrix is assumed to be factorized as a $p\times K$ word-topic matrix $A$ and a $K\times n$ topic-document matrix $T$. Since columns of both matrices represent conditional probabilities belonging to probability simplices, columns of $A$ are viewed as $p$-dimensional mixture components that are common to all documents while columns of $T$ are viewed as the $K$-dimensional mixture weights that are document specific and are allowed to be sparse. The main interest is to provide sharp, finite sample, $\ell_1$-norm convergence rates for estimators of the mixture weights $T$ when $A$ is either known or unknown. For known $A$, we suggest MLE estimation of $T$. Our non-standard analysis of the MLE not only establishes its $\ell_1$ convergence rate, but reveals a remarkable property: the MLE, with no extra regularization, can be exactly sparse and contain the true zero pattern of $T$. We further show that the MLE is both minimax optimal and adaptive to the unknown sparsity in a large class of sparse topic distributions. When $A$ is unknown, we estimate $T$ by optimizing the likelihood function corresponding to a plug in, generic, estimator $\hat{A}$ of $A$. For any estimator $\hat{A}$ that satisfies carefully detailed conditions for proximity to $A$, the resulting estimator of $T$ is shown to retain the properties established for the MLE. The ambient dimensions $K$ and $p$ are allowed to grow with the sample sizes. Our application is to the estimation of 1-Wasserstein distances between document generating distributions. We propose, estimate and analyze new 1-Wasserstein distances between two probabilistic document representations.

preprint2013arXiv

Deconfinement in N=1 super Yang-Mills theory on R^3 x S^1 via dual-Coulomb gas and "affine" XY-model

We study finite-temperature N=1 SU(2) super Yang-Mills theory, compactified on a spatial circle of size L with supersymmetric boundary conditions. In the semiclassical small-L regime, a deconfinement transition occurs at T_c <<1/L. The transition is due to a competition between non-perturbative topological "molecules"---magnetic and neutral bion-instantons---and electrically charged W-bosons and superpartners. Compared to deconfinement in non-supersymmetric QCD(adj) arXiv:1112.6389, the novelty is the relevance of the light modulus scalar field. It mediates interactions between neutral bions (and W-bosons), serves as an order parameter for the Z_2^{L} center symmetry associated with the non-thermal circle, and explicitly breaks the electric-magnetic (Kramers-Wannier) duality enjoyed by non-supersymmetric QCD(adj) near T_c. We show that deconfinement can be studied using an effective two-dimensional gas of electric and magnetic charges with (dual) Coulomb and Aharonov-Bohm interactions, or, equivalently, via an XY-spin model with a symmetry-breaking perturbation, where each system couples to the scalar field. To study the realization of the discrete R-symmetry and the Z_2^{beta} thermal and Z_2^{L} non-thermal center symmetries, we perform Monte Carlo simulations of both systems. The dual-Coulomb gas simulations are a novel way to analyze deconfinement and provide a new venue to study the phase structure of a class of two-dimensional condensed matter models that can be mapped into dual-Coulomb gases. Our results indicate a continuous deconfinement transition, with Z_2^{L} remaining unbroken at the transition. Thus, the SYM transition appears similar to the one in SU(2) QCD(adj) arXiv:1112.6389 and is also likely to be characterized by continuously varying critical exponents.