Source author record

John Hoffman

John Hoffman appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Artificial Intelligence astro-ph.IM Machine Learning Applications astro-ph.HE astro-ph.SR Computation and Language Computer Science and Game Theory math.AP math.CA math.MG physics.soc-ph

Catalog footprint

What is connected

8works

12topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

No Language Left Behind: Scaling Human-Centered Machine Translation

Driven by the goal of eradicating language barriers on a global scale, machine translation has solidified itself as a key focus of artificial intelligence research today. However, such efforts have coalesced around a small subset of languages, leaving behind the vast majority of mostly low-resource languages. What does it take to break the 200 language barrier while ensuring safe, high quality results, all while keeping ethical considerations in mind? In No Language Left Behind, we took on this challenge by first contextualizing the need for low-resource language translation support through exploratory interviews with native speakers. Then, we created datasets and models aimed at narrowing the performance gap between low and high-resource languages. More specifically, we developed a conditional compute model based on Sparsely Gated Mixture of Experts that is trained on data obtained with novel and effective data mining techniques tailored for low-resource languages. We propose multiple architectural and training improvements to counteract overfitting while training on thousands of tasks. Critically, we evaluated the performance of over 40,000 different translation directions using a human-translated benchmark, Flores-200, and combined human evaluation with a novel toxicity benchmark covering all languages in Flores-200 to assess translation safety. Our model achieves an improvement of 44% BLEU relative to the previous state-of-the-art, laying important groundwork towards realizing a universal translation system. Finally, we open source all contributions described in this work, accessible at https://github.com/facebookresearch/fairseq/tree/nllb.

preprint2021arXiv

A Fast Template Periodogram for Detecting Non-sinusoidal Fixed-shape Signals in Irregularly Sampled Time Series

Astrophysical time series often contain periodic signals. The large and growing volume of time series data from photometric surveys demands computationally efficient methods for detecting and characterizing such signals. The most efficient algorithms available for this purpose are those that exploit the $\mathcal{O}(N\log N)$ scaling of the Fast Fourier Transform (FFT). However, these methods are not optimal for non-sinusoidal signal shapes. Template fits (or periodic matched filters) optimize sensitivity for a priori known signal shapes but at a significant computational cost. Current implementations of template periodograms scale as $\mathcal{O}(N_f N_{obs})$, where $N_f$ is the number of trial frequencies and $N_{obs}$ is the number of lightcurve observations, and due to non-convexity, they do not guarantee the best fit at each trial frequency, which can lead to spurious results. In this work, we present a non-linear extension of the Lomb-Scargle periodogram to obtain a template-fitting algorithm that is both accurate (globally optimal solutions are obtained except in pathological cases) and computationally efficient (scaling as $\mathcal{O}(N_f\log N_f)$ for a given template). The non-linear optimization of the template fit at each frequency is recast as a polynomial zero-finding problem, where the coefficients of the polynomial can be computed efficiently with the non-equispaced fast Fourier transform. We show that our method, which uses truncated Fourier series to approximate templates, is an order of magnitude faster than existing algorithms for small problems ($N\lesssim 10$ observations) and 2 orders of magnitude faster for long base-line time series with $N_{obs} \gtrsim 10^4$ observations. An open-source implementation of the fast template periodogram is available at https://www.github.com/PrincetonUniversity/FastTemplatePeriodogram.

preprint2021arXiv

On Big Pieces approximations of parabolic hypersurfaces

Let $Σ$ be a closed subset of $\mathbb{R}^ {n+1}$ which is parabolic Ahlfors-David regular and assume that $Σ$ satisfies a 2-sided corkscrew condition. Assume, in addition, that $Σ$ is either time-forwards Ahlfors-David regular, time-backwards Ahlfors-David regular, or parabolic uniform rectifiable. We then first prove that $Σ$ satisfies a {\it weak synchronized two cube condition}. Based on this we are able to revisit the argument in \cite{NS} and prove that $Σ$ contains {\it uniform big pieces of Lip(1,1/2) graphs}. When $Σ$ is parabolic uniformly rectifiable the construction can be refined and in this case we prove that $Σ$ contains {\it uniform big pieces of regular parabolic Lip(1,1/2) graphs}. Similar results hold if $Ω\subset\mathbb R^{n+1}$ is a connected component of $\mathbb R^{n+1}\setminusΣ$ and in this context we also give a parabolic counterpart of the main result in \cite{AHMNT} by proving that if $Ω$ is a one-sided parabolic chord arc domain, and if $Σ$ is parabolic uniformly rectifiable, then $Ω$ is in fact a parabolic chord arc domain. Our results give a flexible parabolic version of the classical (elliptic) result of G. David and D. Jerison concerning the existence of uniform big pieces of Lipschitz graphs for sets satisfying a two disc condition.

preprint2021arXiv

Online and Scalable Model Selection with Multi-Armed Bandits

Many online applications running on live traffic are powered by machine learning models, for which training, validation, and hyper-parameter tuning are conducted on historical data. However, it is common for models demonstrating strong performance in offline analysis to yield poorer performance when deployed online. This problem is a consequence of the difficulty of training on historical data in non-stationary environments. Moreover, the machine learning metrics used for model selection may not sufficiently correlate with real-world business metrics used to determine the success of the applications being tested. These problems are particularly prominent in the Real-Time Bidding (RTB) domain, in which ML models power bidding strategies, and a change in models will likely affect performance of the advertising campaigns. In this work, we present Automatic Model Selector (AMS), a system for scalable online selection of RTB bidding strategies based on real-world performance metrics. AMS employs Multi-Armed Bandits (MAB) to near-simultaneously run and evaluate multiple models against live traffic, allocating the most traffic to the best-performing models while decreasing traffic to those with poorer online performance, thereby minimizing the impact of inferior models on overall campaign performance. The reliance on offline data is avoided, instead making model selections on a case-by-case basis according to actionable business goals. AMS allows new models to be safely introduced into live campaigns as soon as they are developed, minimizing the risk to overall performance. In live-traffic tests on multiple ad campaigns, the AMS system proved highly effective at improving ad campaign performance.

preprint2020arXiv

Coronizations and big pieces in metric spaces

We prove that coronizations with respect to arbitrary d-regular sets (not necessarily graphs) imply big pieces squared of these (approximating) sets. This is known (and due to David and Semmes in the case of sufficiently large co-dimension, and to Azzam and Schul in general) in the (classical) setting of Euclidean spaces with Hausdorff measure of integer dimension, where the approximating sets are Lipschitz graphs. Our result is a far reaching generalization of these results and we prove that coronizations imply big pieces squared is a generic property. In particular, our result applies, when suitably interpreted, in metric spaces having a fixed positive (perhaps non-integer) dimension, equipped with a Borel regular measure and with arbitrary approximating sets. As a novel application we highlight how to utilize this general setting in the context of parabolic uniform rectifiability.

preprint2020arXiv

Dynamic Bidding Strategies with Multivariate Feedback Control for Multiple Goals in Display Advertising

Real-Time Bidding (RTB) display advertising is a method for purchasing display advertising inventory in auctions that occur within milliseconds. The performance of RTB campaigns is generally measured with a series of Key Performance Indicators (KPIs) - measurements used to ensure that the campaign is cost-effective and that it is purchasing valuable inventory. While an RTB campaign should ideally meet all KPIs, simultaneous improvement tends to be very challenging, as an improvement to any one KPI risks a detrimental effect toward the others. Here we present an approach to simultaneously controlling multiple KPIs with a PID-based feedback-control system. This method generates a control score for each KPI, based on both the output of a PID controller module and a metric that quantifies the importance of each KPI for internal business needs. On regular intervals, this algorithm - Sequential Control - will choose the KPI with the greatest overall need for improvement. In this way, our algorithm is able to continually seek the greatest marginal improvements to its current state. Multiple methods of control can be associated with each KPI, and can be triggered either simultaneously or chosen stochastically, in order to avoid local optima. In both offline ad bidding simulations and testing on live traffic, our methods proved to be effective in simultaneously controlling multiple KPIs, and bringing them toward their respective goals.

preprint2016arXiv

The Astropy Problem

The Astropy Project (http://astropy.org) is, in its own words, "a community effort to develop a single core package for Astronomy in Python and foster interoperability between Python astronomy packages." For five years this project has been managed, written, and operated as a grassroots, self-organized, almost entirely volunteer effort while the software is used by the majority of the astronomical community. Despite this, the project has always been and remains to this day effectively unfunded. Further, contributors receive little or no formal recognition for creating and supporting what is now critical software. This paper explores the problem in detail, outlines possible solutions to correct this, and presents a few suggestions on how to address the sustainability of general purpose astronomical software.

preprint2012arXiv

Constraints on the ubiquity of coronal X-ray cycles

Stellar activity cycles are known to be a widespread phenomenon amongst moderately active solar- and late-type stars from long-term periodic variations in chromospheric Ca II H and K emission lines, yet to date only a handful of coronal X-ray cycles are known. We have surveyed serendipitously observed stellar sources in fields observed multiple times in the last decade by XMM-Newton and present our analysis of 9 stars from 6 fields. Since our sample is flux-limited, it is strongly biased towards higher levels of X-ray activity. We fit a single temperature APEC spectrum to each source and search for significant periodicities using a Lomb-Scargle Periodogram (LSP). We use a Monte Carlo (MC) algorithm to yield robust analysis of the statistical significance of cycle detections and non-detections. None of the 9 stellar lightcurves show any convincing indications of periodicity. From MC simulations we simulate the detection capabilities of our methodology and, assuming a uniform distribution of cycle periods and strengths over the domain searched, we conclude with 95% confidence that less than 72% of the stars represented by our sample of active stars have 5-13 year coronal X-ray cycles.

John Hoffman

What is connected

Connect this record

See the researcher in context

Building this map preview

8 published item(s)

No Language Left Behind: Scaling Human-Centered Machine Translation

A Fast Template Periodogram for Detecting Non-sinusoidal Fixed-shape Signals in Irregularly Sampled Time Series

On Big Pieces approximations of parabolic hypersurfaces

Online and Scalable Model Selection with Multi-Armed Bandits

Coronizations and big pieces in metric spaces

Dynamic Bidding Strategies with Multivariate Feedback Control for Multiple Goals in Display Advertising

The Astropy Problem

Constraints on the ubiquity of coronal X-ray cycles