Researcher profile

John Hoffman

John Hoffman contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
6works
0followers
9topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

6 published item(s)

preprint2022arXiv

No Language Left Behind: Scaling Human-Centered Machine Translation

Driven by the goal of eradicating language barriers on a global scale, machine translation has solidified itself as a key focus of artificial intelligence research today. However, such efforts have coalesced around a small subset of languages, leaving behind the vast majority of mostly low-resource languages. What does it take to break the 200 language barrier while ensuring safe, high quality results, all while keeping ethical considerations in mind? In No Language Left Behind, we took on this challenge by first contextualizing the need for low-resource language translation support through exploratory interviews with native speakers. Then, we created datasets and models aimed at narrowing the performance gap between low and high-resource languages. More specifically, we developed a conditional compute model based on Sparsely Gated Mixture of Experts that is trained on data obtained with novel and effective data mining techniques tailored for low-resource languages. We propose multiple architectural and training improvements to counteract overfitting while training on thousands of tasks. Critically, we evaluated the performance of over 40,000 different translation directions using a human-translated benchmark, Flores-200, and combined human evaluation with a novel toxicity benchmark covering all languages in Flores-200 to assess translation safety. Our model achieves an improvement of 44% BLEU relative to the previous state-of-the-art, laying important groundwork towards realizing a universal translation system. Finally, we open source all contributions described in this work, accessible at https://github.com/facebookresearch/fairseq/tree/nllb.

preprint2021arXiv

A Fast Template Periodogram for Detecting Non-sinusoidal Fixed-shape Signals in Irregularly Sampled Time Series

Astrophysical time series often contain periodic signals. The large and growing volume of time series data from photometric surveys demands computationally efficient methods for detecting and characterizing such signals. The most efficient algorithms available for this purpose are those that exploit the $\mathcal{O}(N\log N)$ scaling of the Fast Fourier Transform (FFT). However, these methods are not optimal for non-sinusoidal signal shapes. Template fits (or periodic matched filters) optimize sensitivity for a priori known signal shapes but at a significant computational cost. Current implementations of template periodograms scale as $\mathcal{O}(N_f N_{obs})$, where $N_f$ is the number of trial frequencies and $N_{obs}$ is the number of lightcurve observations, and due to non-convexity, they do not guarantee the best fit at each trial frequency, which can lead to spurious results. In this work, we present a non-linear extension of the Lomb-Scargle periodogram to obtain a template-fitting algorithm that is both accurate (globally optimal solutions are obtained except in pathological cases) and computationally efficient (scaling as $\mathcal{O}(N_f\log N_f)$ for a given template). The non-linear optimization of the template fit at each frequency is recast as a polynomial zero-finding problem, where the coefficients of the polynomial can be computed efficiently with the non-equispaced fast Fourier transform. We show that our method, which uses truncated Fourier series to approximate templates, is an order of magnitude faster than existing algorithms for small problems ($N\lesssim 10$ observations) and 2 orders of magnitude faster for long base-line time series with $N_{obs} \gtrsim 10^4$ observations. An open-source implementation of the fast template periodogram is available at https://www.github.com/PrincetonUniversity/FastTemplatePeriodogram.

preprint2021arXiv

On Big Pieces approximations of parabolic hypersurfaces

Let $Σ$ be a closed subset of $\mathbb{R}^ {n+1}$ which is parabolic Ahlfors-David regular and assume that $Σ$ satisfies a 2-sided corkscrew condition. Assume, in addition, that $Σ$ is either time-forwards Ahlfors-David regular, time-backwards Ahlfors-David regular, or parabolic uniform rectifiable. We then first prove that $Σ$ satisfies a {\it weak synchronized two cube condition}. Based on this we are able to revisit the argument in \cite{NS} and prove that $Σ$ contains {\it uniform big pieces of Lip(1,1/2) graphs}. When $Σ$ is parabolic uniformly rectifiable the construction can be refined and in this case we prove that $Σ$ contains {\it uniform big pieces of regular parabolic Lip(1,1/2) graphs}. Similar results hold if $Ω\subset\mathbb R^{n+1}$ is a connected component of $\mathbb R^{n+1}\setminusΣ$ and in this context we also give a parabolic counterpart of the main result in \cite{AHMNT} by proving that if $Ω$ is a one-sided parabolic chord arc domain, and if $Σ$ is parabolic uniformly rectifiable, then $Ω$ is in fact a parabolic chord arc domain. Our results give a flexible parabolic version of the classical (elliptic) result of G. David and D. Jerison concerning the existence of uniform big pieces of Lipschitz graphs for sets satisfying a two disc condition.

preprint2021arXiv

Online and Scalable Model Selection with Multi-Armed Bandits

Many online applications running on live traffic are powered by machine learning models, for which training, validation, and hyper-parameter tuning are conducted on historical data. However, it is common for models demonstrating strong performance in offline analysis to yield poorer performance when deployed online. This problem is a consequence of the difficulty of training on historical data in non-stationary environments. Moreover, the machine learning metrics used for model selection may not sufficiently correlate with real-world business metrics used to determine the success of the applications being tested. These problems are particularly prominent in the Real-Time Bidding (RTB) domain, in which ML models power bidding strategies, and a change in models will likely affect performance of the advertising campaigns. In this work, we present Automatic Model Selector (AMS), a system for scalable online selection of RTB bidding strategies based on real-world performance metrics. AMS employs Multi-Armed Bandits (MAB) to near-simultaneously run and evaluate multiple models against live traffic, allocating the most traffic to the best-performing models while decreasing traffic to those with poorer online performance, thereby minimizing the impact of inferior models on overall campaign performance. The reliance on offline data is avoided, instead making model selections on a case-by-case basis according to actionable business goals. AMS allows new models to be safely introduced into live campaigns as soon as they are developed, minimizing the risk to overall performance. In live-traffic tests on multiple ad campaigns, the AMS system proved highly effective at improving ad campaign performance.

preprint2020arXiv

Coronizations and big pieces in metric spaces

We prove that coronizations with respect to arbitrary d-regular sets (not necessarily graphs) imply big pieces squared of these (approximating) sets. This is known (and due to David and Semmes in the case of sufficiently large co-dimension, and to Azzam and Schul in general) in the (classical) setting of Euclidean spaces with Hausdorff measure of integer dimension, where the approximating sets are Lipschitz graphs. Our result is a far reaching generalization of these results and we prove that coronizations imply big pieces squared is a generic property. In particular, our result applies, when suitably interpreted, in metric spaces having a fixed positive (perhaps non-integer) dimension, equipped with a Borel regular measure and with arbitrary approximating sets. As a novel application we highlight how to utilize this general setting in the context of parabolic uniform rectifiability.

preprint2020arXiv

Dynamic Bidding Strategies with Multivariate Feedback Control for Multiple Goals in Display Advertising

Real-Time Bidding (RTB) display advertising is a method for purchasing display advertising inventory in auctions that occur within milliseconds. The performance of RTB campaigns is generally measured with a series of Key Performance Indicators (KPIs) - measurements used to ensure that the campaign is cost-effective and that it is purchasing valuable inventory. While an RTB campaign should ideally meet all KPIs, simultaneous improvement tends to be very challenging, as an improvement to any one KPI risks a detrimental effect toward the others. Here we present an approach to simultaneously controlling multiple KPIs with a PID-based feedback-control system. This method generates a control score for each KPI, based on both the output of a PID controller module and a metric that quantifies the importance of each KPI for internal business needs. On regular intervals, this algorithm - Sequential Control - will choose the KPI with the greatest overall need for improvement. In this way, our algorithm is able to continually seek the greatest marginal improvements to its current state. Multiple methods of control can be associated with each KPI, and can be triggered either simultaneously or chosen stochastically, in order to avoid local optima. In both offline ad bidding simulations and testing on live traffic, our methods proved to be effective in simultaneously controlling multiple KPIs, and bringing them toward their respective goals.