Source author record

John Miller

John Miller appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning gr-qc physics.optics astro-ph.IM Computation and Language physics.ins-det Artificial Intelligence astro-ph.CO astro-ph.SR Computer Vision Databases physics.ao-ph physics.ed-ph quant-ph

Catalog footprint

What is connected

16works

14topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Retiring Adult: New Datasets for Fair Machine Learning

Although the fairness community has recognized the importance of data, researchers in the area primarily rely on UCI Adult when it comes to tabular data. Derived from a 1994 US Census survey, this dataset has appeared in hundreds of research papers where it served as the basis for the development and comparison of many algorithmic fairness interventions. We reconstruct a superset of the UCI Adult data from available US Census sources and reveal idiosyncrasies of the UCI Adult dataset that limit its external validity. Our primary contribution is a suite of new datasets derived from US Census surveys that extend the existing data ecosystem for research on fair machine learning. We create prediction tasks relating to income, employment, health, transportation, and housing. The data span multiple years and all states of the United States, allowing researchers to study temporal shift and geographic variation. We highlight a broad initial sweep of new empirical insights relating to trade-offs between fairness criteria, performance of algorithmic interventions, and the role of distribution shift based on our new datasets. Our findings inform ongoing debates, challenge some existing narratives, and point to future research directions. Our datasets are available at https://github.com/zykls/folktables.

preprint2021arXiv

Distance measures in gravitational-wave astrophysics and cosmology

We present quantities which characterize the sensitivity of gravitational-wave observatories to sources at cosmological distances. In particular, we introduce and generalize the horizon, range, response, and reach distances. These quantities incorporate a number of important effects, including cosmologically well-defined distances and volumes, cosmological redshift, cosmological time dilation, and rate density evolution. In addition, these quantities incorporate unique aspects of gravitational wave detectors, such as the variable sky sensitivity of the detectors and the scaling of the sensitivity with inverse distance. An online calculator (https://users.rcc.uchicago.edu/~dholz/gwc/) and python notebook (https://github.com/hsinyuc/distancetool) to determine GW distances are available. We provide answers to the question: "How far can gravitational-wave detectors hear?"

preprint2021arXiv

Gaussian Function On Response Surface Estimation

We propose a new framework for 2-D interpreting (features and samples) black-box machine learning models via a metamodeling technique, by which we study the output and input relationships of the underlying machine learning model. The metamodel can be estimated from data generated via a trained complex model by running the computer experiment on samples of data in the region of interest. We utilize a Gaussian process as a surrogate to capture the response surface of a complex model, in which we incorporate two parts in the process: interpolated values that are modeled by a stationary Gaussian process Z governed by a prior covariance function, and a mean function mu that captures the known trends in the underlying model. The optimization procedure for the variable importance parameter theta is to maximize the likelihood function. This theta corresponds to the correlation of individual variables with the target response. There is no need for any pre-assumed models since it depends on empirical observations. Experiments demonstrate the potential of the interpretable model through quantitative assessment of the predicted samples.

preprint2020arXiv

Strategic Classification is Causal Modeling in Disguise

Consequential decision-making incentivizes individuals to strategically adapt their behavior to the specifics of the decision rule. While a long line of work has viewed strategic adaptation as gaming and attempted to mitigate its effects, recent work has instead sought to design classifiers that incentivize individuals to improve a desired quality. Key to both accounts is a cost function that dictates which adaptations are rational to undertake. In this work, we develop a causal framework for strategic adaptation. Our causal perspective clearly distinguishes between gaming and improvement and reveals an important obstacle to incentive design. We prove any procedure for designing classifiers that incentivize improvement must inevitably solve a non-trivial causal inference problem. Moreover, we show a similar result holds for designing cost functions that satisfy the requirements of previous work. With the benefit of hindsight, our results show much of the prior work on strategic classification is causal modeling in disguise.

preprint2020arXiv

Test-Time Training with Self-Supervision for Generalization under Distribution Shifts

In this paper, we propose Test-Time Training, a general approach for improving the performance of predictive models when training and test data come from different distributions. We turn a single unlabeled test sample into a self-supervised learning problem, on which we update the model parameters before making a prediction. This also extends naturally to data in an online stream. Our simple approach leads to improvements on diverse image classification benchmarks aimed at evaluating robustness to distribution shifts.

preprint2020arXiv

The Effect of Natural Distribution Shift on Question Answering Models

We build four new test sets for the Stanford Question Answering Dataset (SQuAD) and evaluate the ability of question-answering systems to generalize to new data. Our first test set is from the original Wikipedia domain and measures the extent to which existing systems overfit the original test set. Despite several years of heavy test set re-use, we find no evidence of adaptive overfitting. The remaining three test sets are constructed from New York Times articles, Reddit posts, and Amazon product reviews and measure robustness to natural distribution shifts. Across a broad range of models, we observe average performance drops of 3.8, 14.0, and 17.4 F1 points, respectively. In contrast, a strong human baseline matches or exceeds the performance of SQuAD models on the original domain and exhibits little to no drop in new domains. Taken together, our results confirm the surprising resilience of the holdout method and emphasize the need to move towards evaluation metrics that incorporate robustness to natural distribution shifts.

preprint2016arXiv

Observing the carbon-climate system

Increases in atmospheric CO2 and CH4 result from a combination of forcing from anthropogenic emissions and Earth System feedbacks that reduce or amplify the effects of those emissions on atmospheric concentrations. Despite decades of research carbon-climate feedbacks remain poorly quantified. The impact of these uncertainties on future climate are of increasing concern, especially in the wake of recent climate negotiations. Emissions, long concentrated in the developed world, are now shifting to developing countries, where the emissions inventories have larger uncertainties. The fraction of anthropogenic CO2 remaining in the atmosphere has remained remarkably constant over the last 50 years. Will this change in the future as the climate evolves? Concentrations of CH4, the 2nd most important greenhouse gas, which had apparently stabilized, have recently resumed their increase, but the exact cause for this is unknown. While greenhouse gases affect the global atmosphere, their sources and sinks are remarkably heterogeneous in time and space, and traditional in situ observing systems do not provide the coverage and resolution to attribute the changes to these greenhouse gases to specific sources or sinks. In the past few years, space-based technologies have shown promise for monitoring carbon stocks and fluxes. Advanced versions of these capabilities could transform our understanding and provide the data needed to quantify carbon-climate feedbacks. A new observing system that allows resolving global high resolution fluxes will capture variations on time and space scales that allow the attribution of these fluxes to underlying mechanisms.

preprint2015arXiv

Audio-band frequency-dependent squeezing

Quantum vacuum fluctuations impose strict limits on precision displacement measurements, those of interferometric gravitational-wave detectors among them. Introducing squeezed states into an interferometer's readout port can improve the sensitivity of the instrument, leading to richer astrophysical observations. However, optomechanical interactions dictate that the vacuum's squeezed quadrature must rotate by 90 degrees around 50Hz. Here we use a 2-m-long, high-finesse optical resonator to produce frequency-dependent rotation around 1.2kHz. This demonstration of audio-band frequency-dependent squeezing uses technology and methods that are scalable to the required rotation frequency, heralding application of the technique in future gravitational-wave detectors.

preprint2015arXiv

Observation of Parametric Instability in Advanced LIGO

Parametric instabilities have long been studied as a potentially limiting effect in high-power interferometric gravitational wave detectors. Until now, however, these instabilities have never been observed in a kilometer-scale interferometer. In this work we describe the first observation of parametric instability in an Advanced LIGO detector, and the means by which it has been removed as a barrier to progress.

preprint2015arXiv

Thermal noise of gram-scale cantilever flexures

We present measurements of thermal noise in niobium and aluminium flexures. Our measurements cover the audio frequency band from 10Hz to 10kHz, which is of particular relevance to ground-based interferometric gravitational wave detectors, and span up to an order of magnitude above and below the fundamental flexure resonances at 50Hz - 300Hz. Our results are well-explained by a simple model in which both structural and thermoelastic loss play a role. The ability of such a model to explain this interplay is important for investigations of quantum-radiation-pressure noise and the standard quantum limit.

preprint2015arXiv

Traversing Knowledge Graphs in Vector Space

Path queries on a knowledge graph can be used to answer compositional questions such as "What languages are spoken by people living in Lisbon?". However, knowledge graphs often have missing facts (edges) which disrupts path queries. Recent models for knowledge base completion impute missing facts by embedding knowledge graphs in vector spaces. We show that these models can be recursively applied to answer path queries, but that they suffer from cascading errors. This motivates a new "compositional" training objective, which dramatically improves all models' ability to answer path queries, in some cases more than doubling accuracy. On a standard knowledge base completion task, we also demonstrate that compositional training acts as a novel form of structural regularization, reliably improving performance across all base models (reducing errors by up to 43%) and achieving new state-of-the-art results.

preprint2014arXiv

Length control of an optical resonator using second-order transverse modes

We present the analysis of an unorthodox technique for locking a laser to a resonant optical cavity. Error signals are derived from the interference between the fundamental cavity mode and higher-order spatial modes of order two excited by mode mismatch. This scheme is simple, inexpensive and, in contrast to similar techniques, first-order-insensitive to beam jitter. After mitigating sources of technical noise, performance is fundamentally limited by quantum shot-noise.

preprint2014arXiv

Prospects for doubling the range of Advanced LIGO

In the coming years, the gravitational wave community will be optimizing detector performance for a variety of astrophysical sources that make competing demands on the detector sensitivity in different frequency bands. In this paper we describe a number of technologies that are being developed as anticipated upgrades to the Advanced LIGO detector, and quantify the potential sensitivity improvement they offer. Specifically, we consider squeezed light injection for reduction of quantum noise, detector design and materials changes which reduce thermal noise, and mirrors with significantly increased mass. We explore how each of these technologies impacts the detection of the most promising gravitational wave sources, and suggest an effective progression of upgrades which culminate in a factor of two broadband sensitivity improvement.

preprint2013arXiv

Constructing a Multiple-Choice Assessment For Upper-Division Quantum Physics From An Open-Ended Tool

As part of an ongoing investigation of student learning in upper-division quantum mechanics, we needed a high-quality conceptual assessment instrument for comparing outcomes of different curricular approaches. The 14 item open-ended Quantum Mechanics Assessment Tool (QMAT) was previously developed for this purpose. However, open-ended tests require complex scoring rubrics, are difficult to score consistently, and demand substantial investment of faculty time to grade. Here, we present the process of converting open-ended questions to multiple-choice (MC) format. We highlight the construction of effective distractors and the use of student interviews to revise and validate questions and distractors. We examine other elements of the process, including results of a preliminary implementation of the MC assessment given at Cal Poly Pomona and CU Boulder.

preprint2013arXiv

Magnetic fields in Accretion Discs around Neutron Stars - Consequences for the change of spin

Accretion disks are ubiquitous in the universe and it is generally accepted that magnetic fields play a pivotal role in accretion-disk physics. The spin history of millisecond pulsars, which are usually classified as magnetized neutron stars spun up by an accretion disk, depends sensitively on the magnetic field structure, and yet highly idealized models from the 80s are still being used for calculating the magnetic field components. We present a possible way of improving the currently used models with a semi-analytic approach. The resulting magnetic field profile of both the poloidal and the toroidal component can be very different from the one suggested previously. This might dramatically change our picture of which parts of the disk tend to spin the star up or down.

preprint2011arXiv

Arm-length stabilisation for interferometric gravitational-wave detectors using frequency-doubled auxiliary lasers

Residual motion of the arm cavity mirrors is expected to prove one of the principal impediments to systematic lock acquisition in advanced gravitational-wave interferometers. We present a technique which overcomes this problem by employing auxiliary lasers at twice the fundamental measurement frequency to pre-stabilise the arm cavities' lengths. Applying this approach, we reduce the apparent length noise of a 1.3 m long, independently suspended Fabry-Perot cavity to 30 pm rms and successfully transfer longitudinal control of the system from the auxiliary laser to the measurement laser.

John Miller

What is connected

Connect this record

See the researcher in context

Building this map preview

16 published item(s)

Retiring Adult: New Datasets for Fair Machine Learning

Distance measures in gravitational-wave astrophysics and cosmology

Gaussian Function On Response Surface Estimation

Strategic Classification is Causal Modeling in Disguise

Test-Time Training with Self-Supervision for Generalization under Distribution Shifts

The Effect of Natural Distribution Shift on Question Answering Models

Observing the carbon-climate system

Audio-band frequency-dependent squeezing

Observation of Parametric Instability in Advanced LIGO

Thermal noise of gram-scale cantilever flexures

Traversing Knowledge Graphs in Vector Space

Length control of an optical resonator using second-order transverse modes

Prospects for doubling the range of Advanced LIGO

Constructing a Multiple-Choice Assessment For Upper-Division Quantum Physics From An Open-Ended Tool

Magnetic fields in Accretion Discs around Neutron Stars - Consequences for the change of spin

Arm-length stabilisation for interferometric gravitational-wave detectors using frequency-doubled auxiliary lasers