Researcher profile

Peter K. G. Williams

Peter K. G. Williams contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
13works
0followers
6topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

13 published item(s)

preprint2022arXiv

Automated Detection of Antenna Malfunctions in Large-N Interferometers: A Case Study with the Hydrogen Epoch of Reionization Array

We present a framework for identifying and flagging malfunctioning antennas in large radio interferometers. We outline two distinct categories of metrics designed to detect outliers along known failure modes of large arrays: cross-correlation metrics, based on all antenna pairs, and auto-correlation metrics, based solely on individual antennas. We define and motivate the statistical framework for all metrics used, and present tailored visualizations that aid us in clearly identifying new and existing systematics. We implement these techniques using data from 105 antennas in the Hydrogen Epoch of Reionization Array (HERA) as a case study. Finally, we provide a detailed algorithm for implementing these metrics as flagging tools on real data sets.

preprint2022arXiv

Figure and Figure Caption Extraction for Mixed Raster and Vector PDFs: Digitization of Astronomical Literature with OCR Features

Scientific articles published prior to the "age of digitization" in the late 1990s contain figures which are "trapped" within their scanned pages. While progress to extract figures and their captions has been made, there is currently no robust method for this process. We present a YOLO-based method for use on scanned pages, post-Optical Character Recognition (OCR), which uses both grayscale and OCR-features. When applied to the astrophysics literature holdings of the Astrophysics Data System (ADS), we find F1 scores of 90.9% (92.2%) for figures (figure captions) with the intersection-over-union (IOU) cut-off of 0.9 which is a significant improvement over other state-of-the-art methods.

preprint2022arXiv

The Astropy Project: Sustaining and Growing a Community-oriented Open-source Project and the Latest Major Release (v5.0) of the Core Package

The Astropy Project supports and fosters the development of open-source and openly-developed Python packages that provide commonly needed functionality to the astronomical community. A key element of the Astropy Project is the core package $\texttt{astropy}$, which serves as the foundation for more specialized projects and packages. In this article, we summarize key features in the core package as of the recent major release, version 5.0, and provide major updates for the Project. We then discuss supporting a broader ecosystem of interoperable packages, including connections with several astronomical observatories and missions. We also revisit the future outlook of the Astropy Project and the current status of Learn Astropy. We conclude by raising and discussing the current and future challenges facing the Project.

preprint2022arXiv

The First Short GRB Millimeter Afterglow: The Wide-Angled Jet of the Extremely Energetic SGRB 211106A

We present the discovery of the first millimeter afterglow of a short-duration $γ$-ray burst (SGRB) and the first confirmed afterglow of an SGRB localized by the GUANO system on Swift. Our Atacama Large Millimeter/Sub-millimeter Array (ALMA) detection of SGRB 211106A establishes an origin in a faint host galaxy detected in Hubble Space Telescope (HST) imaging at $0.7\lesssim z\lesssim1.4$. From the lack of a detectable optical afterglow, coupled with the bright millimeter counterpart, we infer a high extinction, $A_{\rm V}\gtrsim2.6$ mag along the line of sight, making this the one of the most highly dust-extincted SGRBs known to date. The millimeter-band light curve captures the passage of the synchrotron peak from the afterglow forward shock and reveals a jet break at $t_{\rm jet}=29.2^{+4.5}_{-4.0}$~days. For a presumed redshift of $z=1$, we infer an opening angle, $θ_{\rm jet}=(15.5\pm1.4)$~degrees, and beaming-corrected kinetic energy of $\log(E_{\rm K}/{\rm erg})=51.8\pm0.3$, making this one of the widest and most energetic SGRB jets known to date. Combining all published millimeter-band upper limits in conjunction with the energetics for a large sample of SGRBs, we find that energetic outflows in high density environments are more likely to have detectable millimeter counterparts. Concerted afterglow searches with ALMA should yield detection fractions of 24-40% on timescales of $\gtrsim2$~days at rates $\approx0.8$-1.6 per year, outpacing the historical discovery rate of SGRB centimeter-band afterglows.

preprint2021arXiv

First Results from HERA Phase I: Upper Limits on the Epoch of Reionization 21 cm Power Spectrum

We report upper-limits on the Epoch of Reionization (EoR) 21 cm power spectrum at redshifts 7.9 and 10.4 with 18 nights of data ($\sim36$ hours of integration) from Phase I of the Hydrogen Epoch of Reionization Array (HERA). The Phase I data show evidence for systematics that can be largely suppressed with systematic models down to a dynamic range of $\sim10^9$ with respect to the peak foreground power. This yields a 95% confidence upper limit on the 21 cm power spectrum of $Δ^2_{21} \le (30.76)^2\ {\rm mK}^2$ at $k=0.192\ h\ {\rm Mpc}^{-1}$ at $z=7.9$, and also $Δ^2_{21} \le (95.74)^2\ {\rm mK}^2$ at $k=0.256\ h\ {\rm Mpc}^{-1}$ at $z=10.4$. At $z=7.9$, these limits are the most sensitive to-date by over an order of magnitude. While we find evidence for residual systematics at low line-of-sight Fourier $k_\parallel$ modes, at high $k_\parallel$ modes we find our data to be largely consistent with thermal noise, an indicator that the system could benefit from deeper integrations. The observed systematics could be due to radio frequency interference, cable sub-reflections, or residual instrumental cross-coupling, and warrant further study. This analysis emphasizes algorithms that have minimal inherent signal loss, although we do perform a careful accounting in a companion paper of the small forms of loss or bias associated with the pipeline. Overall, these results are a promising first step in the development of a tuned, instrument-specific analysis pipeline for HERA, particularly as Phase II construction is completed en route to reaching the full sensitivity of the experiment.

preprint2021arXiv

The Galactic Faraday rotation sky 2020

This work gives an update to existing reconstructions of the Galactic Faraday rotation sky by processing almost all Faraday rotation data sets available at the end of the year 2020. Observations of extra-Galactic sources in recent years have, among other regions, further illuminated the previously under-constrained southern celestial sky, as well as parts of the inner disc of the Milky Way. This has culminated in an all-sky data set of 55,190 data points, which is a significant expansion on the 41,330 used in previous works, hence making an updated separation of the Galactic component a promising venture. The increased source density allows us to present our results in a resolution of about $1.3\cdot 10^{-2}\, \mathrm{deg}^2$ ($46.8\,\mathrm{arcmin}^2$), which is a twofold increase compared to previous works. As for previous Faraday rotation sky reconstructions, this work is based on information field theory, a Bayesian inference scheme for field-like quantities which handles noisy and incomplete data. In contrast to previous reconstructions, we find a significantly thinner and pronounced Galactic disc with small-scale structures exceeding values of several thousand $\mathrm{rad}\,\mathrm{m}^{-2}$. The improvements can mainly be attributed to the new catalog of Faraday data, but are also supported by advances in correlation structure modeling within numerical information field theory. We furthermore give a detailed discussion on statistical properties of the Faraday rotation sky and investigate correlations to other data sets.

preprint2021arXiv

Validation of the HERA Phase I Epoch of Reionization 21 cm Power Spectrum Software Pipeline

We describe the validation of the HERA Phase I software pipeline by a series of modular tests, building up to an end-to-end simulation. The philosophy of this approach is to validate the software and algorithms used in the Phase I upper limit analysis on wholly synthetic data satisfying the assumptions of that analysis, not addressing whether the actual data meet these assumptions. We discuss the organization of this validation approach, the specific modular tests performed, and the construction of the end-to-end simulations. We explicitly discuss the limitations in scope of the current simulation effort. With mock visibility data generated from a known analytic power spectrum and a wide range of realistic instrumental effects and foregrounds, we demonstrate that the current pipeline produces power spectrum estimates that are consistent with known analytic inputs to within thermal noise levels (at the 2 sigma level) for k > 0.2 h/Mpc for both bands and fields considered. Our input spectrum is intentionally amplified to enable a strong `detection' at k ~0.2 h/Mpc -- at the level of ~25 sigma -- with foregrounds dominating on larger scales, and thermal noise dominating at smaller scales. Our pipeline is able to detect this amplified input signal after suppressing foregrounds with a dynamic range (foreground to noise ratio) of > 10^7. Our validation test suite uncovered several sources of scale-independent signal loss throughout the pipeline, whose amplitude is well-characterized and accounted for in the final estimates. We conclude with a discussion of the steps required for the next round of data analysis.

preprint2020arXiv

$K2$ Ultracool Dwarfs Survey. VI. White light superflares observed on an L5 dwarf and flare rates of L dwarfs

Kepler K2 long cadence data are used to study white light flares in a sample of 45 L dwarfs. We identified 11 flares on 9 L dwarfs with equivalent durations of (1.3 - 198) hr and total (UV/optical/IR) energies of $\geq$0.9 $\times$ 10$^{32}$ erg. Two superflares with energies of $>$10$^{33}$ erg were detected on an L5 dwarf: this is the coolest object so far on which flares have been identified. The larger superflare on this L5 dwarf has an energy of 4.6$\times$ 10$^{34}$ ergs and an amplitude of $>$300 times the photospheric level: so far, this is the largest amplitude flare detected by the $Kepler/K2$ mission. The next coolest star on which we identified a flare was an L2 dwarf: 2MASS J08585891+1804463. Combining the energies of all the flares which we have identified on 9 L dwarfs with the total observation time which was dedicated by $Kepler$ to all 45 L dwarfs, we construct a composite flare frequency distribution (FFD). The FFD slope is quite shallow (-0.51$\pm$0.17), consistent with earlier results reported by Paudel et al. (2018) for one particular L0 dwarf, for which the FFD slope was found to be -0.34. Using the composite FFD, we predict that, in early and mid-L dwarfs, a superflare of energy 10$^{33}$ erg occurs every 2.4 years and a superflare of energy 10$^{34}$ erg occurs every 7.9 years. Analysis of our L dwarf flares suggests that magnetic fields of $\geq$0.13-1.3 kG are present on the stellar surface: such fields could suppress Type II radio bursts.

preprint2020arXiv

Absolute Calibration Strategies for the Hydrogen Epoch of Reionization Array and Their Impact on the 21 cm Power Spectrum

We discuss absolute calibration strategies for Phase I of the Hydrogen Epoch of Reionization Array (HERA), which aims to measure the cosmological 21 cm signal from the Epoch of Reionization (EoR). HERA is a drift-scan array with a 10 degree wide field of view, meaning bright, well-characterized point source transits are scarce. This, combined with HERA's redundant sampling of the uv plane and the modest angular resolution of the Phase I instrument, make traditional sky-based and self-calibration techniques difficult to implement with high dynamic range. Nonetheless, in this work we demonstrate calibration for HERA using point source catalogues and electromagnetic simulations of its primary beam. We show that unmodeled diffuse flux and instrumental contaminants can corrupt the gain solutions, and present a gain smoothing approach for mitigating their impact on the 21 cm power spectrum. We also demonstrate a hybrid sky and redundant calibration scheme and compare it to pure sky-based calibration, showing only a marginal improvement to the gain solutions at intermediate delay scales. Our work suggests that the HERA Phase I system can be well-calibrated for a foreground-avoidance power spectrum estimator by applying direction-independent gains with a small set of degrees of freedom across the frequency and time axes.

preprint2020arXiv

Detection of Cosmic Structures using the Bispectrum Phase. II. First Results from Application to Cosmic Reionization Using the Hydrogen Epoch of Reionization Array

Characterizing the epoch of reionization (EoR) at $z\gtrsim 6$ via the redshifted 21 cm line of neutral Hydrogen (HI) is critical to modern astrophysics and cosmology, and thus a key science goal of many current and planned low-frequency radio telescopes. The primary challenge to detecting this signal is the overwhelmingly bright foreground emission at these frequencies, placing stringent requirements on the knowledge of the instruments and inaccuracies in analyses. Results from these experiments have largely been limited not by thermal sensitivity but by systematics, particularly caused by the inability to calibrate the instrument to high accuracy. The interferometric bispectrum phase is immune to antenna-based calibration and errors therein, and presents an independent alternative to detect the EoR HI fluctuations while largely avoiding calibration systematics. Here, we provide a demonstration of this technique on a subset of data from the Hydrogen Epoch of Reionization Array (HERA) to place approximate constraints on the brightness temperature of the intergalactic medium (IGM). From this limited data, at $z=7.7$ we infer "$1σ$" upper limits on the IGM brightness temperature to be $\le 316$ "pseudo" mK at $κ_\parallel=0.33$ "pseudo" $h$ Mpc$^{-1}$ (data-limited) and $\le 1000$ "pseudo" mK at $κ_\parallel=0.875$ "pseudo" $h$ Mpc$^{-1}$ (noise-limited). The "pseudo" units denote only an approximate and not an exact correspondence to the actual distance scales and brightness temperatures. By propagating models in parallel to the data analysis, we confirm that the dynamic range required to separate the cosmic HI signal from the foregrounds is similar to that in standard approaches, and the power spectrum of the bispectrum phase is still data-limited (at $\gtrsim 10^6$ dynamic range) indicating scope for further improvement in sensitivity as the array build-out continues.

preprint2020arXiv

Foreground modelling via Gaussian process regression: an application to HERA data

The key challenge in the observation of the redshifted 21-cm signal from cosmic reionization is its separation from the much brighter foreground emission. Such separation relies on the different spectral properties of the two components, although, in real life, the foreground intrinsic spectrum is often corrupted by the instrumental response, inducing systematic effects that can further jeopardize the measurement of the 21-cm signal. In this paper, we use Gaussian Process Regression to model both foreground emission and instrumental systematics in $\sim 2$ hours of data from the Hydrogen Epoch of Reionization Array. We find that a simple co-variance model with three components matches the data well, giving a residual power spectrum with white noise properties. These consist of an "intrinsic" and instrumentally corrupted component with a coherence-scale of 20 MHz and 2.4 MHz respectively (dominating the line of sight power spectrum over scales $k_{\parallel} \le 0.2$ h cMpc$^{-1}$) and a baseline dependent periodic signal with a period of $\sim 1$ MHz (dominating over $k_{\parallel} \sim 0.4 - 0.8$h cMpc$^{-1}$) which should be distinguishable from the 21-cm EoR signal whose typical coherence-scales is $\sim 0.8$ MHz.

preprint2020arXiv

IDEAS: Immersive Dome Experiences for Accelerating Science

Astrophysics lies at the crossroads of big datasets (such as the Large Synoptic Survey Telescope and Gaia), open source software to visualize and interpret high dimensional datasets (such as Glue, WorldWide Telescope, and OpenSpace), and uniquely skilled software engineers who bridge data science and research fields. At the same time, more than 4,000 planetariums across the globe immerse millions of visitors in scientific data. We have identified the potential for critical synergy across data, software, hardware, locations, and content that -- if prioritized over the next decade -- will drive discovery in astronomical research. Planetariums can and should be used for the advancement of scientific research. Current facilities such as the Hayden Planetarium in New York City, Adler Planetarium in Chicago, Morrison Planetarium in San Francisco, the Iziko Planetarium and Digital Dome Research Consortium in Cape Town, and Visualization Center C in Norrkoping are already developing software which ingests catalogs of astronomical and multi-disciplinary data critical for exploration research primarily for the purpose of creating scientific storylines for the general public. We propose a transformative model whereby scientists become the audience and explorers in planetariums, utilizing software for their own investigative purposes. In this manner, research benefits from the authentic and unique experience of data immersion contained in an environment bathed in context and equipped for collaboration. Consequently, in this white paper we argue that over the next decade the research astronomy community should partner with planetariums to create visualization-based research opportunities for the field. Realizing this vision will require new investments in software and human capital.

preprint2019arXiv

Mitigating Internal Instrument Coupling II: A Method Demonstration with the Hydrogen Epoch of Reionization Array

We present a study of internal reflection and cross coupling systematics in Phase I of the Hydrogen Epoch of Reionization Array (HERA). In a companion paper, we outlined the mathematical formalism for such systematics and presented algorithms for modeling and removing them from the data. In this work, we apply these techniques to data from HERA&#39;s first observing season as a method demonstration. The data show evidence for systematics that, without removal, would hinder a detection of the 21 cm power spectrum for the targeted EoR line-of-sight modes in the range 0.2 < k_parallel < 0.5\ h^-1 Mpc. After systematic removal, we find we can recover these modes in the power spectrum down to the integrated noise-floor of a nightly observation, achieving a dynamic range in the EoR window of 10^-6 in power (mK^2 units) with respect to the bright galactic foreground signal. In the absence of other systematics and assuming the systematic suppression demonstrated here continues to lower noise levels, our results suggest that fully-integrated HERA Phase I may have the capacity to set competitive upper limits on the 21 cm power spectrum. For future observing seasons, HERA will have upgraded analog and digital hardware to better control these systematics in the field.