Researcher profile

Matthew Graham

Matthew Graham contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
12works
0followers
13topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

12 published item(s)

preprint2026arXiv

AstroAlertBench: Evaluating the Accuracy, Reasoning, and Honesty of Multimodal LLMs in Astronomical Classification

Modern astronomical observatories generate a massive volume of multimodal data, creating a critical bottleneck for expert human review. While multimodal large language models (LLMs) have shown promise in interpreting complex visual and textual inputs, their ability to perform specialized scientific classification while providing interpretable reasoning remains understudied. We introduce AstroAlertBench, a comprehensive multimodal benchmark designed to evaluate LLM performance in astronomical event review along a three-stage logical chain: metadata grounding, scientific reasoning, and hierarchical classification over five categories. We use a pilot sample of 1,500 real-world alerts from the Zwicky Transient Facility (ZTF), a wide-field survey that scans the northern sky to detect transient astronomical events. On this dataset, we benchmark 13 frontier closed-source and open-weight LLMs that support visual input. Our results reveal that high accuracy does not always align with model ``honesty,'' defined as the ability to self-evaluate its reasoning, which affects its reliability as a real-world assistant. We further initialize a human-in-the-loop evaluation protocol as a precursor to future community-scale participation. Together, AstroAlertBench provides a framework for developing calibrated and interpretable astronomical assistants.

preprint2022arXiv

Bounded-degree plane geometric spanners in practice

The construction of bounded-degree plane geometric spanners has been a focus of interest since 2002 when Bose, Gudmundsson, and Smid proposed the first algorithm to construct such spanners. To date, eleven algorithms have been designed with various trade-offs in degree and stretch factor. We have implemented these sophisticated algorithms in C++ using the CGAL library and experimented with them using large synthetic and real-world pointsets. Our experiments have revealed their practical behavior and real-world efficacy. We share the implementations via GitHub for broader uses and future research. We present a simple practical algorithm, named AppxStretchFactor, that can estimate stretch factors (obtains lower bounds on the exact stretch factors) of geometric spanners - a challenging problem for which no practical algorithm is known yet. In our experiments with bounded-degree plane geometric spanners, we find that AppxStretchFactor estimates stretch factors almost precisely. Further, it gives linear runtime performance in practice for the pointset distributions considered in this work, making it much faster than the naive Dijkstra-based algorithm for calculating stretch factors

preprint2022arXiv

Data Science and Machine Learning in Education

The growing role of data science (DS) and machine learning (ML) in high-energy physics (HEP) is well established and pertinent given the complex detectors, large data, sets and sophisticated analyses at the heart of HEP research. Moreover, exploiting symmetries inherent in physics data have inspired physics-informed ML as a vibrant sub-field of computer science research. HEP researchers benefit greatly from materials widely available materials for use in education, training and workforce development. They are also contributing to these materials and providing software to DS/ML-related fields. Increasingly, physics departments are offering courses at the intersection of DS, ML and physics, often using curricula developed by HEP researchers and involving open software and data used in HEP. In this white paper, we explore synergies between HEP research and DS/ML education, discuss opportunities and challenges at this intersection, and propose community activities that will be mutually beneficial.

preprint2022arXiv

Experiments with Unit Disk Cover Algorithms for Covering Massive Pointsets

Given a set of $n$ points in the plane, the Unit Disk Cover (UDC) problem asks to compute the minimum number of unit disks required to cover the points, along with a placement of the disks. The problem is NP-hard and several approximation algorithms have been designed over the last three decades. In this paper, we have engineered and experimentally compared practical performances of some of these algorithms on massive pointsets. The goal is to investigate which algorithms run fast and give good approximation in practice. We present a simple $7$-approximation algorithm for UDC that runs in $O(n)$ expected time and uses $O(s)$ extra space, where $s$ denotes the size of the generated cover. In our experiments, it turned out to be the speediest of all. We also present two heuristics to reduce the sizes of covers generated by it without slowing it down by much. To our knowledge, this is the first work that experimentally compares geometric covering algorithms. Experiments with them using massive pointsets (in the order of millions) throw light on their practical uses. We share the engineered algorithms via GitHub - https://github.com/ghoshanirban/UnitDiskCoverAlgorithms for broader uses and future research in the domain of geometric optimization.

preprint2022arXiv

The Time Domain Spectroscopic Survey: Changing-Look Quasar Candidates from Multi-Epoch Spectroscopy in SDSS-IV

Active galactic nuclei (AGN) can vary significantly in their rest-frame optical/UV continuum emission, and with strong associated changes in broad line emission, on much shorter timescales than predicted by standard models of accretion disks around supermassive black holes. Most such ``changing-look'' or "changing-state" AGN -- and at higher luminosities, changing-look quasars (CLQs) -- have been found via spectroscopic follow-up of known quasars showing strong photometric variability. The Time Domain Spectroscopic Survey of SDSS-IV includes repeat spectroscopy of large numbers of previously-known quasars, many selected irrespective of photometric variability, and with spectral epochs separated by months to decades. Our visual examination of these repeat spectra for strong broad line variability yielded 61 newly-discovered CLQ candidates. We quantitatively compare spectral epochs to measure changes in continuum and H$β$ broad line emission, finding 19 CLQs, of which 15 are newly-recognized. The parent sample includes only broad line quasars, so our study tends to find objects that have dimmed, i.e., turn-off CLQs. However, we nevertheless find 4 turn-on CLQs that meet our criteria, albeit with broad lines in both dim and bright states. We study the response of H$β$ and MgII emission lines to continuum changes. The Eddington ratios of CLQs are low, and/or their H$β$ broad line width is large relative to the overall quasar population. Repeat quasar spectroscopy in the upcoming SDSS-V Black Hole Mapper program will reveal significant numbers of CLQs, enhancing our understanding of the frequency and duty-cycle of such strong variability, and the physics and dynamics of the phenomenon.

preprint2021arXiv

Discovery and confirmation of the shortest gamma ray burst from a collapsar

Gamma-ray bursts (GRBs) are among the brightest and most energetic events in the universe. The duration and hardness distribution of GRBs has two clusters, now understood to reflect (at least) two different progenitors. Short-hard GRBs (SGRBs; T90 <2 s) arise from compact binary mergers, while long-soft GRBs (LGRBs; T90 >2 s) have been attributed to the collapse of peculiar massive stars (collapsars). The discovery of SN 1998bw/GRB 980425 marked the first association of a LGRB with a collapsar and AT 2017gfo/GRB 170817A/GW170817 marked the first association of a SGRB with a binary neutron star merger, producing also gravitational wave (GW). Here, we present the discovery of ZTF20abwysqy (AT2020scz), a fast-fading optical transient in the Fermi Satellite and the InterPlanetary Network (IPN) localization regions of GRB 200826A; X-ray and radio emission further confirm that this is the afterglow. Follow-up imaging (at rest-frame 16.5 days) reveals excess emission above the afterglow that cannot be explained as an underlying kilonova (KN), but is consistent with being the supernova (SN). Despite the GRB duration being short (rest-frame T90 of 0.65 s), our panchromatic follow-up data confirms a collapsar origin. GRB 200826A is the shortest LGRB found with an associated collapsar; it appears to sit on the brink between a successful and a failed collapsar. Our discovery is consistent with the hypothesis that most collapsars fail to produce ultra-relativistic jets.

preprint2021arXiv

Initial Characterization of Active Transitioning Centaur, P/2019 LD2 (ATLAS), using Hubble, Spitzer, ZTF, Keck, APO and GROWTH Visible & Infrared Imaging and Spectroscopy

We present visible and mid-infrared imagery and photometry of temporary Jovian co-orbital comet P/2019 LD$_2$ taken with HST/WFC3, Spitzer/IRAC, the GROWTH telescope network, visible spectroscopy from Keck/LRIS and archival ZTF observations taken between 2019 April and 2020 August. Our observations indicate that the nucleus of LD$_2$ has a radius between 0.2-1.8 km assuming a 0.08 albedo and a coma dominated by $\sim$100$μ$ m-scale dust ejected at $\sim$1 m/s speeds with a $\sim$1&#39;&#39; jet pointing in the SW direction. LD$_2$ experienced a total dust mass loss of $\sim$10$^8$ kg at a loss rate of $\sim$6 kg/s with Af$ρ$/cross-section varying between $\sim$85 cm/125 km$^2$ and $\sim$200 cm/310 km$^2$ from 2019 April 9 to 2019 Nov 8. If the increase in Af$ρ$/cross-section remained constant, it implies LD$_2$&#39;s activity began $\sim$2018 November when within 4.8 au of the Sun, implying the onset of H$_2$O sublimation. We measure CO/CO$_2$ gas production of $\lesssim$10$^{27}$ mol/s /$\lesssim$10$^{26}$ mol/s from our 4.5 $μ$m Spitzer observations, $g$-$r$ = 0.59$\pm$0.03, $r$-$i$ = 0.18$\pm$0.05, $i$-$z$ = 0.01$\pm$0.07 from GROWTH observations, H$_2$O gas production of $\lesssim$80 kg/s scaling from our estimated $C_2$ production of $Q_{C_2}\lesssim$7.5$\times10^{24}$ mol/s from Keck/LRIS spectroscopy. We determine that the long-term orbit of LD$_2$ is similar to Jupiter family comets having close encounters with Jupiter within $\sim$0.5 Hill radius in the last $\sim$3 y, within 0.8 Hill radius in $\sim$9 y. Additionally, 78.8$\%$ of our orbital clones are ejected from the Solar System within $1 \times 10^{6}$ years having a dynamical half-life of 3.4 $\times 10^5$ years.

preprint2021arXiv

The nascent milliquasar VT J154843.06+220812.6: tidal disruption event or extreme accretion-state change?

We present detailed multiwavelength follow up of a nuclear radio flare, VT J154843.06+220812.6, hereafter VT J1548. VT J1548 was selected as a ${\sim}1$ mJy radio flare in 3 GHz observations from the VLA Sky Survey (VLASS). It is located in the nucleus of a low mass ($\log M_{\rm BH}/M_\odot \sim6$) host galaxy with weak or no past AGN activity. VT J1548 is associated with a slow rising (multiple year), bright mid IR flare in the WISE survey, peaking at ${\sim}10\%L_{\rm edd.}$. No associated optical transient is detected, although we cannot rule out a short, early optical flare given the limited data available. Constant late time (${\sim}3$ years post-flare) X-ray emission is detected at ${\sim}10^{42}$ erg s$^{-1}$. The radio SED is consistent with synchrotron emission from an outflow incident on an asymmetric medium. A follow-up, optical spectrum shows transient, bright, high-ionization coronal line emission ($[{\rm Fe\,X}]\,λ6375,[{\rm Fe\,XI}]\,λ7894,[{\rm S\,XII}]\,λ7612$). Transient broad H$α$ is also detected but without corresponding broad H$β$ emission, suggesting high nuclear extinction. We interpret this event as either a tidal disruption event or an extreme flare of an active galactic nucleus, in both cases obscured by a dusty torus. Although these individual properties have been observed in previous transients, the combination is unprecedented. This event highlights the importance of searches across all wave bands for assembling a sample of nuclear flares that spans the range of observable properties and possible triggers.

preprint2020arXiv

Characterization of Temporarily-Captured Minimoon 2020 CD$_3$ by Keck Time-resolved Spectrophotometry

We present time-resolved visible spectrophotometry of minimoon 2020 CD$_3$, the second asteroid known to become temporarily captured by the Earth-Moon system&#39;s gravitational field. The spectrophotometry was taken with Keck I/LRIS between wavelengths 434 nm and 912 nm in $B$, $g$, $V$, $R$, $I$ and RG850 filters as it was leaving the Earth-Moon system on 2020 March 23 UTC. The spectrophotometry of 2020 CD$_3$ most closely resembles the spectra of V-type asteroids and some Lunar rock samples with a reddish slope of ~18$\%$/100 nm between 434 nm and 761 nm corresponding to colors of $g$-$r$ = 0.62$\pm$0.08, $r$-$i$ = 0.21 $\pm$ 0.06 and an absorption band at ~900 nm corresponding to $i$-$z$ = -0.54$\pm$0.10. Combining our measured 31.9$\pm$0.1 absolute magnitude with an albedo of 0.35 typical for V-type asteroids, we determine 2020 CD$_3$&#39;s diameter to be ~0.9$\pm$0.1 m making it the first minimoon and one of the smallest asteroids to be spectrally studied. We use our time-series photometry to detect periodic lightcurve variations with a $<$10$^{-4}$ false alarm probability corresponding to a lightcurve period of ~573 s and a lightcurve amplitude of ~1 mag implying 2020 CD$_3$ possesses a $b/a$ axial ratio of ~2.5. In addition, we extend the observational arc of 2020 CD$_3$ to 37 days between 2020 February 15 UTC and 2020 March 23 UTC. From the improved orbital solution for 2020 CD$_3$, we estimate its likely duration of its capture to be ~2 y, and we measure the non-gravitation perturbation on its orbit due to radiation pressure with an area-to-mass ratio of 6.9$\pm$2.4$\times$10$^{-4}$ m$^2$/kg implying a density of 2.3$\pm$0.8 g/cm$^3$, broadly compatible with the densities of other meter-scale asteroids and Lunar rock. We searched for pre-discovery detections of 2020 CD$_3$ in the ZTF archive as far back as 2018 October, but were unable to locate any positive detections.

preprint2020arXiv

Characterization of the Nucleus, Morphology and Activity of Interstellar Comet 2I/Borisov by Optical and Near-Infrared GROWTH, Apache Point, IRTF, ZTF and Keck Observations

We present visible and near-infrared photometric and spectroscopic observations of interstellar object 2I/Borisov taken from 2019 September 10 to 2019 November 29 using the GROWTH, the APO ARC 3.5 m and the NASA/IRTF 3.0 m combined with post and pre-discovery observations of 2I obtained by ZTF from 2019 March 17 to 2019 May 5. Comparison with imaging of distant Solar System comets shows an object very similar to mildly active Solar System comets with an out-gassing rate of $\sim$10$^{27}$ mol/sec. The photometry, taken in filters spanning the visible and NIR range shows a gradual brightening trend of $\sim0.03$ mags/day since 2019 September 10 UTC for a reddish object becoming neutral in the NIR. The lightcurve from recent and pre-discovery data reveals a brightness trend suggesting the recent onset of significant H$_2$O sublimation with the comet being active with super volatiles such as CO at heliocentric distances $>$6 au consistent with its extended morphology. Using the advanced capability to significantly reduce the scattered light from the coma enabled by high-resolution NIR images from Keck adaptive optics taken on 2019 October 04, we estimate a diameter of 2I&#39;s nucleus of $\lesssim$1.4 km. We use the size estimates of 1I/&#39;Oumuamua and 2I/Borisov to roughly estimate the slope of the ISO size-distribution resulting in a slope of $\sim$3.4$\pm$1.2, similar to Solar System comets and bodies produced from collisional equilibrium.

preprint2020arXiv

TDE Hosts are Green and Centrally Concentrated: Signatures of a Post-Merger System

We study the properties of the galaxies hosting the first 19 tidal disruption events (TDEs) detected with the Zwicky Transient Facility (ZTF) within the context of a carefully constructed, representative host galaxy sample. We find that the ZTF sample of TDE hosts is dominated by compact &#34;green valley&#34; galaxies. After we restrict the comparison sample to galaxies with a similar concentration, as measured by Sersic index, we find this green valley over representation is even larger. That is, concentrated red sequence galaxies are not producing TDEs at elevated levels. We present host galaxy spectra which show that E+A galaxies are overrepresented in the ZTF sample by a factor of $\approx$22, which is lower than previous TDE host galaxy studies have found. We find that this overrepresentation can be fully accounted for when taking into account the masses, colors, and Sérsic indices of the ZTF TDE hosts. The combination of both green colors and high Sérsic index of the typical TDE host galaxy could be explained if the TDE rate is temporarily enhanced following a merger that leads to a higher central concentration of stars.

preprint2020arXiv

The Zwicky Transient Facility: Observing System

The Zwicky Transient Facility (ZTF) Observing System (OS) is the data collector for the ZTF project to study astrophysical phenomena in the time domain. ZTF OS is based upon the 48-inch aperture Schmidt-type design Samuel Oschin Telescope at the Palomar Observatory in Southern California. It incorporates new telescope aspheric corrector optics, dome and telescope drives, a large-format exposure shutter, a flat-field illumination system, a robotic bandpass filter exchanger, and the key element: a new 47-square-degree, 600 megapixel cryogenic CCD mosaic science camera, along with supporting equipment. The OS collects and delivers digitized survey data to the ZTF Data System (DS). Here, we describe the ZTF OS design, optical implementation, delivered image quality, detector performance, and robotic survey efficiency.