Source author record

Digvijay Wadekar

Digvijay Wadekar appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

astro-ph.CO astro-ph.GA Artificial Intelligence astro-ph.HE astro-ph.IM gr-qc Computer Vision hep-ph Machine Learning

Catalog footprint

What is connected

6works

9topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Discovery of Interpretable Surrogates via Agentic AI: Application to Gravitational Waves

Fast surrogate models for expensive simulations are now essential across the sciences, yet they typically operate as black boxes. We present \texttt{GWAgent}, a large language model (LLM)-based workflow that constructs interpretable analytic surrogates directly from simulation data. Surrogate modeling is well suited to agentic workflows because candidate models can be quantitatively validated against ground-truth simulations at each iteration. As a demonstration, we build a surrogate for gravitational waveforms from eccentric binary black hole mergers. We show that providing the agent with a physics-informed domain ansatz substantially improves output model accuracy. The resulting analytic surrogate attains a median Advanced LIGO mismatch of $6.9\times10^{-4}$ together with an $\sim 8.4\times$ speedup in waveform evaluation, surpassing both symbolic regression and conventional machine learning baselines. Beyond producing an accurate model, the workflow identifies compact physical structure from the learned representation. As an astrophysical application, we use \texttt{GWAgent} to analyze the eccentricity of GW200129 and infer $e_{20\mathrm{Hz}}=0.099^{+0.063}_{-0.044}$. These results show that validation-constrained agentic workflows can produce accurate, fast, and interpretable surrogates for scientific simulations and inference.

preprint2026arXiv

gwBenchmarks: Stress-Testing LLM Agents on High-Precision Gravitational Wave Astronomy

Modern gravitational wave astronomy relies on modeling tasks that often require months of graduate-level effort, including building fast waveform surrogates from expensive numerical relativity simulations, modeling orbital dynamics of black holes, fitting merger remnant properties and constructing template banks. These problems demand extreme precision to support detection and parameter inference, with state-of-the-art models achieving $\lesssim 10^{-4}$ relative error. We study whether state-of-the-art LLM coding agents can perform such end-to-end scientific modeling, where success requires constructing models with stringent accuracy criteria and reasoning about physical systems. We introduce gwBenchmarks, a suite of eight tasks grounded in gravitational wave analytic calculations and numerical simulations collectively representing over $10^8$ core-hours of compute. The tasks span interpolation, regression, and high-dimensional time-series modeling, requiring a combination of numerical methods, machine learning, and physics-informed approaches. In preliminary experiments, agents frequently relied on proxy metrics, partial evaluation, or fabricated results to spuriously complete tasks. We therefore implement an external pre-defined framework to gauge agent progress. Evaluating twelve coding agents, we find no consistent winner. On the easiest task, multiple agents converge to the same cubic spline solution, with one rediscovering a coordinate transformation widely used in the literature. On harder tasks like analytic waveform modeling, all agents fall 1-2 orders of magnitude short of domain requirements and exhibit systematic failures, including metric misuse, constraint violations, and result fabrication. Our code, data, and website are publicly available.

preprint2022arXiv

Percent-level constraints on baryonic feedback with spectral distortion measurements

High-significance measurements of the monopole thermal Sunyaev-Zel'dovich CMB spectral distortions have the potential to tightly constrain poorly understood baryonic feedback processes. The sky-averaged Compton-y distortion and its relativistic correction are measures of the total thermal energy in electrons in the observable universe and their mean temperature. We use the CAMELS suite of hydrodynamic simulations to explore possible constraints on parameters describing the subgrid implementation of feedback from active galactic nuclei and supernovae, assuming a PIXIE-like measurement. The small 25 Mpc/h CAMELS boxes present challenges due to the significant cosmic variance. We utilize machine learning to construct interpolators through the noisy simulation data. Using the halo model, we translate the simulation halo mass functions into correction factors to reduce cosmic variance where required. Our results depend on the subgrid model. In the case of IllustrisTNG, we find that the best-determined parameter combination can be measured to ~2% and corresponds to a product of AGN and SN feedback. In the case of SIMBA, the tightest constraint is ~0.2% on a ratio between AGN and SN feedback. A second orthogonal parameter combination can be measured to ~8%. Our results demonstrate the significant constraining power a measurement of the late-time spectral distortion monopoles would have for baryonic feedback models.

preprint2021arXiv

The CAMELS Multifield Dataset: Learning the Universe's Fundamental Parameters with Artificial Intelligence

We present the Cosmology and Astrophysics with MachinE Learning Simulations (CAMELS) Multifield Dataset, CMD, a collection of hundreds of thousands of 2D maps and 3D grids containing many different properties of cosmic gas, dark matter, and stars from 2,000 distinct simulated universes at several cosmic times. The 2D maps and 3D grids represent cosmic regions that span $\sim$100 million light years and have been generated from thousands of state-of-the-art hydrodynamic and gravity-only N-body simulations from the CAMELS project. Designed to train machine learning models, CMD is the largest dataset of its kind containing more than 70 Terabytes of data. In this paper we describe CMD in detail and outline a few of its applications. We focus our attention on one such task, parameter inference, formulating the problems we face as a challenge to the community. We release all data and provide further technical details at https://camels-multifield-dataset.readthedocs.io.

preprint2020arXiv

Comment on the paper "Calorimetric Dark Matter Detection with Galactic Center Gas Clouds"

The paper "Calorimetric Dark Matter Detection with Galactic Center Gas Clouds" (Bhoonah et al. 2018) aims to derive limits on dark matter interactions by demanding that heat transfer due to DM interactions is less than that by astrophysical cooling, using clouds in the hot, high-velocity nuclear outflow wind of the Milky Way ($T_{wind} \sim 10^{6-7}$ K, $V_{wind} \sim$ 330 km/s). We argue that clouds in such an extreme environment cannot be assumed to be stable over the long timescales associated with their radiative cooling rates. Furthermore, Bhoonah et al. (2018) uses incorrect parameters for their clouds.

preprint2014arXiv

Zeldovich pancakes at redshift zero: the equilibration state and phase space properties

One of the components of the cosmic web are sheets, which are commonly referred to as Zeldovich pancakes. These are structures which have only collapsed along one dimension, as opposed to filaments or galaxies and cluster, which have collapsed along two or three dimensions. These pancakes have recently received renewed interest, since they have been shown to be useful tools for an independent method to determine galaxy cluster masses. We consider sheet-like structures resulting from cosmological simulations, which were previously used to establish the cluster mass determination method, and we show through their level of equilibration, that these structures have indeed only collapsed along the one dimension. We also extract the density profiles of these pancake, which agrees acceptably well with theoretical expectations. We derive the observable velocity distribution function (VDF) analytically by generalizing the Eddington method to one dimension, and we compare with the distribution function from the numerical simulation.