Researcher profile

Neven Caplar

Neven Caplar contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 19 - UnverifiedVerification L1Unclaimed author
5works
0followers
6topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

5 published item(s)

preprint2026arXiv

Hyrax: An Extensible Framework for Rapid ML Experimentation and Unsupervised Discovery in the Era of Rubin, Roman, and Euclid

The NSF-DOE Vera C. Rubin Observatory, Roman Space Telescope, Euclid, and other next-generation surveys will deliver imaging, spectroscopic, and time-domain data at scales that increasingly shift the bottleneck in astronomical machine learning (ML) projects from model design to infrastructure. We present Hyrax, an open-source, modular, GPU-enabled Python framework that supports the full ML lifecycle in astronomy: from data acquisition and training to inference and experiment comparison, with capabilities including multimodal dataset support, integrated vector databases for similarity search, and interactive two- and three-dimensional latent-space exploration for unsupervised discovery. We demonstrate Hyrax's versatility through five representative applications on real survey data: (i) unsupervised representation learning on $\sim 4\times10^5$ Rubin Legacy Survey of Space and Time (LSST) Data Preview 1 (DP1) galaxies, surfacing new merger and low-surface-brightness candidates missing from reference Euclid and Dark Energy Survey catalogs, while also isolating imaging artifacts -- all without labeled training data; (ii) hybrid density-based clustering for identifying cluster-scale gravitational lens candidates in DP1 data; (iii) multimodal early-time transient classification in the Zwicky Transient Facility leveraging light curves, spectra, images, and metadata; (iv) supervised false-positive filtering in shift-and-stack searches for distant solar system objects in the Dark Energy Camera Ecliptic Exploration Project survey; and (v) supervised detection of semi-resolved dwarf galaxies in Hyper Suprime-Cam and LSST-like imaging using synthetic source injection. Together, these results demonstrate that Hyrax provides astronomy-specific ML infrastructure that enables systematic discovery and rapid methodological iteration across next-generation astronomical surveys.

preprint2022arXiv

From Data to Software to Science with the Rubin Observatory LSST

The Vera C. Rubin Observatory Legacy Survey of Space and Time (LSST) dataset will dramatically alter our understanding of the Universe, from the origins of the Solar System to the nature of dark matter and dark energy. Much of this research will depend on the existence of robust, tested, and scalable algorithms, software, and services. Identifying and developing such tools ahead of time has the potential to significantly accelerate the delivery of early science from LSST. Developing these collaboratively, and making them broadly available, can enable more inclusive and equitable collaboration on LSST science. To facilitate such opportunities, a community workshop entitled "From Data to Software to Science with the Rubin Observatory LSST" was organized by the LSST Interdisciplinary Network for Collaboration and Computing (LINCC) and partners, and held at the Flatiron Institute in New York, March 28-30th 2022. The workshop included over 50 in-person attendees invited from over 300 applications. It identified seven key software areas of need: (i) scalable cross-matching and distributed joining of catalogs, (ii) robust photometric redshift determination, (iii) software for determination of selection functions, (iv) frameworks for scalable time-series analyses, (v) services for image access and reprocessing at scale, (vi) object image access (cutouts) and analysis at scale, and (vii) scalable job execution systems. This white paper summarizes the discussions of this workshop. It considers the motivating science use cases, identified cross-cutting algorithms, software, and services, their high-level technical specifications, and the principles of inclusive collaborations needed to develop them. We provide it as a useful roadmap of needs, as well as to spur action and collaboration between groups and individuals looking to develop reusable software for early LSST science.

preprint2020arXiv

Observational nonstationarity of AGN variability: The only way to go is down!

To gain insights into long-term Active Galactic Nuclei (AGN) variability, we analyze an AGN sample from the Sloan Digital Sky Survey (SDSS) and compare their photometry with observations from the Hyper Suprime-Cam survey (HSC) observed $\langle 14.85 \rangle$ years after SDSS. On average, the AGN are fainter in HSC than SDSS. We demonstrate that the difference is not due to subtle differences in the SDSS versus HSC filters or photometry. The decrease in mean brightness is redshift dependent, consistent with expectations for a change that is a function of the rest-frame time separation between observations. At a given redshift, the mean decrease in brightness is stronger for more luminous AGN and for objects with longer time separation between measurements. We demonstrate that the dependence on redshift and luminosity of measured mean brightness decrease is consistent with simple models of Eddington ratio variability in AGN on long (Myr, Gyr) timescales. We show how our results can be used to constrain the variability and demographic properties of AGN populations.

preprint2020arXiv

Stochastic modelling of star-formation histories II: star-formation variability from molecular clouds and gas inflow

A key uncertainty in galaxy evolution is the physics regulating star formation, ranging from small-scale processes related to the life-cycle of molecular clouds within galaxies to large-scale processes such as gas accretion onto galaxies. We study the imprint of such processes on the time-variability of star formation with an analytical approach tracking the gas mass of galaxies ("regulator model"). Specifically, we quantify the strength of the fluctuation in the star-formation rate (SFR) on different timescales, i.e. the power spectral density (PSD) of the star-formation history, and connect it to gas inflow and the life-cycle of molecular clouds. We show that in the general case the PSD of the SFR has three breaks, corresponding to the correlation time of the inflow rate, the equilibrium timescale of the gas reservoir of the galaxy, and the average lifetime of individual molecular clouds. On long and intermediate timescales (relative to the dynamical timescale of the galaxy), the PSD is typically set by the variability of the inflow rate and the interplay between outflows and gas depletion. On short timescales, the PSD shows an additional component related to the life-cycle of molecular clouds, which can be described by a damped random walk with a power-law slope of $β\approx2$ at high frequencies with a break near the average cloud lifetime. We discuss star-formation "burstiness" in a wide range of galaxy regimes, study the evolution of galaxies about the main sequence ridgeline, and explore the applicability of our method for understanding the star-formation process on cloud-scale from galaxy-integrated measurements.

preprint2020arXiv

The Diversity and Variability of Star Formation Histories in Models of Galaxy Evolution

Understanding the variability of galaxy star formation histories (SFHs) across a range of timescales provides insight into the underlying physical processes that regulate star formation within galaxies. We compile the SFHs of galaxies at $z=0$ from an extensive set of models, ranging from cosmological hydrodynamical simulations (Illustris, IllustrisTNG, Mufasa, Simba, EAGLE), zoom simulations (FIRE-2, g14, and Marvel/Justice League), semi-analytic models (Santa Cruz SAM) and empirical models (UniverseMachine), and quantify the variability of these SFHs on different timescales using the power spectral density (PSD) formalism. We find that the PSDs are well described by broken power-laws, and variability on long timescales ($\gtrsim1$ Gyr) accounts for most of the power in galaxy SFHs. Most hydrodynamical models show increased variability on shorter timescales ($\lesssim300$ Myr) with decreasing stellar mass. Quenching can induce $\sim0.4-1$ dex of additional power on timescales $>1$ Gyr. The dark matter accretion histories of galaxies have remarkably self-similar PSDs and are coherent with the in-situ star formation on timescales $>3$ Gyr. There is considerable diversity among the different models in their (i) power due to SFR variability at a given timescale, (ii) amount of correlation with adjacent timescales (PSD slope), (iii) evolution of median PSDs with stellar mass, and (iv) presence and locations of breaks in the PSDs. The PSD framework is a useful space to study the SFHs of galaxies since model predictions vary widely. Observational constraints in this space will help constrain the relative strengths of the physical processes responsible for this variability.