Researcher profile

Dan S. Taranu

Dan S. Taranu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
6works
0followers
5topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

6 published item(s)

preprint2026arXiv

Hyrax: An Extensible Framework for Rapid ML Experimentation and Unsupervised Discovery in the Era of Rubin, Roman, and Euclid

The NSF-DOE Vera C. Rubin Observatory, Roman Space Telescope, Euclid, and other next-generation surveys will deliver imaging, spectroscopic, and time-domain data at scales that increasingly shift the bottleneck in astronomical machine learning (ML) projects from model design to infrastructure. We present Hyrax, an open-source, modular, GPU-enabled Python framework that supports the full ML lifecycle in astronomy: from data acquisition and training to inference and experiment comparison, with capabilities including multimodal dataset support, integrated vector databases for similarity search, and interactive two- and three-dimensional latent-space exploration for unsupervised discovery. We demonstrate Hyrax's versatility through five representative applications on real survey data: (i) unsupervised representation learning on $\sim 4\times10^5$ Rubin Legacy Survey of Space and Time (LSST) Data Preview 1 (DP1) galaxies, surfacing new merger and low-surface-brightness candidates missing from reference Euclid and Dark Energy Survey catalogs, while also isolating imaging artifacts -- all without labeled training data; (ii) hybrid density-based clustering for identifying cluster-scale gravitational lens candidates in DP1 data; (iii) multimodal early-time transient classification in the Zwicky Transient Facility leveraging light curves, spectra, images, and metadata; (iv) supervised false-positive filtering in shift-and-stack searches for distant solar system objects in the Dark Energy Camera Ecliptic Exploration Project survey; and (v) supervised detection of semi-resolved dwarf galaxies in Hyper Suprime-Cam and LSST-like imaging using synthetic source injection. Together, these results demonstrate that Hyrax provides astronomy-specific ML infrastructure that enables systematic discovery and rapid methodological iteration across next-generation astronomical surveys.

preprint2022arXiv

Faro: A framework for measuring the scientific performance of petascale Rubin Observatory data products

The Vera C. Rubin Observatory will advance many areas of astronomy over the next decade with its unique wide-fast-deep multi-color imaging survey, the Legacy Survey of Space and Time (LSST). The LSST will produce approximately 20TB of raw data per night, which will be automatically processed by the LSST Science Pipelines to generate science-ready data products -- processed images, catalogs and alerts. To ensure that these data products enable transformative science with LSST, stringent requirements have been placed on their quality and scientific fidelity, for example on image quality and depth, astrometric and photometric performance, and object recovery completeness. In this paper we introduce faro, a framework for automatically and efficiently computing scientific performance metrics on the LSST data products for units of data of varying granularity, ranging from single-detector to full-survey summary statistics. By measuring and monitoring metrics, we are able to evaluate trends in algorithmic performance and conduct regression testing during development, compare the performance of one algorithm against another, and verify that the LSST data products will meet performance requirements by comparing to specifications. We present initial results using faro to characterize the performance of the data products produced on simulated and precursor data sets, and discuss plans to use faro to verify the performance of the LSST commissioning data products.

preprint2022arXiv

Galaxy And Mass Assembly (GAMA): Bulge-disk decomposition of KiDS data in the nearby universe

We derive single Sérsic fits and bulge-disk decompositions for 13096 galaxies at redshifts z < 0.08 in the GAMA II equatorial survey regions in the Kilo-Degree Survey (KiDS) g, r and i bands. The surface brightness fitting is performed using the Bayesian two-dimensional profile fitting code ProFit. We fit three models to each galaxy in each band independently with a fully automated Markov-chain Monte Carlo analysis: a single Sérsic model, a Sérsic plus exponential and a point source plus exponential. After fitting the galaxies, we perform model selection and flag galaxies for which none of our models are appropriate (mainly mergers/Irregular galaxies). The fit quality is assessed by visual inspections, comparison to previous works, comparison of independent fits of galaxies in the overlap regions between KiDS tiles and bespoke simulations. The latter two are also used for a detailed investigation of systematic error sources. We find that our fit results are robust across various galaxy types and image qualities with minimal biases. Errors given by the MCMC underestimate the true errors typically by factors 2-3. Automated model selection criteria are accurate to > 90 % as calibrated by visual inspection of a subsample of galaxies. We also present g-r component colours and the corresponding colour-magnitude diagram, consistent with previous works despite our increased fit flexibility. Such reliable structural parameters for the components of a diverse sample of galaxies across multiple bands will be integral to various studies of galaxy properties and evolution. All results are integrated into the GAMA database.

preprint2022arXiv

The Astropy Project: Sustaining and Growing a Community-oriented Open-source Project and the Latest Major Release (v5.0) of the Core Package

The Astropy Project supports and fosters the development of open-source and openly-developed Python packages that provide commonly needed functionality to the astronomical community. A key element of the Astropy Project is the core package $\texttt{astropy}$, which serves as the foundation for more specialized projects and packages. In this article, we summarize key features in the core package as of the recent major release, version 5.0, and provide major updates for the Project. We then discuss supporting a broader ecosystem of interoperable packages, including connections with several astronomical observatories and missions. We also revisit the future outlook of the Astropy Project and the current status of Learn Astropy. We conclude by raising and discussing the current and future challenges facing the Project.

preprint2021arXiv

The SAMI Galaxy Survey: the third and final data release

We have entered a new era where integral-field spectroscopic surveys of galaxies are sufficiently large to adequately sample large-scale structure over a cosmologically significant volume. This was the primary design goal of the SAMI Galaxy Survey. Here, in Data Release 3 (DR3), we release data for the full sample of 3068 unique galaxies observed. This includes the SAMI cluster sample of 888 unique galaxies for the first time. For each galaxy, there are two primary spectral cubes covering the blue (370-570nm) and red (630-740nm) optical wavelength ranges at spectral resolving power of R=1808 and 4304 respectively. For each primary cube, we also provide three spatially binned spectral cubes and a set of standardized aperture spectra. For each galaxy, we include complete 2D maps from parameterized fitting to the emission-line and absorption-line spectral data. These maps provide information on the gas ionization and kinematics, stellar kinematics and populations, and more. All data are available online through Australian Astronomical Optics (AAO) Data Central.

preprint2021arXiv

Third Data Release of the Hyper Suprime-Cam Subaru Strategic Program

The paper presents the third data release of Hyper Suprime-Cam Subaru Strategic Program (HSC-SSP), a wide-field multi-band imaging survey with the Subaru 8.2m telescope. HSC-SSP has three survey layers (Wide, Deep, and UltraDeep) with different area coverages and depths, designed to address a wide array of astrophysical questions. This third release from HSC-SSP includes data from 278 nights of observing time and covers about 670 square degrees in all five broad-band filters at the full depth ($\sim26$~mag at $5σ$) in the Wide layer. If we include partially observed area, the release covers 1,470 square degrees. The Deep and UltraDeep layers have $\sim80\%$ of the originally planned integration times, and are considered done, as we have slightly changed the observing strategy in order to compensate for various time losses. There are a number of updates in the image processing pipeline. Of particular importance is the change in the sky subtraction algorithm; we subtract the sky on small scales before the detection and measurement stages, which has significantly reduced false detections. Thanks to this and other updates, the overall quality of the processed data has improved since the previous release. However, there are limitations in the data (for example, the pipeline is not optimized for crowded fields), and we encourage the user to check the quality assurance plots as well as a list of known issues before exploiting the data. The data release website is https://hsc-release.mtk.nao.ac.jp/.