Researcher profile

C. Tao

C. Tao contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
12works
0followers
7topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

12 published item(s)

preprint2026arXiv

Euclid preparation. Calibrated intrinsic galaxy alignments in the Euclid Flagship simulation

Intrinsic alignments of galaxies are potentially a major contaminant of cosmological analyses of weak gravitational lensing. We construct a semi-analytic model of galaxy ellipticities and alignments in the \Euclid Flagship simulation to predict this contamination in Euclid's weak lensing observations. Galaxy shapes and orientations are determined by the corresponding properties of the host haloes in the underlying $N$-body simulation, as well as the relative positions of galaxies within their halo. Alignment strengths are moderated via stochastic misalignments, separately for central and satellite galaxies and conditional on the galaxy's redshift, luminosity, and rest-frame colour. The resulting model is calibrated against galaxy ellipticity statistics from the COSMOS Survey, selected alignment measurements based on Sloan Digital Sky Survey samples, and galaxy orientations extracted from the Horizon-AGN hydrodynamic simulation at redshift $z=1$. The best-fit model has a total of 12 alignment parameters and generally reproduces the calibration data sets well within the $1σ$ statistical uncertainties of the observations and the \flagship simulation, with notable exceptions for the most luminous sub-samples on small physical scales. The statistical power of the calibration data and the volume of the single \flagship realisation are still too small to provide informative prior ranges for intrinsic alignment amplitudes in relevant galaxy samples. As a first application, we predict that \Euclid end-of-mission tomographic weak gravitational lensing two-point statistics are modified by up to order $10\,\%$ due to intrinsic alignments.

preprint2026arXiv

Euclid preparation. Galaxy 2-point correlation function modelling in redshift space

The Euclid satellite will measure spectroscopic redshifts for tens of millions of emission-line galaxies. In the context of Stage-IV surveys, the 3-dimensional clustering of galaxies plays a key role in providing cosmological constraints. In this paper, we conduct a model comparison for the multipole moments of the galaxy 2-point correlation function (2PCF) in redshift space. We test state-of-the-art models, in particular the effective field theory of large-scale structure (EFT), one based on the velocity difference generating function (VDG$_{\infty}$), and different variants of Lagrangian perturbation theory (LPT) models, such as convolutional LPT (CLPT) and its effective-field-theory extension (CLEFT). We analyse the first three even multipoles of the 2PCF in the Flagship 1 simulation, which consists of four snapshots at $z\in\{0.9,1.2,1.5,1.8\}$. We study both template-fitting and full-shape approaches and find that with the template-fitting approach, only the VDG$_{\infty}$ model is able to reach a minimum fitting scale of $s_{\rm min}=20\,h^{-1}\,{\rm Mpc}$ at $z=0.9$ without biasing the recovered parameters. Indeed, the EFT model becomes inaccurate already at $s_{\rm min}=30\,h^{-1}\,{\rm Mpc}$. Conversely, in the full-shape analysis, the CLEFT and VDG$_{\infty}$ models perform similarly well, but only the CLEFT model can reach $s_{\rm min}=20\,h^{-1}\,{\rm Mpc}$ while the VDG$_{\infty}$ model is unbiased down to $s_{\rm min}=25\,h^{-1}\,{\rm Mpc}$ at the lowest redshift. Overall, in order to achieve the accuracy required by Euclid, non-perturbative modelling such as in the VDG$_{\infty}$ or CLEFT models should be considered. At $z=1.8$, the CLPT model is sufficient to describe the data with high figure of merit. This comparison selects baseline models that perform best in ideal conditions and sets the stage for an optimal analysis of Euclid data in configuration space.

preprint2026arXiv

Euclid preparation. Testing analytic models of galaxy intrinsic alignments in the Euclid Flagship simulation

We model intrinsic alignments (IA) in Euclid&#39;s Flagship simulation to investigate its impact on Euclid&#39;s weak lensing signal. Our IA implementation in the Flagship simulation takes into account photometric properties of galaxies as well as their dark matter host halos. We compare simulations against theory predictions, determining the parameters of two of the most widely used IA models: the Non Linear Alignment (NLA) and the Tidal Alignment and Tidal Torquing (TATT) models. We measure the amplitude of the simulated IA signal as a function of galaxy magnitude and colour in the redshift range $0.1<z<2.1$. We find that both NLA and TATT can accurately describe the IA signal in the simulation down to scales of $6$-$7 \,h^{-1}\,$Mpc. We measure alignment amplitudes for red galaxies comparable to those of the observations, with samples not used in the calibration procedure. For blue galaxies, our constraints are consistent with zero alignments in our first redshift bin $0.1 < z < 0.3$, but we detect a non-negligible signal at higher redshift, which is, however, consistent with the upper limits set by observational constraints. Additionally, several hydrodynamical simulations predict alignment for spiral galaxies, in agreement with our findings. Finally, the evolution of alignment with redshift is realistic and comparable to that determined in the observations. However, we find that the commonly adopted redshift power-law for IA fails to reproduce the simulation alignments above $z=1.1$. A significantly better agreement is obtained when a luminosity dependence is included, capturing the intrinsic luminosity evolution with redshift in magnitude-limited surveys. We conclude that the Flagship IA simulation is a useful tool for translating current IA constraints into predictions for IA contamination of Euclid-like samples.

preprint2026arXiv

Euclid: Galaxy SED reconstruction in the PHZ processing function: impact on the PSF and the role of medium-band filters

Weak lensing surveys require accurate correction for the point spread function (PSF) when measuring galaxy shapes. For a diffraction-limited PSF, as arises in space-based missions, this correction depends on each galaxy SED. In the Euclid mission, galaxy SED reconstruction, a tasks of the photometric-redshift processing function (PHZ PF), relies on broad- and medium-band ancillary photometry. The limited wavelength sampling of the Euclid VIS passband and signal-to-noise ratio may affect the reconstruction accuracy and translate into biases in the weak lensing measurements. In this study, we present the methodology, which is employed in the Euclid PHZ PF, for reconstructing galaxy SEDs at 55 wavelengths, sampling the VIS passband every 10 nm, and we assess whether it fulfils the accuracy requirements imposed on the Euclid PSF model. We employ both physics- and data-driven methods, focusing on a new approach of template-based flux correction and Gaussian processes, and we introduce an SED metric whose bias propagates into PSF quadrupole moment errors. Our findings demonstrate that Gaussian processes and template fitting meet the requirements only in specific, but complementary, redshift intervals. We therefore propose a hybrid approach, which leverages both methods. This solution proves to be effective in meeting the Euclid accuracy requirements for most of the redshift range of the survey. Finally, we investigate the impact on the SED reconstruction of a new set of 16 evenly-spaced medium-band filters for the Subaru telescope, providing quasi-spectroscopic coverage of the VIS passband. This study shows promising results, ensuring accurate SED reconstruction and meeting the mission PSF requirements. This work thus provides not only the methodological foundation of galaxy SED reconstruction in the Euclid PHZ PF, but also a roadmap for future improvements using a new medium-band survey.

preprint2025arXiv

Euclid preparation. Simulating thousands of Euclid spectroscopic skies

We present two extensive sets of 3500+1000 simulations of dark matter haloes on the past light cone, and two corresponding sets of simulated (`mock&#39;) galaxy catalogues that represent the Euclid spectroscopic sample. The simulations were produced with the latest version of the PINOCCHIO code, and provide the largest, public set of simulated skies. Mock galaxy catalogues were obtained by populating haloes with galaxies using an halo occupation distribution (HOD) model extracted from the Flagship galaxy catalogue provided by Euclid Collaboration. The Geppetto set of 3500 simulated skies was obtained by tiling a 1.2 Gpc/h box to cover a light-cone whose sky footprint is a circle of 30 deg radius, for an area of 2763 deg$^2$ and a minimum halo mass of $1.5\times10^{11}$ Msun/h. The relatively small box size makes this set unfit for measuring very large scales. The EuclidLargeBox set consists of 1000 simulations of 3.38 Gpc/h, with the same mass resolution and a footprint that covers half of the sky, excluding the Milky Way zone of avoidance. From this we produced a set of 1000 EuclidLargeMocks on the 30 deg radius footprint, whose comoving volume is fully contained in the simulation box. We validated the two sets of catalogues by analysing number densities, power spectra, and 2-point correlation functions, showing that the Flagship spectroscopic catalogue is consistent with being one of the realisations of the simulated sets, although we noticed small deviations limited to the quadrupole at k>0.2 h/Mpc. We show cosmological parameter inference from these catalogues and demonstrate that using one realisation of EuclidLargeMocks in place of the Flagship mock produces the same posteriors, to within the expected shift given by sample variance. These simulated skies will be used for the galaxy clustering analysis of Euclid&#39;s Data Release 1 (DR1).

preprint2022arXiv

A Probabilistic Autoencoder for Type Ia Supernovae Spectral Time Series

We construct a physically-parameterized probabilistic autoencoder (PAE) to learn the intrinsic diversity of type Ia supernovae (SNe Ia) from a sparse set of spectral time series. The PAE is a two-stage generative model, composed of an Auto-Encoder (AE) which is interpreted probabilistically after training using a Normalizing Flow (NF). We demonstrate that the PAE learns a low-dimensional latent space that captures the nonlinear range of features that exists within the population, and can accurately model the spectral evolution of SNe Ia across the full range of wavelength and observation times directly from the data. By introducing a correlation penalty term and multi-stage training setup alongside our physically-parameterized network we show that intrinsic and extrinsic modes of variability can be separated during training, removing the need for the additional models to perform magnitude standardization. We then use our PAE in a number of downstream tasks on SNe Ia for increasingly precise cosmological analyses, including automatic detection of SN outliers, the generation of samples consistent with the data distribution, and solving the inverse problem in the presence of noisy and incomplete data to constrain cosmological distance measurements. We find that the optimal number of intrinsic model parameters appears to be three, in line with previous studies, and show that we can standardize our test sample of SNe Ia with an RMS of $0.091 \pm 0.010$ mag, which corresponds to $0.074 \pm 0.010$ mag if peculiar velocity contributions are removed. Trained models and codes are released at \href{https://github.com/georgestein/suPAErnova}{github.com/georgestein/suPAErnova}

preprint2022arXiv

Uniform Recalibration of Common Spectrophotometry Standard Stars onto the CALSPEC System using the SuperNova Integral Field Spectrograph

We calibrate spectrophotometric optical spectra of 32 stars commonly used as standard stars, referenced to 14 stars already on the HST-based CALSPEC flux system. Observations of CALSPEC and non-CALSPEC stars were obtained with the SuperNova Integral Field Spectrograph over the wavelength range 3300 A to 9400 A as calibration for the Nearby Supernova Factory cosmology experiment. In total, this analysis used 4289 standard-star spectra taken on photometric nights. As a modern cosmology analysis, all pre-submission methodological decisions were made with the flux scale and external comparison results blinded. The large number of spectra per star allows us to treat the wavelength-by-wavelength calibration for all nights simultaneously with a Bayesian hierarchical model, thereby enabling a consistent treatment of the Type Ia supernova cosmology analysis and the calibration on which it critically relies. We determine the typical per-observation repeatability (median 14 mmag for exposures >~ 5 s), the Maunakea atmospheric transmission distribution (median dispersion of 7 mmag with uncertainty 1 mmag), and the scatter internal to our CALSPEC reference stars (median of 8 mmag). We also check our standards against literature filter photometry, finding generally good agreement over the full 12-magnitude range. Overall, the mean of our system is calibrated to the mean of CALSPEC at the level of ~ 3 mmag. With our large number of observations, careful crosschecks, and 14 reference stars, our results are the best calibration yet achieved with an integral-field spectrograph, and among the best calibrated surveys.

preprint2022arXiv

Void BAO measurements on quasars from eBOSS

We present the clustering of voids based on the quasar (QSO) sample of the extended Baryon Oscillation Spectroscopic Survey Data Release 16 in configuration space. We define voids as overlapping empty circumspheres computed by Delaunay tetrahedra spanned by quartets of quasars, allowing for an estimate of the depth of underdense regions. To maximise the BAO signal-to-noise ratio, we consider only voids with radii larger than 36$h^{-1}$Mpc. Our analysis shows a negative BAO peak in the cross-correlation of QSOs and voids. The joint BAO measurement of the QSO auto-correlation and the corresponding cross-correlation with voids shows an improvement in 70$\%$ of the QSO mocks with an average improvement of $\sim5\%$. However, on the SDSS data, we find no improvement compatible with cosmic variance. For both mocks and data, adding voids does not introduce any bias. We find under the flat $Λ$CDM assumption, a distance joint measurement on data at the effective redshift $z_{\rm eff}=1.48$ of $D_V(z_{\rm eff})=26.297\pm0.547$. A forecast of a DESI-like survey with 1000 boxes with a similar effective volume recovers the same results as for light-cone mocks with an average of 4.8$\%$ improvement in 68$\%$ of the boxes.

preprint2020arXiv

Strong Dependence of Type Ia Supernova Standardization on the Local Specific Star Formation Rate

As part of an on-going effort to identify, understand and correct for astrophysics biases in the standardization of Type Ia supernovae (SNIa) for cosmology, we have statistically classified a large sample of nearby SNeIa into those located in predominantly younger or older environments. This classification is based on the specific star formation rate measured within a projected distance of 1kpc from each SN location (LsSFR). This is an important refinement compared to using the local star formation rate directly as it provides a normalization for relative numbers of available SN progenitors and is more robust against extinction by dust. We find that the SNeIa in predominantly younger environments are DY=0.163\pm0.029 mag (5.7 sigma) fainter than those in predominantly older environments after conventional light-curve standardization. This is the strongest standardized SN Ia brightness systematic connected to host-galaxy environment measured to date. The well-established step in standardized brightnesses between SNeIa in hosts with lower or higher total stellar masses is smaller at DM=0.119\pm0.032 mag (4.5 sigma), for the same set of SNeIa. When fit simultaneously, the environment age offset remains very significant, with DY=0.129\pm0.032 mag (4.0 sigma), while the global stellar mass step is reduced to DM=0.064\pm0.029 mag (2.2 sigma). Thus, approximately 70% of the variance from the stellar mass step is due to an underlying dependence on environment-based progenitor age. Standardization using only the SNeIa in younger environments reduces the total dispersion from 0.142\pm0.008 mag to 0.120\pm0.010 mag. We show that as environment ages evolve with redshift a strong bias on measurement of the dark energy equation of state parameters can develop. Fortunately, data to measure and correct for this effect is likely to be available for many next-generation experiments. [abstract shorten]

preprint2020arXiv

Track length measurement of $^{19}$F$^+$ ions with the MIMAC Dark Matter directional detector prototype

Weakly Interacting Massive Particles (WIMPs) are one of the most preferred candidate for Dark Matter. WIMPs should interact with the nuclei of detectors. If a robust signal is eventually observed in direct detection experiments, the best signature to confirm its Galactic origin would be the nuclear recoil track direction. The MIMAC collaboration has developed a low pressure gas detector providing both the kinetic energy and three-dimensional track reconstruction of nuclear recoils. In this paper we report the first ever observations of $^{19}$F nuclei tracks in a $5$ cm drift prototype MIMAC detector, in the low kinetic energy range ($6$-$26$ keV), using specially developed ion beam facilities. We have measured the recoil track lengths and found significant differences between our measurements and standard simulations. In order to understand these differences, we have performed a series of complementary experiments and simulations to study the impact of the diffusion and eventual systematics. We show an unexpected dependence of the number of read-out corresponding to the track on the electric field applied to the $512\ \mathrm{μm}$ gap of the Micromegas detector. We have introduced, based on the flash-ADC observable, corrections in order to reconstruct the physical 3D track length of the primary electron clouds proposing the physics behind these corrections. We show that diffusion and space charge effects need to be taken into account to explain the differences between measurements and standard simulations. These measurements and simulations may shed a new light on the high-gain TPC ionization signals in general and particularly at low energy.

preprint2019arXiv

SUGAR: An improved empirical model of Type Ia Supernovae based on spectral features

Type Ia Supernovae (SNe Ia) are widely used to measure the expansion of the Universe. Improving distance measurements of SNe Ia is one technique to better constrain the acceleration of expansion and determine its physical nature. This document develops a new SNe Ia spectral energy distribution (SED) model, called the SUpernova Generator And Reconstructor (SUGAR), which improves the spectral description of SNe Ia, and consequently could improve the distance measurements. This model is constructed from SNe Ia spectral properties and spectrophotometric data from The Nearby Supernova Factory collaboration. In a first step, a PCA-like method is used on spectral features measured at maximum light, which allows us to extract the intrinsic properties of SNe Ia. Next, the intrinsic properties are used to extract the average extinction curve. Third, an interpolation using Gaussian Processes facilitates using data taken at different epochs during the lifetime of a SN Ia and then projecting the data on a fixed time grid. Finally, the three steps are combined to build the SED model as a function of time and wavelength. This is the SUGAR model. The main advancement in SUGAR is the addition of two additional parameters to characterize SNe Ia variability. The first is tied to the properties of SNe Ia ejecta velocity, the second is correlated with their calcium lines. The addition of these parameters, as well as the high quality the Nearby Supernova Factory data, makes SUGAR an accurate and efficient model for describing the spectra of normal SNe Ia as they brighten and fade. The performance of this model makes it an excellent SED model for experiments like ZTF, LSST or WFIRST.