Researcher profile

Markus Michael Rau

Markus Michael Rau contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
7works
0followers
5topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

7 published item(s)

preprint2022arXiv

Galaxy Distribution Incompleteness Testing Using Self-Organizing Maps

The calibration of redshift distributions for photometric samples using spectroscopic surveys is plagued by the difficulty in modelling the selection functions of spectroscopic surveys. In this work, we analyse how these selection functions impact redshift inference and quantify the induced biases using local calibration tests in photometry space. The study is carried out using simulations that mimic the radial selection function of a spectroscopic survey and an accompanying mock catalog of a photometric galaxy survey catalog. We use a self-organizing map to partition the photometry space and perform a local $χ^2$ test to study the probability calibration of redshift inferences that use the spectroscopic data for calibration. The goal of this work is to investigate the effect of uncorrected selection functions in the calibration data on redshift prediction accuracy and critically discuss mitigation methods. In particular we test culling-based bias correction techniques, that aim to remove redshift calibration biases by identifying regions in photometry with few spectroscopic calibration data, and propose avenues for future research. We found that removing regions in color-magnitude space that are underpopulated with spectroscopic calibration data does not remove all biases in redshift inference induced by the selection function.

preprint2022arXiv

Photometric Redshifts from SDSS Images with an Interpretable Deep Capsule Network

Studies of cosmology, galaxy evolution, and astronomical transients with current and next-generation wide-field imaging surveys like the Rubin Observatory Legacy Survey of Space and Time (LSST) are all critically dependent on estimates of photometric redshifts. Capsule networks are a new type of neural network architecture that is better suited for identifying morphological features of the input images than traditional convolutional neural networks. We use a deep capsule network trained on $ugriz$ images, spectroscopic redshifts, and Galaxy Zoo spiral/elliptical classifications of $\sim$400,000 Sloan Digital Sky Survey (SDSS) galaxies to do photometric redshift estimation. We achieve a photometric redshift prediction accuracy and a fraction of catastrophic outliers that are comparable to or better than current methods for SDSS main galaxy sample-like data sets ($r\leq17.8$ and $z_{\mathrm{spec}}\leq0.4$) while requiring less data and fewer trainable parameters. Furthermore, the decision-making of our capsule network is much more easily interpretable as capsules act as a low-dimensional encoding of the image. When the capsules are projected on a 2-dimensional manifold, they form a single redshift sequence with the fraction of spirals in a region exhibiting a gradient roughly perpendicular to the redshift sequence. We perturb encodings of real galaxy images in this low-dimensional space to create synthetic galaxy images that demonstrate the image properties (e.g., size, orientation, and surface brightness) encoded by each dimension. We also measure correlations between galaxy properties (e.g., magnitudes, colours, and stellar mass) and each capsule dimension. We publicly release our code, estimated redshifts, and additional catalogues at https://biprateep.github.io/encapZulate-1 .

preprint2022arXiv

Re-calibrating Photometric Redshift Probability Distributions Using Feature-space Regression

Many astrophysical analyses depend on estimates of redshifts (a proxy for distance) determined from photometric (i.e., imaging) data alone. Inaccurate estimates of photometric redshift uncertainties can result in large systematic errors. However, probability distribution outputs from many photometric redshift methods do not follow the frequentist definition of a Probability Density Function (PDF) for redshift -- i.e., the fraction of times the true redshift falls between two limits $z_{1}$ and $z_{2}$ should be equal to the integral of the PDF between these limits. Previous works have used the global distribution of Probability Integral Transform (PIT) values to re-calibrate PDFs, but offsetting inaccuracies in different regions of feature space can conspire to limit the efficacy of the method. We leverage a recently developed regression technique that characterizes the local PIT distribution at any location in feature space to perform a local re-calibration of photometric redshift PDFs. Though we focus on an example from astrophysics, our method can produce PDFs which are calibrated at all locations in feature space for any use case.

preprint2022arXiv

The Dynamical Mass of the Coma Cluster from Deep Learning

In 1933, Fritz Zwicky's famous investigations of the mass of the Coma cluster led him to infer the existence of dark matter \cite{1933AcHPh...6..110Z}. His fundamental discoveries have proven to be foundational to modern cosmology; as we now know such dark matter makes up 85\% of the matter and 25\% of the mass-energy content in the universe. Galaxy clusters like Coma are massive, complex systems of dark matter in addition to hot ionized gas and thousands of galaxies, and serve as excellent probes of the dark matter distribution. However, empirical studies show that the total mass of such systems remains elusive and difficult to precisely constrain. Here, we present new estimates for the dynamical mass of the Coma cluster based on Bayesian deep learning methodologies developed in recent years. Using our novel data-driven approach, we predict Coma's $\mthc$ mass to be $10^{15.10 \pm 0.15}\ \hmsun$ within a radius of $1.78 \pm 0.03\ h^{-1}\mathrm{Mpc}$ of its center. We show that our predictions are rigorous across multiple training datasets and statistically consistent with historical estimates of Coma's mass. This measurement reinforces our understanding of the dynamical state of the Coma cluster and advances rigorous analyses and verification methods for empirical applications of machine learning in astronomy.

preprint2021arXiv

The Role of Machine Learning in the Next Decade of Cosmology

In recent years, machine learning (ML) methods have remarkably improved how cosmologists can interpret data. The next decade will bring new opportunities for data-driven cosmological discovery, but will also present new challenges for adopting ML methodologies and understanding the results. ML could transform our field, but this transformation will require the astronomy community to both foster and promote interdisciplinary research endeavors.

preprint2020arXiv

Dark Energy Survey Year 1 Results: Cosmological Constraints from Cluster Abundances and Weak Lensing

We perform a joint analysis of the counts and weak lensing signal of redMaPPer clusters selected from the Dark Energy Survey (DES) Year 1 dataset. Our analysis uses the same shear and source photometric redshifts estimates as were used in the DES combined probes analysis. Our analysis results in surprisingly low values for $S_8 =σ_8(Ω_{\rm m}/0.3)^{0.5}= 0.65\pm 0.04$, driven by a low matter density parameter, $Ω_{\rm m}=0.179^{+0.031}_{-0.038}$, with $σ_8-Ω_{\rm m}$ posteriors in $2.4σ$ tension with the DES Y1 3x2pt results, and in $5.6σ$ with the Planck CMB analysis. These results include the impact of post-unblinding changes to the analysis, which did not improve the level of consistency with other data sets compared to the results obtained at the unblinding. The fact that multiple cosmological probes (supernovae, baryon acoustic oscillations, cosmic shear, galaxy clustering and CMB anisotropies), and other galaxy cluster analyses all favor significantly higher matter densities suggests the presence of systematic errors in the data or an incomplete modeling of the relevant physics. Cross checks with X-ray and microwave data, as well as independent constraints on the observable--mass relation from SZ selected clusters, suggest that the discrepancy resides in our modeling of the weak lensing signal rather than the cluster abundance. Repeating our analysis using a higher richness threshold ($λ\ge 30$) significantly reduces the tension with other probes, and points to one or more richness-dependent effects not captured by our model.

preprint2020arXiv

Estimating redshift distributions using Hierarchical Logistic Gaussian processes

This work uses hierarchical logistic Gaussian processes to infer true redshift distributions of samples of galaxies, through their cross-correlations with spatially overlapping spectroscopic samples. We demonstrate that this method can accurately estimate these redshift distributions in a fully Bayesian manner jointly with galaxy-dark matter bias models. We forecast how systematic biases in the redshift-dependent galaxy-dark matter bias model affect redshift inference. Using published galaxy-dark matter bias measurements from the Illustris simulation, we compare these systematic biases with the statistical error budget from a forecasted weak gravitational lensing measurement. If the redshift-dependent galaxy-dark matter bias model is mis-specified, redshift inference can be biased. This can propagate into relative biases in the weak lensing convergence power spectrum on the 10% - 30% level. We, therefore, showcase a methodology to detect these sources of error using Bayesian model selection techniques. Furthermore, we discuss the improvements that can be gained from incorporating prior information from Bayesian template fitting into the model, both in redshift prediction accuracy and in the detection of systematic modeling biases.