Researcher profile

Zarija Lukić

Zarija Lukić contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 17 - UnverifiedVerification L1Unclaimed author
4works
0followers
3topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

4 published item(s)

preprint2022arXiv

Snowmass2021 Computational Frontier White Paper: Cosmological Simulations and Modeling

Powerful new observational facilities will come online over the next decade, enabling a number of discovery opportunities in the "Cosmic Frontier", which targets understanding of the physics of the early universe, dark matter and dark energy, and cosmological probes of fundamental physics, such as neutrino masses and modifications of Einstein gravity. Synergies between different experiments will be leveraged to present new classes of cosmic probes as well as to minimize systematic biases present in individual surveys. Success of this observational program requires actively pairing it with a well-matched state-of-the-art simulation and modeling effort. Next-generation cosmological modeling will increasingly focus on physically rich simulations able to model outputs of sky surveys spanning multiple wavebands. These simulations will have unprecedented resolution, volume coverage, and must deliver guaranteed high-fidelity results for individual surveys as well as for the cross-correlations across different surveys. The needed advances are as follows: (1) Development of scientifically rich and broadly-scoped simulations, which capture the relevant physics and correlations between probes (2) Accurate translation of simulation results into realistic image or spectral data to be directly compared with observations (3) Improved emulators and/or data-driven methods serving as surrogates for expensive simulations, constructed from a finite set of full-physics simulations (4) Detailed and transparent verification and validation programs for both simulations and analysis tools. (Abridged)

preprint2021arXiv

Estimating Galactic Distances From Images Using Self-supervised Representation Learning

We use a contrastive self-supervised learning framework to estimate distances to galaxies from their photometric images. We incorporate data augmentations from computer vision as well as an application-specific augmentation accounting for galactic dust. We find that the resulting visual representations of galaxy images are semantically useful and allow for fast similarity searches, and can be successfully fine-tuned for the task of redshift estimation. We show that (1) pretraining on a large corpus of unlabeled data followed by fine-tuning on some labels can attain the accuracy of a fully-supervised model which requires 2-4x more labeled data, and (2) that by fine-tuning our self-supervised representations using all available data labels in the Main Galaxy Sample of the Sloan Digital Sky Survey (SDSS), we outperform the state-of-the-art supervised learning method.

preprint2021arXiv

Fast, high-fidelity Lyman $α$ forests with convolutional neural networks

Full-physics cosmological simulations are powerful tools for studying the formation and evolution of structure in the universe but require extreme computational resources. Here, we train a convolutional neural network to use a cheaper N-body-only simulation to reconstruct the baryon hydrodynamic variables (density, temperature, and velocity) on scales relevant to the Lyman-$α$ (Ly$α$) forest, using data from Nyx simulations. We show that our method enables rapid estimation of these fields at a resolution of $\sim$20kpc, and captures the statistics of the Ly$α$ forest with much greater accuracy than existing approximations. Because our model is fully-convolutional, we can train on smaller simulation boxes and deploy on much larger ones, enabling substantial computational savings. Furthermore, as our method produces an approximation for the hydrodynamic fields instead of Ly$α$ flux directly, it is not limited to a particular choice of ionizing background or mean transmitted flux.

preprint2021arXiv

Self-Supervised Representation Learning for Astronomical Images

Sky surveys are the largest data generators in astronomy, making automated tools for extracting meaningful scientific information an absolute necessity. We show that, without the need for labels, self-supervised learning recovers representations of sky survey images that are semantically useful for a variety of scientific tasks. These representations can be directly used as features, or fine-tuned, to outperform supervised methods trained only on labeled data. We apply a contrastive learning framework on multi-band galaxy photometry from the Sloan Digital Sky Survey (SDSS) to learn image representations. We then use them for galaxy morphology classification, and fine-tune them for photometric redshift estimation, using labels from the Galaxy Zoo 2 dataset and SDSS spectroscopy. In both downstream tasks, using the same learned representations, we outperform the supervised state-of-the-art results, and we show that our approach can achieve the accuracy of supervised models while using 2-4 times fewer labels for training.