Researcher profile

Song Huang

Song Huang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
8works
0followers
3topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

8 published item(s)

preprint2026arXiv

Can AI Dream of Unseen Galaxies? Conditional Diffusion Model for Galaxy Morphology Augmentation

Observational astronomy relies on visual feature identification to detect critical astrophysical phenomena. While machine learning (ML) increasingly automates this process, models often struggle with generalization in large-scale surveys due to the limited representativeness of labeled datasets, whether from simulations or human annotation, a challenge pronounced for rare yet scientifically valuable objects. To address this, we propose a conditional diffusion model to synthesize realistic galaxy images for augmenting ML training data (hereafter GalaxySD). Leveraging the Galaxy Zoo 2 dataset which contains visual feature, galaxy image pairs from volunteer annotation, we demonstrate that GalaxySD generates diverse, high-fidelity galaxy images that closely adhere to the specified morphological feature conditions. Moreover, this model enables generative extrapolation to project well-annotated data into unseen domains and advancing rare object detection. Integrating synthesized images into ML pipelines improves performance in standard morphology classification, boosting completeness and purity by up to 30% across key metrics. For rare object detection, using early-type galaxies with prominent dust lane features (~0.1% in GZ2 dataset) as a test case, our approach doubled the number of detected instances, from 352 to 872, compared to previous studies based on visual inspection. This study highlights the power of generative models to bridge gaps between scarce labeled data and the vast, uncharted parameter space of observational astronomy and sheds insight for future astrophysical foundation model developments. Our project homepage is available at https://galaxysd-webpage.streamlit.app/.

preprint2022arXiv

The In-situ Origins of Dwarf Stellar Outskirts in FIRE-2

Extended, old, and round stellar halos appear to be ubiquitous around high-mass dwarf galaxies ($10^{8.5}<M_\star/M_\odot<10^{9.6}$) in the observed universe. However, it is unlikely that these dwarfs have undergone a sufficient number of minor mergers to form stellar halos that are composed of predominantly accreted stars. Here, we demonstrate that FIRE-2 (Feedback in Realistic Environments) cosmological zoom-in simulations are capable of producing dwarf galaxies with realistic structure, including both a thick disk and round stellar halo. Crucially, these stellar halos are formed in-situ, largely via the outward migration of disk stars. However, there also exists a large population of &#34;non-disky&#34; dwarfs in FIRE-2 that lack a well-defined disk/halo and do not resemble the observed dwarf population. These non-disky dwarfs tend to be either more gas poor or to have burstier recent star formation histories than the disky dwarfs, suggesting that star formation feedback may be preventing disk formation. Both classes of dwarfs underscore the power of a galaxy&#39;s intrinsic shape -- which is a direct quantification of the distribution of the galaxy&#39;s stellar content -- to interrogate the feedback implementation in simulated galaxies.

preprint2021arXiv

Reaching for the Edge I: Probing the Outskirts of Massive Galaxies with HSC, DECaLS, SDSS, and Dragonfly

The outer light (stellar halos) of massive galaxies has recently emerged as a possible low scatter tracer of dark matter halo mass. To test the robustness of outer light measurements across different data sets, we compare the surface brightness profiles of massive galaxies using four independent data sets: the Hyper Suprime-Cam survey (HSC), the Dark Energy Camera Legacy Survey (DECaLS), the Sloan Digital Sky Survey (SDSS), and the Dragonfly Wide Field Survey (Dragonfly). We use customized pipelines for HSC and DECaLS to achieve better sky background subtraction. For galaxies at $z<0.05$, Dragonfly has the best control of systematics, reaching surface brightness levels of $μ_r \sim 30$ mag/arcsec$^{2}$. At $0.19<z<0.50$, HSC can reliably recover surface brightness profiles to $μ_{r} \sim 28.5$ mag/arcsec$^{2}$ reaching $R=100 - 150$ kpc. DECaLS surface brightness profiles show good agreement with HSC but are noisier at large radii. The median profiles of galaxy ensembles in both HSC and DECaLS reach $R > 200$ kpc without significant bias. At $0.19<z<0.50$, DECaLS and HSC measurements of the stellar mass contained within 100 kpc agree within 0.05 dex. Finally, we use weak gravitational lensing to show that measurements of outer light with DECaLS at $0.19<z<0.50$ show a similar promise as HSC as a low scatter proxy of halo mass. The tests and results from this paper represent an important step forward for accurate measurements of the outer light of massive galaxies and demonstrate that outer light measurements from DECam imaging will be a promising method for finding galaxy clusters for DES and DESI.

preprint2021arXiv

The Outer Stellar Mass of Massive Galaxies: A Simple Tracer of Halo Mass with Scatter Comparable to Richness and Reduced Projection Effects

Using the weak gravitational lensing data from the Hyper Suprime-Cam Subaru Strategic Program (HSC survey), we study the potential of different stellar mass estimates in tracing halo mass. We consider galaxies with $\log {M_{\star}/M_{\odot}}>11.5$ at 0.2 < z < 0.5 with carefully measured light profiles and clusters from the redMaPPer and CAMIRA richness-based algorithms. We devise a method (the &#34;TopN&#34; test) to evaluate the scatter in the halo mass-observable relation for different tracers and inter-compare halo mass proxies in four number density bins using stacked galaxy-galaxy lensing profiles. This test reveals three key findings. The stellar mass based on cModel photometry or aperture luminosity within R<30 kpc is a poor proxy of halo mass. In contrast, the stellar mass of the outer envelope is an excellent halo mass proxy. The stellar mass within R=[50,100] kpc, M*[50,100], has performance comparable to the state-of-the-art richness-based cluster finders at $\log{M_{\rm vir}/M_{\odot}}>14.0$ and could be a better halo mass tracer at lower halo masses. Finally, using N-body simulations, we find that the lensing profiles of massive halos selected by M*[50,100] are consistent with the expectation for a sample without projection or mis-centering effects. On the other hand, Richness-selected clusters display an excess at R~1 Mpc in their lensing profiles, which may suggest a more significant impact from selection biases. These results suggest that Mstar-based tracers have distinct advantages in identifying massive halos, which could open up new avenues for cluster cosmology.

preprint2020arXiv

Tracing the Intrinsic Shapes of Dwarf Galaxies out to Four Effective Radii: Clues to Low-Mass Stellar Halo Formation

Though smooth, extended spheroidal stellar outskirts have long been observed around nearby dwarf galaxies, it is unclear whether dwarfs generically host an extended stellar halo. We use imaging from the Hyper Suprime-Cam Subaru Strategic Program (HSC-SSP) to measure the shapes of dwarf galaxies out to four effective radii for a sample of dwarfs at 0.005<z<0.2 and 10^7<M_star/M_sun<10^9.6. We find that dwarfs are slightly triaxial, with a <B/A> >~ 0.75 (where the ellipsoid is characterized by three principle semi-axes constrained by C<=B<=A). At M_star>10^8.5 M_sun, the galaxies grow from thick disk-like near their centers towards the spheroidal extreme at four effective radii. We also see that although blue dwarfs are, on average, characterized by thinner discs than red dwarfs, both blue and red dwarfs grow more spheroidal as a function of radius. This relation also holds true for a comparison between field and satellite dwarfs. This uniform trend towards relatively spheroidal shapes as a function of radius is consistent with an in-situ formation mechanism for stellar outskirts around low-mass galaxies, in agreement with proposed models where star formation feedback produces round stellar outskirts around dwarfs.

preprint2019arXiv

Galaxy-Galaxy Lensing in HSC: Validation Tests and the Impact of Heterogeneous Spectroscopic Training Sets

Although photometric redshifts (photo-z&#39;s) are crucial ingredients for current and upcoming large-scale surveys, the high-quality spectroscopic redshifts currently available to train, validate, and test them are substantially non-representative in both magnitude and color. We investigate the nature and structure of this bias by tracking how objects from a heterogeneous training sample contribute to photo-z predictions as a function of magnitude and color, and illustrate that the underlying redshift distribution at fixed color can evolve strongly as a function of magnitude. We then test the robustness of the galaxy-galaxy lensing signal in 120 deg$^2$ of HSC-SSP DR1 data to spectroscopic completeness and photo-z biases, and find that their impacts are sub-dominant to current statistical uncertainties. Our methodology provides a framework to investigate how spectroscopic incompleteness can impact photo-z-based weak lensing predictions in future surveys such as LSST and WFIRST.

preprint2019arXiv

Physical Correlations of the Scatter between Galaxy Mass, Stellar Content, and Halo Mass

We use the UniverseMachine to analyze the source of scatter between the central galaxy mass, the total stellar mass in the halo, and the dark matter halo mass. We also propose a new halo mass estimator, the cen+N mass: the sum of the stellar mass of the central and the N most massive satellites. We show that, when real space positions are perfectly known, the cen+N mass has scatter competitive with that of richness-based estimators. However, in redshift space, the cen+N mass suffers less from projection effects in the UniverseMachine model. The cen+N mass is therefore a viable low scatter halo mass estimator, and should be considered an important tool to constrain cosmology with upcoming spectroscopic data from DESI. We analyze the scatter in stellar mass at fixed halo mass and show that the total stellar mass in a halo is uncorrelated with secondary halo properties, but that the central stellar mass is a function of both halo mass and halo age. This is because central galaxies in older halos have had more time to grow via accretion. If the UniverseMachine model is correct, accurate galaxy-halo modeling of mass selected samples therefore needs to consider halo age in addition to mass.

preprint2018arXiv

Weak Lensing Reveals a Tight Connection Between Dark Matter Halo Mass and the Distribution of Stellar Mass in Massive Galaxies

Using deep images from the Hyper Suprime-Cam (HSC) survey and taking advantage of its unprecedented weak lensing capabilities, we reveal a remarkably tight connection between the stellar mass distribution of massive central galaxies and their host dark matter halo mass. Massive galaxies with more extended stellar mass distributions tend to live in more massive dark matter haloes. We explain this connection with a phenomenological model that assumes, (1) a tight relation between the halo mass and the total stellar content in the halo, (2) that the fraction of in-situ and ex-situ mass at $r<10$ kpc depends on halo mass. This model provides an excellent description of the stellar mass functions (SMF) of total stellar mass ($M_{\star}^{\rm Max}$) and stellar mass within inner 10 kpc ($M_{\star}^{10}$) and also reproduces the HSC weak lensing signals of massive galaxies with different stellar mass distributions. The best-fit model shows that halo mass varies significantly at fixed total stellar mass (as much as 0.4 dex) with a clear dependence on $M_{\star}^{10}$. Our two-parameter $M_{\star}^{\rm Max}$-$M_{\star}^{10}$ description provides a more accurate picture of the galaxy-halo connection at the high-mass end than the simple stellar-halo mass relation (SHMR) and opens a new window to connect the assembly history of halos with those of central galaxies. The model also predicts that the ex-situ component dominates the mass profiles of galaxies at $r< 10$ kpc for $\log M_{\star} \ge 11.7$). The code used for this paper is available online: https://github.com/dr-guangtou/asap