Researcher profile

Yuetong Zhao

Yuetong Zhao contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 17 - UnverifiedVerification L1Unclaimed author
4works
0followers
5topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

4 published item(s)

preprint2023arXiv

Sgap: Towards Efficient Sparse Tensor Algebra Compilation for GPU

Sparse compiler is a promising solution for sparse tensor algebra optimization. In compiler implementation, reduction in sparse-dense hybrid algebra plays a key role in performance. Though GPU provides various reduction semantics that can better utilize the parallel computing and memory bandwidth capacity, the central question is: how to elevate the flexible reduction semantics to sparse compilation theory that assumes serial execution. Specifically, we have to tackle two main challenges: (1) there are wasted parallelism by adopting static synchronization granularity (2) static reduction strategy limits optimization space exploration. We propose Sgap: segment group and atomic parallelism to solve these problems. Atomic parallelism captures the flexible reduction semantics to systematically analyze the optimization space of sparse-dense hybrid algebra on GPU. It is a new optimization technique beyond current compiler-based and open-source runtime libraries. Segment group elevates the flexible reduction semantics to suitable levels of abstraction in the sparse compilation theory. It adopts changeable group size and user-defined reduction strategy to solve challenge (1) and (2), respectively. Finally, we use GPU sparse matrix-matrix multiplication (SpMM) on the TACO compiler as a use case to demonstrate the effectiveness of segment group in reduction semantics elevation. We achieve up to 1.2x speedup over the original TACO's SpMM kernels. We also apply new optimization techniques found by atomic parallelism to an open-source state-of-the-art SpMM library dgSPARSE. We achieve 1.6x - 2.3x speedup on the algorithm tuned with atomic parallelism.

preprint2022arXiv

On Dark Gravitational Wave Standard Sirens as Cosmological Inference and Forecasting the Constraint on Hubble Constant using Binary Black Holes Detected by Deci-hertz Observatory

Gravitational wave (GW) signals from compact binary coalescences can be used as standard sirens to constrain cosmological parameters if their redshift can be measured independently. However, mergers of stellar binary black holes (BBHs) may not have electromagnetic counterparts and thus have no direct redshift measurements. These dark sirens may be still used to statistically constrain cosmological parameters by combining their GW measured luminosity distances and localization with deep redshift surveys of galaxies around it. We investigate this dark siren method in detail by using mock BBH and galaxy samples. We find that the Hubble constant can be constrained well with an accuracy $\lesssim1\%$ with a few tens or more of BBH mergers at redshift up to $1$ if GW observations can provide accurate estimates of their luminosity distance (with relative error of $\lesssim0.01$) and localization ($\lesssim0.1~\rm{deg}^2$), though the constraint may be significantly biased if the luminosity distance and localization errors are larger. We also introduce a simple method to correct this bias and find it is valid when the luminosity distance and localization errors are modestly large. We further generate mock BBH samples, according to current constraints on BBH merger rate and the distributions of BBH properties, and find that the Deci-hertz Observatory (DO) in a half year observation period may detect about one hundred BBHs with signal-to-noise ratio $\varrho\gtrsim30$, relative luminosity distance error $\lesssim0.02$, and localization error $\lesssim0.01\rm{deg}^2$. By applying the dark standard siren method, we find that the Hubble constant can be constrained to the $\sim0.1-1\%$ level using these DO BBHs, an accuracy comparable to the constraints obtained by using electromagnetic observations in the near future, thus it may provide insight into the Hubble tension.

preprint2022arXiv

On Detecting Stellar Binary Black Holes via the LISA-Taiji Network

The detection of gravitational waves (GWs) by ground-based laser interferometer GW observatories (LIGO/Virgo) reveals a population of stellar binary black holes (sBBHs) with (total) masses up to $\sim 150M_\odot$, which are potential sources for space-based GW detectors, such as LISA and Taiji. In this paper, we investigate in details on the possibility of detecting sBBHs by the LISA-Taiji network in future. We adopt the sBBH merger rate density constrained by LIGO/VIRGO observations to randomly generate mock sBBHs samples. Assuming an observation period of $4$ years, we find that the LISA-Taiji network may detect several tens (or at least several) sBBHs with signal-to-noise ratio (SNR) $>8$ (or $>15$), a factor $2-3$ times larger than that by only using LISA or Taiji observations. Among these sBBHs, no more than a few that can merge during the $4$-year observation period. If extending the observation period to $10$ years, then the LISA-Taiji network may detect about one hundred (or twenty) sBBHs with SNR $>8$ (or $>15$), among them about twenty (or at least several) can merge within the observation period. Our results suggest that the LISA-Taiji network may be able to detect at least a handful to twenty or more sBBHs even if assuming a conservative SNR threshold ($15$) for ``detection'', which enables multi-band GW observations by space and ground-based GW detectors. We also further estimate the uncertainties in the parameter estimations of the sBBH systems ``detected'' by the LISA-Taiji network. We find that the relative errors in the luminosity distance measurements and sky localization are mostly in the range of $0.05-0.2$ and $1-100°^2$, respectively, for these sBBHs.

preprint2020arXiv

On two Diophantine inequalities over primes (II)

Let $1<c<\frac{26088036}{12301745},c\not=2$ and $N$ be a sufficiently large real number. In this paper, it is proved that, for almost all $R\in (N,2N]$, the Diophantine inequality \begin{equation*} \big|p_1^c+p_2^c+p_3^c-R\big|<\log^{-1}N \end{equation*} is solvable in primes $p_1,p_2,p_3$. Moreover, we also prove that the following Diophantine inequality \begin{equation*} \big|p_1^c+p_2^c+p_3^c+p_4^c+p_5^c+p_6^c-N\big|<\log^{-1}N \end{equation*} is solvable in prime variables $p_1,p_2,p_3,p_4,p_5,p_6$, which improves the previous result $1<c<\frac{37}{18},c\neq2$.