Researcher profile

Yu Yu

Yu Yu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
15works
0followers
8topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

15 published item(s)

preprint2026arXiv

On the (In-)Security of the Shuffling Defense in the Transformer Secure Inference

For Transformer models, cryptographically secure inference ensures that the client learns only the final output, while the server learns nothing about the client's input. However, securely computing nonlinear layers remains a major efficiency bottleneck due to the substantial communication rounds and data transmission required. To address this issue, prior works reveal intermediate activations to the client, allowing nonlinear operations to be computed in plaintext. Although this approach significantly improves efficiency, exposing activations enables adversaries to extract model weights. To mitigate this risk, existing works employ a shuffling defense that reveals only randomly permuted activations to the client. In this work, we show that the shuffling defense is not as robust as previously claimed. We propose an attack that aligns differently shuffled activations to a common permutation and subsequently exploits them to extract model weights. Experiments on Pythia-70m and GPT-2 demonstrate that the proposed attack can align shuffled activations with mean squared errors ranging from $10^{-9}$ to $10^{-6}$. With a query cost of approximately \$1, the adversary can recover model weights with L1-norm differences ranging from $10^{-4}$ to $10^{-2}$ compared to the oracle weights.

preprint2022arXiv

Sensitivity tests of cosmic velocity fields to massive neutrinos

We investigate impacts of massive neutrinos on the cosmic velocity fields, employing high-resolution cosmological $N$-body simulations provided by the information-optimized CUBE code, where cosmic neutrinos are evolved using collisionless hydrodynamics and their perturbations can be accurately resolved. In this study we focus, for the first time, on the analysis of massive-neutrino induced suppression effects in various cosmic velocity field components of velocity magnitude, divergence, vorticity and dispersion. By varying the neutrino mass sum $M_ν$ from 0 -- 0.4 eV, the simulations show that, the power spectra of vorticity -- exclusively sourced by non-linear structure formation that is affected by massive neutrinos significantly -- is very sensitive to the mass sum, which potentially provide novel signatures in detecting massive neutrinos. Furthermore, using the chi-square statistic, we quantitatively test the sensitivity of the density and velocity power spectra to the neutrino mass sum. Indeed, we find that, the vorticity spectrum has the highest sensitivity, and the null hypothesis of massless neutrinos is incompatible with both vorticity and divergence spectra from $M_ν=0.1$ eV at high significance ($p$-value $= 0.03$ and $0.07$, respectively). These results demonstrate clearly the importance of peculiar velocity field measurements, in particular of vorticity and divergence components, in determination of neutrino mass and mass hierarchy.

preprint2022arXiv

Strong Conformity and Assembly Bias: Towards a Physical Understanding of the Galaxy-Halo Connection in SDSS Clusters

Understanding the physical connection between cluster galaxies and massive haloes is key to mitigating systematic uncertainties in next-generation cluster cosmology. We develop a novel method to infer the level of conformity between the stellar mass of the brightest central galaxies~(BCGs) $M_*^{BCG}$ and the satellite richness $λ$, defined as their correlation coefficient $ρ_{cc}$ at fixed halo mass, using the abundance and weak lensing of SDSS clusters as functions of $M_*^{BCG}$ and $λ$. We detect a halo mass-dependent conformity as $ρ_{cc}{=}0.60{+}0.08\ln(M_h/3{\times}10^{14}M_{\odot}/h)$. The strong conformity successfully resolves the "halo mass equality" conundrum discovered in Zu et al. 2021 -- when split by $M_*^{BCG}$ at fixed $λ$, the low and high-$M_*^{BCG}$ clusters have the same average halo mass despite having a $0.34$ dex discrepancy in average $M_*^{BCG}$. On top of the best-fitting conformity model, we develop a cluster assembly bias~(AB) prescription calibrated against the CosmicGrowth simulation, and build a conformity+AB model for the cluster weak lensing measurements. Our model predicts that with a ${\sim}20\%$ lower halo concentration $c$, the low-$M_*^{BCG}$ clusters are ${\sim}10\%$ more biased than the high-$M_*^{BCG}$ systems, in excellent agreement with the observations. We also show that the observed conformity and assembly bias are unlikely due to projection effects. Finally, we build a toy model to argue that while the early-time BCG-halo co-evolution drives the $M_*^{BCG}$-$c$ correlation, the late-time dry merger-induced BCG growth naturally produces the $M_*^{BCG}$-$λ$ conformity despite the well-known anti-correlation between $λ$ and $c$. Our method paves the path towards simultaneously constraining cosmology and cluster formation with future cluster surveys.

preprint2022arXiv

The DESI $N$-body Simulation Project -- II. Suppressing sample variance with fast simulations

Dark Energy Spectroscopic Instrument (DESI) will construct a large and precise three-dimensional map of our Universe. The survey effective volume reaches $\sim20\Gpchcube$. It is a great challenge to prepare high-resolution simulations with a much larger volume for validating the DESI analysis pipelines. \textsc{AbacusSummit} is a suite of high-resolution dark-matter-only simulations designed for this purpose, with $200\Gpchcube$ (10 times DESI volume) for the base cosmology. However, further efforts need to be done to provide a more precise analysis of the data and to cover also other cosmologies. Recently, the CARPool method was proposed to use paired accurate and approximate simulations to achieve high statistical precision with a limited number of high-resolution simulations. Relying on this technique, we propose to use fast quasi-$N$-body solvers combined with accurate simulations to produce accurate summary statistics. This enables us to obtain 100 times smaller variance than the expected DESI statistical variance at the scales we are interested in, e.g. $k < 0.3\hMpc$ for the halo power spectrum. In addition, it can significantly suppress the sample variance of the halo bispectrum. We further generalize the method for other cosmologies with only one realization in \textsc{AbacusSummit} suite to extend the effective volume $\sim 20$ times. In summary, our proposed strategy of combining high-fidelity simulations with fast approximate gravity solvers and a series of variance suppression techniques sets the path for a robust cosmological analysis of galaxy survey data.

preprint2021arXiv

Authentication of Metropolitan Quantum Key Distribution Network with Post-quantum Cryptography

Quantum key distribution (QKD) provides information theoretically secures key exchange requiring authentication of the classic data processing channel via pre-sharing of symmetric private keys. In previous studies, the lattice-based post-quantum digital signature algorithm Aigis-Sig, combined with public-key infrastructure (PKI) was used to achieve high-efficiency quantum security authentication of QKD, and its advantages in simplifying the MAN network structure and new user entry were demonstrated. This experiment further integrates the PQC algorithm into the commercial QKD system, the Jinan field metropolitan QKD network comprised of 14 user nodes and 5 optical switching nodes. The feasibility, effectiveness and stability of the post-quantum cryptography (PQC) algorithm and advantages of replacing trusted relays with optical switching brought by PQC authentication large-scale metropolitan area QKD network were verified. QKD with PQC authentication has potential in quantum-secure communications, specifically in metropolitan QKD networks.

preprint2021arXiv

Numerical investigation of non-Gaussianities in the phase and modulus of density Fourier modes

We numerically investigate non-Gaussianities in the late-time cosmological density field in Fourier space. We explore various statistics, including the two-point and three-point probability distribution function (PDF) of phase and modulus, and two \& three-point correlation function of of phase and modulus. We detect significant non-Gaussianity for certain configurations. We compare the simulation results with the theoretical expansion series of \citet{2007ApJS..170....1M}. We find that the $\mathcal{O}(V^{-1/2})$ order term alone is sufficiently accurate to describe all the measured non-Gaussianities in not only the PDFs, but also the correlations. We also numerically find that the phase-modulus cross-correlation contributes $\sim 50\%$ to the bispectrum, further verifying the accuracy of the $\mathcal{O}(V^{-1/2})$ order prediction. This work demonstrates that non-Gaussianity of the cosmic density field is simpler in Fourier space, and may facilitate the data analysis in the era of precision cosmology.

preprint2021arXiv

The DESI $N$-body Simulation Project I: Testing the Robustness of Simulations for the DESI Dark Time Survey

Analysis of large galaxy surveys requires confidence in the robustness of numerical simulation methods. The simulations are used to construct mock galaxy catalogs to validate data analysis pipelines and identify potential systematics. We compare three $N$-body simulation codes, ABACUS, GADGET, and SWIFT, to investigate the regimes in which their results agree. We run $N$-body simulations at three different mass resolutions, $6.25\times10^{8}$, $2.11\times10^{9}$, and $5.00\times10^{9}~h^{-1}$M$_{\odot}$, matching phases to reduce the noise within the comparisons. We find systematic errors in the halo clustering between different codes are smaller than the DESI statistical error for $s > 20\, h^{-1}$Mpc in the correlation function in redshift space. Through the resolution comparison we find that simulations run with a mass resolution of $2.1\times10^{9}~h^{-1}$M$_{\odot}$ are sufficiently converged for systematic effects in the halo clustering to be smaller than the DESI statistical error at scales larger than $20 \, h^{-1}$Mpc. These findings show that the simulations are robust for extracting cosmological information from large scales which is the key goal of the DESI survey. Comparing matter power spectra, we find the codes agree to within 1% for $k \leq 10~h$Mpc$^{-1}$. We also run a comparison of three initial condition generation codes and find good agreement. In addition, we include a quasi-$N$-body code, FastPM, since we plan use it for certain DESI analyses. The impact of the halo definition and galaxy-halo relation will be presented in a follow up study.

preprint2020arXiv

A Differential Approach for Gaze Estimation

Non-invasive gaze estimation methods usually regress gaze directions directly from a single face or eye image. However, due to important variabilities in eye shapes and inner eye structures amongst individuals, universal models obtain limited accuracies and their output usually exhibit high variance as well as biases which are subject dependent. Therefore, increasing accuracy is usually done through calibration, allowing gaze predictions for a subject to be mapped to his/her actual gaze. In this paper, we introduce a novel image differential method for gaze estimation. We propose to directly train a differential convolutional neural network to predict the gaze differences between two eye input images of the same subject. Then, given a set of subject specific calibration images, we can use the inferred differences to predict the gaze direction of a novel eye sample. The assumption is that by allowing the comparison between two eye images, annoyance factors (alignment, eyelid closing, illumination perturbations) which usually plague single image prediction methods can be much reduced, allowing better prediction altogether. Experiments on 3 public datasets validate our approach which constantly outperforms state-of-the-art methods even when using only one calibration sample or when the latter methods are followed by subject specific gaze adaptation.

preprint2020arXiv

Multimode silicon photonics using on-chip geometrical-optics

On-chip optical interconnect has been widely accepted as a promising technology to realize future large-scale multiprocessors. Mode-division multiplexing (MDM) provides a new degree of freedom for optical interconnects to dramatically increase the link capacity. Present on-chip multimode devices are based on traditional wave-optics. Although large amount of computation and optimization are adopted to support more modes, mode-independent manipulation is still hard to be achieved due to severe mode dispersion. Here, we propose a universal solution to standardize the design of fundamental multimode building blocks, by introducing a geometrical-optics-like concept adopting waveguide width larger than the working wavelength. The proposed solution can tackle a group of modes at the same time with very simple processes, avoiding demultiplexing procedure and ensuring compact footprint. Compare to conventional schemes, it is scalable to larger mode channels without increasing the complexity and whole footprint. As a proof of concept, we demonstrate a set of multimode building blocks including crossing, bend, coupler and switches. Low losses of multimode waveguide crossing and bend are achieved, as well as ultra-low power consumption of the multimode switch is realized since it enables reconfigurable routing for a group of modes simultaneously. Our work promotes the multimode photonics research and makes the MDM technique more practical.

preprint2020arXiv

Neutrino effects on the morphology of cosmic large-scale structure

In this work, we propose a powerful probe of neutrino effects on the large-scale structure (LSS) of the Universe, i.e., Minkowski functionals (MFs). The morphology of LSS can be fully described by four MFs. This tool, with strong statistical power, is robust to various systematics and can comprehensively probe all orders of N-point statistics. By using a pair of high-resolution N-body simulations, for the first time, we comprehensively studied the subtle neutrino effects on the morphology of LSS. For an ideal LSS survey of volume $\sim1.73$ Gpc$^3$/$h^3$, neutrino signals are mainly detected from void regions with a significant level up to $\thicksim 10σ$ and $\thicksim 300σ$ for CDM and total matter density fields, respectively. This demonstrates its enormous potential for much improving the neutrino mass constraint in the data analysis of up-coming ambitious LSS surveys.

preprint2020arXiv

PrivPy: Enabling Scalable and General Privacy-Preserving Machine Learning

We introduce PrivPy, a practical privacy-preserving collaborative computation framework, especially optimized for machine learning tasks. PrivPy provides an easy-to-use and highly compatible Python programming front-end which supports high-level array operations and different secure computation engines to allow for security assumptions and performance trade-offs. With PrivPy, programmers can write modern machine learning algorithms conveniently and efficiently in Python. We also design and implement a new efficient computation engine, with which people can use competing cloud providers to efficiently perform general arithmetics over real numbers. We demonstrate the usability and scalability of PrivPy using common machine learning models (e.g. logistic regression and convolutional neural networks) and real-world datasets (including a 5000-by-1-million matrix).

preprint2020arXiv

Probing Primordial Chirality with Galaxy Spins

Chiral symmetry is maximally violated in weak interactions, and such microscopic asymmetries in the early Universe might leave observable imprints on astrophysical scales without violating the cosmological principle. In this Letter, we propose a helicity measurement to detect primordial chiral violation. We point out that observations of halo-galaxy angular momentum directions (spins), which are frozen in during the galaxy formation process, provide a fossil chiral observable. From the clustering mode of large scale structure of the Universe, we construct a spin mode in Lagrangian space and show in simulations that it is a good probe of halo-galaxy spins. In standard model, a strong symmetric correlation between the left and right helical components of this spin mode and galaxy spins is expected. Measurements of these correlations will be sensitive to chiral breaking, providing a direct test of chiral symmetry breaking in the early Universe.

preprint2020arXiv

The copula of the cosmological matter density field is non-Gaussian

Non-Gaussianity of the cosmological matter density field can be largely reduced by a local Gaussianization transformation (and its approximations such as the logrithmic transformation). Such behavior can be recasted as the Gaussian copula hypothesis, and has been verified to very high accuracy at two-point level. On the other hand, statistically significant non-Gaussianities in the Gaussianized field have been detected in simulations. We point out that, this apparent inconsistency is caused by the very limited degrees of freedom in the copula function, which make it misleading as a diagnosis of residual non-Gaussianity in the Gaussianized field. Using the copula density, we highlight the departure from Gaussianity. We further quantify its impact in the predicted n-point correlation functions. We explore a remedy of the Gaussian copula hypothesis, which alleviates but not completely solves the above problems.

preprint2020arXiv

Unsupervised Representation Learning for Gaze Estimation

Although automatic gaze estimation is very important to a large variety of application areas, it is difficult to train accurate and robust gaze models, in great part due to the difficulty in collecting large and diverse data (annotating 3D gaze is expensive and existing datasets use different setups). To address this issue, our main contribution in this paper is to propose an effective approach to learn a low dimensional gaze representation without gaze annotations, which to the best of our best knowledge, is the first work to do so. The main idea is to rely on a gaze redirection network and use the gaze representation difference of the input and target images (of the redirection network) as the redirection variable. A redirection loss in image domain allows the joint training of both the redirection network and the gaze representation network. In addition, we propose a warping field regularization which not only provides an explicit physical meaning to the gaze representations but also avoids redirection distortions. Promising results on few-shot gaze estimation (competitive results can be achieved with as few as <= 100 calibration samples), cross-dataset gaze estimation, gaze network pretraining, and another task (head pose estimation) demonstrate the validity of our framework.

preprint2019arXiv

Nonlinear Reconstruction of the Velocity Field

We propose a new velocity reconstruction method based on the displacement estimation by recently developed methods. The velocity is first reconstructed by transfer functions in Lagrangian space and then mapped into Eulerian space. High resolution simulations are used to test the performance. We find that the new reconstruction method outperforms the standard velocity reconstruction in the sense of better cross-correlation coefficient, less velocity misalignment and smaller amplitude difference. We conclude that this new method has the potential to improve the large-scale structure sciences involving a velocity reconstruction, such as kinetic Sunyaev-Zel&#39;dovich measurement and supernova cosmology.