Source author record

Seng Pei Liew

Seng Pei Liew appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

hep-ph hep-ex astro-ph.CO Cryptography and Security Machine Learning Databases Computation and Language cond-mat.mes-hall cond-mat.stat-mech quant-ph

Catalog footprint

What is connected

19works

10topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Differentially Private Sampling from Distributions via Wasserstein Projection

In this paper, we study the problem of sampling from a distribution under the constraint of differential privacy (DP). Prior works measure the utility of DP sampling with density ratio-based measures such as KL divergence. However, such formulations suffer from two key limitations: 1) they fail to capture the geometric structure of the support, and 2) they are not applicable when the supports of the distributions differ. To deal with these issues, we develop a novel framework for DP sampling with Wasserstein distance as the utility measure. In this formulation, we propose Wasserstein Projection Mechanism (WPM), a minimax optimal mechanism based on Wasserstein projection. Furthermore, we develop efficient algorithms for computing the proposed mechanisms approximately and provide convergence guarantees.

preprint2026arXiv

Shuffling-Aware Optimization for Private Vector Mean Estimation

We study $d$-dimensional unbiased mean estimation in the single-message shuffle model, where each user sends a single privatized message and the analyzer only observes the shuffled multiset of reports. While minimax-optimal mechanisms are well understood in the local differential privacy setting, the corresponding notion of optimality after shuffling has remained largely unexplored. To address this gap, we introduce the recently proposed shuffle index and use it to formulate the post-shuffling mechanism design problem as an explicit optimization problem. We then establish a minimax lower bound on the achievable mean squared error in terms of the shuffle index, which implies that mechanisms that are optimal under LDP can become suboptimal once shuffling is applied. Finally, we construct an asymptotically minimax optimal mechanism in the high privacy regime, which as a consequence achieves a privacy-utility trade-off nearly identical to that of the central Gaussian mechanism.

preprint2026arXiv

Towards Principled Design of Mixture-of-Experts Language Models under Memory and Inference Constraints

Modern Mixture-of-Experts (MoE) language models are designed based on total parameters (memory footprint) and active parameters (inference cost). However, we find these two factors alone are insufficient to describe an optimal architecture. Through a systematic study, we demonstrate that MoE performance is primarily determined by total parameters ($N_{total}$) and expert sparsity ($s:=n_{exp}/n_{topk}$). Moreover, $n_{exp}$ and $n_{topk}$ do not "cancel out" within the sparsity ratio; instead, a larger total number of experts slightly penalizes performance by forcing a reduction in core model dimensions (depth and width) to meet memory constraints. This motivates a simple principle for MoE design which maximizes $N_{total}$ while minimizing $s$ (maximizing $n_{topk}$) and $n_{exp}$ under the given constraints. Our findings provide a robust framework for resolving architectural ambiguity and guiding MoE design.

preprint2022arXiv

HDPView: Differentially Private Materialized View for Exploring High Dimensional Relational Data

How can we explore the unknown properties of high-dimensional sensitive relational data while preserving privacy? We study how to construct an explorable privacy-preserving materialized view under differential privacy. No existing state-of-the-art methods simultaneously satisfy the following essential properties in data exploration: workload independence, analytical reliability (i.e., providing error bound for each search query), applicability to high-dimensional data, and space efficiency. To solve the above issues, we propose HDPView, which creates a differentially private materialized view by well-designed recursive bisected partitioning on an original data cube, i.e., count tensor. Our method searches for block partitioning to minimize the error for the counting query, in addition to randomizing the convergence, by choosing the effective cutting points in a differentially private way, resulting in a less noisy and compact view. Furthermore, we ensure formal privacy guarantee and analytical reliability by providing the error bound for arbitrary counting queries on the materialized views. HDPView has the following desirable properties: (a) Workload independence, (b) Analytical reliability, (c) Noise resistance on high-dimensional data, (d) Space efficiency. To demonstrate the above properties and the suitability for data exploration, we conduct extensive experiments with eight types of range counting queries on eight real datasets. HDPView outperforms the state-of-the-art methods in these evaluations.

preprint2022arXiv

Measuring Lower Bounds of Local Differential Privacy via Adversary Instantiations in Federated Learning

Local differential privacy (LDP) gives a strong privacy guarantee to be used in a distributed setting like federated learning (FL). LDP mechanisms in FL protect a client's gradient by randomizing it on the client; however, how can we interpret the privacy level given by the randomization? Moreover, what types of attacks can we mitigate in practice? To answer these questions, we introduce an empirical privacy test by measuring the lower bounds of LDP. The privacy test estimates how an adversary predicts if a reported randomized gradient was crafted from a raw gradient $g_1$ or $g_2$. We then instantiate six adversaries in FL under LDP to measure empirical LDP at various attack surfaces, including a worst-case attack that reaches the theoretical upper bound of LDP. The empirical privacy test with the adversary instantiations enables us to interpret LDP more intuitively and discuss relaxation of the privacy parameter until a particular instantiated attack surfaces. We also demonstrate numerical observations of the measured privacy in these adversarial settings, and the worst-case attack is not realistic in FL. In the end, we also discuss the possible relaxation of privacy levels in FL under LDP.

preprint2022arXiv

Network Shuffling: Privacy Amplification via Random Walks

Recently, it is shown that shuffling can amplify the central differential privacy guarantees of data randomized with local differential privacy. Within this setup, a centralized, trusted shuffler is responsible for shuffling by keeping the identities of data anonymous, which subsequently leads to stronger privacy guarantees for systems. However, introducing a centralized entity to the originally local privacy model loses some appeals of not having any centralized entity as in local differential privacy. Moreover, implementing a shuffler in a reliable way is not trivial due to known security issues and/or requirements of advanced hardware or secure computation technology. Motivated by these practical considerations, we rethink the shuffle model to relax the assumption of requiring a centralized, trusted shuffler. We introduce network shuffling, a decentralized mechanism where users exchange data in a random-walk fashion on a network/graph, as an alternative of achieving privacy amplification via anonymity. We analyze the threat model under such a setting, and propose distributed protocols of network shuffling that is straightforward to implement in practice. Furthermore, we show that the privacy amplification rate is similar to other privacy amplification techniques such as uniform shuffling. To our best knowledge, among the recently studied intermediate trust models that leverage privacy amplification techniques, our work is the first that is not relying on any centralized entity to achieve privacy amplification.

preprint2022arXiv

PEARL: Data Synthesis via Private Embeddings and Adversarial Reconstruction Learning

We propose a new framework of synthesizing data using deep generative models in a differentially private manner. Within our framework, sensitive data are sanitized with rigorous privacy guarantees in a one-shot fashion, such that training deep generative models is possible without re-using the original data. Hence, no extra privacy costs or model constraints are incurred, in contrast to popular approaches such as Differentially Private Stochastic Gradient Descent (DP-SGD), which, among other issues, causes degradation in privacy guarantees as the training iteration increases. We demonstrate a realization of our framework by making use of the characteristic function and an adversarial re-weighting objective, which are of independent interest as well. Our proposal has theoretical guarantees of performance, and empirical evaluations on multiple datasets show that our approach outperforms other methods at reasonable levels of privacy.

preprint2022arXiv

Scaling Private Deep Learning with Low-Rank and Sparse Gradients

Applying Differentially Private Stochastic Gradient Descent (DPSGD) to training modern, large-scale neural networks such as transformer-based models is a challenging task, as the magnitude of noise added to the gradients at each iteration scales with model dimension, hindering the learning capability significantly. We propose a unified framework, $\textsf{LSG}$, that fully exploits the low-rank and sparse structure of neural networks to reduce the dimension of gradient updates, and hence alleviate the negative impacts of DPSGD. The gradient updates are first approximated with a pair of low-rank matrices. Then, a novel strategy is utilized to sparsify the gradients, resulting in low-dimensional, less noisy updates that are yet capable of retaining the performance of neural networks. Empirical evaluation on natural language processing and computer vision tasks shows that our method outperforms other state-of-the-art baselines.

preprint2020arXiv

Light Axinos from Freeze-in: production processes, phase space distributions, and Ly-$α$ forest constraints

We consider freeze-in production of 7 keV axino dark matter (DM) in the supersymmetric Dine-Fischler-Srednicki-Zhitnitsky (DFSZ) model in light of the 3.5 keV line excess. The warmness of such 7 keV DM produced from the thermal bath, in general, appears in tension with Ly-$α$ forest data, although a direct comparison is not straightforward. This is because the Ly-$α$ forest constraints are usually reported on the mass of the conventional warm dark matter (WDM), where large entropy production is implicitly assumed to occur in the thermal bath after WDM particles decouple. The phase space distribution of freeze-in axino DM varies depending on production processes and axino DM may alleviate the tension with the tight Ly-$α$ forest constraints. By solving the Boltzmann equation, we first obtain the resultant phase space distribution of axinos produced by 2-body decay, 3-body decay, and 2-to-2 scattering, respectively. The reduced collision term and resultant phase space distribution are useful for studying other freeze-in scenarios as well. We then calculate the resultant linear matter power spectra for such axino DM and directly compare them with the linear matter power spectra for the conventional WDM. In order to demonstrate realistic axino DM production, we consider benchmark points with the Higgsino next-to-lightest supersymmetric particle (NLSP) and wino NLSP. In the case of the Higgsino NLSP, the phase space distribution of axinos is colder than that in the conventional WDM case, so the most stringent Ly-$α$ forest constraint can be evaded with mild entropy production from saxion decay inherent in the supersymmetric DFSZ axion model.

preprint2016arXiv

Models of 750 GeV quarkonium and the LHC excesses

We investigate models involving a vector-like quark X, which forms a 750 GeV bound state and reproduces the observed diphoton signals at the LHC, in connection with other excesses in the LHC data. An exotic hypercharge of -4/3 is required to fit the signal cross section, which indicates that there is additional particle(s) that mediates the decay of X in the full theory. We find that, introducing an SU(2) doublet vector-like quark of mass around 600 GeV in our UV-complete framework can accommodate not only the diphoton but also the on-Z excess (and potentially a slight excess in the monojet events). Our models also include a dark matter candidate. The most useful way to probe the models at the LHC is via monojet searches. The relic dark matter density is largely determined by coannihilation effects, and indirect detection of dark matter annihilation signals is the alternative and complementary probe of our models.

preprint2015arXiv

A closer look at a hint of SUSY at the 8 TeV LHC

A recent CMS analysis has reported the observation of an excess in the invariant mass distribution of the opposite-sign same-flavour lepton pair, which can be interpreted as a kinematic edge due to new physics. Using collider simulation tools, we recast relevant LHC search results reported by ATLAS and CMS collaborations in order to determine constraints on supersymmetric models that could produce the observed features. In particular, we focus on models involving cascade decays of light-flavour squarks and sbottoms. We find no favourable supersymmetry scenario within our exploration that could explain the origin of the excess when other LHC constraints are taken into account.

preprint2015arXiv

Jumping into buckets, or How to decontaminate overlapping fat jets

At the LHC, tagging boosted heavy particle resonances which decay hadronically, such as top quarks and Higgs bosons, can play an essential role in new physics searches. In events with high multiplicity, however, the standard approach to tag boosted resonances by a large-radius fat jet becomes difficult because the resonances are not well-separated from other hard radiation. In this paper, we propose a different approach to tag and reconstruct boosted resonances by using the recently proposed mass-jump jet algorithm. A key feature of the algorithm is the flexible radius of the jets, which results from a terminating veto that prevents the recombination of two hard prongs if their combined jet mass is substantially larger than the masses of the separate prongs. The idea of collecting jets in "buckets" is also used. As an example, we consider the fully hadronic final state of pair-produced vectorlike top partners at the LHC, $pp\to T\bar{T}\to t\bar{t}HH$, and show that the new approach works better than the corresponding generalized $k_T$ jet clustering algorithm. We also show that tagging and kinematic reconstruction of boosted top quarks and Higgs bosons are possible with good quality even in these very busy final states. The vectorlike top partners are kinematically reconstructed, which allows their direct mass measurement.

preprint2015arXiv

Testing ATLAS Diboson Excess with Dark Matter Searches at LHC

The ATLAS collaboration has recently reported a 2.6 sigma excess in the search for a heavy resonance decaying into a pair of weak gauge bosons. Only fully hadronic final states are being looked for in the analysis. If the observed excess really originates from the gauge bosons' decays, other decay modes of the gauge bosons would inevitably leave a trace on other exotic searches. In this paper, we propose the use of the Z boson decay into a pair of neutrinos to test the excess. This decay leads to a very large missing energy and can be probed with conventional dark matter searches at the LHC. We discuss the current constraints from the dark matter searches and the prospects. We find that optimizing these searches may give a very robust probe of the resonance, even with the currently available data of the 8 TeV LHC.

preprint2015arXiv

Z-peaked excess in goldstini scenarios

We study a possible explanation of a 3.0 $σ$ excess recently reported by the ATLAS Collaboration in events with Z-peaked same-flavour opposite-sign lepton pair, jets and large missing transverse momentum in the context of gauge-mediated SUSY breaking with more than one hidden sector, the so-called goldstini scenario. In a certain parameter space, the gluino two-body decay chain $\tilde g\to g\tildeχ^0_{1,2}\to gZ\tilde G'$ becomes dominant, where $\tildeχ^0_{1,2}$ and $\tilde G'$ are the Higgsino-like neutralino and the massive pseudo-goldstino, respectively, and gluino pair production can contribute to the signal. We find that a mass spectrum such as $m_{\tilde g}\sim 1000$ GeV, $m_{\tildeχ^0_{1,2}}\sim 800$ GeV and $m_{\tilde G'}\sim 600$ GeV demonstrates the rate and the distributions of the excess, without conflicting with the stringent constraints from jets plus missing energy analyses and with the CMS constraint on the identical final state.

preprint2014arXiv

Axino dark matter in light of an anomalous X-ray line

Axino as the superpartner of axion that solves the strong CP problem can be a good candidate of dark matter. Inspired by the 3.5 keV X-ray line signal found to be originated from galaxy clusters and Andromeda galaxy, we study axino models with R-parity violations, and point out that axino dark matter with trilinear R-parity violations is an attractive scenario that reproduces the X-ray line. The Peccei-Quinn scale is required to be $f_a \sim{\cal O}(10^{9}-10^{11})$ GeV for trilinear R-parity violating couplings $λ\sim {\cal O} (10^{-3}-10^{-1})$ in order to explain the line signal. Moreover, the right-handed stau is predicted to be light, i.e. $\sim{\cal O}(100)$ GeV, and thus can be looked for at the LHC. Cosmological aspects of the model are also discussed in this study.

preprint2014arXiv

Isospin-Violating Dark Matter with Colored Mediators

In light of positive signals reported by the CDMS-II Si experiment and the recent results of the LUX and SuperCDMS experiments, we study isospin-violating dark matter scenarios assuming that the interaction of the dark matter is mediated by colored particles. We investigate the phenomenology of the model, including collider searches, flavor and CP phenomenology. A minimal possible scenario includes scalar dark matter and new vector-like colored fermions with masses of O(1) TeV as mediators. Such a scenario may be probed at the 14 TeV LHC, while flavor and CP constraints are stringent and severe tuning in the couplings is unavoidable. We also found that, as an explanation of the CDMS-II Si signal, isospin-violating fermionic dark matter models with colored scalar mediators are disfavored by the LHC constraints.

preprint2013arXiv

Axino dark matter with R-parity violation and 130 GeV gamma-ray line

We show that decaying axino dark matter with R-parity violation can explain the observed excess of the 130GeV gamma-ray line from the Galactic center in the Fermi data. The branching fraction of the axino decay into monochromatic photons can be O(1), and constraints from continuum gamma-rays and the anti-proton flux are ameliorated. The Peccei-Quinn scale of $O(10^{13}-10^{14})$\,GeV and the R-parity violation parameter of $O(10^{-12}-10^{-11})$ are cosmologically favored.

preprint2013arXiv

Gamma-ray line from radiative decay of gravitino dark matter

We study radiative decay of gravitino dark matter with trilinear R-parity violations. We show that the branching ratio of the decay of gravitino into monochromatic photon can be large enough to explain the observed gamma-ray line from the Galactic centre in the Fermi-LAT data without producing too much continuum gamma-ray and anti-proton flux. This scenario is realized when the mass of sfermions and the trilinear R-parity violating coupling are $O(1-10)$ TeV and $O(10^{-7}-10^{-6})$ respectively.

preprint2010arXiv

High-energy long-lived resonance of electrons in fractal-like semiconductor heterostructures

A fractal-like alignment of quantum wells is shown to accommodate resonant states with long lifetimes. For the parameters of the semiconductor heterostructure GaAs/Al$_{0.4}$Ga$_{0.6}$As with the well depth 300meV, a resonant state of the energy as high as 44meV with the lifetime as long as 2.8\{mu}s is shown to be achievable.

Seng Pei Liew

What is connected

Connect this record

See the researcher in context

Building this map preview

19 published item(s)

Differentially Private Sampling from Distributions via Wasserstein Projection

Shuffling-Aware Optimization for Private Vector Mean Estimation

Towards Principled Design of Mixture-of-Experts Language Models under Memory and Inference Constraints

HDPView: Differentially Private Materialized View for Exploring High Dimensional Relational Data

Measuring Lower Bounds of Local Differential Privacy via Adversary Instantiations in Federated Learning

Network Shuffling: Privacy Amplification via Random Walks

PEARL: Data Synthesis via Private Embeddings and Adversarial Reconstruction Learning

Scaling Private Deep Learning with Low-Rank and Sparse Gradients

Light Axinos from Freeze-in: production processes, phase space distributions, and Ly-$α$ forest constraints

Models of 750 GeV quarkonium and the LHC excesses

A closer look at a hint of SUSY at the 8 TeV LHC

Jumping into buckets, or How to decontaminate overlapping fat jets

Testing ATLAS Diboson Excess with Dark Matter Searches at LHC

Z-peaked excess in goldstini scenarios

Axino dark matter in light of an anomalous X-ray line

Isospin-Violating Dark Matter with Colored Mediators

Axino dark matter with R-parity violation and 130 GeV gamma-ray line

Gamma-ray line from radiative decay of gravitino dark matter

High-energy long-lived resonance of electrons in fractal-like semiconductor heterostructures