Researcher profile

Leo Liu

Leo Liu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 17 - UnverifiedVerification L1Unclaimed author
4works
0followers
6topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

4 published item(s)

preprint2022arXiv

Training Large-Vocabulary Neural Language Models by Private Federated Learning for Resource-Constrained Devices

Federated Learning (FL) is a technique to train models using data distributed across devices. Differential Privacy (DP) provides a formal privacy guarantee for sensitive data. Our goal is to train a large neural network language model (NNLM) on compute-constrained devices while preserving privacy using FL and DP. However, the DP-noise introduced to the model increases as the model size grows, which often prevents convergence. We propose Partial Embedding Updates (PEU), a novel technique to decrease noise by decreasing payload size. Furthermore, we adopt Low Rank Adaptation (LoRA) and Noise Contrastive Estimation (NCE) to reduce the memory demands of large models on compute-constrained devices. This combination of techniques makes it possible to train large-vocabulary language models while preserving accuracy and privacy.

preprint2020arXiv

Diffuser-Assisted Infrared Transit Photometry for Four Dynamically Interacting \textit{Kepler} Systems

We present ground-based infrared transit observations for four dynamically interacting \textit{Kepler} planets, including Kepler-29b, Kepler-36c, KOI-1783.01, and Kepler-177c, obtained using the Wide-field Infrared Camera on the Hale 200" telescope at Palomar Observatory. By utilizing an engineered diffuser and custom guiding software, we mitigate time-correlated telluric and instrumental noise sources in these observations. We achieve an infrared photometric precision comparable to or better than that of space-based observatories such as the \textit{Spitzer Space Telescope}, and detect transits with greater than 3$σ$ significance for all planets. For Kepler-177c ($J=13.9$) our measurement uncertainties are only $1.2\times$ the photon noise limit and 1.9 times better than the predicted photometric precision for \textit{Spitzer} IRAC photometry of this same target. We find that a single transit observation obtained $4-5$ years after the end of the original \textit{Kepler} mission can reduce dynamical mass uncertainties by as much as a factor of three for these systems. Additionally, we combine our new observations of KOI-1783.01 with information from the literature to confirm the planetary nature of this system. We discuss the implications of our new mass and radius constraints in the context of known exoplanets with low incident fluxes, and we note that Kepler-177c may be a more massive analog to the currently known super-puffs given its core mass (3.8$\pm0.9M_\Earth$) and large gas-to-core ratio (2.8$\pm0.7$). Our demonstrated infrared photometric performance opens up new avenues for ground-based observations of transiting exoplanets previously thought to be restricted to space-based investigation.

preprint2020arXiv

SNDCNN: Self-normalizing deep CNNs with scaled exponential linear units for speech recognition

Very deep CNNs achieve state-of-the-art results in both computer vision and speech recognition, but are difficult to train. The most popular way to train very deep CNNs is to use shortcut connections (SC) together with batch normalization (BN). Inspired by Self- Normalizing Neural Networks, we propose the self-normalizing deep CNN (SNDCNN) based acoustic model topology, by removing the SC/BN and replacing the typical RELU activations with scaled exponential linear unit (SELU) in ResNet-50. SELU activations make the network self-normalizing and remove the need for both shortcut connections and batch normalization. Compared to ResNet- 50, we can achieve the same or lower (up to 4.5% relative) word error rate (WER) while boosting both training and inference speed by 60%-80%. We also explore other model inference optimization schemes to further reduce latency for production use.

preprint2020arXiv

Voice trigger detection from LVCSR hypothesis lattices using bidirectional lattice recurrent neural networks

We propose a method to reduce false voice triggers of a speech-enabled personal assistant by post-processing the hypothesis lattice of a server-side large-vocabulary continuous speech recognizer (LVCSR) via a neural network. We first discuss how an estimate of the posterior probability of the trigger phrase can be obtained from the hypothesis lattice using known techniques to perform detection, then investigate a statistical model that processes the lattice in a more explicitly data-driven, discriminative manner. We propose using a Bidirectional Lattice Recurrent Neural Network (LatticeRNN) for the task, and show that it can significantly improve detection accuracy over using the 1-best result or the posterior.