Source author record

Leo Liu

Leo Liu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computation and Language Machine Learning eess.AS Sound astro-ph.EP Cryptography and Security

Catalog footprint

What is connected

4works

6topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Training Large-Vocabulary Neural Language Models by Private Federated Learning for Resource-Constrained Devices

Federated Learning (FL) is a technique to train models using data distributed across devices. Differential Privacy (DP) provides a formal privacy guarantee for sensitive data. Our goal is to train a large neural network language model (NNLM) on compute-constrained devices while preserving privacy using FL and DP. However, the DP-noise introduced to the model increases as the model size grows, which often prevents convergence. We propose Partial Embedding Updates (PEU), a novel technique to decrease noise by decreasing payload size. Furthermore, we adopt Low Rank Adaptation (LoRA) and Noise Contrastive Estimation (NCE) to reduce the memory demands of large models on compute-constrained devices. This combination of techniques makes it possible to train large-vocabulary language models while preserving accuracy and privacy.

preprint2020arXiv

Diffuser-Assisted Infrared Transit Photometry for Four Dynamically Interacting \textit{Kepler} Systems

We present ground-based infrared transit observations for four dynamically interacting \textit{Kepler} planets, including Kepler-29b, Kepler-36c, KOI-1783.01, and Kepler-177c, obtained using the Wide-field Infrared Camera on the Hale 200" telescope at Palomar Observatory. By utilizing an engineered diffuser and custom guiding software, we mitigate time-correlated telluric and instrumental noise sources in these observations. We achieve an infrared photometric precision comparable to or better than that of space-based observatories such as the \textit{Spitzer Space Telescope}, and detect transits with greater than 3$σ$ significance for all planets. For Kepler-177c ($J=13.9$) our measurement uncertainties are only $1.2\times$ the photon noise limit and 1.9 times better than the predicted photometric precision for \textit{Spitzer} IRAC photometry of this same target. We find that a single transit observation obtained $4-5$ years after the end of the original \textit{Kepler} mission can reduce dynamical mass uncertainties by as much as a factor of three for these systems. Additionally, we combine our new observations of KOI-1783.01 with information from the literature to confirm the planetary nature of this system. We discuss the implications of our new mass and radius constraints in the context of known exoplanets with low incident fluxes, and we note that Kepler-177c may be a more massive analog to the currently known super-puffs given its core mass (3.8$\pm0.9M_\Earth$) and large gas-to-core ratio (2.8$\pm0.7$). Our demonstrated infrared photometric performance opens up new avenues for ground-based observations of transiting exoplanets previously thought to be restricted to space-based investigation.

preprint2020arXiv

SNDCNN: Self-normalizing deep CNNs with scaled exponential linear units for speech recognition

Very deep CNNs achieve state-of-the-art results in both computer vision and speech recognition, but are difficult to train. The most popular way to train very deep CNNs is to use shortcut connections (SC) together with batch normalization (BN). Inspired by Self- Normalizing Neural Networks, we propose the self-normalizing deep CNN (SNDCNN) based acoustic model topology, by removing the SC/BN and replacing the typical RELU activations with scaled exponential linear unit (SELU) in ResNet-50. SELU activations make the network self-normalizing and remove the need for both shortcut connections and batch normalization. Compared to ResNet- 50, we can achieve the same or lower (up to 4.5% relative) word error rate (WER) while boosting both training and inference speed by 60%-80%. We also explore other model inference optimization schemes to further reduce latency for production use.

preprint2020arXiv

Voice trigger detection from LVCSR hypothesis lattices using bidirectional lattice recurrent neural networks

We propose a method to reduce false voice triggers of a speech-enabled personal assistant by post-processing the hypothesis lattice of a server-side large-vocabulary continuous speech recognizer (LVCSR) via a neural network. We first discuss how an estimate of the posterior probability of the trigger phrase can be obtained from the hypothesis lattice using known techniques to perform detection, then investigate a statistical model that processes the lattice in a more explicitly data-driven, discriminative manner. We propose using a Bidirectional Lattice Recurrent Neural Network (LatticeRNN) for the task, and show that it can significantly improve detection accuracy over using the 1-best result or the posterior.

Leo Liu

What is connected

Connect this record

See the researcher in context

Building this map preview

4 published item(s)

Training Large-Vocabulary Neural Language Models by Private Federated Learning for Resource-Constrained Devices

Diffuser-Assisted Infrared Transit Photometry for Four Dynamically Interacting \textit{Kepler} Systems

SNDCNN: Self-normalizing deep CNNs with scaled exponential linear units for speech recognition

Voice trigger detection from LVCSR hypothesis lattices using bidirectional lattice recurrent neural networks