Researcher profile

Nathan Ng

Nathan Ng contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
6works
0followers
5topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

6 published item(s)

preprint2023arXiv

Improving Dialogue Breakdown Detection with Semi-Supervised Learning

Building user trust in dialogue agents requires smooth and consistent dialogue exchanges. However, agents can easily lose conversational context and generate irrelevant utterances. These situations are called dialogue breakdown, where agent utterances prevent users from continuing the conversation. Building systems to detect dialogue breakdown allows agents to recover appropriately or avoid breakdown entirely. In this paper we investigate the use of semi-supervised learning methods to improve dialogue breakdown detection, including continued pre-training on the Reddit dataset and a manifold-based data augmentation method. We demonstrate the effectiveness of these methods on the Dialogue Breakdown Detection Challenge (DBDC) English shared task. Our submissions to the 2020 DBDC5 shared task place first, beating baselines and other submissions by over 12\% accuracy. In ablations on DBDC4 data from 2019, our semi-supervised learning methods improve the performance of a baseline BERT model by 2\% accuracy. These methods are applicable generally to any dialogue task and provide a simple way to improve model performance.

preprint2022arXiv

If Influence Functions are the Answer, Then What is the Question?

Influence functions efficiently estimate the effect of removing a single training data point on a model's learned parameters. While influence estimates align well with leave-one-out retraining for linear models, recent works have shown this alignment is often poor in neural networks. In this work, we investigate the specific factors that cause this discrepancy by decomposing it into five separate terms. We study the contributions of each term on a variety of architectures and datasets and how they vary with factors such as network width and training time. While practical influence function estimates may be a poor match to leave-one-out retraining for nonlinear networks, we show they are often a good approximation to a different object we term the proximal Bregman response function (PBRF). Since the PBRF can still be used to answer many of the questions motivating influence functions, such as identifying influential or mislabeled examples, our results suggest that current algorithms for influence function estimation give more informative results than previous error analyses would suggest.

preprint2022arXiv

Long-time memory effects in a localizable central spin problem

We study the properties of the Nakajima-Zwanzig memory kernel for a qubit immersed in a many-body localized (i.e., disordered and interacting) bath. We argue that the memory kernel decays as a power law in both the localized and ergodic regimes, and show how this can be leveraged to extract $t\to\infty$ populations for the qubit from finite time ($J t \leq 10^2$) data in the thermalizing phase. This allows us to quantify how the long-time values of the populations approach the expected thermalized state as the bath approaches the thermodynamic limit. This approach should provide a good complement to state-of-the-art numerical methods, for which the long-time dynamics with large baths are impossible to simulate in this phase. Additionally, our numerics on finite baths reveal the possibility for unbounded exponential growth in the memory kernel, a phenomenon rooted in the appearance of exceptional points in the projected Liouvillian governing the reduced dynamics. In small systems amenable to exact numerics, we find that these pathologies may have some correlation with delocalization.

preprint2022arXiv

The eighth moment of the Riemann zeta function

In this article, we establish an asymptotic formula for the eighth moment of the Riemann zeta function, assuming the Riemann hypothesis and a quaternary additive divisor conjecture. This builds on the work of the first author on the sixth moment of the Riemann zeta function and work of Conrey-Gonek and Ivić. A key input is a sharp bound for a certain shifted moment of the Riemann zeta function, assuming the Riemann hypothesis.

preprint2021arXiv

Explicit zero density for the Riemann zeta function

Let $N(σ,T)$ denote the number of nontrivial zeros of the Riemann zeta function with real part greater than $σ$ and imaginary part between $0$ and $T$. We provide explicit upper bounds for $N(σ,T)$ commonly referred to as a zero density result. In 1937, Ingham showed the following asymptotic result $N(σ,T)=\mathcal{O} ( T^{\frac83(1-σ)} (\log T)^5 )$. Ramaré recently proved an explicit version of this estimate. We discuss a generalization of the method used in these two results which yields an explicit bound of a similar shape while also improving the constants.