Source author record

Alan Nichol

Alan Nichol appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

4works
2topics
4close collaborators

Actions

Connect this record

Log in to claim

Research graph

See the researcher in context

Open full explorer

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

4 published item(s)

preprint2020arXiv

Dialogue Transformers

We introduce a dialogue policy based on a transformer architecture, where the self-attention mechanism operates over the sequence of dialogue turns. Recent work has used hierarchical recurrent neural networks to encode multiple utterances in a dialogue context, but we argue that a pure self-attention mechanism is more suitable. By default, an RNN assumes that every item in a sequence is relevant for producing an encoding of the full sequence, but a single conversation can consist of multiple overlapping discourse segments as speakers interleave multiple topics. A transformer picks which turns to include in its encoding of the current dialogue state, and is naturally suited to selectively ignoring or attending to dialogue history. We compare the performance of the Transformer Embedding Dialogue (TED) policy to an LSTM and to the REDP, which was specifically designed to overcome this limitation of RNNs.

preprint2020arXiv

DIET: Lightweight Language Understanding for Dialogue Systems

Large-scale pre-trained language models have shown impressive results on language understanding benchmarks like GLUE and SuperGLUE, improving considerably over other pre-training methods like distributed representations (GloVe) and purely supervised approaches. We introduce the Dual Intent and Entity Transformer (DIET) architecture, and study the effectiveness of different pre-trained representations on intent and entity prediction, two common dialogue language understanding tasks. DIET advances the state of the art on a complex multi-domain NLU dataset and achieves similarly high performance on other simpler datasets. Surprisingly, we show that there is no clear benefit to using large pre-trained models for this task, and in fact DIET improves upon the current state of the art even in a purely supervised setup without any pre-trained embeddings. Our best performing model outperforms fine-tuning BERT and is about six times faster to train.

preprint2020arXiv

Where is the context? -- A critique of recent dialogue datasets

Recent dialogue datasets like MultiWOZ 2.1 and Taskmaster-1 constitute some of the most challenging tasks for present-day dialogue models and, therefore, are widely used for system evaluation. We identify several issues with the above-mentioned datasets, such as history independence, strong knowledge base dependence, and ambiguous system responses. Finally, we outline key desiderata for future datasets that we believe would be more suitable for the construction of conversational artificial intelligence.

preprint2016arXiv

Relating melting trends and elasticity in simple metals: an empirical potential approach

We demonstrate that the melting points and other thermodynamic quantities of the alkali metals can be calculated based on static crystalline properties. To do this we derive analytic interatomic potentials for the alkali metals fitted precisely to cohesive and vacancy energies, elastic moduli, lattice parameter and crystal stability. These potentials are then used to calculate melting points by simulating the equilibration of solid and liquid samples in thermal contact at ambient pressure. With the exception of lithium, remarkably good agreement is found with experimental values. The instability of the bcc structure in Li and Na at low temperatures is also reproduced, and, unusually, is not due to a soft T1N phonon mode. No forces or finite temperature properties are included in the fit, so this demonstrates a surprisingly high level of intrinsic transferrability in the simple potentials. Currently, there are few potentials available for the alkali metals, so in, addition to demonstrating trends in behaviour, we expect that the potentials will be of broad general use.