Researcher profile

Simon Baker

Simon Baker contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
14works
0followers
8topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

14 published item(s)

preprint2022arXiv

A note on dyadic approximation in Cantor's set

We consider the convergence theory for dyadic approximation in the middle-third Cantor set, $K$, for approximation functions of the form $ψ_τ(n) = n^{-τ}$ ($τ\ge 0$). In particular, we show that for values of $τ$ beyond a certain threshold we have that almost no point in $K$ is dyadically $ψ_τ$-well approximable with respect to the natural probability measure on $K$. This refines a previous result in this direction obtained by the first, third, and fourth named authors (arXiv, 2020).

preprint2022arXiv

Approximating elements of the middle third Cantor set with dyadic rationals

Let $C$ be the middle third Cantor set and $μ$ be the $\frac{\log 2}{\log 3}$-dimensional Hausdorff measure restricted to $C$. In this paper we study approximations of elements of $C$ by dyadic rationals. Our main result implies that for $μ$ almost every $x\in C$ we have $$\#\left\{1\leq n\leq N:\left|x-\frac{p}{2^n}\right| \leq \frac{1}{n^{0.01}\cdot 2^{n}}\textrm{ for some }p\in\mathbb{N}\right\}\sim 2\sum_{n=1}^{N}n^{-0.01}.$$ This improves upon a recent result of Allen, Chow, and Yu which gives a sub-logarithmic improvement over the trivial approximation rate.

preprint2021arXiv

Equidistribution results for self-similar measures

A well known theorem due to Koksma states that for Lebesgue almost every $x>1$ the sequence $(x^n)_{n=1}^{\infty}$ is uniformly distributed modulo one. In this paper we give sufficient conditions for an analogue of this theorem to hold for self-similar measures. Our approach applies more generally to sequences of the form $(f_{n}(x))_{n=1}^{\infty}$ where $(f_n)_{n=1}^{\infty}$ is a sequence of sufficiently smooth real valued functions satisfying a nonlinearity assumption. As a corollary of our main result, we show that if $C$ is equal to the middle third Cantor set and $t\geq 1$, then with respect to the Cantor-Lebesgue measure on $C+t$ the sequence $(x^n)_{n=1}^{\infty}$ is uniformly distributed for almost every $x$.

preprint2021arXiv

Non-Autoregressive Text Generation with Pre-trained Language Models

Non-autoregressive generation (NAG) has recently attracted great attention due to its fast inference speed. However, the generation quality of existing NAG models still lags behind their autoregressive counterparts. In this work, we show that BERT can be employed as the backbone of a NAG model to greatly improve performance. Additionally, we devise mechanisms to alleviate the two common problems of vanilla NAG models: the inflexibility of prefixed output length and the conditional independence of individual token predictions. Lastly, to further increase the speed advantage of the proposed model, we propose a new decoding strategy, ratio-first, for applications where the output lengths can be approximately estimated beforehand. For a comprehensive evaluation, we test the proposed model on three text generation tasks, including text summarization, sentence compression and machine translation. Experimental results show that our model significantly outperforms existing non-autoregressive baselines and achieves competitive performance with many strong autoregressive models. In addition, we also conduct extensive analysis experiments to reveal the effect of each proposed component.

preprint2020arXiv

An analogue of Khintchine's theorem for self-conformal sets

Khintchine's theorem is a classical result from metric number theory which relates the Lebesgue measure of certain limsup sets with the convergence/divergence of naturally occurring volume sums. In this paper we ask whether an analogous result holds for iterated function systems (IFSs). We say that an IFS is approximation regular if we observe Khintchine type behaviour, i.e., if the size of certain limsup sets defined using the IFS is determined by the convergence/divergence of naturally occurring sums. We prove that an IFS is approximation regular if it consists of conformal mappings and satisfies the open set condition. The divergence condition we introduce incorporates the inhomogeneity present within the IFS. We demonstrate via an example that such an approach is essential. We also formulate an analogue of the Duffin-Schaeffer conjecture and show that it holds for a set of full Hausdorff dimension. Combining our results with the mass transference principle of Beresnevich and Velani \cite{BerVel}, we prove a general result that implies the existence of exceptional points within the attractor of our IFS. These points are exceptional in the sense that they are "very well approximated". As a corollary of this result, we obtain a general solution to a problem of Mahler, and prove that there are badly approximable numbers that are very well approximated by quadratic irrationals. The ideas put forward in this paper are introduced in the general setting of IFSs that may contain overlaps. We believe that by viewing IFS's from the perspective of metric number theory, one can gain a greater insight into the extent to which they overlap. The results of this paper should be interpreted as a first step in this investigation.

preprint2020arXiv

Equidistribution results for sequences of polynomials

Let $(f_n)_{n=1}^{\infty}$ be a sequence of polynomials and $α>1$. In this paper we study the distribution of the sequence $(f_n(α))_{n=1}^{\infty}$ modulo one. We give sufficient conditions for a sequence $(f_n)_{n=1}^{\infty}$ to ensure that for Lebesgue almost every $α>1$ the sequence $(f_n(α))_{n=1}^{\infty}$ has Poissonian pair correlations. In particular, this result implies that for Lebesgue almost every $α>1$, for any $k\geq 2$ the sequence $(α^{n^k})_{n=1}^{\infty}$ has Poissonian pair correlations.

preprint2020arXiv

Iterated function systems with super-exponentially close cylinders II

Until recently, it was an important open problem in Fractal Geometry to determine whether there exists an iterated function system acting on $\mathbb{R}$ with no exact overlaps for which cylinders are super-exponentially close at all small scales. Iterated function systems satisfying these properties were shown to exist by the author and by Bárány and Käenmäki. In this paper we prove a general theorem on the existence of such iterated function systems within a parameterised family. This theorem shows that if a parameterised family contains two independent subfamilies, and the set of parameters that cause exact overlaps satisfies some weak topological assumptions, then the original family will contain an iterated function system satisfying the desired properties. We include several explicit examples of parameterised families to which this theorem can be applied.

preprint2020arXiv

Multi-SimLex: A Large-Scale Evaluation of Multilingual and Cross-Lingual Lexical Semantic Similarity

We introduce Multi-SimLex, a large-scale lexical resource and evaluation benchmark covering datasets for 12 typologically diverse languages, including major languages (e.g., Mandarin Chinese, Spanish, Russian) as well as less-resourced ones (e.g., Welsh, Kiswahili). Each language dataset is annotated for the lexical relation of semantic similarity and contains 1,888 semantically aligned concept pairs, providing a representative coverage of word classes (nouns, verbs, adjectives, adverbs), frequency ranks, similarity intervals, lexical fields, and concreteness levels. Additionally, owing to the alignment of concepts across languages, we provide a suite of 66 cross-lingual semantic similarity datasets. Due to its extensive size and language coverage, Multi-SimLex provides entirely novel opportunities for experimental evaluation and analysis. On its monolingual and cross-lingual benchmarks, we evaluate and analyze a wide array of recent state-of-the-art monolingual and cross-lingual representation models, including static and contextualized word embeddings (such as fastText, M-BERT and XLM), externally informed lexical representations, as well as fully unsupervised and (weakly) supervised cross-lingual word embeddings. We also present a step-by-step dataset creation protocol for creating consistent, Multi-Simlex-style resources for additional languages. We make these contributions -- the public release of Multi-SimLex datasets, their creation protocol, strong baseline results, and in-depth analyses which can be be helpful in guiding future developments in multilingual lexical semantics and representation learning -- available via a website which will encourage community effort in further expansion of Multi-Simlex to many more languages. Such a large-scale semantic resource could inspire significant further advances in NLP across languages.

preprint2020arXiv

On the pair correlations of powers of real numbers

A classical theorem of Koksma states that for Lebesgue almost every $x>1$ the sequence $(x^n)_{n=1}^{\infty}$ is uniformly distributed modulo one. In the present paper we extend Koksma's theorem to the pair correlation setting. More precisely, we show that for Lebesgue almost every $x>1$ the pair correlations of the fractional parts of $(x^n)_{n=1}^{\infty}$ are asymptotically Poissonian. The proof is based on a martingale approximation method.

preprint2020arXiv

Prototype-to-Style: Dialogue Generation with Style-Aware Editing on Retrieval Memory

The ability of a dialog system to express prespecified language style during conversations has a direct, positive impact on its usability and on user satisfaction. We introduce a new prototype-to-style (PS) framework to tackle the challenge of stylistic dialogue generation. The framework uses an Information Retrieval (IR) system and extracts a response prototype from the retrieved response. A stylistic response generator then takes the prototype and the desired language style as model input to obtain a high-quality and stylistic response. To effectively train the proposed model, we propose a new style-aware learning objective as well as a de-noising learning strategy. Results on three benchmark datasets from two languages demonstrate that the proposed approach significantly outperforms existing baselines in both in-domain and cross-domain evaluations

preprint2020arXiv

Quantitative recurrence properties for self-conformal sets

In this paper we study the quantitative recurrence properties of self-conformal sets $X$ equipped with the map $T:X\to X$ induced by the left shift. In particular, given a function $φ:\mathbb{N}\to(0,\infty),$ we study the metric properties of the set $$R(T,φ)=\left\{x\in X:|T^nx-x|<φ(n)\textrm{ for infinitely many }n\in \mathbb{N}\right\}.$$ Our main result shows that for the natural measure supported on $X$, $R(T,φ)$ has zero measure if a natural volume sum converges, and under the open set condition $R(T,φ)$ has full measure if this volume sum diverges.

preprint2020arXiv

Stylistic Dialogue Generation via Information-Guided Reinforcement Learning Strategy

Stylistic response generation is crucial for building an engaging dialogue system for industrial use. While it has attracted much research interest, existing methods often generate stylistic responses at the cost of the content quality (relevance and fluency). To enable better balance between the content quality and the style, we introduce a new training strategy, know as Information-Guided Reinforcement Learning (IG-RL). In IG-RL, a training model is encouraged to explore stylistic expressions while being constrained to maintain its content quality. This is achieved by adopting reinforcement learning strategy with statistical style information guidance for quality-preserving explorations. Experiments on two datasets show that the proposed approach outperforms several strong baselines in terms of the overall response performance.

preprint2017arXiv

Digit frequencies and self-affine sets with non-empty interior

In this paper we study digit frequencies in the setting of expansions in non-integer bases, and self-affine sets with non-empty interior. Within expansions in non-integer bases we show that if $β\in(1,1.787\ldots)$ then every $x\in(0,\frac{1}{β-1})$ has a simply normal $β$-expansion. We also prove that if $β\in(1,\frac{1+\sqrt{5}}{2})$ then every $x\in(0,\frac{1}{β-1})$ has a $β$-expansion for which the digit frequency does not exist, and a $β$-expansion with limiting frequency of zeros $p$, where $p$ is any real number sufficiently close to $1/2$. For a class of planar self-affine sets we show that if the horizontal contraction lies in a certain parameter space and the vertical contractions are sufficiently close to $1,$ then every nontrivial vertical fibre contains an interval. Our approach lends itself to explicit calculation and give rise to new examples of self-affine sets with non-empty interior. One particular strength of our approach is that it allows for different rates of contraction in the vertical direction.