Source author record

Peter Cholak

Peter Cholak appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.LO Computation and Language Machine Learning

Catalog footprint

What is connected

9works

3topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Overcoming a Theoretical Limitation of Self-Attention

Although transformers are remarkably effective for many tasks, there are some surprisingly easy-looking regular languages that they struggle with. Hahn shows that for languages where acceptance depends on a single input symbol, a transformer's classification decisions become less and less confident (that is, with cross-entropy approaching 1 bit per string) as input strings get longer and longer. We examine this limitation using two languages: PARITY, the language of bit strings with an odd number of 1s, and FIRST, the language of bit strings starting with a 1. We demonstrate three ways of overcoming the limitation suggested by Hahn's lemma. First, we settle an open question by constructing a transformer that recognizes PARITY with perfect accuracy, and similarly for FIRST. Second, we use layer normalization to bring the cross-entropy of both models arbitrarily close to zero. Third, when transformers need to focus on a single position, as for FIRST, we find that they can fail to generalize to longer strings; we offer a simple remedy to this problem that also improves length generalization in machine translation.

preprint2020arXiv

Realizing Computably Enumerable Degrees in Separating Classes

We investigate what collections of c.e.\ Turing degrees can be realised as the collection of elements of a separating $Π^0_1$ class of c.e.\ degree. We show that for every c.e.\ degree $\mathbf{c}$, the collection $\{\mathbf{c}, \mathbf{0}'\}$ can be thus realized. We also rule out several attempts at constructing separating classes realizing a unique c.e.\ degree. For example, we show that there is no \emph{super-maximal} pair: disjoint c.e.\ sets $A$ and $B$ whose separating class is infinite, but every separator of c.e.\ degree is a finite variant of either $A$ or $\overline{B}$.

preprint2016arXiv

Any FIP real computes a 1-generic

We construct a computable sequence of computable reals $\langle X_i\rangle$ such that any real that can compute a subsequence that is maximal with respect to the finite intersection property can also compute a Cohen 1-generic. This is extended to establish the same result with 2IP in place of FIP.

preprint2016arXiv

Density-1-bounding and quasiminimality in the generic degrees

We consider the question "Is every nonzero generic degree a density-1-bounding generic degree?" By previous results \cite{I2} either resolution of this question would answer an open question concerning the structure of the generic degrees: A positive result would prove that there are no minimal generic degrees, and a negative result would prove that there exist minimal pairs in the generic degrees. We consider several techniques for showing that the answer might be positive, and use those techniques to prove that a wide class of assumptions is sufficient to prove density-1-bounding. We also consider a historic difficulty in constructing a potential counterexample: By previous results \cite{I1} any generic degree that is not density-1-bounding must be quasiminimal, so in particular, any construction of a non-density-1-bounding generic degree must use a method that is able to construct a quasiminimal generic degree. However, all previously known examples of quasiminimal sets are also density-1, and so trivially density-1-bounding. We provide several examples of non-density-1 sets that are quasiminimal. Using cofinite and mod-finite reducibility, we extend our results to the uniform coarse degrees, and to the nonuniform generic degrees. We define all of the above terms, and we provide independent motivation for the study of each of them. Combined with a concurrently written paper of Hirschfeldt, Jockusch, Kuyper, and Schupp \cite{HJKS}, this paper provides a characterization of the level of randomness required to ensure quasiminimality in the uniform and nonuniform coarse and generic degrees.

preprint2016arXiv

On Splits of Computably enumerable sets

Our focus will be on the computably enumerable (c.e.) sets and trivial, non-trivial, Friedberg, and non-Friedberg splits of the c.e. sets. Every non-computable set has a non-trivial Friedberg split. Moreover, this theorem is uniform. V. Yu. Shavrukov recently answered the question which c.e. sets have a non-trivial non-Friedberg splitting and we provide a different proof of his result. We end by showing there is no uniform splitting of all c.e. sets such that all non-computable sets are non-trivially split and, in addition, all sets with a non-trivial non-Friedberg split are split accordingly.

preprint2015arXiv

Computably Enumerable Sets that are Automorphic to Low Sets

We work with the structure consisting of all computably enumerable (c.e.) sets ordered by set inclusion. The question we will partially address is which c.e.\ sets are autormorphic to low (or low$_2$ sets. Using work of Miller, we can see that every set with semilow complement is $Δ^0_3$ automorphic to a low set. While it remains open whether every set with semilow complement is effectively automorphic to a low set, we show that there are sets without semilow complement that are effectively automorphic to low sets. We also consider other lowness notions such as having a semilow$_{1.5}$ complement, having the the outer splitting property, and having a semilow$_2$ complement. We show that in every non low \ce degree, there are sets with semilow$_{1.5}$ complements without semilow complements as well as sets with semilow$_2$ complements and the outer splitting property that do not have semilow$_{1.5}$ complements. We also address the question of which sets are automorphic to low$_2$ sets.

preprint2014arXiv

$\mathcal{D}$-maximal sets

Soare proved that the maximal sets form an orbit in $\mathcal{E}$. We consider here $\mathcal{D}$-maximal sets, generalizations of maximal sets introduced by Herrmann and Kummer. Some orbits of $\mathcal{D}$-maximal sets are well understood, e.g., hemimaximal sets, but many are not. The goal of this paper is to define new invariants on computably enumerable sets and to use them to give a complete nontrivial classification of the $\mathcal{D}$-maximal sets. Although these invariants help us to better understand the $\mathcal{D}$-maximal sets, we use them to show that several classes of $\mathcal{D}$-maximal sets break into infinitely many orbits.

preprint2013arXiv

Some recent research directions in the computably enumerable sets

As suggested by the title, this paper is a survey of recent results and questions on the collection of computably enumerable sets under inclusion. This is not a broad survey but one focused on the author's and a few others' current research.

preprint2011arXiv

Reverse mathematics and infinite traceable graphs

This paper falls within the general program of investigating the proof theoretic strength (in terms of reverse mathematics) of combinatorial principals which follow from versions of Ramsey's theorem. We examine two statements in graph theory and one statement in lattice theory proved by Galvin, Rival and Sands \cite{GRS:82} using Ramsey's theorem for 4-tuples. Our main results are that the statements concerning graph theory are equivalent to Ramsey's theorem for 4-tuples over $\RCA$ while the statement concerning lattices is provable in $\RCA$. Revised 12/2010. To appear in Archive for Mathematical Logic

Peter Cholak

What is connected

Connect this record

See the researcher in context

Building this map preview

9 published item(s)

Overcoming a Theoretical Limitation of Self-Attention

Realizing Computably Enumerable Degrees in Separating Classes

Any FIP real computes a 1-generic

Density-1-bounding and quasiminimality in the generic degrees

On Splits of Computably enumerable sets

Computably Enumerable Sets that are Automorphic to Low Sets

$\mathcal{D}$-maximal sets

Some recent research directions in the computably enumerable sets

Reverse mathematics and infinite traceable graphs