Researcher profile

Ben Krause

Ben Krause contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
6works
0followers
5topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

6 published item(s)

preprint2026arXiv

The Wiener Wintner and Return Times Theorem Along the Primes

We prove the following Return Times Theorem along the sequence of prime times, the first extension of the Return Times Theorem to arithmetic sequences: For every probability space, $(Ω,ν)$, equipped with a measure-preserving transformation, $T \colon Ω\to Ω$, and every $f \in L^\infty(Ω)$, there exists a set of full probability, $Ω_f \subset Ω$ with $ν(Ω_f) =1$, so that for all $ω\in Ω_f$, for any other probability space $(X,μ)$, equipped with a measure-preserving transformation $S : X \to X$, for any $g \in L^{\infty}(X)$, \begin{align} \frac{1}{N} \sum_{n \leq N} f(T^{p_n} ω) g(S^{p_n} \cdot) \end{align} converges $μ$-almost surely; above, $\{ 2=p_1 < p_2 < \dots \}$ are an enumeration of the primes. The Wiener-Wintner theorem along the primes is an immediate corollary. Our proof lives at the interface of classical Fourier analysis, combinatorial number theory, higher order Fourier analysis, and pointwise ergodic theory, with $U^3$ theory playing an important role; our $U^3$-estimates for \emph{Heath-Brown} models of the von Mangoldt function may be of independent interest.

preprint2022arXiv

Pointwise ergodic theorems for non-conventional bilinear polynomial averages

We establish convergence in norm and pointwise almost everywhere for the non-conventional (in the sense of Furstenberg) bilinear polynomial ergodic averages \[ A_N(f,g)(x) := \frac{1}{N} \sum_{n =1}^N f(T^nx) g(T^{P(n)}x)\] as $N \to \infty$, where $T \colon X \to X$ is a measure-preserving transformation of a $σ$-finite measure space $(X,μ)$, $P(\mathrm{n}) \in \mathbb Z[\mathrm{n}]$ is a polynomial of degree $d \geq 2$, and $f \in L^{p_1}(X), \ g \in L^{p_2}(X)$ for some $p_1,p_2 > 1$ with $\frac{1}{p_1} + \frac{1}{p_2} \leq 1$. We also establish an $r$-variational inequality for these averages (at lacunary scales) in the optimal range $r > 2$. We are also able to &#34;break duality&#34; by handling some ranges of exponents $p_1,p_2$ with $\frac{1}{p_1}+\frac{1}{p_2} > 1$, at the cost of increasing $r$ slightly. This gives an affirmative answer to Problem 11 from Frantzikinakis&#39; open problems survey for the Furstenberg--Weiss averages (with $P(\mathrm{n})=\mathrm{n}^2$), which is a bilinear variant of Question 9 considered by Bergelson in his survey on Ergodic Ramsey Theory from 1996. This also gives a contribution to the Furstenberg-Bergelson-Leibman conjecture. Our methods combine techniques from harmonic analysis with the recent inverse theorems of Peluse and Prendiville in additive combinatorics. At large scales, the harmonic analysis of the adelic integers $\mathbb A_{\mathbb Z}$ also plays a role.

preprint2020arXiv

Averages Along the Primes: Improving and Sparse Bounds

Consider averages along the prime integers $ \mathbb P $ given by \begin{equation*} \mathcal{A}_N f (x) = N ^{-1} \sum_{ p \in \mathbb P \;:\; p\leq N} (\log p) f (x-p). \end{equation*} These averages satisfy a uniform scale-free $ \ell ^{p}$-improving estimate. For all $ 1< p < 2$, there is a constant $ C_p$ so that for all integer $ N$ and functions $ f$ supported on $ [0,N]$, there holds \begin{equation*} N ^{-1/p&#39; }\lVert \mathcal{A}_N f\rVert_{\ell^{p&#39;}} \leq C_p N ^{- 1/p} \lVert f\rVert_{\ell^p}. \end{equation*} The maximal function $ \mathcal{A}^{\ast} f =\sup_{N} \lvert \mathcal{A}_N f \rvert$ satisfies $ (p,p)$ sparse bounds for all $ 1< p < 2$. The latter are the natural variants of the scale-free bounds. As a corollary, $ \mathcal{A}^{\ast} $ is bounded on $ \ell ^{p} (w)$, for all weights $ w$ in the Muckenhoupt $A_p$ class. No prior weighted inequalities for $ \mathcal{A}^{\ast} $ were known.

preprint2020arXiv

On Maximal Functions With Curvature

We exhibit a class of &#34;relatively curved&#34; $\vecγ(t) := (γ_1(t),\dots,γ_n(t))$, so that the pertaining multi-linear maximal function satisfies the sharp range of Hölder exponents, \[ \left\| \sup_{r > 0} \ \frac{1}{r} \int_{0}^r \prod_{i=1}^n |f_i(x-γ_i(t))| \ dt \right\|_{L^p(\mathbb{R})} \leq C \cdot \prod_{i=1}^n \| f_j \|_{L^{p_j}(\mathbb{R})} \] whenever $\frac{1}{p} = \sum_{j=1}^n \frac{1}{p_j}$, where $p_j > 1$ and $p \geq p_{\vecγ}$, where $1 \geq p_{\vecγ} > 1/n$ for certain curves. For instance, $p_{\vecγ} = 1/n^+$ for the case of fractional monomials, \[ \vecγ(t) = (t^{α_1},\dots,t^{α_n}), \; \; \; α_1 < \dots < α_n.\] Two sample applications of our method are as follows: For any measurable $u_1,\dots,u_n : \mathbb{R}^{n} \to \mathbb{R}$, with $u_i$ independent of the $i$th coordinate vector, and any relatively curved $\vecγ$, \[ \lim_{r \to 0} \ \frac{1}{r} \int_0^r F\big(x_1 - u_1(x) \cdot γ_1(t),\dots,x_n - u_n(x) \cdot γ_n(t) \big) \ dt = F(x_1,\dots,x_n), \; \; \; a.e. \] for every $F \in L^p(\mathbb{R}^n), \ p > 1$. Every appropriately normalized set $A \subset [0,1]$ of sufficiently large Hausdorff dimension contains the progression, \[ \{ x, x-γ_1(t),\dots,x - γ_n(t) \} \subset A, \] for some $t \geq c_{\vecγ} > 0$ strictly bounded away from zero, depending on $\vecγ$.

preprint2020arXiv

Taylorized Training: Towards Better Approximation of Neural Network Training at Finite Width

We propose \emph{Taylorized training} as an initiative towards better understanding neural network training at finite width. Taylorized training involves training the $k$-th order Taylor expansion of the neural network at initialization, and is a principled extension of linearized training---a recently proposed theory for understanding the success of deep learning. We experiment with Taylorized training on modern neural network architectures, and show that Taylorized training (1) agrees with full neural network training increasingly better as we increase $k$, and (2) can significantly close the performance gap between linearized and full training. Compared with linearized training, higher-order training works in more realistic settings such as standard parameterization and large (initial) learning rate. We complement our experiments with theoretical results showing that the approximation error of $k$-th order Taylorized models decay exponentially over $k$ in wide neural networks.