Source author record

Thomas Cass

Thomas Cass appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.PR math.CA Machine Learning math.NA Numerical Analysis math.AP math.OC q-fin.MF

Catalog footprint

What is connected

14works

8topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Gradient Regularized Newton Boosting Trees with Global Convergence

Gradient Boosting Decision Trees (GBDTs) dominate tabular machine learning, with modern implementations like XGBoost, LightGBM, and CatBoost being based on Newton boosting: a second-order descent step in the space of decision trees. Despite its empirical success, the global convergence of Newton boosting is poorly understood compared to first-order boosting. In this paper, we introduce Restricted Newton Descent, which studies convex optimization with Newton's method on Hilbert spaces with inexact iterates, based on the concepts of cosine angle and weak gradient edge. Within this framework, we recover Newton boosting with GBDTs and classical finite-dimensional theory as special cases. We first prove that vanilla Newton boosting achieves a linear rate of convergence for smooth, strongly convex losses that satisfy a Hessian-dominance condition. To handle general convex losses with Lipschitz Hessians, we extend a recent gradient regularized Newton scheme to the restricted weak learner setting. This scheme minimally modifies the classical algorithm by introducing an adaptive $\ell_2$-regularization term proportional to the square root of the gradient norm at each iteration. We establish a $\mathcal{O}(\frac{1}{k^2})$ rate for this scheme, thereby obtaining a globally convergent second-order GBDT algorithm with a rate matching that of first-order boosting with Nesterov momentum. In numerical experiments, we show that our scheme converges while vanilla Newton boosting may diverge.

preprint2025arXiv

Generative Modelling of Lévy Area for High Order SDE Simulation

It is well understood that, when numerically simulating SDEs with general noise, achieving a strong convergence rate better than $O(\sqrt{h})$ (where h is the step size) requires the use of certain iterated integrals of Brownian motion, commonly referred to as its "Lévy areas". However, these stochastic integrals are difficult to simulate due to their non-Gaussian nature and for a $d$-dimensional Brownian motion with $d > 2$, no fast almost-exact sampling algorithm is known. In this paper, we propose LévyGAN, a deep-learning-based model for generating approximate samples of Lévy area conditional on a Brownian increment. Due to our "Bridge-flipping" operation, the output samples match all joint and conditional odd moments exactly. Our generator employs a tailored GNN-inspired architecture, which enforces the correct dependency structure between the output distribution and the conditioning variable. Furthermore, we incorporate a mathematically principled characteristic-function based discriminator. Lastly, we introduce a novel training mechanism termed "Chen-training", which circumvents the need for expensive-to-generate training data-sets. This new training procedure is underpinned by our two main theoretical results. For 4-dimensional Brownian motion, we show that LévyGAN exhibits state-of-the-art performance across several metrics which measure both the joint and marginal distributions. We conclude with a numerical experiment on the log-Heston model, a popular SDE in mathematical finance, demonstrating that high-quality synthetic Lévy area can lead to high order weak convergence and variance reduction when using multilevel Monte Carlo (MLMC).

preprint2025arXiv

Numerical Schemes for Signature Kernels

Signature kernels have emerged as a powerful tool within kernel methods for sequential data. In the paper "The Signature Kernel is the solution of a Goursat PDE", the authors identify a kernel trick that demonstrates that, for continuously differentiable paths, the signature kernel satisfies a Goursat problem for a hyperbolic partial differential equation (PDE) in two independent time variables. While finite difference methods have been explored for this PDE, they face limitations in accuracy and stability when handling highly oscillatory inputs. In this work, we introduce two advanced numerical schemes that leverage polynomial representations of boundary conditions through either approximation or interpolation techniques, and rigorously establish the theoretical convergence of the polynomial approximation scheme. Experimental evaluations reveal that our approaches yield improvements of several orders of magnitude in mean absolute percentage error (MAPE) compared to traditional finite difference schemes, without increasing computational complexity. Furthermore, like finite difference methods, our algorithms can be GPU-parallelized to reduce computational complexity from quadratic to linear in the length of the input sequences, thereby improving scalability for high-frequency data. We have implemented these algorithms in a dedicated Python library, which is publicly available at: https://github.com/FrancescoPiatti/polysigkernel.

preprint2022arXiv

Topologies on unparameterised path space

The signature of a path, introduced by K.T. Chen [5] in $1954$, has been extensively studied in recent years. The $2010$ paper [12] of Hambly and Lyons showed that the signature is injective on the space of continuous finite-variation paths up to a general notion of reparameterisation called tree-like equivalence. The signature has been widely used in applications, underpinned by the result [15] that guarantees uniform approximation of a continuous function on a compact set by a linear functional of the signature. We study in detail, and for the first time, the properties of three candidate topologies on the set of unparameterised paths (the tree-like equivalence classes). These are obtained through properties of the signature and are: (1) the product topology, obtained by equipping the tensor algebra with the product topology and requiring $S$ to be an embedding, (2) the quotient topology derived from the 1-variation topology on the underlying path space, and (3) the metric topology associated to $d( [ γ] ,[ σ] ) := \vert\vert γ^*-σ^*\vert\vert_{1}$ using suitable representatives $γ^*$ and $σ^*$ of the equivalence classes. The topologies are ordered by strict inclusion, (1) being the weakest and (3) the strongest. Each is separable and Hausdorff, (1) being both metrisable and $σ$-compact, but not a Baire space and so neither Polish nor locally compact. The quotient topology (2) is not metrisable and the metric $d$ is not complete. An important function on (unparameterised) path space is the (fixed-time) solution map of a controlled differential equation. For a broad class of such equations, we prove measurability of this map for each topology. Under stronger regularity assumptions, we show continuity on explicit compact subsets of the product topology (1). We relate these results to the expected signature model of [15].

preprint2021arXiv

A combinatorial approach to geometric rough paths and their controlled paths

We develop the structure theory for transformations of weakly geometric rough paths of bounded $1 < p$-variation and their controlled paths. Our approach differs from existing approaches as it does not rely on smooth approximations. We derive an explicit combinatorial expression for the rough path lift of a controlled path, and use it to obtain fundamental identities such as the associativity of the rough integral, the adjunction between pushforwards and pullbacks, and a change of variables formula for rough differential equations (RDEs). As applications we define rough paths, rough integration and RDEs on manifolds, extending the results of [CDL15] to the case of arbitrary $p$.

preprint2020arXiv

Non-Geometric Rough Paths on Manifolds

We provide a theory of manifold-valued rough paths of bounded 3 > p-variation, which we do not assume to be geometric. Rough paths are defined in charts, and coordinate-free (but connection-dependent) definitions of the rough integral of cotangent bundle-valued controlled paths, and of RDEs driven by a rough path valued in another manifold, are given. When the path is the realisation of semimartingale we recover the theory of Itô integration and SDEs on manifolds [É89]. We proceed to present the extrinsic counterparts to our local formulae, and show how these extend the work in [CDL15] to the setting of non-geometric rough paths and controlled integrands more general than 1-forms. In the last section we turn to parallel transport and Cartan development: the lack of geometricity leads us to make the choice of a connection on the tangent bundle of the manifold TM, which figures in an Itô correction term in the parallelism RDE; such connection, which is not needed in the geometric/Stratonovich setting, is required to satisfy properties which guarantee well-definedness, linearity, and optionally isometricity of parallel transport. We conclude by providing numerous examples, some accompanied by numerical simulations, which explore the additional subtleties introduced by our change in perspective.

preprint2020arXiv

Option pricing models without probability: a rough paths approach

We describe the pricing and hedging of financial options without the use of probability using rough paths. By encoding the volatility of assets in an enhancement of the price trajectory, we give a pathwise presentation of the replication of European options. The continuity properties of rough-paths allow us to generalise the so-called fundamental theorem of derivative trading, showing that a small misspecification of the model will yield only a small excess profit or loss of the replication strategy. Our hedging strategy is an enhanced version of classical delta hedging where we use volatility swaps to hedge the second order terms arising in rough-path integrals, resulting in improved robustness.

preprint2020arXiv

Rough functional quantization and the support of McKean-Vlasov equations

We prove a representation for the support of McKean Vlasov Equations. To do so, we construct functional quantizations for the law of Brownian motion as a measure over the (non-reflexive) Banach space of Hölder continuous paths. By solving optimal Karhunen Loève expansions and exploiting the compact embedding of Gaussian measures, we obtain a sequence of deterministic finite supported measures that converge to the law of a Brownian motion with explicit rate. We show the approximation sequence is near optimal with very favourable integrability properties and prove these approximations remain true when the paths are enhanced to rough paths. These results are of independent interest. The functional quantization results then yield a novel way to build deterministic, finite supported measures that approximate the law of the McKean Vlasov Equation driven by the Brownian motion which crucially avoid the use of random empirical distributions. These are then used to solve an approximate skeleton process that characterises the support of the McKean Vlasov Equation. We give explicit rates of convergence for the deterministic finite supported measures in rough-path Hölder metrics and determine the size of the particle system required to accurately estimate the law of McKean Vlasov equations with respect to the Hölder norm.

preprint2015arXiv

Smoothness of the density for solutions to Gaussian rough differential equations

We consider stochastic differential equations of the form $dY_t=V(Y_t)\,dX_t+V_0(Y_t)\,dt$ driven by a multi-dimensional Gaussian process. Under the assumption that the vector fields $V_0$ and $V=(V_1,\ldots,V_d)$ satisfy Hörmander's bracket condition, we demonstrate that $Y_t$ admits a smooth density for any $t\in(0,T]$, provided the driving noise satisfies certain nondegeneracy assumptions. Our analysis relies on relies on an interplay of rough path theory, Malliavin calculus and the theory of Gaussian processes. Our result applies to a broad range of examples including fractional Brownian motion with Hurst parameter $H>1/4$, the Ornstein-Uhlenbeck process and the Brownian bridge returning after time $T$.

preprint2015arXiv

Tail estimates for Markovian rough paths

We work in the context of Markovian rough paths associated to a class of uniformly subelliptic Dirichlet forms [25] and prove a better-than-exponential tail estimate for the accummulated local p-variation functional, which has been introduced and studied in [17]. We comment on the significance of these estimates to a range of currently-studied problems, including the recent results of Chevyrev and Lyons in [18].

preprint2014arXiv

The filtering equations revisited

The problem of nonlinear filtering has engendered a surprising number of mathematical techniques for its treatment. A notable example is the change-of--probability-measure method originally introduced by Kallianpur and Striebel to derive the filtering equations and the Bayes-like formula that bears their names. More recent work, however, has generally preferred other methods. In this paper, we reconsider the change-of-measure approach to the derivation of the filtering equations and show that many of the technical conditions present in previous work can be relaxed. The filtering equations are established for general Markov signal processes that can be described by a martingale-problem formulation. Two specific applications are treated.

preprint2013arXiv

Integrability and tail estimates for Gaussian rough differential equations

We derive explicit tail-estimates for the Jacobian of the solution flow for stochastic differential equations driven by Gaussian rough paths. In particular, we deduce that the Jacobian has finite moments of all order for a wide class of Gaussian process including fractional Brownian motion with Hurst parameter H>1/4. We remark on the relevance of such estimates to a number of significant open problems.

preprint2011arXiv

On the error estimate for cubature on Wiener space

It was pointed out in Crisan, Ghazali [2] that the error estimate for the cubature on Wiener space algorithm developed in Lyons, Victoir [11] requires an additional assumption on the drift. In this note we demonstrate that it is straightforward to adopt the analysis of Kusuoka [7] to obtain a general estimate without an additional assumptions on the drift. In the process we slightly sharpen the bounds derived in [7].

preprint2011arXiv

Rough Paths on Manifolds

We develop a fundamental framework for and extend the theory of rough paths to Lipschitz-gamma manifolds.

Institution

Affiliation not imported yet

This author record came from a source that does not expose affiliation metadata. Once the author claims the profile or we enrich the record from another provider, this section will link to the concrete institution.

Topic footprint