Researcher profile

Tomasz Stebel

Tomasz Stebel contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
9works
0followers
6topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

9 published item(s)

preprint2026arXiv

Sampling two-dimensional spin systems with transformers

Autoregressive Neural Networks based on dense or convolutional layers have recently been shown to be a viable strategy for generating classical spin systems. Unlike these methods, sampling with transformers is commonly considered to be computationally inefficient. In this work, we propose a novel approach to transformer-based neural samplers in which we generate not a single spin per step but groups of spins. As an additional improvement, we construct a model of approximated probabilities, further improving the efficiency of the algorithm. Despite our approach being computationally heavier than dense networks or CNN-based approaches, we were able to sample larger systems of up to $180 \times 180$ spins in case of the Ising model. The Effective Sample Size of our sampler is $\sim 20$ times larger than that of the previous state-of-the-art neural sampler when trained for the $128 \times 128$ Ising model at critical temperature. Finally, we also test our algorithm on the 2D Edwards-Anderson model, where we train $64\times 64$ spin systems.

preprint2026arXiv

Variational Autoregressive Networks with probability priors

Monte Carlo methods are essential across diverse scientific fields, yet their efficiency is frequently hampered by critical slowing down-a sharp increase in autocorrelation times near phase transitions. Although deep learning approaches, such as neural-network-based samplers, have been proposed to alleviate this issue, they face another serious problem: the difficulty of training the models. This difficulty partially stems from the overly general nature of original machine-learning architectures, which often ignore underlying physical symmetries and force networks to relearn them from scratch. In this paper, we demonstrate that incorporating physical priors into the model significantly enhances performance. Building upon existing strategies that integrate spin-spin interactions, we propose a framework that utilizes a prior probability distribution as a starting point for training. Our results for the Ising model, as well as for the Edwards-Anderson spin glass model, suggest that moving away from `blank slate' models in favor of physics-informed priors reduces the training burden and facilitates the simulation of larger system sizes in discrete spin models.

preprint2023arXiv

Analysis of autocorrelation times in Neural Markov Chain Monte Carlo simulations

We provide a deepened study of autocorrelations in Neural Markov Chain Monte Carlo (NMCMC) simulations, a version of the traditional Metropolis algorithm which employs neural networks to provide independent proposals. We illustrate our ideas using the two-dimensional Ising model. We discuss several estimates of autocorrelation times in the context of NMCMC, some inspired by analytical results derived for the Metropolized Independent Sampler (MIS). We check their reliability by estimating them on a small system where analytical results can also be obtained. Based on the analytical results for MIS we propose a new loss function and study its impact on the autocorelation times. Although, this function's performance is a bit inferior to the traditional Kullback-Leibler divergence, it offers two training algorithms which in some situations may be beneficial. By studying a small, $4 \times 4$, system we gain access to the dynamics of the training process which we visualize using several observables. Furthermore, we quantitatively investigate the impact of imposing global discrete symmetries of the system in the neural network training process on the autocorrelation times. Eventually, we propose a scheme which incorporates partial heat-bath updates which considerably improves the quality of the training. The impact of the above enhancements is discussed for a $16 \times 16$ spin system. The summary of our findings may serve as a guidance to the implementation of Neural Markov Chain Monte Carlo simulations for more complicated models.

preprint2022arXiv

Gradient estimators for normalising flows

Recently a machine learning approach to Monte-Carlo simulations called Neural Markov Chain Monte-Carlo (NMCMC) is gaining traction. In its most popular form it uses neural networks to construct normalizing flows which are then trained to approximate the desired target distribution. In this contribution we present new gradient estimator for Stochastic Gradient Descent algorithm (and the corresponding \texttt{PyTorch} implementation) and show that it leads to better training results for $ϕ^4$ model. For this model our estimator achieves the same precision in approximately half of the time needed in standard approach and ultimately provides better estimates of the free energy. We attribute this effect to the lower variance of the new estimator. In contrary to the standard learning algorithm our approach does not require estimation of the action gradient with respect to the fields, thus has potential of further speeding up the training for models with more complicated actions.

preprint2021arXiv

Prompt photon production in proton collisions as a probe of parton scattering in high energy limit

We study the prompt photon hadroproduction at the LHC with the $k_T$-factorization approach and the $qg^* \to qγ$ and $g^*g^* \to q\bar qγ$ partonic channels, using three unintegrated gluon distributions which depend on gluon transverse momentum. They represent three different theoretical schemes which are usually considered in the $k_T$-factorization approach, known under the acronyms: KMR, CCFM and GBW gluon distributions. We find sensitivity of the calculated prompt photon transverse momentum distribution to the gluon transverse momentum distribution. The predictions obtained with the three approaches are compared to data, that allows to differentiate between them. We also discuss the significance of the two partonic channels, confronted with the expectations which are based on the applicability of the $k_T$-factorization scheme in the high energy approximation.

preprint2020arXiv

Associated top quark pair production with a heavy boson: differential cross sections at NLO+NNLL accuracy

We present theoretical predictions for selected differential cross sections for the process $pp \to t \bar{t} B$ at the LHC, where $B$ can be a Higgs ($H$), a $Z$ or a $W$ boson. The predictions are calculated in the direct QCD framework up to the next-to-next-leading logarithmic (NNLL) accuracy and matched to the complete NLO results including QCD and electroweak effects. Additionally, results for the total cross sections are provided. The calculations deliver a significant improvement of the theoretical predictions, especially for the $t \bar{t} H$ and the $t \bar{t} Z$ production. In these cases, predictions for both the total and differential cross sections are remarkably stable with respect to the central scale choice and carry a substantially reduced scale uncertainty in comparison with the complete NLO predictions.

preprint2020arXiv

Drell-Yan production with the CCFM-K evolution

We discuss the Drell-Yan dilepton production using the transverse momentum dependent parton distributions evolved with the Catani-Ciafaloni-Fiorani-Marchesini-Kwieciński (CCFM-K) equations in the single loop approximation. Such equations are obtained assuming angular ordering of emitted partons (coherence) for $x\sim 1$ and transverse momentum ordering for $x \ll 1$. This evolution scheme also contains the Collins-Soper-Sterman (CSS) soft gluon resummation. We make a comparison with a broad class of data on transverse momentum spectra of low mass Drell-Yan dileptons.

preprint2020arXiv

Sub-femtometer scale color charge correlations in the proton

Color charge correlations in the proton at moderately small $x\sim 0.1$ are extracted from its light-cone wave function. The charge fluctuations are far from Gaussian and they exhibit interesting dependence on impact parameter and on the relative transverse momentum (or distance) of the gluon probes. We provide initial conditions for small-$x$ Balitsky-Kovchegov evolution of the dipole scattering amplitude with impact parameter and $\hat r \cdot \hat b$ dependence, and with non-zero $C$-odd component due to three-gluon exchange. Lastly, we compute the (forward) Weizsaecker-Williams gluon distributions, including the distribution of linearly polarized gluons, up to fourth order in $A^+$. The correction due to the quartic correlator provides a transverse momentum scale, $q > 0.5$ GeV, for nearly maximal polarization.

preprint2020arXiv

Top Precision for Associated Top-Pair Production Processes at the LHC

The studies of the associated production processes of a top-quark pair with a colour-singlet boson, e.g. Higgs, W or Z, are among the highest priorities of the LHC programme. Correspondingly, improvements in precision of theoretical predictions for these processes are of central importance. In this talk, we review our latest results on resummation of soft gluon corrections. The resummation is carried out using the direct QCD Mellin space technique in three-particle invariant mass kinematics. We discuss the impact of the soft gluon corrections on predictions for total cross sections and differential distributions.