Source author record

Rasmus Larsen

Rasmus Larsen appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

hep-lat hep-ph nucl-th Artificial Intelligence Machine Learning

Catalog footprint

What is connected

11works

5topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Programmatic Policy Extraction by Iterative Local Search

Reinforcement learning policies are often represented by neural networks, but programmatic policies are preferred in some cases because they are more interpretable, amenable to formal verification, or generalize better. While efficient algorithms for learning neural policies exist, learning programmatic policies is challenging. Combining imitation-projection and dataset aggregation with a local search heuristic, we present a simple and direct approach to extracting a programmatic policy from a pretrained neural policy. After examining our local search heuristic on a programming by example problem, we demonstrate our programmatic policy extraction method on a pendulum swing-up problem. Both when trained using a hand crafted expert policy and a learned neural policy, our method discovers simple and interpretable policies that perform almost as well as the original.

preprint2022arXiv

Reinforcement Learning of Causal Variables Using Mediation Analysis

Many open problems in machine learning are intrinsically related to causality, however, the use of causal analysis in machine learning is still in its early stage. Within a general reinforcement learning setting, we consider the problem of building a general reinforcement learning agent which uses experience to construct a causal graph of the environment, and use this graph to inform its policy. Our approach has three characteristics: First, we learn a simple, coarse-grained causal graph, in which the variables reflect states at many time instances, and the interventions happen at the level of policies, rather than individual actions. Secondly, we use mediation analysis to obtain an optimization target. By minimizing this target, we define the causal variables. Thirdly, our approach relies on estimating conditional expectations rather the familiar expected return from reinforcement learning, and we therefore apply a generalization of Bellman's equations. We show the method can learn a plausible causal graph in a grid-world environment, and the agent obtains an improvement in performance when using the causally informed policy. To our knowledge, this is the first attempt to apply causal analysis in a reinforcement learning setting without strict restrictions on the number of states. We have observed that mediation analysis provides a promising avenue for transforming the problem of causal acquisition into one of cost-function minimization, but importantly one which involves estimating conditional expectations. This is a new challenge, and we think that causal reinforcement learning will involve development methods suited for online estimation of such conditional expectations. Finally, a benefit of our approach is the use of very simple causal models, which are arguably a more natural model of human causal understanding.

preprint2022arXiv

Static quark anti-quark interactions at non-zero temperature from lattice QCD

We present results on the in-medium interactions of static quark anti-quark pairs using realistic 2+1 HISQ flavor lattice QCD. Focus is put on the extraction of spectral information from Wilson line correlators in Coulomb gauge using four complementary methods. Our results indicate that on HISQ lattices, the position of the dominant spectral peak associated with the real-part of the interquark potential remains unaffected by temperature. This is in contrast to prior work in quenched QCD and we present follow up comparisons to newly generated quenched ensembles.

preprint2022arXiv

Static quark anti-quark interactions at non-zero temperature from lattice QCD

We study the interactions of a static quark antiquark pair at non-zero temperature using realistic 2+1 flavor lattice QCD calculations. The study consists of two parts: the first investigates the properties of Wilson line correlators in Coulomb gauge and compares to predictions of hard-thermal loop perturbation theory. As a second step we extract the spectral functions underlying the correlators using four conceptually different methods: spectral function fits, a HTL inspired fit for the correlation function, Padé rational approximation and the Bayesian BR spectral reconstruction. We find that our high statistics Euclidean lattice data are amenable to different hypotheses for the shapes of the spectral function and we compare the implications of each analysis method for the existence and properties of a well defined ground state spectral peak.

preprint2020arXiv

Bethe-Salpeter amplitudes of Upsilons

Based on lattice non-relativistic QCD (NRQCD) studies we present results for Bethe-Salpeter amplitudes for $Υ(1S)$, $Υ(2S)$ and $Υ(3S)$ in vacuum as well as in quark-gluon plasma. Our study is based on 2+1 flavor $48^3 \times 12$ lattices generated using the Highly Improved Staggered Quark (HISQ) action and with a pion mass of $161$ MeV. At zero temperature the Bethe-Salpeter amplitudes follow the expectations based on non-relativistic potential models. At non-zero temperatures, the interpretation of Bethe-Salpeter amplitudes turns out to be more nuanced, but consistent with our previous lattice QCD study of excited Upsilons in quark-gluon plasma.

preprint2020arXiv

Excited bottomonia in quark-gluon plasma from lattice QCD

We present the first lattice QCD study of up to $3S$ and $2P$ bottomonia at non-zero temperatures. Correlation functions of bottomonia were computed using novel bottomonium operators and a variational technique, within the lattice non-relativistic QCD framework. We analyzed the bottomonium correlation functions based on simple physically-motivated spectral functions. We found evidence of sequential in-medium modifications, in accordance with the sizes of the bottomonium states.

preprint2020arXiv

Thermal Broadening of Bottomonia: Lattice Non-Relativistic QCD with Extended Operators

We present lattice non-relativistic QCD calculations of bottomonium correlation functions at temperatures $T \simeq 150-350$ MeV. The correlation functions were computed using extended bottomonium operators, and on background gauge-field configurations for 2+1-flavor QCD having physical kaon and nearly-physical pion masses. We analyzed these correlation functions based on simple theoretically-motivated parameterizations of the corresponding spectral functions. The results of our analyses are compatible with significant in-medium thermal broadening of the ground state S- and P-wave bottomonia.

preprint2016arXiv

Classical interactions of the instanton-dyons with antidyons

Instanton-dyons, also known as instanton-monopoles or instanton-quarks, are topological constituents of the instantons at nonzero temperature and nonzero expectation value of $A_4$. While the interaction between instanton-dyons has been calculated to one-loop order by a number of authors, that for dyon-antidyon pairs remains unknown even at the classical level. In this work we are filling this gap, by solving the gradient flow equation on a 3d lattice. We start with two well separated objects. We find that, after initial rapid relaxation, the configurations follow "streamline" set of configurations, which is basically independent on the initial configurations used. In striking difference to instanton-antiinstanton streamlines, in this case it ends at a quasi-stationary configuration, with an abrupt drop to perturbative fields. We parameterize the action of the streamline configurations, which is to be used in future many-body calculations.

preprint2016arXiv

Instanton-dyon Ensemble with two Dynamical Quarks: the Chiral Symmetry Breaking

This is the second paper of the series aimed at understanding the ensemble of instanton-dyons, now with two flavors of light dynamical quarks. The partition function is appended by the fermionic factor, $(det T)^{N_f}$ and Dirac eigenvalue spectra at small values are derived from the numerical simulation of 64 and 128 dyons. Those spectra show clear chiral symmetry breaking pattern at high dyon density.

preprint2016arXiv

Instanton-dyon Ensembles III: Exotic Quark Flavors

"Exotic quarks" in the title refers to a modification of quark periodicity condition on the thermal circle by introduction of some phases -- known also as "flavor holonomies" -- different quark flavors. These phases provide a valuable tool, to be used for better understanding of deconfinement and chiral restoration phase transitions: by changing them one can dramatically modify both phase transitions. In the language of instanton constituents -- instanton-dyons or monopoles -- it has a very direct explanation: the interplay of flavor and color holonomies can switch topological zero modes between various dyon types. The model we will study in detail, the so called $Z_{N_c}$-symmetric QCD model with equal number of colors and flavors $N_c=N_f=2$ and special arrangement of flavor and color holonomies, ensure "most democratic" setting, in which each quark flavor and each dyon type are in one-to-one correspondence. The usual QCD has the opposite "most exclusive" arrangement: all quarks are antiperiodic and thus all zero modes fall on only one -- twisted or $L$ -- dyon type. As we show by ensemble simulation, deconfinement and chiral restoration phase transitions in these two models are dramatically different. In the usual QCD both are smooth crossovers: but in the case of $Z_2$-symmetric model deconfinement becomes strong first order transition, while chiral symmetry remains broken for all dyon densities studied. These results are in good correspondence with those from recent lattice simulations.

preprint2015arXiv

Interacting Ensemble of the Instanton-dyons and Deconfinement Phase Transition in the SU(2) Gauge Theory

Instanton-dyons, also known as instanton-monopoles or instanton-quarks, are topological constituents of the instantons at nonzero temperature and holonomy. We perform numerical simulations of the ensemble of interacting dyons for the SU(2) pure gauge theory. Unlike previous studies, we focus on back reaction on the holonomy and the issue of confinement. We calculate the free energy as a function of the holonomy and the dyon densities, using standard Metropolis Monte Carlo and integration over parameter methods. We observe that as the temperature decreases and the dyon density grows, its minimum indeed moves from small holonomy to the value corresponding to confinement. We then report various parameters of the self-consistent ensembles as a function of temperature, and investigate the role of inter-particle correlations.

Rasmus Larsen

What is connected

Connect this record

See the researcher in context

Building this map preview

11 published item(s)

Programmatic Policy Extraction by Iterative Local Search

Reinforcement Learning of Causal Variables Using Mediation Analysis

Static quark anti-quark interactions at non-zero temperature from lattice QCD

Static quark anti-quark interactions at non-zero temperature from lattice QCD

Bethe-Salpeter amplitudes of Upsilons

Excited bottomonia in quark-gluon plasma from lattice QCD

Thermal Broadening of Bottomonia: Lattice Non-Relativistic QCD with Extended Operators

Classical interactions of the instanton-dyons with antidyons

Instanton-dyon Ensemble with two Dynamical Quarks: the Chiral Symmetry Breaking

Instanton-dyon Ensembles III: Exotic Quark Flavors

Interacting Ensemble of the Instanton-dyons and Deconfinement Phase Transition in the SU(2) Gauge Theory