Researcher profile

Rasmus Larsen

Rasmus Larsen contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
7works
0followers
5topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

7 published item(s)

preprint2022arXiv

Programmatic Policy Extraction by Iterative Local Search

Reinforcement learning policies are often represented by neural networks, but programmatic policies are preferred in some cases because they are more interpretable, amenable to formal verification, or generalize better. While efficient algorithms for learning neural policies exist, learning programmatic policies is challenging. Combining imitation-projection and dataset aggregation with a local search heuristic, we present a simple and direct approach to extracting a programmatic policy from a pretrained neural policy. After examining our local search heuristic on a programming by example problem, we demonstrate our programmatic policy extraction method on a pendulum swing-up problem. Both when trained using a hand crafted expert policy and a learned neural policy, our method discovers simple and interpretable policies that perform almost as well as the original.

preprint2022arXiv

Reinforcement Learning of Causal Variables Using Mediation Analysis

Many open problems in machine learning are intrinsically related to causality, however, the use of causal analysis in machine learning is still in its early stage. Within a general reinforcement learning setting, we consider the problem of building a general reinforcement learning agent which uses experience to construct a causal graph of the environment, and use this graph to inform its policy. Our approach has three characteristics: First, we learn a simple, coarse-grained causal graph, in which the variables reflect states at many time instances, and the interventions happen at the level of policies, rather than individual actions. Secondly, we use mediation analysis to obtain an optimization target. By minimizing this target, we define the causal variables. Thirdly, our approach relies on estimating conditional expectations rather the familiar expected return from reinforcement learning, and we therefore apply a generalization of Bellman's equations. We show the method can learn a plausible causal graph in a grid-world environment, and the agent obtains an improvement in performance when using the causally informed policy. To our knowledge, this is the first attempt to apply causal analysis in a reinforcement learning setting without strict restrictions on the number of states. We have observed that mediation analysis provides a promising avenue for transforming the problem of causal acquisition into one of cost-function minimization, but importantly one which involves estimating conditional expectations. This is a new challenge, and we think that causal reinforcement learning will involve development methods suited for online estimation of such conditional expectations. Finally, a benefit of our approach is the use of very simple causal models, which are arguably a more natural model of human causal understanding.

preprint2022arXiv

Static quark anti-quark interactions at non-zero temperature from lattice QCD

We study the interactions of a static quark antiquark pair at non-zero temperature using realistic 2+1 flavor lattice QCD calculations. The study consists of two parts: the first investigates the properties of Wilson line correlators in Coulomb gauge and compares to predictions of hard-thermal loop perturbation theory. As a second step we extract the spectral functions underlying the correlators using four conceptually different methods: spectral function fits, a HTL inspired fit for the correlation function, Padé rational approximation and the Bayesian BR spectral reconstruction. We find that our high statistics Euclidean lattice data are amenable to different hypotheses for the shapes of the spectral function and we compare the implications of each analysis method for the existence and properties of a well defined ground state spectral peak.

preprint2022arXiv

Static quark anti-quark interactions at non-zero temperature from lattice QCD

We present results on the in-medium interactions of static quark anti-quark pairs using realistic 2+1 HISQ flavor lattice QCD. Focus is put on the extraction of spectral information from Wilson line correlators in Coulomb gauge using four complementary methods. Our results indicate that on HISQ lattices, the position of the dominant spectral peak associated with the real-part of the interquark potential remains unaffected by temperature. This is in contrast to prior work in quenched QCD and we present follow up comparisons to newly generated quenched ensembles.

preprint2020arXiv

Bethe-Salpeter amplitudes of Upsilons

Based on lattice non-relativistic QCD (NRQCD) studies we present results for Bethe-Salpeter amplitudes for $Υ(1S)$, $Υ(2S)$ and $Υ(3S)$ in vacuum as well as in quark-gluon plasma. Our study is based on 2+1 flavor $48^3 \times 12$ lattices generated using the Highly Improved Staggered Quark (HISQ) action and with a pion mass of $161$ MeV. At zero temperature the Bethe-Salpeter amplitudes follow the expectations based on non-relativistic potential models. At non-zero temperatures, the interpretation of Bethe-Salpeter amplitudes turns out to be more nuanced, but consistent with our previous lattice QCD study of excited Upsilons in quark-gluon plasma.

preprint2020arXiv

Excited bottomonia in quark-gluon plasma from lattice QCD

We present the first lattice QCD study of up to $3S$ and $2P$ bottomonia at non-zero temperatures. Correlation functions of bottomonia were computed using novel bottomonium operators and a variational technique, within the lattice non-relativistic QCD framework. We analyzed the bottomonium correlation functions based on simple physically-motivated spectral functions. We found evidence of sequential in-medium modifications, in accordance with the sizes of the bottomonium states.

preprint2020arXiv

Thermal Broadening of Bottomonia: Lattice Non-Relativistic QCD with Extended Operators

We present lattice non-relativistic QCD calculations of bottomonium correlation functions at temperatures $T \simeq 150-350$ MeV. The correlation functions were computed using extended bottomonium operators, and on background gauge-field configurations for 2+1-flavor QCD having physical kaon and nearly-physical pion masses. We analyzed these correlation functions based on simple theoretically-motivated parameterizations of the corresponding spectral functions. The results of our analyses are compatible with significant in-medium thermal broadening of the ground state S- and P-wave bottomonia.