Researcher profile

Michael Groom

Michael Groom contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 13 - UnverifiedVerification L1Unclaimed author
2works
0followers
4topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

2 published item(s)

preprint2026arXiv

Quantile-Coupled Flow Matching for Distributional Reinforcement Learning

Unlike standard expected-return Reinforcement Learning (RL), Distributional RL (DRL) models the full return distribution, making it better-suited for uncertainty-aware and risk-sensitive decision-making. Conditional Flow Matching (CFM) critics have recently attracted attention for modelling continuous, multi-modal return distributions. Despite this interest, there remains a substantial metric mismatch: DRL theory relies on the distributional Bellman operator being contractive in the $p$-Wasserstein distance, yet existing CFM critics are trained with arbitrary source-target couplings, so their flow-matching losses are not Wasserstein-aligned surrogates for matching Bellman target return distributions. In this work, we address this mismatch by proposing FlowIQN, a CFM critic that sorts source and Bellman target samples within each mini-batch to approximate the monotone optimal transport coupling, replacing arbitrary pairings with quantile-aligned flow paths. We prove that the loss of our quantile-coupled CFM critic yields a Wasserstein-aligned approximate projection compatible with the foundations of DRL. To our knowledge, FlowIQN is the first flow-matching distributional critic with an explicit Wasserstein-aligned projection guarantee. We further extend FlowIQN with shortcut models for efficient inference. Empirical results show that FlowIQN improves Wasserstein return-distribution accuracy over other CFM critics. It also yields competitive performance on offline RL benchmarks across multiple policy extraction methods, providing a theoretically grounded CFM critic that is readily compatible with DRL pipelines. Code: https://github.com/ori-goals/flowIQN.

preprint2020arXiv

The influence of initial perturbation power spectra on the growth of a turbulent mixing layer induced by Richtmyer-Meshkov instability

This paper investigates the influence of different broadband perturbations on the evolution of a Richtmyer--Meshkov turbulent mixing layer initiated by a Mach 1.84 shock traversing a perturbed interface separating gases with a density ratio of 3:1. Both the bandwidth of modes in the interface perturbation, as well as their relative amplitudes, are varied in a series of carefully designed numerical simulations at grid resolutions up to $3.2\times10^9$ cells. Three different perturbations are considered, characterised by a power spectrum of the form $P(k)\propto k^m$ where $m=-1$, $-2$ and $-3$. The growth of the mixing layer is shown to strongly depend on the initial conditions, with the growth rate exponent $θ$ found to be $0.5$, $0.63$ and $0.75$ for each value of $m$ at the highest grid resolution. The asymptotic values of the molecular mixing fraction $Θ$ are also shown to vary significantly with $m$; at the latest time considered $Θ$ is $0.56$, $0.39$ and $0.20$ respectively. Turbulent kinetic energy (TKE) is also analysed in both the temporal and spectral domains. The temporal decay rate of TKE is found not to match the predicted value of $n=2-3θ$, which is shown to be due to a time-varying {normalised dissipation rate $C_ε$}. In spectral space, the data follow the theoretical scaling of $k^{(m+2)/2}$ at low wavenumbers and tend towards $k^{-3/2}$ and $k^{-5/3}$ scalings at high wavenumbers for the spectra of transverse and normal velocity components respectively. The results represent a significant extension of previous work on the Richtmyer--Meshkov instability evolving from broadband initial perturbations and provide useful benchmarks for future research.