Researcher profile

Matthieu Schaller

Matthieu Schaller contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
19works
0followers
5topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

19 published item(s)

preprint2026arXiv

Cosmological back-reaction of baryons on dark matter in the CAMELS simulations

Baryonic processes such as radiative cooling and feedback from massive stars and active galactic nuclei (AGN) directly redistribute baryons in the Universe but also indirectly redistribute dark matter due to changes in the gravitational potential. In this work, we investigate this &#34;back-reaction&#34; of baryons on dark matter using thousands of cosmological hydrodynamic simulations from the Cosmology and Astrophysics with MachinE Learning Simulations (CAMELS) project, including parameter variations in the SIMBA, IllustrisTNG, ASTRID, and Swift-EAGLE galaxy formation models. Matching haloes to corresponding N-body (dark matter-only) simulations, we find that virial masses decrease owing to the ejection of baryons by feedback. Relative to N-body simulations, halo profiles show an increased dark matter density in the center (due to radiative cooling) and a decrease in density farther out (due to feedback), with both effects being strongest in SIMBA (> 450% increase at r < 0.01 Rvir). The clustering of dark matter strongly responds to changes in baryonic physics, with dark matter power spectra in some simulations from each model showing as much as 20% suppression or increase in power at k ~ 10 h/Mpc relative to N-body simulations. We find that the dark matter back-reaction depends intrinsically on cosmology (Omega_m and sigma_8) at fixed baryonic physics, and varies strongly with the details of the feedback implementation. These results emphasize the need for marginalizing over uncertainties in baryonic physics to extract cosmological information from weak lensing surveys as well as their potential to constrain feedback models in galaxy evolution.

preprint2026arXiv

Luminosity-Dependent Assembly Bias of Central Galaxies from Weak Lensing and Clustering

Assembly bias, which is the variation in halo clustering at fixed mass driven by formation history, has long been predicted by numerical simulations but remains difficult to confirm observationally. Previous studies have reported evidence for halo assembly bias by dividing samples according to galaxy stellar mass using various methods. In this work, we present observational measurements of halo assembly bias based on the luminosity of spectroscopically confirmed brightest cluster galaxies (BCGs). Using cluster catalogs and shear measurements from the DESI Legacy Imaging Surveys, we employ a mass-dependent halo-bias model to disentangle halo bias from its underlying mass dependence in galaxy-galaxy lensing and clustering measurements. We confirm that brighter BCGs are less strongly clustered on large scales, with a relative bias ratio deviating from unity at the $\sim3σ$ level, suggesting the presence of assembly bias. Similar qualitative trends are also found in the FLAMINGO and MillenniumTNG hydrodynamical simulations, strengthening the connection between galaxy luminosity and halo formation history.

preprint2026arXiv

Modelling the evolution and influence of dust in cosmological simulations that include the cold phase of the interstellar medium

While marginal in mass terms, dust grains play an outsized role in both the physics and observation of the interstellar medium (ISM). However, explicit modelling of this ISM constituent remains uncommon in large cosmological simulations. In this work, we present a model for the life-cycle of dust in the ISM that couples to the forthcoming COLIBRE galaxy formation model, which explicitly simulates the cold ISM. We follow 6 distinct grain types: 3 chemical species, including carbon and two silicate grains, with 2 size bins each. Our dust model accounts for seeding of grains from stellar ejecta, self-consistent element-by-element metal yields and growth by accretion, grain size transfer (shattering and coagulation) and destruction of dust by thermal sputtering in the ISM. We detail the calibration of this model, particularly the use of a clumping factor, to account for unresolved gas clouds in which dust readily evolves. We present a fiducial run in a 25$^3$~cMpc$^3$ cosmological volume that displays good agreement with observations of the cosmic evolution of dust density, as well as the $z=0$ galaxy dust mass function and dust scaling relations. We highlight known tensions between observational datasets of the dust-to-gas ratio as a function of metallicity depending on which metallicity calibrator is used; our model favours higher-normalisation metallicity calibrators, which agree with the observations within 0.1~dex for stellar masses $>10^9 \; {\rm M_\odot}$. We compare the grain size distribution to observations of local galaxies, and find that our simulation suggests a higher concentration of small grains, associated with more diffuse ISM and the warm-neutral medium (WNM), which both play a key role in boosting H$_2$ content. Putting these results and modelling approaches in context, we set the stage for upcoming insights into the dusty ISM of galaxies using the COLIBRE simulations.

preprint2023arXiv

On the anisotropic distribution of clusters in the local Universe

In his 2021 lecture to the Canadian Association of Physicists Congress, P.J.E. Peebles pointed out that the brightest extra-galactic radio sources tend to be aligned with the plane of the de Vaucouleur Local Supercluster up to redshifts of $z=0.02$ ($d_{\rm MW}\approx 85~\rm{Mpc}$). He then asked whether such an alignment of clusters is anomalous in the standard $Λ$CDM framework. In this letter, we employ an alternative, absolute orientation agnostic, measure of the anisotropy based on the inertia tensor axis ratio of these brightest sources and use a large cosmological simulation from the FLAMINGO suite to measure how common such an alignment of structures is. We find that only 3.5% of randomly selected regions display an anisotropy of their clusters more extreme than the one found in the local Universe&#39;s radio data. This sets the region around the Milky Way as a $1.85σ$ outlier. Varying the selection parameters of the objects in the catalogue, we find that the clusters in the local Universe are never more than $2σ$ away from the simulations&#39; prediction for the same selection. We thus conclude that the reported anisotropy, whilst note-worthy, is not in tension with the $Λ$CDM paradigm.

preprint2022arXiv

SIBELIUS-DARK: a galaxy catalogue of the Local Volume from a constrained realisation simulation

We present SIBELIUS-DARK, a constrained realisation simulation of the local volume to a distance of 200~Mpc from the Milky Way. SIBELIUS-DARK is the first study of the \textit{Simulations Beyond The Local Universe} (SIBELIUS) project, which has the goal of embedding a model Local Group-like system within the correct cosmic environment. The simulation is dark-matter-only, with the galaxy population calculated using the semi-analytic model of galaxy formation, GALFORM. We demonstrate that the large-scale structure that emerges from the SIBELIUS constrained initial conditions matches well the observational data. The inferred galaxy population of SIBELIUS-DARK also match well the observational data, both statistically for the whole volume and on an object-by-object basis for the most massive clusters. For example, the $K$-band number counts across the whole sky, and when divided between the northern and southern Galactic hemispheres, are well reproduced by SIBELIUS-DARK. We find that the local volume is somewhat unusual in the wider context of $Λ$CDM: it contains an abnormally high number of supermassive clusters, as well as an overall large-scale underdensity at the level of $\approx 5$\% relative to the cosmic mean. However, whilst rare, the extent of these peculiarities does not significantly challenge the $Λ$CDM model. SIBELIUS-DARK is the most comprehensive constrained realisation simulation of the local volume to date, and with this paper we publicly release the halo and galaxy catalogues at $z=0$, which we hope will be useful to the wider astronomy community.

preprint2022arXiv

Spin-driven jet feedback in idealised simulations of galaxy groups and clusters

We implement a black hole spin evolution and jet feedback model into SWIFT, a smoothed particle hydrodynamics code. The jet power is determined self-consistently assuming Bondi accretion, using a realistic, spin-dependant efficiency. The jets are launched along the spin axis of the black hole, resulting in natural reorientation and precession. We apply the model to idealised simulations of galaxy groups and clusters, finding that jet feedback successfully quenches gas cooling and star formation in all systems. Our group-size halo ($M_\mathrm{200}=10^{13}$ $\mathrm{M}_\odot$) is quenched by a strong jet episode triggered by a cooling flow, and it is kept quenched by a low-power jet fed from hot halo accretion. In more massive systems ($M_\mathrm{200}\geq 10^{14}$ $\mathrm{M}_\odot$), hot halo accretion is insufficient to quench the galaxies, or to keep them quenched after the first cooling episode. These galaxies experience multiple episodes of gas cooling, star formation and jet feedback. In the most massive galaxy cluster that we simulate ($M_\mathrm{200}=10^{15}$ $\mathrm{M}_\odot$), we find peak cold gas masses of $10^{10}$ $\mathrm{M}_\odot$ and peak star formation rates of a few times $100$ $\mathrm{M}_\odot\mathrm{yr}^{-1}$. These values are achieved during strong cooling flows, which also trigger the strongest jets with peak powers of $10^{47}$ $\mathrm{erg}\hspace{0.3mm}\mathrm{s}^{-1}$. These jets subsequently shut off the cooling flows and any associated star formation. Jet-inflated bubbles draw out low-entropy gas that subsequently forms dense cooling filaments in their wakes, as seen in observations.

preprint2022arXiv

The importance of black hole repositioning for galaxy formation simulations

Active galactic nucleus (AGN) feedback from accreting supermassive black holes (SMBHs) is an essential ingredient of galaxy formation simulations. The orbital evolution of SMBHs is affected by dynamical friction that cannot be predicted self-consistently by contemporary simulations of galaxy formation in representative volumes. Instead, such simulations typically use a simple &#34;repositioning&#34; of SMBHs, but the effects of this approach on SMBH and galaxy properties have not yet been investigated systematically. Based on a suite of smoothed particle hydrodynamics simulations with the SWIFT code and a Bondi-Hoyle-Lyttleton subgrid gas accretion model, we investigate the impact of repositioning on SMBH growth and on other baryonic components through AGN feedback. Across at least a factor ~1000 in mass resolution, SMBH repositioning (or an equivalent approach) is a necessary prerequisite for AGN feedback; without it, black hole growth is negligible. Limiting the effective repositioning speed to $\lesssim$ 10 km/s delays the onset of AGN feedback and severely limits its impact on stellar mass growth in the centre of massive galaxies. Repositioning has three direct physical consequences. It promotes SMBH mergers and thus accelerates their initial growth. In addition, it raises the peak density of the ambient gas and reduces the SMBH velocity relative to it, giving a combined boost to the accretion rate that can reach many orders of magnitude. Our results suggest that a more sophisticated and/or better calibrated treatment of SMBH repositioning is a critical step towards more predictive galaxy formation simulations.

preprint2022arXiv

The importance of the way in which supernova energy is distributed around young stellar populations in simulations of galaxies

Supernova (SN) feedback plays a crucial role in simulations of galaxy formation. Because blastwaves from individual SNe occur on scales that remain unresolved in modern cosmological simulations, SN feedback must be implemented as a subgrid model. Differences in the manner in which SN energy is coupled to the local interstellar medium and in which excessive radiative losses are prevented have resulted in a zoo of models used by different groups. However, the importance of the selection of resolution elements around young stellar particles for SN feedback has largely been overlooked. In this work, we examine various selection methods using the smoothed particle hydrodynamics code SWIFT. We run a suite of isolated disk galaxy simulations of a Milky Way-mass galaxy and small cosmological volumes, all with the thermal stochastic SN feedback model used in the EAGLE simulations. We complement the original mass-weighted neighbour selection with a novel algorithm guaranteeing that the SN energy distribution is as close to isotropic as possible. Additionally, we consider algorithms where the energy is injected into the closest, least dense, or most dense neighbour. We show that different neighbour-selection strategies cause significant variations in star formation rates, gas densities, wind mass loading factors, and galaxy morphology. The isotropic method results in more efficient feedback than the conventional mass-weighted selection. We conclude that the manner in which the feedback energy is distributed among the resolution elements surrounding a feedback event is as important as changing the amount of energy by factors of a few.

preprint2022arXiv

The interplay between AGN feedback and precipitation of the intracluster medium in simulations of galaxy groups and clusters

Using high-resolution hydrodynamical simulations of galaxy clusters, we study the interaction between the brightest cluster galaxy, its supermassive black hole (BH) and the intracluster medium (ICM). We create initial conditions for which the ICM is in hydrostatic equilibrium within the gravitational potential from the galaxy and an NFW dark matter halo. Two free parameters associated with the thermodynamic profiles determine the cluster gas fraction and the central temperature, where the latter can be used to create cool-core or non-cool-core systems. Our simulations include radiative cooling, star formation, BH accretion, and stellar and active galactic nucleus (AGN) feedback. Even though the energy of AGN feedback is injected thermally and isotropically, it leads to anisotropic outflows and buoyantly rising bubbles. We find that the BH accretion rate (BHAR) is highly variable and only correlates strongly with the star formation rate (SFR) and the ICM when it is averaged over more than $1~\rm Myr$. We generally find good agreement with the theoretical precipitation framework. In $10^{13}~\rm M_\odot$ haloes, AGN feedback quenches the central galaxy and converts cool-core systems into non-cool-core systems. In contrast, higher-mass, cool-core clusters evolve cyclically. Episodes of high BHAR raise the entropy of the ICM out to the radius where the ratio of the cooling time and the local dynamical time $t_{\rm cool}/t_{\rm dyn} > 10$, thus suppressing condensation and, after a delay, the BHAR. The corresponding reduction in AGN feedback allows the ICM to cool and become unstable to precipitation, thus initiating a new episode of high SFR and BHAR.

preprint2022arXiv

The Milky Way&#39;s plane of satellites: consistent with $Λ$CDM

The &#34;plane of satellites problem&#34; describes the arrangement of the Milky Way&#39;s 11 brightest satellite galaxies in a remarkably thin plane, possibly supported by rotation. This is in apparent contradiction to the standard cosmological model, wherein the Galaxy is surrounded by a dispersion-supported dark matter halo. Here, we show that the reported exceptional anisotropy of the satellite system is strongly contingent on a lopsided radial distribution, which earlier simulations have failed to reproduce, combined with the close but fleeting conjunction of the two most distant satellites, Leo I and Leo II. Using Gaia proper motions, we show that the orbital pole alignment is much more common than previously reported, and reveal the plane of satellites to be transient rather than rotationally supported. Comparing to new simulations, where such short-lived planes are common, we find the Milky Way satellites to be compatible with standard model expectations.

preprint2021arXiv

A high-resolution cosmological simulation of a strong gravitational lens

We present a cosmological hydrodynamical simulation of a 10^13 Msun galaxy group and its environment (out to 10 times the virial radius) carried out using the EAGLE model of galaxy formation. Exploiting a novel technique to increase the resolution of the dark matter calculation independently of that of the gas, the simulation resolves dark matter haloes and subhaloes of mass 5x10^6 Msun . It is therefore useful for studying the abundance and properties of the haloes and subhaloes targeted in strong lensing tests of the cold dark matter model. We estimate the halo and subhalo mass functions and discuss how they are affected both by the inclusion of baryons in the simulation and by the environment. We find that the halo and subhalo mass functions have lower amplitude in the hydrodynamical simulation than in its dark matter only counterpart. This reflects the reduced growth of haloes in the hydrodynamical simulation due to the early loss of gas by reionisation and galactic winds and, additionally, in the case of subhaloes, disruption by enhanced tidal effects within the host halo due to the presence of a massive central galaxy. The distribution of haloes is highly anisotropic reflecting the filamentary character of mass accretion onto the cluster. As a result, there is significant variation in the number of structures with viewing direction. The median number of structures near the centre of the halo, when viewed in projection, is reduced by a factor of two when baryons are included.

preprint2021arXiv

SEAGLE--II: Constraints on feedback models in galaxy formation from massive early type strong lens galaxies

We use nine different galaxy formation scenarios in ten cosmological simulation boxes from the EAGLE suite of ΛCDM hydrodynamical simulations to assess the impact of feedback mechanisms in galaxy formation and compare these to observed strong gravitational lenses. To compare observations with simulations, we create strong lenses with $M_\star$ > $10^{11}$ $M_\odot$ with the appropriate resolution and noise level, and model them with an elliptical power-law mass model to constrain their total mass density slope. We also obtain the mass-size relation of the simulated lens-galaxy sample. We find significant variation in the total mass density slope at the Einstein radius and in the projected stellar mass-size relation, mainly due to different implementations of stellar and AGN feedback. We find that for lens selected galaxies, models with either too weak or too strong stellar and/or AGN feedback fail to explain the distribution of observed mass-density slopes, with the counter-intuitive trend that increasing the feedback steepens the mass density slope around the Einstein radius ($\approx$ 3-10 kpc). Models in which stellar feedback becomes inefficient at high gas densities, or weaker AGN feedback with a higher duty cycle, produce strong lenses with total mass density slopes close to isothermal (i.e. -d log(ρ)/d log(r) $\approx$ 2.0) and slope distributions statistically agreeing with observed strong lens galaxies in SLACS and BELLS. Agreement is only slightly worse with the more heterogeneous SL2S lens galaxy sample. Observations of strong-lens selected galaxies thus appear to favor models with relatively weak feedback in massive galaxies.

preprint2020arXiv

A Hybrid MPI+Threads Approach to Particle Group Finding Using Union-Find

The Friends-of-Friends (FoF) algorithm is a standard technique used in cosmological $N$-body simulations to identify structures. Its goal is to find clusters of particles (called groups) that are separated by at most a cut-off radius. $N$-body simulations typically use most of the memory present on a node, leaving very little free for a FoF algorithm to run on-the-fly. We propose a new method that utilises the common Union-Find data structure and a hybrid MPI+threads approach. The algorithm can also be expressed elegantly in a task-based formalism if such a framework is used in the rest of the application. We have implemented our algorithm in the open-source cosmological code, SWIFT. Our implementation displays excellent strong- and weak-scaling behaviour on realistic problems and compares favourably (speed-up of 18x) over other methods commonly used in the $N$-body community.

preprint2020arXiv

Constraining the inner density slope of massive galaxy clusters

We determine the inner density profiles of massive galaxy clusters (M$_{200}$ > $5 \times 10^{14}$ M$_{\odot}$) in the Cluster-EAGLE (C-EAGLE) hydrodynamic simulations, and investigate whether the dark matter density profiles can be correctly estimated from a combination of mock stellar kinematical and gravitational lensing data. From fitting mock stellar kinematics and lensing data generated from the simulations, we find that the inner density slopes of both the total and the dark matter mass distributions can be inferred reasonably well. We compare the density slopes of C-EAGLE clusters with those derived by Newman et al. for 7 massive galaxy clusters in the local Universe. We find that the asymptotic best-fit inner slopes of &#34;generalized&#34; NFW (gNFW) profiles, $γ_{\rm gNFW}$, of the dark matter haloes of the C-EAGLE clusters are significantly steeper than those inferred by Newman et al. However, the mean mass-weighted dark matter density slopes of the simulated clusters are in good agreement with the Newman et al. estimates. We also find that the estimate of $γ_{\rm gNFW}$ is very sensitive to the constraints from weak lensing measurements in the outer parts of the cluster and a bias can lead to an underestimate of $γ_{\rm gNFW}$.

preprint2020arXiv

Numerical convergence of hydrodynamical simulations of galaxy formation: the abundance and internal structure of galaxies and their cold dark matter haloes

We address the issue of numerical convergence in cosmological smoothed particle hydrodynamics simulations using a suite of runs drawn from the EAGLE project. Our simulations adopt subgrid models that produce realistic galaxy populations at a fiducial mass and force resolution, but systematically vary the latter in order to study their impact on galaxy properties. We provide several analytic criteria that help guide the selection of gravitational softening for hydrodynamical simulations, and present results from runs that both adhere to and deviate from them. Unlike dark matter-only simulations, hydrodynamical simulations exhibit a strong sensitivity to gravitational softening, and care must be taken when selecting numerical parameters. Our results--which focus mainly on star formation histories, galaxy stellar mass functions and sizes--illuminate three main considerations. First, softening imposes a minimum resolved escape speed, $v_ε$, due to the binding energy between gas particles. Runs that adopt such small softening lengths that $v_ε\gt 10\,{\rm km s^{-1}}$ (the sound speed in ionised $\sim 10^4\,{\rm K}$ gas) suffer from reduced effects of photo-heating. Second, feedback from stars or active galactic nuclei may suffer from numerical over-cooling if the gravitational softening length is chosen below a critical value, $ε_{\rm eFB}$. Third, we note that small softening lengths exacerbate the segregation of stars and dark matter particles in halo centres, often leading to the counter-intuitive result that galaxy sizes {\em increase} as softening is reduced. The structure of dark matter haloes in hydrodynamical runs respond to softening in a way that reflects the sensitivity of their galaxy populations to numerical parameters.

preprint2020arXiv

The BUFFALO HST Survey

The Beyond Ultra-deep Frontier Fields and Legacy Observations (BUFFALO) is a 101 orbit + 101 parallel Cycle 25 Hubble Space Telescope Treasury program taking data from 2018-2020. BUFFALO will expand existing coverage of the Hubble Frontier Fields (HFF) in WFC3/IR F105W, F125W, and F160W and ACS/WFC F606W and F814W around each of the six HFF clusters and flanking fields. This additional area has not been observed by HST but is already covered by deep multi-wavelength datasets, including Spitzer and Chandra. As with the original HFF program, BUFFALO is designed to take advantage of gravitational lensing from massive clusters to simultaneously find high-redshift galaxies which would otherwise lie below HST detection limits and model foreground clusters to study properties of dark matter and galaxy assembly. The expanded area will provide a first opportunity to study both cosmic variance at high redshift and galaxy assembly in the outskirts of the large HFF clusters. Five additional orbits are reserved for transient followup. BUFFALO data including mosaics, value-added catalogs and cluster mass distribution models will be released via MAST on a regular basis, as the observations and analysis are completed for the six individual clusters.

preprint2019arXiv

Hydrostatic mass estimates of massive galaxy clusters: a study with varying hydrodynamics flavours and non-thermal pressure support

We use a set of 45 simulated clusters with a wide mass range ($8\times 10^{13} < M_{500}~[$M$_{\odot}]~< 2\times 10^{15}$) to investigate the effect of varying hydrodynamics flavours on cluster mass estimates. The cluster zooms were simulated using the same cosmological models as the BAHAMAS and C-EAGLE projects, leading to differences in both the hydrodynamic solvers and the subgrid physics but still producing clusters which broadly match observations. At the same mass resolution as BAHAMAS, for the most massive clusters ($M_{500} > 10^{15}$ M$_{\odot}$), we find changes in the SPH method produce the greatest differences in the final halo, while the subgrid models dominate at lower mass. By calculating the mass of all of the clusters using different permutations of the pressure, temperature and density profiles, created with either the true simulated data or mock spectroscopic data, we find that the spectroscopic temperature causes a bias in the hydrostatic mass estimates which increases with the mass of the cluster, regardless of the SPH flavour used. For the most massive clusters, the estimated mass of the cluster using spectroscopic density and temperature profiles is found to be as low as 50 per cent of the true mass compared to $\sim$ 90 per cent for low mass clusters. When including a correction for non-thermal pressure, the spectroscopic hydrostatic mass estimates are less biased on average and the mass dependence of the bias is reduced, although the scatter in the measurements does increase.

preprint2019arXiv

Setting the scene for BUFFALO: A study of the matter distribution in the HFF galaxy cluster MACS J0416.1-2403 and its parallel field

In the context of the BUFFALO (Beyond Ultra-deep Frontier Fields And Legacy Observations) survey, we present a new analysis of the merging galaxy cluster MACS\,J0416.1-2403 ($z = 0.397$) and its parallel field using the data collected by the Hubble Frontier Fields (HFF) campaign. In this work, we measure the surface mass density from a weak-lensing analysis, and characterise the overall matter distribution in both the cluster and parallel fields. The surface mass distribution derived for the parallel field shows clumpy overdensities connected by filament-like structures elongated in the direction of the cluster core. We also characterise the X-ray emission of the cluster, and compare it with the lensing mass distribution. We identify five substructures at the $>5σ$ level over the two fields, four of them being in the cluster one. Furthermore, three of them are located close to the edges of the field of view, and border issues can significantly hamper the determination of their physical parameters. Finally, we compare our results with the predicted subhalo distribution of one of the Hydrangea/C-EAGLE simulated cluster. Significant differences are obtained suggesting the simulated cluster is at a more advanced evolutionary state than MACS\,J0416.1-2403. Our results anticipate the upcoming BUFFALO observations that will link the two HFF fields, extending further the \emph{HST} coverage, and thus allowing a better characterisation of the reported substructures.

preprint2016arXiv

SWIFT: Using task-based parallelism, fully asynchronous communication, and graph partition-based domain decomposition for strong scaling on more than 100,000 cores

We present a new open-source cosmological code, called SWIFT, designed to solve the equations of hydrodynamics using a particle-based approach (Smooth Particle Hydrodynamics) on hybrid shared/distributed-memory architectures. SWIFT was designed from the bottom up to provide excellent strong scaling on both commodity clusters (Tier-2 systems) and Top100-supercomputers (Tier-0 systems), without relying on architecture-specific features or specialized accelerator hardware. This performance is due to three main computational approaches: (1) Task-based parallelism for shared-memory parallelism, which provides fine-grained load balancing and thus strong scaling on large numbers of cores. (2) Graph-based domain decomposition, which uses the task graph to decompose the simulation domain such that the work, as opposed to just the data, as is the case with most partitioning schemes, is equally distributed across all nodes. (3) Fully dynamic and asynchronous communication, in which communication is modelled as just another task in the task-based scheme, sending data whenever it is ready and deferring on tasks that rely on data from other nodes until it arrives. In order to use these approaches, the code had to be re-written from scratch, and the algorithms therein adapted to the task-based paradigm. As a result, we can show upwards of 60% parallel efficiency for moderate-sized problems when increasing the number of cores 512-fold, on both x86-based and Power8-based architectures.