Researcher profile

Giorgio Amati

Giorgio Amati contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 19 - UnverifiedVerification L1Unclaimed author
5works
0followers
6topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

5 published item(s)

preprint2022arXiv

On the accuracy and performance of the lattice Boltzmann method with 64-bit, 32-bit and novel 16-bit number formats

Fluid dynamics simulations with the lattice Boltzmann method (LBM) are very memory-intensive. Alongside reduction in memory footprint, significant performance benefits can be achieved by using FP32 (single) precision compared to FP64 (double) precision, especially on GPUs. Here, we evaluate the possibility to use even FP16 and Posit16 (half) precision for storing fluid populations, while still carrying arithmetic operations in FP32. For this, we first show that the commonly occurring number range in the LBM is a lot smaller than the FP16 number range. Based on this observation, we develop novel 16-bit formats - based on a modified IEEE-754 and on a modified Posit standard - that are specifically tailored to the needs of the LBM. We then carry out an in-depth characterization of LBM accuracy for six different test systems with increasing complexity: Poiseuille flow, Taylor-Green vortices, Karman vortex streets, lid-driven cavity, a microcapsule in shear flow (utilizing the immersed-boundary method) and finally the impact of a raindrop (based on a Volume-of-Fluid approach). We find that the difference in accuracy between FP64 and FP32 is negligible in almost all cases, and that for a large number of cases even 16-bit is sufficient. Finally, we provide a detailed performance analysis of all precision levels on a large number of hardware microarchitectures and show that significant speedup is achieved with mixed FP32/16-bit.

preprint2021arXiv

LBcuda: a high-performance CUDA port of LBsoft for simulation of colloidal systems

We present LBcuda, a GPU accelerated version of LBsoft, our open-source MPI-based software for the simulation of multi-component colloidal flows. We describe the design principles, the optimization and the resulting performance as compared to the CPU version, using both an average cost GPU and high-end NVidia GPU cards (V100 and the latest A100). The results show a substantial acceleration for the fluid solver reaching up to 200 GLUPS (Giga Lattice Updates Per Second) on a cluster made of 512 A100 NVIDIA cards simulating a grid of eight billion lattice points. These results open attractive prospects for the computational design of new materials based on colloidal particles.

preprint2020arXiv

LBsoft: a parallel open-source software for simulation of colloidal systems

We present LBsoft, an open-source software developed mainly to simulate the hydro-dynamics of colloidal systems based on the concurrent coupling between lattice Boltzmann methods for the fluid and discrete particle dynamics for the colloids. Such coupling has been developed before, but, to the best of our knowledge, no detailed discussion of the programming issues to be faced in order to attain efficient implementation on parallel architectures, has ever been presented to date. In this paper, we describe in detail the underlying multi-scale models, their coupling procedure, along side with a description of the relevant input variables, to facilitate third-parties usage. The code is designed to exploit parallel computing platforms, taking advantage also of the recent AVX-512 instruction set. We focus on LBsoft structure, functionality, parallel implementation, performance and availability, so as to facilitate the access to this computational tool to the research community in the field. The capabilities of LBsoft are highlighted for a number of prototypical case studies, such as pickering emulsions, bicontinuous systems, as well as an original study of the coarsening process in confined bijels under shear.

preprint2020arXiv

Towards Exascale Lattice Boltzmann computing

We discuss the state of art of Lattice Boltzmann (LB) computing, with special focus on prospective LB schemes capable of meeting the forthcoming Exascale challenge. After reviewing the basic notions of LB computing, we discuss current techniques to improve the performance of LB codes on parallel machines and illustrate selected leading-edge applications in the Petascale range. Finally, we put forward a few ideas on how to improve the communication/computation overlap in current large-scale LB simulations, as well as possible strategies towards fault-tolerant LB schemes.