Source author record

Giorgio Amati

Giorgio Amati appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

physics.comp-ph physics.flu-dyn Biological Physics comp-gas cond-mat.mes-hall cond-mat.other cond-mat.soft cond-mat.stat-mech Distributed, Parallel, and Cluster Computing nlin.CG

Catalog footprint

What is connected

7works

10topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

On the accuracy and performance of the lattice Boltzmann method with 64-bit, 32-bit and novel 16-bit number formats

Fluid dynamics simulations with the lattice Boltzmann method (LBM) are very memory-intensive. Alongside reduction in memory footprint, significant performance benefits can be achieved by using FP32 (single) precision compared to FP64 (double) precision, especially on GPUs. Here, we evaluate the possibility to use even FP16 and Posit16 (half) precision for storing fluid populations, while still carrying arithmetic operations in FP32. For this, we first show that the commonly occurring number range in the LBM is a lot smaller than the FP16 number range. Based on this observation, we develop novel 16-bit formats - based on a modified IEEE-754 and on a modified Posit standard - that are specifically tailored to the needs of the LBM. We then carry out an in-depth characterization of LBM accuracy for six different test systems with increasing complexity: Poiseuille flow, Taylor-Green vortices, Karman vortex streets, lid-driven cavity, a microcapsule in shear flow (utilizing the immersed-boundary method) and finally the impact of a raindrop (based on a Volume-of-Fluid approach). We find that the difference in accuracy between FP64 and FP32 is negligible in almost all cases, and that for a large number of cases even 16-bit is sufficient. Finally, we provide a detailed performance analysis of all precision levels on a large number of hardware microarchitectures and show that significant speedup is achieved with mixed FP32/16-bit.

preprint2021arXiv

LBcuda: a high-performance CUDA port of LBsoft for simulation of colloidal systems

We present LBcuda, a GPU accelerated version of LBsoft, our open-source MPI-based software for the simulation of multi-component colloidal flows. We describe the design principles, the optimization and the resulting performance as compared to the CPU version, using both an average cost GPU and high-end NVidia GPU cards (V100 and the latest A100). The results show a substantial acceleration for the fluid solver reaching up to 200 GLUPS (Giga Lattice Updates Per Second) on a cluster made of 512 A100 NVIDIA cards simulating a grid of eight billion lattice points. These results open attractive prospects for the computational design of new materials based on colloidal particles.

preprint2020arXiv

LBsoft: a parallel open-source software for simulation of colloidal systems

We present LBsoft, an open-source software developed mainly to simulate the hydro-dynamics of colloidal systems based on the concurrent coupling between lattice Boltzmann methods for the fluid and discrete particle dynamics for the colloids. Such coupling has been developed before, but, to the best of our knowledge, no detailed discussion of the programming issues to be faced in order to attain efficient implementation on parallel architectures, has ever been presented to date. In this paper, we describe in detail the underlying multi-scale models, their coupling procedure, along side with a description of the relevant input variables, to facilitate third-parties usage. The code is designed to exploit parallel computing platforms, taking advantage also of the recent AVX-512 instruction set. We focus on LBsoft structure, functionality, parallel implementation, performance and availability, so as to facilitate the access to this computational tool to the research community in the field. The capabilities of LBsoft are highlighted for a number of prototypical case studies, such as pickering emulsions, bicontinuous systems, as well as an original study of the coarsening process in confined bijels under shear.

preprint2020arXiv

Towards Exascale Design of Soft Mesoscale Materials

We provide a brief survey of our current developments in the simulation-based design of novel families of mesoscale porous materials using computational kinetic theory. Prospective applications on exascale computers are also briefly discussed and commented on, with reference to two specific examples of soft mesoscale materials: microfluid crystals and bi-continuous jels

preprint2020arXiv

Towards Exascale Lattice Boltzmann computing

We discuss the state of art of Lattice Boltzmann (LB) computing, with special focus on prospective LB schemes capable of meeting the forthcoming Exascale challenge. After reviewing the basic notions of LB computing, we discuss current techniques to improve the performance of LB codes on parallel machines and illustrate selected leading-edge applications in the Petascale range. Finally, we put forward a few ideas on how to improve the communication/computation overlap in current large-scale LB simulations, as well as possible strategies towards fault-tolerant LB schemes.

preprint2013arXiv

Stick-Slip Sliding of Water Drops on Chemically Heterogeneous Surfaces

We present a comprehensive study of water drops sliding down chemically heterogeneous surfaces formed by a periodic pattern of alternating hydrophobic and hydrophilic stripes. Drops are found to undergo a stick-slip motion whose average speed is an order of magnitude smaller than that measured on a homogeneous surface having the same static contact angle. This motion is the result of the periodic deformations of the drop interface when crossing the stripes. Numerical simulations confirm this view and are used to elucidate the principles underlying the experimental observations.

preprint1996arXiv

Turbulent channel flow simulations using a coarse-grained extension of the Lattice Boltzmann method

A coarse-grained version of the Lattice Boltzmann (LB) method is developed with the intent of enhancing its geometrical flexibility so as to be able to tackle a wider class of flows of engineering interest. To this purpose, the original uniform LB technique is combined with standard finite-volume techniques based upon a blend of piecewise constant and piecewise linear interpolation schemes. A series of validation tests for the three dimensional channel flow with one-dimensional (cross-channel) statistical behaviour are presented. The main conclusion is that, although the method does indeed mark a significant stride forward with respect to the original uniform LB scheme, better interpolation schemes should be developed before the coarse-grain LB can become fully competitive with modern CFD schemes.

Giorgio Amati

What is connected

Connect this record

See the researcher in context

Building this map preview

7 published item(s)

On the accuracy and performance of the lattice Boltzmann method with 64-bit, 32-bit and novel 16-bit number formats

LBcuda: a high-performance CUDA port of LBsoft for simulation of colloidal systems

LBsoft: a parallel open-source software for simulation of colloidal systems

Towards Exascale Design of Soft Mesoscale Materials

Towards Exascale Lattice Boltzmann computing

Stick-Slip Sliding of Water Drops on Chemically Heterogeneous Surfaces

Turbulent channel flow simulations using a coarse-grained extension of the Lattice Boltzmann method