Researcher profile

Andreas Vogel

Andreas Vogel contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 13 - UnverifiedVerification L1Unclaimed author
2works
0followers
5topics
3close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

2 published item(s)

preprint2020arXiv

Accelerating Geometric Multigrid Preconditioning with Half-Precision Arithmetic on GPUs

With the hardware support for half-precision arithmetic on NVIDIA V100 GPUs, high-performance computing applications can benefit from lower precision at appropriate spots to speed up the overall execution time. In this paper, we investigate a mixed-precision geometric multigrid method to solve large sparse systems of equations stemming from discretization of elliptic PDEs. While the final solution is always computed with high-precision accuracy, an iterative refinement approach with multigrid preconditioning in lower precision and residuum scaling is employed. We compare the FP64 baseline for Poisson's equation to purely FP16 multigrid preconditioning and to the employment of FP16-FP32-FP64 combinations within a mesh hierarchy. While the iteration count is almost not affected by using lower accuracy, the solver runtime is considerably decreased due to the reduced memory transfer and a speedup of up to 2.5x is gained for the overall solver. We investigate the performance of selected kernels with the hierarchical Roofline model.

preprint2020arXiv

Parallel 3d shape optimization for cellular composites on large distributed-memory clusters

Skin modeling is an ongoing research area that highly benefits from modern parallel algorithms. This article aims at applying shape optimization to compute cell size and arrangement for elastic energy minimization of a cellular composite material model for the upper layer of the human skin. A gradient-penalized shape optimization algorithm is employed and tested on the distributed-memory cluster Hazel Hen, HLRS, Germany. The performance of the algorithm is studied in two benchmark tests. First, cell structures are optimized with respect to purely geometric aspects. The model is then extended such that the composite is optimized to withstand applied deformations. In both settings, the algorithm is investigated in terms of weak and strong scalability. The results for the geometric test reflect Kelvin's conjecture that the optimal space-filling design of cells with minimal surface is given by tetrakaidecahedrons. The PDE-constrained test case is chosen in order to demonstrate the influence of the deformation gradient penalization on fine inter-cellular channels in the composite and its influence on the multigrid convergence. A scaling study is presented for up to 12,288 cores and 3 billion DoFs.