Researcher profile

Ulrich Rüde

Ulrich Rüde contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
7works
0followers
7topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

7 published item(s)

preprint2026arXiv

Conjugate Gradient Methods are Not Efficient: Experimental Study of the Locality Limitation

The convergence of the Conjugate Gradient method is subject to a locality limitation which imposes a lower bound on the number of iterations required before a qualitatively accurate approximation can be obtained. This limitation originates from the restricted transport of information in the graph induced by the sparsity pattern of the system matrix. In each iteration, information from the right-hand side can propagate only across directly connected graph nodes. The diameter of this graph therefore determines a minimum number of iterations that is necessary to achieve an acceptable level of accuracy.

preprint2022arXiv

Effect of Sediment Form and Form Distribution on Porosity: A Simulation Study Based on the Discrete Element Method

Porosity is one of the key properties of dense particle packings like sediment deposits and is influenced by a multitude of grain characteristics such as their size distribution and shape. In the present work, we focus on the form, a specific aspect of the overall shape, of sedimentary grains in order to investigate and quantify its effect on porosity, ultimately deriving novel porosity-prediction models. To this end, we develop a robust and accurate simulation tool based on the discrete element method which we validate against laboratory experiments. Utilizing digital representations of actual sediment from the Rhine river, we first study packings that are composed of particles with a single form. There, the porosity is found to be mainly determined by the inverse equancy, i.e., the ratio of the longest to the smallest form-defining axis. Only for small ratios, additional shape-related properties become relevant, as revealed by a direct comparison to packings of form-equivalent ellipsoids. Since sediment naturally features form mixtures, we extend our simulation tool to study sediment packings with normally-distributed forms. In agreement with our single form studies, the porosity depends primarily on the inverse of the mean equancy. By supplying additional information about a second form factor and the standard deviations, we derive an accurate model for porosity prediction. Due to its simplicity, it can be readily applied to sediment packings for which some measurements of flatness and elongation, the two most common form factors, are available.

preprint2021arXiv

A massively parallel Eulerian-Lagrangian method for advection-dominated transport in viscous fluids

Motivated by challenges in Earth mantle convection, we present a massively parallel implementation of an Eulerian-Lagrangian method for the advection-diffusion equation in the advection-dominated regime. The advection term is treated by a particle-based, characteristics method coupled to a block-structured finite-element framework. Its numerical and computational performance is evaluated in multiple, two- and three-dimensional benchmarks, including curved geometries, discontinuous solutions, pure advection, and it is applied to a coupled non-linear system modeling buoyancy-driven convection in Stokes flow. We demonstrate the parallel performance in a strong and weak scaling experiment, with scalability to up to $147,456$ parallel processes, solving for more than $5.2 \times 10^{10}$ (52 billion) degrees of freedom per time-step.

preprint2020arXiv

An efficient four-way coupled lattice Boltzmann - discrete element method for fully resolved simulations of particle-laden flows

A four-way coupling scheme for the direct numerical simulation of particle-laden flows is developed and analyzed. It employs a novel adaptive multi-relaxation time lattice Boltzmann method to simulate the fluid phase efficiently. The momentum exchange method is used to couple the fluid and the particulate phase. The particle interactions in normal and tangential direction are accounted for by a discrete element method using linear contact forces. All parameters of the scheme are studied and evaluated in detail and precise guidelines for their choice are developed. The development is based on several carefully selected calibration and validation tests of increasing physical complexity. It is found that a well-calibrated lubrication model is crucial to obtain the correct trajectories of a sphere colliding with a plane wall in a viscous fluid. For adequately resolving the collision dynamics it is found that the collision time must be stretched appropriately. The complete set of tests establishes a validation pipeline that can be universally applied to other fluid-particle coupling schemes providing a systematic methodology that can guide future developments.

preprint2020arXiv

lbmpy: Automatic code generation for efficient parallel lattice Boltzmann methods

Lattice Boltzmann methods are a popular mesoscopic alternative to macroscopic computational fluid dynamics solvers. Many variants have been developed that vary in complexity, accuracy, and computational cost. Extensions are available to simulate multi-phase, multi-component, turbulent, or non-Newtonian flows. In this work we present lbmpy, a code generation package that supports a wide variety of different methods and provides a generic development environment for new schemes as well. A high-level domain-specific language allows the user to formulate, extend and test various lattice Boltzmann schemes. The method specification is represented in a symbolic intermediate representation. Transformations that operate on this intermediate representation optimize and parallelize the method, yielding highly efficient lattice Boltzmann compute kernels not only for single- and two-relaxation-time schemes but also for multi-relaxation-time, cumulant, and entropically stabilized methods. An integration into the HPC framework waLBerla makes massively parallel, distributed simulations possible, which is demonstrated through scaling experiments on the SuperMUC-NG supercomputing system

preprint2020arXiv

Stencil scaling for vector-valued PDEs on hybrid grids with applications to generalized Newtonian fluids

Matrix-free finite element implementations for large applications provide an attractive alternative to standard sparse matrix data formats due to the significantly reduced memory consumption. Here, we show that they are also competitive with respect to the run time in the low order case if combined with suitable stencil scaling techniques. We focus on variable coefficient vector-valued partial differential equations as they arise in many physical applications. The presented method is based on scaling constant reference stencils originating from a linear finite element discretization instead of evaluating the bilinear forms on-the-fly. This method assumes the usage of hierarchical hybrid grids, and it may be applied to vector-valued second-order elliptic partial differential equations directly or as a part of more complicated problems. We provide theoretical and experimental performance estimates showing the advantages of this new approach compared to the traditional on-the-fly integration and stored matrix approaches. In our numerical experiments, we consider two specific mathematical models. Namely, linear elastostatics and incompressible Stokes flow. The final example considers a non-linear shear-thinning generalized Newtonian fluid. For this type of non-linearity, we present an efficient approach to compute a regularized strain rate which is then used to define the node-wise viscosity. Depending on the compute architecture, we could observe maximum speedups of 64% and 122% compared to the on-the-fly integration. The largest considered example involved solving a Stokes problem with 12288 compute cores on the state of the art supercomputer SuperMUC-NG.

preprint2019arXiv

waLBerla: A block-structured high-performance framework for multiphysics simulations

Programming current supercomputers efficiently is a challenging task. Multiple levels of parallelism on the core, on the compute node, and between nodes need to be exploited to make full use of the system. Heterogeneous hardware architectures with accelerators further complicate the development process. waLBerla addresses these challenges by providing the user with highly efficient building blocks for developing simulations on block-structured grids. The block-structured domain partitioning is flexible enough to handle complex geometries, while the structured grid within each block allows for highly efficient implementations of stencil-based algorithms. We present several example applications realized with waLBerla, ranging from lattice Boltzmann methods to rigid particle simulations. Most importantly, these methods can be coupled together, enabling multiphysics simulations. The framework uses meta-programming techniques to generate highly efficient code for CPUs and GPUs from a symbolic method formulation. To ensure software quality and performance portability, a continuous integration toolchain automatically runs an extensive test suite encompassing multiple compilers, hardware architectures, and software configurations.