Source author record

Andreas Pieper

Andreas Pieper appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

cond-mat.mes-hall Distributed, Parallel, and Cluster Computing Computational Engineering, Finance, and Science cond-mat.str-el math.AG math.NA Mathematical Software Numerical Analysis Performance physics.comp-ph

Catalog footprint

What is connected

7works

10topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Constructing all Genus 2 Curves with Supersingular Jacobian

L. Moret-Bailly constructed families $\mathfrak{C}\rightarrow \mathbb{P}^1$ of genus 2 curves with supersingular jacobian. In this paper we first classify the reducible fibers of a Moret-Bailly family using linear algebra over a quaternion algebra. The main result is an algorithm that exploits properties of two reducible fibers to compute a hyperelliptic model for any irreducible fiber of a Moret-Bailly family.

preprint2016arXiv

Electron confinement in graphene with gate-defined quantum dots

We theoretically analyse the possibility to electrostatically confine electrons in circular quantum dot arrays, impressed on contacted graphene nanoribbons by top gates. Utilising exact numerical techniques, we compute the scattering efficiency of a single dot and demonstrate that for small-sized scatterers the cross-sections are dominated by quantum effects, where resonant scattering leads to a series of quasi-bound dot states. Calculating the conductance and the local density of states for quantum dot superlattices we show that the resonant carrier transport through such graphene-based nanostructures can be easily tuned by varying the gate voltage.

preprint2016arXiv

GHOST: Building blocks for high performance sparse linear algebra on heterogeneous systems

While many of the architectural details of future exascale-class high performance computer systems are still a matter of intense research, there appears to be a general consensus that they will be strongly heterogeneous, featuring "standard" as well as "accelerated" resources. Today, such resources are available as multicore processors, graphics processing units (GPUs), and other accelerators such as the Intel Xeon Phi. Any software infrastructure that claims usefulness for such environments must be able to meet their inherent challenges: massive multi-level parallelism, topology, asynchronicity, and abstraction. The "General, Hybrid, and Optimized Sparse Toolkit" (GHOST) is a collection of building blocks that targets algorithms dealing with sparse matrix representations on current and future large-scale systems. It implements the "MPI+X" paradigm, has a pure C interface, and provides hybrid-parallel numerical kernels, intelligent resource management, and truly heterogeneous parallelism for multicore CPUs, Nvidia GPUs, and the Intel Xeon Phi. We describe the details of its design with respect to the challenges posed by modern heterogeneous supercomputers and recent algorithmic developments. Implementation details which are indispensable for achieving high efficiency are pointed out and their necessity is justified by performance measurements or predictions based on performance models. The library code and several applications are available as open source. We also provide instructions on how to make use of GHOST in existing software packages, together with a case study which demonstrates the applicability and performance of GHOST as a component within a larger software stack.

preprint2016arXiv

High-performance implementation of Chebyshev filter diagonalization for interior eigenvalue computations

We study Chebyshev filter diagonalization as a tool for the computation of many interior eigenvalues of very large sparse symmetric matrices. In this technique the subspace projection onto the target space of wanted eigenvectors is approximated with filter polynomials obtained from Chebyshev expansions of window functions. After the discussion of the conceptual foundations of Chebyshev filter diagonalization we analyze the impact of the choice of the damping kernel, search space size, and filter polynomial degree on the computational accuracy and effort, before we describe the necessary steps towards a parallel high-performance implementation. Because Chebyshev filter diagonalization avoids the need for matrix inversion it can deal with matrices and problem sizes that are presently not accessible with rational function methods based on direct or iterative linear solvers. To demonstrate the potential of Chebyshev filter diagonalization for large-scale problems of this kind we include as an example the computation of the $10^2$ innermost eigenpairs of a topological insulator matrix with dimension $10^9$ derived from quantum physics applications.

preprint2016arXiv

Topological insulators in random potentials

We investigate the effects of magnetic and nonmagnetic impurities on the two-dimensional surface states of three-dimensional topological insulators (TIs). Modeling weak and strong TIs using a generic four-band Hamiltonian, which allows for a breaking of inversion and time-reversal symmetries and takes into account random local potentials as well as the Zeeman and orbital effects of external magnetic fields, we compute the local density of states, the single-particle spectral function, and the conductance for a (contacted) slab geometry by numerically exact techniques based on kernel polynomial expansion and Green's function approaches. We show that bulk disorder refills the suface-state Dirac gap induced by a homogeneous magnetic field with states, whereas orbital (Peierls-phase) disorder perserves the gap feature. The former effect is more pronounced in weak TIs than in strong TIs. At moderate randomness, disorder-induced conducting channels appear in the surface layer, promoting diffusive metallicity. Random Zeeman fields rapidly destroy any conducting surface states. Imprinting quantum dots on a TI's surface, we demonstrate that carrier transport can be easily tuned by varying the gate voltage, even to the point where quasi-bound dot states may appear.

preprint2015arXiv

Building a fault tolerant application using the GASPI communication layer

It is commonly agreed that highly parallel software on Exascale computers will suffer from many more runtime failures due to the decreasing trend in the mean time to failures (MTTF). Therefore, it is not surprising that a lot of research is going on in the area of fault tolerance and fault mitigation. Applications should survive a failure and/or be able to recover with minimal cost. MPI is not yet very mature in handling failures, the User-Level Failure Mitigation (ULFM) proposal being currently the most promising approach is still in its prototype phase. In our work we use GASPI, which is a relatively new communication library based on the PGAS model. It provides the missing features to allow the design of fault-tolerant applications. Instead of introducing algorithm-based fault tolerance in its true sense, we demonstrate how we can build on (existing) clever checkpointing and extend applications to allow integrate a low cost fault detection mechanism and, if necessary, recover the application on the fly. The aspects of process management, the restoration of groups and the recovery mechanism is presented in detail. We use a sparse matrix vector multiplication based application to perform the analysis of the overhead introduced by such modifications. Our fault detection mechanism causes no overhead in failure-free cases, whereas in case of failure(s), the failure detection and recovery cost is of reasonably acceptable order and shows good scalability.

preprint2015arXiv

Performance Engineering of the Kernel Polynomial Method on Large-Scale CPU-GPU Systems

The Kernel Polynomial Method (KPM) is a well-established scheme in quantum physics and quantum chemistry to determine the eigenvalue density and spectral properties of large sparse matrices. In this work we demonstrate the high optimization potential and feasibility of peta-scale heterogeneous CPU-GPU implementations of the KPM. At the node level we show that it is possible to decouple the sparse matrix problem posed by KPM from main memory bandwidth both on CPU and GPU. To alleviate the effects of scattered data access we combine loosely coupled outer iterations with tightly coupled block sparse matrix multiple vector operations, which enables pure data streaming. All optimizations are guided by a performance analysis and modelling process that indicates how the computational bottlenecks change with each optimization step. Finally we use the optimized node-level KPM with a hybrid-parallel framework to perform large scale heterogeneous electronic structure calculations for novel topological materials on a petascale-class Cray XC30 system.

Andreas Pieper

What is connected

Connect this record

See the researcher in context

Building this map preview

7 published item(s)

Constructing all Genus 2 Curves with Supersingular Jacobian

Electron confinement in graphene with gate-defined quantum dots

GHOST: Building blocks for high performance sparse linear algebra on heterogeneous systems

High-performance implementation of Chebyshev filter diagonalization for interior eigenvalue computations

Topological insulators in random potentials

Building a fault tolerant application using the GASPI communication layer

Performance Engineering of the Kernel Polynomial Method on Large-Scale CPU-GPU Systems