Researcher profile

Bernd Meyer

Bernd Meyer contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 17 - UnverifiedVerification L1Unclaimed author
4works
0followers
7topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

4 published item(s)

preprint2022arXiv

Pruning vs XNOR-Net: A Comprehensive Study of Deep Learning for Audio Classification on Edge-devices

Deep learning has celebrated resounding successes in many application areas of relevance to the Internet of Things (IoT), such as computer vision and machine listening. These technologies must ultimately be brought directly to the edge to fully harness the power of deep learning for the IoT. The obvious challenge is that deep learning techniques can only be implemented on strictly resource-constrained edge devices if the models are radically downsized. This task relies on different model compression techniques, such as network pruning, quantization, and the recent advancement of XNOR-Net. This study examines the suitability of these techniques for audio classification on microcontrollers. We present an application of XNOR-Net for end-to-end raw audio classification and a comprehensive empirical study comparing this approach with pruning-and-quantization methods. We show that raw audio classification with XNOR yields comparable performance to regular full precision networks for small numbers of classes while reducing memory requirements 32-fold and computation requirements 58-fold. However, as the number of classes increases significantly, performance degrades, and pruning-and-quantization based compression techniques take over as the preferred technique being able to satisfy the same space constraints but requiring approximately 8x more computation. We show that these insights are consistent between raw audio classification and image classification using standard benchmark sets. To the best of our knowledge, this is the first study to apply XNOR to end-to-end audio classification and evaluate it in the context of alternative techniques. All codes are publicly available on GitHub.

preprint2022arXiv

Structural reorientation and compaction of porous MoS2 coatings during wear testing

Industrial upscaling frequently results in a different coating microstructure than the laboratory prototypes presented in the literature. Here, we investigate the wear behavior of physical vapor deposited (PVD) MoS2 coatings: A dense, nanocrystalline MoS2 coating, and a porous, prismatic-textured MoS2 coating. Transmission electron microscopy (TEM) investigations before and after wear testing evidence a crystallographic reorientation towards a basal texture in both samples. A basal texture is usually desirable due to its low-friction properties. This favorable reorientation is associated to a tribological compaction of the porous specimens. Following running-in, sliding under high contact pressure ultimately leads to a wear rate as small as for an ideal grown bulk MoS2 single crystal grown by chemical vapor deposition (CVD). This suggests that the imperfections of industrial grade MoS2 coatings can be remediated by a suitable pretreatment.

preprint2020arXiv

Integrating State of the Art Compute, Communication, and Autotuning Strategies to Multiply the Performance of the Application Programm CPMD for Ab Initio Molecular Dynamics Simulations

We present our recent code modernizations of the of the ab initio molecular dynamics program CPMD (www.cpmd.org) with a special focus on the ultra-soft pseudopotential (USPP) code path. Following the internal instrumentation of CPMD, all time critical routines have been revised to maximize the computational throughput and to minimize the communication overhead for optimal performance. Throughout the program missing hybrid MPI+OpenMP parallelization has been added to optimize scaling. For communication intensive routines, as the multiple distributed 3d FFTs of the electronic states and distributed matrix-matrix multiplications related to the $β$-projectors of the pseudopotentials, this MPI+OpenMP parallelization now overlaps computation and communication. The necessary partitioning of the workload is optimized by an auto-tuning algorithm. In addition, the largest global MPI_Allreduce operation has been replaced by highly tuned node-local parallelized operations using MPI shared-memory windows to avoid inter-node communication. A batched algorithm for the multiple 3d FFTs improves the throughput of the MPI_Alltoall communication and, thus, the scalability of the implementation, both for USPP and for the frequently used norm-conserving pseudopotential code path. The enhanced performance and scalability is demonstrated on a mid-sized benchmark system of 256 water molecules and further water systems of from 32 up to 2048 molecules.

preprint2019arXiv

Perfect and controllable nesting in the small angle twist bilayer graphene

Parallel ("nested") regions of a Fermi surface (FS) drive instabilities of the electron fluid, for example the spin density wave in elemental chromium. In one-dimensional materials, the FS is trivially fully nested (a single nesting vector connects two "Fermi dots"), while in higher dimensions only a fraction of the FS consists of parallel sheets. We demonstrate that the tiny angle regime of twist bilayer graphene (TBLG) possess a phase, accessible by interlayer bias, in which the FS consists entirely of nestable "Fermi lines": the first example of a completely nested FS in a 2d material. This nested phase is found both in the ideal as well as relaxed structure of the twist bilayer. We demonstrate excellent agreement with recent STM images of topological states in this material and elucidate the connection between these and the underlying Fermiology. We show that the geometry of the "Fermi lines" network is controllable by the strength of the applied interlayer bias, and thus that TBLG offers unprecedented access to the physics of FS nesting in 2d materials.