Researcher profile

Paul Mullowney

Paul Mullowney contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 13 - Baseline
2works
0followers
4topics
3close collaborators

Actions

Decide how to stay connected

Follow researcher0

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

2 published item(s)

preprint2013arXiv

A GEMM interface and implementation on NVIDIA GPUs for multiple small matrices

We present an interface and an implementation of the General Matrix Multiply (GEMM) routine for multiple small matrices processed simultaneously on NVIDIA graphics processing units (GPUs). We focus on matrix sizes under 16. The implementation can be easily extended to larger sizes. For single precision matrices, our implementation is 30% to 600% faster than the batched cuBLAS implementation distributed in the CUDA Toolkit 5.0 on NVIDIA Tesla K20c. For example, we obtain 104 GFlop/s and 216 GFlop/s when multiplying 100,000 independent matrix pairs of size 10 and 16, respectively. Similar improvement in performance is obtained for other sizes, in single and double precision for real and complex types, and when the number of matrices is smaller. Apart from our implementation, our different function interface also plays an important role in the improved performance. Applications of this software include Finite Element computation on GPUs.

preprint2008arXiv

Chaotic Advection and the Emergence of Tori in the Küppers-Lortz State

Motivated by the roll-switching behavior observed in rotating Rayleigh-Bénard convection, we define a Küppers-Lortz (K-L) state as a volume-preserving flow with periodic roll switching. For an individual roll state, the Lagrangian particle trajectories are periodic. In a system with roll-switching, the particles can exhibit three-dimensional, chaotic motion. We study a simple phenomenological map that models the Lagrangian dynamics in a K-L state. When the roll axes differ by $120^{\circ}$ in the plane of rotation, we show that the phase space is dominated by invariant tori if the ratio of switching time to roll turnover time is small. When this parameter approaches zero these tori limit onto the classical hexagonal convection patterns, and, as it gets large, the dynamics becomes fully chaotic and well-mixed. For intermediate values, there are interlinked toroidal and poloidal structures separated by chaotic regions. We also compute the exit time distributions and show that the unbounded chaotic orbits are normally diffusive. Although the map presumes instantaneous switching between roll states, we show that the qualitative features of the flow persist when the model has smooth, overlapping time-dependence for the roll amplitudes (the Busse-Heikes model).