Aug 8 – 13, 2022
Hörsaalzentrum Poppelsdorf
Europe/Berlin timezone

Avoiding the Jam

Aug 8, 2022, 4:30 PM
CP1-HSZ/0.002 (CP1-HSZ) - HS4 (CP1-HSZ)

CP1-HSZ/0.002 (CP1-HSZ) - HS4


Show room on map
Oral Presentation Software development and Machines Software development and Machines


Mathias Wagner


Bandwidth and latencies are central performance limiters for Lattice QCD. To overcome bandwidth limiters one way is to reduce the number of bits need by e.g., mixed precision solvers. These provide great speedups but increase the relative importance of latency limiters. We discuss techniques that QUDA uses to reduce latencies from GPU-CPU and GPU-network transfers and their impact for strong-scaling HMC simulations, where these matter most.

Primary authors

Mathias Wagner Kate Clark (NVIDIA)

Presentation materials