UPS and electric maintenance is scheduled for Wednesday, November 27th, 2024, 08:30 - 12:00. A downtime of this service might occur for up to 30 minutes.

8–13 Aug 2022
Hörsaalzentrum Poppelsdorf
Europe/Berlin timezone

Avoiding the Jam

8 Aug 2022, 16:30
20m
CP1-HSZ/0.002 (CP1-HSZ) - HS4 (CP1-HSZ)

CP1-HSZ/0.002 (CP1-HSZ) - HS4

CP1-HSZ

50
Show room on map
Oral Presentation Software development and Machines Software development and Machines

Speaker

Mathias Wagner

Description

Bandwidth and latencies are central performance limiters for Lattice QCD. To overcome bandwidth limiters one way is to reduce the number of bits need by e.g., mixed precision solvers. These provide great speedups but increase the relative importance of latency limiters. We discuss techniques that QUDA uses to reduce latencies from GPU-CPU and GPU-network transfers and their impact for strong-scaling HMC simulations, where these matter most.

Primary authors

Mathias Wagner Kate Clark (NVIDIA)

Presentation materials