VSL Publications

Dynamic Barrier Relaxations for Explicit Stencil Computations

Adam Hammouda, Andrew R. Siegel, and Stephen F. Siegel, Dynamic Barrier Relaxations for Explicit Stencil Computations, Technical Report UD-CIS-2013/002, Department of Computer & Information Sciences, University of Delaware, December 11, 2013. Submitted for publication
Next generation HPC computing platforms are likely to be characterized by significant, unpredictable non-uniformities in execution time among compute nodes and cores. These inherent load imbalances are expected to arise from a variety of sources---manufacturing discrepancies, dynamic power management, runtime component failure, OS jitter, software mediated resiliency, and TLB/- cache performance variations, etc. It is well understood that existing algorithms with frequent points of bulk synchronization will perform relatively poorly in the presence of these performance fluctuations. Thus, re-casting classic bulk-synchronous algorithms into more asynchronous, course grained parallelism is a critical area of research for next generation computing. In the present analysis we propose a robust class of parallel algorithms for explicit stencil computations in the presence of such non-uniformities in process execution time. These algorithms are benchmarked using the 2D heat equation as a model problem, and they are tested in the presence of simulated nonuniform computational noise. The performance is compared to a classic bulk synchronous implementation of the model problem.
  1. fault-tolerant_tr_2013.pdf (paper)
  2. resilient_experiments.tgz (experimental archive with source code)
Related Links
	author = {Adam Hammouda and Andrew R. Siegel and Stephen F. Siegel},
	title = {Dynamic Barrier Relaxations for Explicit Stencil Computations},
	institution = {Department of Computer and Information Sciences,
	University of Delaware},
	year = {2013},
	number = {UDEL-CIS-2013/002}

VSL | Publications