Home

Optimizing transformations of stencil operations for parallel object-oriented scientific frameworks on cache-based architectures


Author(s) : Federico Bassetti Kei Davis, 
Publisher : N/A
Publication Date : 1998
ISSN : N/A
Abstract : Abstract This paper describes a new technique for optimizing serial and parallel stencil- and stencil-like operations for cachebased architectures. This technique takes advantage of the semantic knowledge implicitly in stencil-like computations. The technique is implemented as a source-to-source program transformation; because of its specificity it could not be expected of a conventional compiler. Empirical results demonstrate a uniform factor of two speedup. The experiments clearly show the benefits of this technique to be a consequence, as intended, of the reduction in cache misses. The test codes are based on a 5-point stencil obtained by the discretization of the Poisson equation and applied to a two-dimensional uniform grid using the Jacobi method as an iterative solver. Results are presented for a 1-D and 2-D tiling for a single processor. For the parallel case both blocking and non-blocking communication have been tested. However, the parallel case is not discussed here.,