More iteration space tiling.

Michael Wolfe
1989
1 reference

Abstract

Subdividing the iteration space of a loop into blocks or tiles with a fixed maximum size has several advantages. Tiles become a natural candidate as the unit of work for parallel task scheduling. Synchronization between processors can be done between tiles, reducing synchronization frequency (at some loss of potential parallelism). The shape and size of a tile can be optimized to take advantage of memory locality for memory hierarchy utilization. Vectorization and register locality naturally fits into the optimization within a tile, while parallelization and cache locality fits into optimization between tiles.

1 repository
1 reference

Code References

â–¶ triton-lang/triton
1 file
â–¶ docs/programming-guide/chapter-2/related-work.rst
1
.. [WOLFE1989] M. Wolfe, "More Iteration Space Tiling", SC 1989
Link copied to clipboard!