Two-phase gather¶

Within the ScalES project, DKRZ is investigating possibilities to reduce load imbalances that prevent applications from scaling to high numbers of processors. One common source of load imbalance is gathering distributed arrays from subarrays on each process to one process (the target).
A naive approach implements only the unavoidable data transfer from all processes to the target. This does not account for the overhead of inserting subarray data into the array. This overhead is concentrated at the target.
Another approach tries to reduce the overhead by assembling intermediate subarrays which possess the contiguous property within the target's array. These can be inserted directly using a collective mpi routine. The assembly of these subarrays can be done in parallel. Thus the overhead is distributed to several processes whose optimal number depends on the latency of the communication.
This results in a two-phase gather that, e.g. for a 192x96x47 double precision array distributed over 8x4 processes, is one order of magnitude faster than the naive approach (measured on IBM Power6 p575 running AIX). With higher parallelization one can further improve performance by adapting the procedure to the usually nonuniform communication properties of the interconnect, e.g., shared memory communication for the first phase and InfiniBand communication for the second.

Schematics¶

One-phase gather

Two-phase gather

Vampir snapshots¶

Multi-process-timeline of naive gather implementation (click to enlarge)

Multi-process-timeline (note 10x shorter overall duration) of two-phase gather implementation (click to enlarge)

Effects on total run-time for ECHAM T63L47¶

Files (9)

old_gather2.png (236 KB) old_gather2.png		Joerg Behrens, 10/26/2010 04:09 PM
new_gather2.png (235 KB) new_gather2.png		Joerg Behrens, 10/26/2010 04:09 PM
1p-gather-diagram.png (13.5 KB) 1p-gather-diagram.png		Joerg Behrens, 10/29/2010 12:53 PM
2p-gather-diagram.png (14.2 KB) 2p-gather-diagram.png		Joerg Behrens, 10/29/2010 12:53 PM
gather-diagram.svg (99 KB) gather-diagram.svg		Joerg Behrens, 10/29/2010 12:53 PM
6h_io.png (22.9 KB) 6h_io.png		Joerg Behrens, 10/29/2010 12:53 PM
1h_io.png (22.8 KB) 1h_io.png		Joerg Behrens, 10/29/2010 12:53 PM
old_gather2.pdf (3.84 KB) old_gather2.pdf		Thomas Jahns, 03/11/2013 05:37 PM
new_gather2.pdf (4.75 KB) new_gather2.pdf		Thomas Jahns, 03/11/2013 05:37 PM

Project

General

Profile

ScalES-PPM

Wiki

Two-phase gather¶

Schematics¶

Vampir snapshots¶

Effects on total run-time for ECHAM T63L47¶