The heap is local to only one processor. Therefore, that processor simply removes the top heap element and broadcasts to all of the other processors both the pair involved in the collision and the time step for this simulation loop. An earlier approach to this problem on a 2D system explored the results of randomly spreading the heap across processors. |