Dataflow Coordination of Data-Parallel Tasks via MPI 3.0
|Title||Dataflow Coordination of Data-Parallel Tasks via MPI 3.0|
|Publication Type||Conference Paper|
|Year of Publication||2013|
|Authors||Wozniak, JM, Peterka, T, Armstrong, TG, Dinan, J, Lusk, EL, Wilde, M, Foster, IT|
Scientific applications are often complex collections of many large-scale tasks. Mature tools exist for describing task-parallel workflows consisting of serial tasks, and a variety of tools exist for programming a single data-parallel operation. However, few tools cover the intersection of these two models. In this work, we extend the load balancing library ADLB to support parallel tasks. We demonstrate how applications can easily be composed of parallel tasks using Swift dataflow scripts, which are compiled to ADLB programs with performance comparable to hand-coded equivalents. By combining this framework with data-parallel analysis libraries, we are able to dynamically execute many instances of a parallel data analysis application in support of a parameter exploration workload.