Argonne National Laboratory

Job Coscheduling on Coupled High-End Computing Systems

TitleJob Coscheduling on Coupled High-End Computing Systems
Publication TypeConference Paper
Year of Publication2011
AuthorsTang, W, Desai, NL, Vishwanath, V, Buettner, D, Lan, Z
Conference NameProc. 40th International Conference on Parallel Processing Workshops
Date Published09/2011
Other NumbersANL/MCS-P1909-0611

Supercomputer centers often deploy large-scale computing systems together with an associated data analysis or visualization system. In this paper, we propose a coscheduling mechanism, providing the ability to coordinate execution between jobs on different systems. The mechanism is built on top of a lightweight protocol for coordination between policy domains without manual intervention. We have evaluated this system using real job traces from Intrepid and Eureka, the production Blue Gene/P and data analysis systems, respectively, deployed at Argonne National Laboratory. Our experimental results quantify the costs of coscheduling and demonstrate that coscheduling can be achieved with limited impact on system performance undervarying workloads.