Seminar Details:

LANS Informal Seminar
"Fault Tolerance in MPI and Beyond"

DATE:

TIME: 15:00:00 - 16:00:00
SPEAKER: Wesley Bland, Postdoctoral Appointee, MCS, Argonne National Laboratory
LOCATION: Building 240 Room 1404-1405, Argonne National Laboratory

Description:
As HPC drives on toward exascale and resilience research expands, the need to make these ideas available to applications as API, tools, and libraries becomes more real. These tools will be critical for applications to run at exascale and they must be well understood before that time arrives. This talk will discuss an array of new tools, including: MPI resilience libraries, additions to the future MPI standard for resilience, data resilience, and more. A large focus of this talk will be the proposed chapter for the future MPI Standard related to fault tolerance. This proposed chapter is available at http://goo.gl/I96FJx. Come with questions and comments to participate in a discussion and provide feedback.


 

Please send questions or suggestions to Debojyoti Ghosh: ghosh at mcs dot anl dot gov.