Programming Models & Runtime Systems

 

The Programming Models & Runtime Systems group at Argonne National Laboratory is a subset of the Extreme Computing group in the Mathematics and Computer Science Division.

The group works on a number of research areas including:

  • High-performance MPI implementation
  • Fault Tolerance and Resilience
  • Heterogeneous computing and memory systems
  • Lightweight Communication and Threading Models

MPICH is the flagship project in the group, and is one of the most widely used implementations of MPI in the world.  It forms the basis of various partner MPI implementations including Intel MPI, IBM MPI, Cray MPI, Microsoft MPI, MVAPICH, and various others. Other projects in the group include the Data Movement for Heterogeneous Memory (DMEM) project that deals with improving the usability of non-DRAM based memory regions including byte-addressable NVRAM, scratchpad memory, compute-capable memory, accelerator memory, and various other types of memory. The group is also involved in the Global View Resilience (GVR) project that deals with memory resilience using multi-versioned memory snapshots and cross-layer error information sharing, and the ARGO project that focuses on designing a new Operating and Low-level Runtime System for Exascale.

Details of the various projects the group is involved in are listed here.

The group is supported by funding from the DOE Advanced Scientific Computing Research (ASCR), DOE National Nuclear Security Administration (NNSA), and National Science Foundation (NSF).

News & Events

  • Wesley Bland, a postdoc in the Programming Models and Runtime Systems group, gave a great talk at the ExaMPI workshop at SC’14. Here is the talk he gave: 4.10 – 4.40 Simplifying the Recovery Model of User Level Failure Mitigation by Wesley Bland, Kenneth Raffenetti and Pavan Balaji (Regular paper) You can download the paper here. See more informations here: http://sc14.supercomputing.org/program/workshops https://www.pdc.kth.se/exampi14
  • PMRS group will have a series of events in SC’14 (http://sc14.supercomputing.org), including talks, posters, BoFs, tutorials and demos. Download the SC14-mpich-flyer here. Come and meet us at the following events. Papers Mon / 04:10pm – 04:40pm / 286-7 / Simplifying the Recovery Model of User-Level Failure Mitigation Wed / 10:30am – 11:00am / 393-4-5 / Nonblocking Epochs in MPI One-Sided Communication (Best Paper Finalist) Wed ...
  • Antonio J. Peña, a postdoc in the Programming Models and Runtime Systems group, successfully presented his research about the efficient use of explicitly managed memory subsystems at IEEE Cluster 2014. Here is the paper he presented: Antonio J. Peña and Pavan Balaji, “Toward the Efficient Use of Multiple Explicitly Managed Memory Subsystems.”  IEEE International Conference on Cluster ...

Publications