Extreme Computing Software
Argonne's Mathematics and Computer Science Division is exploring new approaches to systems software, fault tolerance, and innovative programming models for next-generation computers.
Software for extreme computing includes:
- ADLB (Asynchronous Dynamic Load Balancing) is an MPI-based software library designed to help rapidly build scalable parallel programs. It provides a master/worker system with a put/get API for task descriptions, thus allowing workers to add work dynamically to the system. The library has been used as an execution engine for complicated applications such as Green's function Monte Carlo and higher level "many-task" programming models.
- CIFTS provides coordinated and improved fault tolerance for high-performance computing systems.
- ExM provides support for extreme-scale, many-task applications.
- Falkon is a fast and lightweight task execution framework.
- Globus Toolkit is a collection of software tools that make it easier to build computational grids and grid-based applications. The Globus Toolkit is used by many organizations to build grids that can support their applications. The open-source Globus Toolkit includes tools and libraries for solving problems in the following areas: security, communication, information infrastructure, fault detection, resource management, portability, and data management.
- Jumpshot is a profiling tool that provide log files, communication statistics, and graphical output of the results. An enhanced version deals with larger numbers of processes and to provide visualization of parallel I/O activities. Jumpshot is distributed with MPICH.
- MPICH is a high-performance, widely portable implementation of the Message Passing Interface standard. It provides an MPI implementation that efficiently supports different computational and communication platforms, including commodity clusters, high-speed networks, and proprietary high-end computing systems. It also enables cutting-edge research in MPI through an easy-to-extend modular framework for other derived implementations.
- Nimbus is an integrated set of tools that delivers the power and versatility of infrastructure clouds to scientific users.
- Swift provides fast, easy parallel scripting on multicores, clusters, clouds and supercomputers.
- ZeptoOS is a research project studying very efficient and customized Linux kernels for petascale and exascale architectures with 10,000 to 1 million CPUs.