G. Almasi, C. Archer, J. G. Castanos, J. Gunnels, C. Erway, P. Heidelberger, X. Martorell, J. E. Moreira, K. Pinnow, J. Ratterman, B. Steinmacher-burow, W. Gropp, and B. Toonen, "The Design and Implementation of Message Passing Services for the BlueGene/L Supercomputer," Preprint ANL/MCS-P1183-0604, June 2004.
S. Balay, W. Gropp, L. McInnes, and B. Smith, "Software for the Scalable Solution of PDEs", The CRPC Handbook of Parallel Computing, Morgan-Kaufmann (to appear). Also Preprint ANL/MCS-P834-0700, July 2000. bibtex entry
D. Bonachea, P. Dickens, and R. Thakur, "High-Performance File I/O in Java: Existing Approaches and Bulk I/O Extensions," Preprint ANL/MCS-P840-0800, Argonne National Laboratory, August 2000.
D. Buntinas and W. Gropp, "Understanding the Requirements Imposed by Programming Model Middleware on a Common Communication Subsystem," Technical Memorandum ANL/MCS-TM-284, July 2005.
D. Buntinas, W. Gropp, and G. Mercier, "Data Transfers between Processes in an SMP System: Performance Study and Application to MPI," Preprint ANL/MCS-P1306-1105, November 2005.
D. Buntinas, G. Mercier, and W. Gropp, "Design and Evaluation of Nemesis, a Scalable, Low-Latency, Message-Passing Communication Subsystem," ANL/MCS-TM-292, November 2005/
R. Butler, W. Gropp, and E. Lusk, "A Scalable Process-Management Environment for Parallel Programs," Preprint ANL/MCS-P754-0699, April 2000.
S. Byna, W. Gropp, X.-H. Sun, and R. Thakur, "Improving the Performance of MPI Derived Datatypes by Optimizing Memory-Access Cost," Preprint ANL/MCS-P1045-0403 April 2003.
P. H. Carns, W. B. Ligon III, R. B. Ross, and R. Thakur, "PVFS: A Parallel File System for Linux Clusters," Preprint ANL/MCS-P804-0400, April 2000.
A. Choudhary, M. Kandemir, J. No, G. Memik, X. Shen, W. Liao, H. Nagesh, S. More, V. Taylor, R. Thakur, and R. Stevens, "Data Management for Large-Scale Scientific Computations in High Performance Distributed Systems," to appear in Cluster Computing, 2000. bibtex entry
A. Ching, A. Choudhary, W.-K. Liao, R. Ross, and W. Gropp, "Evaluating Structured I/O Methods for Parallel File Systems," Preprint ANL/MCS-P1125-0204, February 2004.
N. Desai, A. Lusk, R. Bradshaw, and E. Lusk, "MPISH: A Parallel Shell for MPI Programs," Preprint ANL/MCS-P1219-0105, January 2005.
N. Desai, R. Bradshaw, A. Lusk, and E. Lusk, "MPI Cluster System Software," Preprint ANL/MCS-P1160-0504, May 2004.
P. Dickens and R. Thakur, "An Evaluation of Java's I/O Capabilities for High-Performance Computing," Preprint, 2000. bibtex entry
P. Dickens and R. Thakur, "On Implementing High-Performance Collective I/O," Preprint ANL/MCS-P852-1000, September 2000.
P. Dickens and R. Thakur, "An Evaluation of Java's I/O Capabilities for High-Performance Computing," Preprint ANL/MCS-P849 -1000, September 2000.
P. M. Dickens and R. Thakur, "On Implementing High-Performance Collective I/O," ANL/MCS-P852-0700, November 2000.
J. J. Evans, S. Baik, C. S. Hood, and W. Gropp, "Toward Understanding Soft Faults in High-Performance Cluster Networks," Preprint ANL/MCS-P1017-0700, January 2003.
W. Gropp, "Building Library Components That Can Use Any MPI Implementation," Preprint ANL/MCS-P956 -0502, May 2002.
W. D. Gropp, "Runtime Checking of Datatype Signatures in MPI," Recent Advances in PVM and MPI, eds. J. Dongarra, P. Kacsuk, N. Podhorszki, 7th European PVM/MPI Users's Group Meeting, Sept. 10-13, 2000 (to appear). Also Preprint ANL/MCS-P826-0500, May 2000.
W. D. Gropp, D. K. Kaushik, D. E. Keyes, and B. F. Smith, "Latency, Bandwidth, and Concurrent Issue Limitations in High-Performance CFD," Preprint ANL/MCS-P850-1000, October 2000.
W. D. Gropp, D. K. Kaushik, D. E. Keyes, and B. F. Smith, "Understanding the Parallel Scalability of an Implicit Unstructured Mesh CFD Code," Preprint ANL/MCS-P845-0900, September 2000.
W. D. Gropp, D. K. Kaushik, D. E. Keyes, and B. F. Smith, "Performance Modeling and Tuning of an Unstructured Mesh CFD Application," Preprint ANL/MCS-P833-0700, July 2000.
W. Gropp, R. Ross, and N. Miller, "Providing Efficient I/O Redundancy in MPI Environments," Preprint ANL/MCS-P1178-0604, June 2004.
W. Gropp and E. Lusk, "Fault Tolerance in MPI Programs," Preprint ANL/MCS-P1154-0404, April 2004.
W. Jiang, K. Liu, H.-W. Jin, D. K. Panda, D. Buntinas, R. Thakur, and W. Gropp, "Efficient Implementation of MPI-2 Passive One-Sided Communication on InfiniBand Clusters," Preprint ANL/MCS-P1164-0504, May 2004.
N. Karonis, B. de Supinski, I. Foster, W. Gropp, E. Lusk, and J. Bresnahan, "Exploiting Hierarchy in Parallel Computer Networks to Optimize Collective Operation Performance," Fourteenth International Parallel and Distributed Processing Symposium (IPDPS '00), Cancun, Mexico, May 1-5, 2000, 377-384.
R. Latham, R. Ross, and R. Thakur, "The Impact of File Systems on MPI-IO Scalability," Preprint ANL/MCS-P1182-0604, June 2004.
J. Lee, X. Ma, R. Ross, R. Thakur, and M. Winslett," "RFS: Efficient and Flexible Remote File Access for MPI-IO," Preprint ANL/MCS-P1176-0604, June 2004.
J. Lee, R. Ross, S. Atchley, M. Beck, and R. Thakur, "MPI-IO/L: Eficient Remote I/O for MPI-IO via Logistical Networking," Preprint ANL/MCS-P1304-1105, November 2005.
J. Li, W.-K. Liao, A. Choudhary, R. Ross, R. Thakur, W. Gropp, and R. Latham, "Parallel netCDF: A Scientific High-Performance I/O Interface," Preprint ANL/MCS-P1048-0503, Argonne National Laboratory, May 2003.
W. Ligon and R. Ross, "Parallel I/O and the Parallel Virtual File System", Preprint ANL/MCS-P1174-0504, May 2004. Published in Beowulf Cluster Computing with Linux, 2nd ed., edited by W. Gropp, E. Lusk, and T. Sterling, MIT Press, 2003.
J. Liu, W. Jiang, P. Wyckoff, D. K. Panda, D. Ashton, D. Buntinas, W. Gropp, and B. Toonen, "Design and Implementation of MPICH2 over InfiniBand with RDMA Support," Preprint ANL/MCS-P1103-1003 October 2003.
J. No, R. Thakur, and A. Choudhary, "High-Performance Scientific Data Management System," Preprint ANL/MCS-P973-0502, April 2003.
J. No, R. Thakur, D. Kaushik, L. Freitag, and A. Choudhary, "A Scientific Data Management System for Irregular Applications," Preprint ANL/MCS-866-1000, October 2000.
J. No, R. Thakur, A. Choudhary, "Integrating Parallel File I/O and Database Support for High-Performance Scientific Data Management," Preprint ANL/MCS-P798-0300, March 2000. abstract
E. Ong, E. Lusk, and W. Gropp, "Scalable Unix Commands for Parallel Processors: A high-Performance Implementation," ANL/MCS-P885 -0601, June 2001. in Recent Advances in Parallel Virtual Machine and Message Passing Interface, eds. Y. Cotronis and J. Dongarra, Lecture Notes in Computer Science, Vol. 2131, Springer-Verlag, pp. 410-418, Sept. 2001.
R. Ross, R. Latham, W. Gropp, R. Thakur, and B. Toonen, "Implementing MPI-IO Atomic Mode without File System Support," Preprint ANL/MCS-P1235-0305, March 2005.
R. Ross, N. Miller, and W. Gropp, "Implementing Fast and Reusable Datatype Processing," Preprint ANL/MCS-P1068-0703, July 2003.
A. Roy, I. Foster, W. Gropp, N. Karonis, V. Sander, and B. Toonen, "MPICH-GQ: Quality-of-Service for Message Passing Programs," Proc. SC00 (SC2000), Dallas, TX, Nov. 2000.
R. Thakur and W. Gropp, "Parallel I/O," Preprint ANL/MCS-P837-0700, ANL/MCS-P837-0700, July 2000.
R. Thakur and W. Gropp, "Improving the Performance of Collective Operations in MPICH," Preprint ANL/MCS-P1038-0405, April 2003.
R. Thakur, W. Gropp, and E. Lusk, "Optimizing Noncontiguous Accesses in MPI-IO," Parallel Computing 28 (2002) 83-105
R. Thakur, W. Gropp, and B. Toonen, "Minimizing Synchronization Overhead in the Implementation of MPI One-Sided ed Communication," Preprint ANL/MCS-P1158-0504, May 2004.
R. Thakur, W. Gropp, and B. Toonen, "Optimizing the Synchronization Operations in MPI One-Sided Communication," Preprint ANL/MCS-P1232-0205, February 2005.
R. Thakur, F. Rabenseifner, and W. Gropp, "Optimization of Collective Communication Operations in MPICH," Preprint ANL/MCS-P1140-0304 , March 2004.
M. Vilayannur, R. B. Ross, P. H. Carns, R. Thakur, A. Sivasubramaniam, and M. Ka ndemir", "Improving the Performance of the POSIX I/O Interface to PVFS," Preprint ANL/MCS-P1010-1102, November 2002.
J. Wu and P. Wyckoff and D. Panda and R. Ross, "Unifier: Unifying Cache Management and Communication Buffer Management for PVFS over InfinBand," Preprint ANL/MCS-P1122-0204, February 2004.
W. Yu, D. Buntinas, R. L. Graham, and D. K. Panda, "Efficient and Scalable Barrier over Quadrics and Myrinet with a New NIC-based Collective Message-Passing Protocol," Preprint ANL/MCS-P1121-0204, February 2004.
W. Yu, D. Buntinas, and D. K. Panda, "Scalable High-Performance NIC-Based All-to-All Broadcast over Myrinet/GM," Preprint ANL/MCS-P1177-0604, June 2004.