Wesley Bland

Postdoctoral Appointee
Mathematics and Computer Science Division
Argonne National Laboratory

PUBLICATIONS

You can find my Google Scholar page here to see more statistics and anything I might have missed.

Balaji, P., Thakur, R., Bland, W., Raffenetti, K., and Zhao, X. Parallel Programming with MPI, June 2014. Argonne National Lab full day tutorial on MPI. [ bib | http ]

Bland, W., Raffenetti, K., and Balaji, P. Simplifying the recovery model of user-level failure mitigation. In Proceedings of the 2014 Workshop on Exascale MPI (Piscataway, NJ, USA, 2014), ExaMPI '14, IEEE Press, pp. 20-25. [ bib | DOI | http ]

Bland, W. Fault tolerant runtime research @ ANL, Mar 2014. Lawrence Berkeley Laboratory Visit. [ bib | http ]

Bland, W. Proposed fault tolerance for MPI-4, Feb 2014. Lawrence Livermore Laboratory Visit. [ bib | .pdf ]

Yang, C., Bland, W., Mellor-Crummey, J., and Balaji, P. Portable, MPI-interoperable Coarray Fortran. In Proceedings of the 19th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (New York, NY, USA, 2014), PPoPP '14, ACM, pp. 81-92. [ bib | DOI | http ]

Bland, W., Du, P., Bouteiller, A., Herault, T., Bosilca, G., and Dongarra, J. J. Extending the scope of the checkpoint-on-failure protocol for forward recovery in standard mpi. Concurrency and Computation: Practice and Experience (June 2013). [ bib | DOI ]

Bland, W., Bouteiller, A., Herault, T., Bosilca, G., and Dongarra, J. Post-failure recovery of mpi communication capability: Design and rationale. International Journal of High Performance Computing Applications 27, 3 (2013), 244-254. [ bib | DOI | arXiv | http ]

Bland, W. Fault tolerant runtime research @ ANL, Nov 2013. 10th Joint Laboratory for Petascale Computing Workshop. [ bib | .pdf ]

Bland, W., Bouteiller, A., Herault, T., Hursey, J., Bosilca, G., and Dongarra, J. An evaluation of User-Level Failure Mitigation support in MPI. Computing 95, 12 (2013), 1171-1184. [ bib | DOI | http ]

Bland, W. Toward Message Passing Failure Management. PhD thesis, University of Tennessee, Knoxville, 2013. [ bib | http | .pdf ]

Bland, W. User Level Failure Mitigation in MPI. In Euro-Par 2012: Parallel Processing Workshops, I. Caragiannis, M. Alexander, R. M. Badia, M. Cannataro, A. Costan, M. Danelutto, F. Desprez, B. Krammer, J. Sahuquillo, S. L. Scott, and J. Weidendorfer, Eds., vol. 7640 of Lecture Notes in Computer Science. Springer Berlin Heidelberg, 2013, pp. 499-504. [ bib | DOI | .pdf ]

Bland, W. Enabling Application Resilience with and Without the MPI Standard. In Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (Ccgrid 2012) (Washington, DC, USA, May 2012), CCGRID '12, IEEE Computer Society, pp. 746-751. [ bib | DOI | .pdf ]

Bland, W. User Level Failure Mitigation in MPI, Aug 2012. Resilience Workshop co-located with Euro-Par. [ bib | .pdf ]

Bland, W., Bouteiller, A., Herault, T., Hursey, J., Bosilca, G., and Dongarra, J. J. An Evaluation of User-Level Failure Mitigation Support in MPI. In Recent Advances in the Message Passing Interface, J. L. Traff, S. Benkner, and J. J. Dongarra, Eds., vol. 7490 of Lecture Notes in Computer Science. Springer Berlin Heidelberg, Sep 2012, pp. 193-203. [ bib | DOI | .pdf ]

Bland, W., Du, P., Bouteiller, A., Herault, T., Bosilca, G., and Dongarra, J. A Checkpoint-on-Failure Protocol for Algorithm-Based Recovery in Standard MPI. In Euro-Par 2012 Parallel Processing, C. Kaklamanis, T. Papatheodorou, and P. G. Spirakis, Eds., vol. 7484 of Lecture Notes in Computer Science. Springer Berlin Heidelberg, Aug 2012, pp. 477-488. [ bib | DOI | .pdf ]

Bland, W., Bosilca, G., Bouteiller, A., Herault, T., and Dongarra, J. A proposal for User-Level Failure Mitigation in the MPI-3 Standard. Tech. rep., Tech. rep., Department of Electrical Engineering and Computer Science, University of Tennessee, 2012. [ bib | .pdf ]

Naughton, T., Bland, W., Vallee, G., Engelmann, C., and Scott, S. L. Fault Injection Framework for System Resilience Evaluation: Fake Faults for Finding Future Failures. In Proceedings of the 2009 Workshop on Resiliency in High Performance (New York, NY, USA, 2009), Resilience '09, ACM, pp. 23-28. [ bib | DOI | .pdf ]

Vallee, G., Naughton, T., Ong, H., Tikotekar, A., Engelmann, C., Bland, W., Aderholdt, F., and Scott, S. L. Virtual System Environments. In Systems and Virtualization Management. Standards and New Technologies, L. Boursas, M. Carlson, W. Hommel, M. Sibilla, and K. Wold, Eds., vol. 18 of Communications in Computer and Information Science. Springer Berlin Heidelberg, 2008, pp. 72-83. [ bib | DOI | .pdf ]

Bland, W., Naughton, T., Vallee, G., and Scott, S. Design and Implementation of a Menu Based OSCAR Command Line Interface. In High Performance Computing Systems and Applications, 2007. HPCS 2007. 21st International Symposium on (2007), pp. 25-25. [ bib | DOI | .pdf ]

Vallee, G., Naughton, T., Bland, W., and Scott, S. Automatic Testing Tool for OSCAR Using System-level Virtualization. In High Performance Computing Systems and Applications, 2007. HPCS 2007. 21st International Symposium on (2007), pp. 26-26. [ bib | DOI | .pdf ]