Pavan Balaji
Computer Scientist
Mathematics and Computer Science Division
Argonne National Laboratory



(A detailed list of publications can be found in my CV)

(An autogenerated list of collaborators (past 4 years) can be found here)

Book Chapters

  1. James S. Dinan and Pavan Balaji. Scalable Computing and Communications: Theory and Practice. Chapter on Parallel Programming Models for Scalable Computing. Editors: Samee Ullah Khan, Lizhe Wang, and Albert Y. Zomaya. John Wiley & Sons Publishing, 2012.
  2. Pavan Balaji, Darius T. Buntinas and Dries Kimpe. Scalable Computing and Communications: Theory and Practice. Chapter on Fault Tolerance Techniques for Scalable Computing. Editors: Samee Ullah Khan, Lizhe Wang, and Albert Y. Zomaya. John Wiley & Sons Publishing, 2012.
  3. Pavan Balaji, Wu-chun Feng and Qian Zhu. Scalable Computing and Communications: Theory and Practice. Chapter on Virtualization Techniques for Graphics Processing Units. Editors: Samee Ullah Khan, Lizhe Wang, and Albert Y. Zomaya. John Wiley & Sons Publishing, 2012.
  4. Dhabaleswar K. Panda, Pavan Balaji, Sayantan Sur and Matthew Koop. Attaining High Performance Communication: A Vertical Approach. Chapter on Commodity High Performance Interconnects. Editor: Ada Gavrilovska. CRC Press, 2009.
  5. Wu-chun Feng and Pavan Balaji. Attaining High Performance Communication: A Vertical Approach. Chapter on Ethernet vs. Ethernot. Editor: Ada Gavrilovska. CRC Press, 2009.
  6. Pavan Balaji, P. Sadayappan and Mohammad Kamrul Islam. Market-Oriented Grid and Utility Computing. Chapter on Techniques on Providing Hard Quality of Service Guarantees in Job Scheduling. Editors: Rajkumar Buyya and Kris Bubendorfer. Wiley Publishers, 2008.


Refereed Journal Articles

  1. James S. Dinan, Pavan Balaji, David J. Goodell, Douglas Miller, Marc Snir and Rajeev S. Thakur. Enabling Communication Concurrency Through Flexible MPI Endpoints. International Journal of High Performance Computing Applications (JHPCA); special issue for the Euro MPI Users’ Group Meeting (Euro MPI). (accepted for publication).
  2. Marc Snir, Robert W. Wisniewski, Jacob A. Abraham, Sarita V. Adve, Saurabh Bagchi, Pavan Balaji, Jim Belak, Pradip Bose, Franck Cappello, Bill Carlson, Andrew A. Chien, Paul Coteus, Nathan A. Debardeleben, Pedro Diniz, Christian Engelmann, Mattan Erez, Saverio Fazzari, Al Geist, Rinku K. Gupta, Fred Johnson, Sriram Krishnamoorthy, Sven Leyffer, Dean Liberty, Subhashish Mitra, Todd Munson, Rob Schreiber, Jon Stearley, and Eric Van Hensbergen. Addressing Failures in Exascale Computing. International Journal of High Performance Computing Applications (JHPCA). (accepted for publication).
  3. John Jenkins, James S. Dinan, Pavan Balaji, Tom Peterka, Nagiza F. Somatova, Rajeev S. Thakur. Processing MPI Derived Datatypes on Noncontiguous GPU-Resident Data. IEEE Transactions on Parallel and Distributed Systems (TPDS). (accepted for publication).
  4. Torsten Hoefler, James S. Dinan, Darius T. Buntinas, Pavan Balaji, Brian Barrett, Ronald Brightwell, William D. Gropp, Vivek Kale and Rajeev S. Thakur. MPI+MPI: A New, Hybrid Approach to Parallel Programming with MPI Plus Shared Memory. Springer Journal of Computing; special issue for the Euro MPI Users’ Group Meeting (Euro MPI), pp. 1121–1136, Vol. 95, 2013. [pdf]
  5. Hameed Hussain, Nasro Min-Allah, Samee Ullah Khan, Abdul Hameed, Saif Ur-Rehman Malik, Limin Zhang, Nasir Ghani, Joanna Kolodziej, Albert Y. Zomaya, Cheng-Zhong Xu, Pavan Balaji, Abhinav Vishnu, Fredric Pinel, Jonathan E. Pecero, Pascal Bouvry, and Ammar Rayes. A Survey on Resource Allocation in High Performance Distributed Computing Systems. International Parallel Computing (ParCo) Journal, pp. 709–736, Vol. 39, Issue 11, 2013. [pdf]
  6. Jue Hong, Pavan Balaji, Gaojin Wen, Bibo Tu, Junming Yan, Cheng-Zhong Xu, and Shengzhong Feng. Implementation and Evaluation of Container-based Job Management for Fair Resource Sharing. Lecture Notes in Computer Science and General Issues; special issue for the International Supercomputing Conference (ISC), pp. 290–301, Vol. 7905, 2013. (conference: Jun. 16–20, 2013, Leipzip, Germany.) [pdf] [slides]
  7. Abhinav Vishnu, Shuaiwen Song, Andres Marquez, Kevin Barker, Darren Kerbyson, Kirk W. Cameron and Pavan Balaji. Designing Energy Efficient Communication Runtime Systems: A View from PGAS Models. Journal of Supercomputing (JoS), pp. 691-709, Vol. 63, Issue 3, 2013. [pdf]
  8. Giorgio Luigi Valentini, Walter Lassonde, Samee Ullah Khan, Nasro Min-Allah, Sajjad A. Madani, Juan Li, Limin Zhang, Lizhe Wang, Nasir Ghani, Joanna Kolodziej, Hongxiang Li, Albert Y. Zomaya, Cheng-Zhong Xu, Pavan Balaji, Abhinav Vishnu, Fredric Pinel, Jonathan E. Pecero, Dzimitry Kliazovich, and Pascal Bouvry. An Overview of Energy Efficiency Techniques in Cluster Computing Systems. Springer Journal of Cluster Computing; special issue on Green Computing and Communications, pp. 3-15, Vol. 16, Issue 1, 2013. [pdf]
  9. Pavan Balaji, Rinku K. Gupta, Abhinav Vishnu and Peter H. Beckman. Mapping Communication Layouts to Network Hardware Characteristics on Massive-Scale Blue Gene Systems. Springer Journal of Computer Science on Research and Development; special issue for the International Supercomputing Conference (ISC), pp. 247-256, Vol. 26, Issue 3-4, 2011. (conference: Jun. 18–23, 2011, Hamburg, Germany.) [pdf] [slides]
  10. Pavan Balaji, Darius T. Buntinas, David J. Goodell, William D. Gropp, Torsten Hoefler, Sameer Kumar, Ewing L. (Rusty) Lusk, Rajeev S. Thakur and Jesper Larsson Träff. MPI on Millions of Cores. Parallel Processing Letters (PPL) Journal; special issue for the Euro MPI Users’ Group Meeting (Euro MPI), pp. 45–60, Vol. 21, Issue 1, 2011.
  11. Pavan Balaji, Wu-chun Feng, Heshan Lin, Jeremy Archuleta, Satoshi Matsuoka, Andrew Warren, João Carlos Setubal, Ewing L. (Rusty) Lusk, Rajeev S. Thakur, Ian Foster, Daniel S. Katz, Shantenu Jha, Kevin Shinpaugh, Susan Coghlan, and Daniel A. Reed. Global-scale Distributed I/O with ParaMEDIC. Journal of Concurrency and Computation: Practice and Experience (CCPE), pp. 2266–2281, Vol. 22, Issue 16, 2010. [pdf]
  12. Pavan Balaji, Anthony K. Chan, William D. Gropp, Rajeev S. Thakur and Ewing L. (Rusty) Lusk. The Importance of Non-Data-Communication Overheads in MPI. International Journal of High Performance Computing Applications (IJHPCA); special issue for the Euro MPI Users’ Group Meeting (Euro MPI), pp. 5–15, Vol. 24, Issue 1, 2010.
  13. Pavan Balaji, Darius T. Buntinas, David J. Goodell, William D. Gropp and Rajeev S. Thakur. Fine-Grained Multithreading Support for Hybrid Threaded MPI Programming. International Journal of High Performance Computing Applications (IJHPCA); special issue for the Euro MPI Users’ Group Meeting (Euro MPI), pp. 49–57, Vol. 24, Issue 1, 2010.
  14. Jesper Larsson Träff, Andreas Ripke, Christian Siebert, Pavan Balaji, Rajeev S. Thakur and William D. Gropp. A Pipelined Algorithm for Large, Irregular All-gather Problems. International Journal of High Performance Computing Applications (IJHPCA); special issue for the Euro MPI Users’ Group Meeting (Euro MPI), pp. 58–68, Vol. 24, Issue 1, 2010.
  15. Pavan Balaji, Anthony K. Chan, Rajeev S. Thakur, William D. Gropp and Ewing L. (Rusty) Lusk. Toward Message Passing for a Million Processes: Characterizing MPI on a Massive Scale Blue Gene/P. Springer Journal of Computer Science on Research and Development; special issue for the International Supercomputing Conference (ISC), pp. 11–19, Vol. 24, Issue 1, 2009. Best Paper Award at ISC. (conference: June 23–26, 2009, Hamburg, Germany.) [pdf] [slides]
  16. Ping Lai, Pavan Balaji, Rajeev S. Thakur and Dhabaleswar K. Panda. ProOnE: A General Purpose Protocol Onload Engine for Multi- and Many-Core Architectures. Springer Journal of Computer Science on Research and Development; special issue for the International Supercomputing Conference (ISC), pp. 133–142, Vol. 23, Issue 3, 2009. (conference: June 23–26, 2009, Hamburg, Germany.) [pdf] [slides]
  17. Pavan Balaji, Wu-chun Feng and Dhabaleswar K. Panda. Bridging the Ethernet-Ethernot Performance Gap. IEEE Micro Journal; special issue on High-Performance Interconnects, pp. 24–40, Vol. 26, Issue 3, 2006. [pdf]
  18. Hyun-Wook Jin, Pavan Balaji, Chuck Yoo, Jin-Young Choi and Dhabaleswar K. Panda. Exploiting NIC Architectural Support for Enhancing IP based Protocols on High Performance Networks. Journal of Parallel and Distributed Computing (JPDC); special issue on Design and Performance of Networks for Super-, Cluster- and Grid-Computing, pp. 1348–1365, Vol. 65, Issue 11, 2005. [pdf]
  19. Mohammad Kamrul Islam, Pavan Balaji, P. Sadayappan and Dhabaleswar K. Panda. QoPS: A QoS based scheme for Parallel Job Scheduling (extended journal version). IEEE Springer LNCS Journal Series, pp. 252–268, Vol. 2862, 2003. [pdf]


Refereed Conference Publications

  1. Judicael A. Zounmevo, Xin Zhao, Pavan Balaji, William D. Gropp, and Ahmad Afsahi. Nonblocking Epochs in MPI One-Sided Communication. IEEE/ACM International Conference on High Performance Computing, Networking, Storage and Analysis (SC). Best Paper Finalist. Nov. 16–21, 2014, New Orleans, Louisiana.
  2. Zhezhe Chen, James S. Dinan, Zhen Tang, Pavan Balaji, Hua Zhong, Jun Wei, Tao Huang, and Feng Qin. MC-Checker: Detecting Memory Consistency Errors in MPI One-Sided Applications. IEEE/ACM International Conference on High Performance Computing, Networking, Storage and Analysis (SC). Nov. 16–21, 2014, New Orleans, Louisiana.
  3. Antonio J. Peña and Pavan Balaji. Toward the Efficient Use of Multiple Explicitly Managed Memory Subsystems. IEEE International Conference on Cluster Computing (Cluster). Sep. 22–26, 2014, Madrid, Spain.
  4. Junchao Zhang, Bill Long, Kenneth Raffenetti, and Pavan Balaji. Implementing the MPI-3.0 Fortran 2008 Binding. The Euro MPI Users’ Group Conference (Euro MPI/Asia). Sep. 9–12, 2014, Kyoto, Japan.
  5. Min Si, Antonio J. Peña, Pavan Balaji, Masamichi Takagi and Yutaka Ishikawa. MT-MPI: Multithreaded MPI for Many-core Environments. ACM International Conference on Supercomputing (ICS). June 10–13, 2014, Munich, Germany. [pdf] [slides]
  6. Chaoran Yang, Wesley Bland, Pavan Balaji, and John Mellor-Crummey. Portable, MPI-Interoperable Coarray Fortran. ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP). Feb. 15–19, 2014, Orlando, Florida. [pdf] [slides]
  7. Xin Zhao, Pavan Balaji, William D. Gropp, Rajeev S. Thakur. Optimization Strategies for MPI-Interoperable Active Messages. IEEE International Conference on Scalable Computing and Communications (ScalCom). Best Paper Award. Dec. 21–22, 2013, Chengdu, China. [pdf] [slides]
  8. Lokendra S. Panwar, Ashwin M. Aji, Jiayuan Meng, Pavan Balaji, and Wu-chun Feng. Online Performance Projection for Clusters with Heterogeneous GPUs. IEEE International Conference on Parallel and Distributed Systems (ICPADS). Dec. 15–18, 2013, Seoul, Korea. [pdf] [slides]
  9. Xin Zhao, Pavan Balaji, William D. Gropp, and Rajeev S. Thakur. MPI-Interoperable Generalized Active Messages. IEEE International Conference on Parallel and Distributed Systems (ICPADS). Dec. 15–18, 2013, Seoul, Korea. [pdf] [slides]
  10. Pavan Balaji and Dries Kimpe. On the Reproducibility of MPI Reduction Operations. IEEE International Conference on High Performance Computing and Communications (HPCC). Nov. 13–15, 2013, Zhangjiajie, China. [pdf] [slides]
  11. David Ozog, Jeffrey R. Hammond, James S. Dinan, Pavan Balaji, Sameer Shende, and Allen D. Malony. Inspector-Executor Load Balancing Algorithms for Block-Sparse Tensor Contractions. International Conference on Parallel Processing (ICPP). Oct. 1–4, 2013, Lyon, France. [pdf] [slides]
  12. Md. Ziaul Haque Olive, Qing Yi, James S. Dinan, and Pavan Balaji. Enhancing Performance Portability of MPI Applications Through Annotation-Based Transformations. International Conference on Parallel Processing (ICPP). Oct. 1–4, 2013, Lyon, France. [pdf] [slides]
  13. Antonio J. Peña, Ralf Gunter Correa Carvalho, James S. Dinan, Pavan Balaji, Rajeev S. Thakur and William D. Gropp. Analysis of Topology-Dependent MPI Performance on Gemini Networks. The Euro MPI Users’ Group Conference (EuroMPI). Sep. 15–18, 2013, Madrid, Spain. [pdf] [slides]
  14. James S. Dinan, Pavan Balaji, David J. Goodell, Douglas Miller, Marc Snir and Rajeev S. Thakur. Enabling MPI Interoperability Through Flexible Communication Endpoints. The Euro MPI Users’ Group Conference (EuroMPI). Sep. 15–18, 2013, Madrid, Spain. [pdf] [slides]
  15. Palden Lama, Yan Li, Ashwin M. Aji, Pavan Balaji, James S. Dinan, Shucai Xiao, Yunquan Zhang, Wu-chun Feng, Rajeev S. Thakur and Xiaobo Zhou. pVOCL: Power-Aware Dynamic Placement and Migration in Virtualized GPU Environments. International Conference on Distributed Computing Systems (ICDCS). July 8–11, 2013, Philadelphia, Pennsylvania. [pdf] [slides]
  16. Ashwin M. Aji, Lokendra S. Panwar, Feng Ji, Milind Chabbi, Karthik Murthy, Pavan Balaji, Keith R. Bisset, James S. Dinan, Wu-chun Feng, John Mellor-Crummey, Xiaosong Ma and Rajeev S. Thakur. On the Efficacy of GPU-Integrated MPI for Scientific Applications. ACM International Symposium on High Performance Parallel and Distributed Computing (HPDC). Jun. 17–21, 2013, New York, New York. [pdf] [slides]
  17. Xin Zhao, Darius T. Buntinas, Judicael A. Zounmevo, James S. Dinan, David J. Goodell, Pavan Balaji, Rajeev S. Thakur, Ahmad Afsahi and William D. Gropp. Towards Asynchronous, MPI-Interoperable Active Messages. IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid). May 13–16, 2013, Delft, Netherlands. [pdf] [slides]
  18. Jing Zhang, Heshan Lin, Pavan Balaji and Wu-chun Feng. Optimizing Burrows-Wheeler Transform-Based Sequence Alignment on Multicore Architectures. IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid). May 13–16, 2013, Delft, Netherlands. [pdf]
  19. Jeffrey R. Hammond, James S. Dinan, Pavan Balaji, Ivo Kabadshow, Sreeram Potluri and Vinod Tipparaju. OSPRI: An Optimized One-Sided Communication Runtime for Leadership-Class Machines. International Conference on Partitioned Global Address Space Programming Models (PGAS). Oct. 10–12, 2012, Santa Barbara, California. [pdf] [slides]
  20. John Jenkins, James S. Dinan, Pavan Balaji, Nagiza F. Samatova and Rajeev S. Thakur. Enabling Fast, Noncontiguous GPU Data Movement in Hybrid MPI+GPU Environments. IEEE International Conference on Cluster Computing (Cluster). Sep. 28–30, 2012, Beijing, China. [pdf] [slides]
  21. Torsten Hoefler, James S. Dinan, Darius T. Buntinas, Pavan Balaji, Brian Barrett, Ronald Brightwell, William D. Gropp, Vivek Kale, Rajeev S. Thakur. Leveraging MPI’s One-Sided Communication Interface for Shared-Memory Programming. The Euro MPI Users’ Group Conference (EuroMPI). Sep. 23–26, 2012, Vienna, Austria. [pdf] [slides]
  22. James S. Dinan, David J. Goodell, William D. Gropp, Rajeev S. Thakur, and Pavan Balaji. Efficient Multithreaded Context ID Allocation in MPI. The Euro MPI Users’ Group Conference (EuroMPI). Sep. 23–26, 2012, Vienna, Austria. [pdf] [slides]
  23. Feng Ji, Ashwin M. Aji, James S. Dinan, Darius T. Buntinas, Pavan Balaji, Rajeev S. Thakur, Wu-chun Feng and Xiaosong Ma. DMA-Assisted, Intranode Communication in GPU Accelerated Systems. IEEE International Conference on High Performance Computing and Communications (HPCC). June 25–27, 2012, Liverpool, UK. [pdf] [slides]
  24. Ashwin M. Aji, James S. Dinan, Darius T. Buntinas, Pavan Balaji, Wu-chun Feng, Keith R. Bisset and Rajeev S. Thakur. MPI-ACC: An Integrated and Extensible Approach to Data Movement in Accelerator-Based Systems. IEEE International Conference on High Performance Computing and Communications (HPCC). June 25–27, 2012, Liverpool, UK. [pdf] [slides]
  25. James S. Dinan, Pavan Balaji, Jeffrey R. Hammond, Sriram Krishnamoorthy and Vinod Tipparaju. Supporting the Global Arrays PGAS Model Using MPI One-Sided Communication. IEEE International Parallel and Distributed Processing Symposium (IPDPS). May 21–25, 2012, Shanghai, China. [pdf] [slides]
  26. Shucai Xiao, Pavan Balaji, James S. Dinan, Qian Zhu, Rajeev S. Thakur, Susan Coghlan, Heshan Lin, Gaojin Wen, Jue Hong and Wu-chun Feng. Transparent Accelerator Migration in a Virtualized GPU Environment. IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid). May 13–16, 2012, Ottawa, Canada. [pdf] [slides]
  27. Shucai Xiao, Pavan Balaji, Qian Zhu, Rajeev S. Thakur, Susan Coghlan, Heshan Lin, Gaojin Wen, Jue Hong and Wu-chun Feng. VOCL: An Optimized Environment for Transparent Virtualization of Graphics Processing Units. International Conference on Innovative Parallel Computing (InPar). May 12–14, 2012, San Jose, California. [pdf] [slides]
  28. Rui Wang, Erlin Yao, Pavan Balaji, Darius T. Buntinas, Mingyu Chen and Guangming Tan. Building Algorithmically Nonstop Fault Tolerant MPI Programs. IEEE International Conference on High Performance Computing (HiPC). Dec. 18–21, 2011, Bangalore, India. [pdf] [slides]
  29. Gaojin Wen, Jue Hong, Cheng-Zhong Xu, Pavan Balaji, Shengzhong Feng and Pingchuang Jiang. Energy-aware Hierarchy Scheduling of Applications in Large Scale Data Centers. International Conference on Cloud and Service Computing (CSC). Dec. 12–14, 2011, Hong Kong, China. [pdf] [slides]
  30. James S. Dinan, Sriram Krishnamoorthy, Pavan Balaji, Jeffrey R. Hammond, Manoj Krishnan, Vinod Tipparaju and Abhinav Vishnu. Noncollective Communicator Creation in MPI. The Euro MPI Users’ Group Conference (EuroMPI); special session on Improving MPI User and Developer Interaction (IMUDI). Sep. 18–21, 2011, Santorini, Greece. [pdf] [slides]
  31. Mohammad J. Rashti, Jonathan Green, Pavan Balaji, Ahmad Afsahi and William D. Gropp. Multi-core and Network Aware MPI Topology Functions. The Euro MPI Users’ Group Conference (EuroMPI). Sep. 18–21, 2011, Santorini, Greece. [pdf] [slides]
  32. Ryan E. Grant, Mohammad J. Rashti, Pavan Balaji and Ahmad Afsahi. RDMA Capable iWARP over Datagrams. IEEE International Parallel and Distributed Processing Symposium (IPDPS). May 16–20, 2011, Anchorage, Alaska. [pdf] [slides]
  33. Mohammad J. Rashti, Ryan E. Grant, Pavan Balaji and Ahmad Afsahi. iWARP Redefined: Scalable Connectionless Communication over High-Speed Ethernet. IEEE International Conference on High Performance Computing (HiPC). Dec. 19–22, 2010, Goa, India. [pdf] [slides]
  34. Abhinav Vishnu, Huub Van Dam, Wibe De Jong, Pavan Balaji and Shuaiwen Song. Fault Tolerant Communication Runtime Support for Data Centric Programming Models. IEEE International Conference on High Performance Computing (HiPC). Dec. 19–22, 2010, Goa, India. [pdf] [slides]
  35. Yang Jiao, Heshan Lin, Pavan Balaji and Wu-chun Feng. Power and Performance Characterization of Computational Kernels on the GPU. IEEE/ACM International Conference on Green Computing and Communications (GreenCom). Dec. 18–20, 2010, Hangzhou, China. [pdf] [slides]
  36. Abhinav Vishnu, Shuaiwen Song, Andres Marquez, Kevin Barker, Darren Kerbyson, Kirk W. Cameron, Pavan Balaji. Designing Energy Efficient Communication Runtime Systems for Data Centric Programming Models. IEEE/ACM International Conference on Green Computing and Communications (GreenCom). Dec. 18–20, 2010, Hangzhou, China. [pdf] [slides]
  37. David J. Goodell, Pavan Balaji, Darius T. Buntinas, Gabor Dózsa, William D. Gropp, Sameer Kumar, Bronis R. de Supinski and Rajeev S. Thakur. Minimizing MPI Resource Contention in Multithreaded Multicore Environments. IEEE International Conference on Cluster Computing (Cluster). Sep. 20–24, 2010, Heraklion, Crete, Greece. [pdf] [slides]
  38. Pavan Balaji, Darius T. Buntinas, David J. Goodell, William D. Gropp, Jayesh Krishna, Ewing L. (Rusty) Lusk and Rajeev S. Thakur. PMI: A Scalable Parallel Process-Management Interface for Extreme-Scale Systems. The Euro MPI Users’ Group Conference (Euro MPI). Sep. 12–15, 2010, Stuttgart, Germany. [pdf] [slides]
  39. Gabor Dózsa, Sameer Kumar, Pavan Balaji, Darius T. Buntinas, David J. Goodell, William D. Gropp, Joseph Ratterman and Rajeev S. Thakur. Enabling Concurrent Multithreaded MPI Communication on Multicore Petascale Systems. The Euro MPI Users’ Group Conference (Euro MPI). Sep. 12–15, 2010, Stuttgart, Germany. [pdf] [slides]
  40. Jayesh Krishna, Pavan Balaji, Ewing L. (Rusty) Lusk, Rajeev S. Thakur and Fab Tiller. Implementing MPI on Windows: Comparison with Common Approaches on Unix. The Euro MPI Users’ Group Conference (Euro MPI). Sep. 12–15, 2010, Stuttgart, Germany. [pdf] [slides]
  41. Ryan E. Grant, Pavan Balaji and Ahmad Afsahi. A Study of Hardware Assisted IP over InfiniBand and its Impact on Enterprise Data Center Performance. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS). Mar. 28–30, 2010, White Plains, NY. [pdf] [slides]
  42. James S. Dinan, Pavan Balaji, Ewing L. (Rusty) Lusk, P. Sadayappan and Rajeev S. Thakur. Hybrid Parallel Programming with MPI and Unified Parallel C. ACM International Conference on Computing Frontiers (CF). May 17–19, 2010, Bertinoro, Italy. [pdf] [slides]
  43. Pavan Balaji, Harish Naik and Narayan Desai. Understanding Network Saturation Behavior on Large-Scale Blue Gene/P Systems. IEEE International Conference on Parallel and Distributed Systems (ICPADS). Dec. 8–10, 2009, Shenzhen, China. [pdf] [slides]
  44. Ryan E. Grant, Ahmad Afsahi and Pavan Balaji. An Evaluation of ConnectX Virtual Protocol Interconnect for Data Centers. IEEE International Conference on Parallel and Distributed Systems (ICPADS). Dec. 8–10, 2009, Shenzhen, China. [pdf] [slides]
  45. Ajeet Singh, Pavan Balaji and Wu-chun Feng. GePSeA: A General-Purpose Software Acceleration Framework for Lightweight Task Offloading. International Conference on Parallel Processing (ICPP). Sep. 22–25, 2009, Vienna, Austria. [pdf] [slides]
  46. Narayan Desai, Darius T. Buntinas, Daniel Buettner, Pavan Balaji and Anthony K. Chan. Improving Resource Availability by Relaxing Network Allocation Constraints on the Blue Gene/P. International Conference on Parallel Processing (ICPP). Sep. 22–25, 2009, Vienna, Austria. [pdf] [slides]
  47. Pavan Balaji, Darius T. Buntinas, David J. Goodell, William D. Gropp, Sameer Kumar, Ewing L. (Rusty) Lusk, Rajeev S. Thakur and Jesper Larsson Träff. MPI on a Million Processors. The Euro PVM/MPI Users’ Group Conference (Euro PVM/MPI). Outstanding Paper Award. Sep. 7–10, 2009, Espoo, Finland. [pdf] [slides]
  48. Gopalakrishnan Santhanaraman, Pavan Balaji, Karthik Gopalakrishnan, Rajeev S. Thakur, William D. Gropp and Dhabaleswar K. Panda. Natively Supporting True One-sided Communication in MPI on Multi-core Systems with InfiniBand. IEEE International Symposium on Cluster Computing and the Grid (CCGrid). May 18–21, 2009, Shanghai, China. [pdf] [slides]
  49. Pavan Balaji, Sitha Bhagvat, Rajeev S. Thakur and Dhabaleswar K. Panda. Sockets Direct Protocol for Hybrid Network Stacks: A Case Study with iWARP over 10G Ethernet. International Conference on High Performance Computing (HiPC). Dec. 17–20, 2008, Bangalore, India. [pdf] [slides]
  50. Anthony K. Chan, Pavan Balaji, William D. Gropp and Rajeev S. Thakur. Communication Analysis of Parallel 3D FFT for Flat Cartesian Meshes on Large Blue Gene Systems. International Conference on High Performance Computing (HiPC). Dec. 17–20, 2008, Bangalore, India. [pdf] [slides]
  51. Mithlesh Kumar, Vineeta Chaube, Pavan Balaji, Wu-chun Feng and Hyun-Wook Jin. Making a Case for Proactive Flow Control in Optical Circuit-Switched Networks. International Conference on High Performance Computing (HiPC). Dec. 17–20, 2008, Bangalore, India. [pdf] [slides]
  52. Heshan Lin, Pavan Balaji, Ruth Poole, Carlos Sosa, Xiaosong Ma and Wu-chun Feng. Massively Parallel Genomic Sequence Search on the Blue Gene/P Architecture. IEEE/ACM International Conference for High Performance Computing, Networking, Storage and Analysis (SC). Nov. 15–21, 2008, Austin, Texas. [pdf] [slides]
  53. Thomas R. W. Scogland, Ganesh Narayanaswamy, Pavan Balaji and Wu-chun Feng. Asymmetric Interactions in Symmetric Multi-core Systems: Analysis, Enhancements and Evaluation. IEEE/ACM International Conference for High Performance Computing, Networking, Storage and Analysis (SC). Nov. 15–21, 2008, Austin, Texas. [pdf] [slides]
  54. Narayan Desai, Pavan Balaji, P. Sadayappan and Mohammad Kamrul Islam. Are Non-Blocking Networks Really Needed for High-End-Computing Workloads? IEEE International Conference on Cluster Computing (Cluster). Best Paper Award. Sep. 29 – Oct. 1st, 2008, Tsukuba, Japan. [pdf] [slides]
  55. Pavan Balaji, Anthony K. Chan, William D. Gropp, Rajeev S. Thakur and Ewing L. (Rusty) Lusk. Non-Data-Communication Overheads in MPI: Analysis on Blue Gene/P. The Euro PVM/MPI Users’ Group Conference (Euro PVM/MPI). Outstanding Paper Award. Sep. 7–10, 2008, Dublin, Ireland. [pdf] [slides]
  56. Pavan Balaji, Darius T. Buntinas, David J. Goodell, William D. Gropp and Rajeev S. Thakur. Toward Efficient Support for Multithreaded MPI Communication. The Euro PVM/MPI Users’ Group Conference (Euro PVM/MPI). Sep. 7–10, 2008, Dublin, Ireland. [pdf] [slides]
  57. Jesper Larsson Träff, Andreas Ripke, Christian Siebert, Pavan Balaji, Rajeev S. Thakur and William D. Gropp. A Simple, Pipelined Algorithm for Large, Irregular All-gather Problems. The Euro PVM/MPI Users’ Group Conference (Euro PVM/MPI). Sep. 7–10, 2008, Dublin, Ireland. [pdf] [slides]
  58. Ganesh Narayanaswamy, Pavan Balaji and Wu-chun Feng. Impact of Network Sharing in Multi-core Architectures. IEEE International Conference on Computer Communication and Networks (ICCCN). Aug. 3–7, 2008, St. Thomas, U.S. Virgin Islands. [pdf] [slides]
  59. Pavan Balaji, Wu-chun Feng and Heshan Lin. Semantics-based Distributed I/O with the ParaMEDIC Framework. ACM/IEEE International Symposium on High Performance Distributed Computing (HPDC). Jun. 23–27, 2008, Boston, Massachusetts. [pdf] [slides]
  60. Pavan Balaji, Wu-chun Feng, Heshan Lin, Jeremy Archuleta, Satoshi Matsuoka, Andrew Warren, João Carlos Setubal, Ewing L. (Rusty) Lusk, Rajeev S. Thakur, Ian Foster, Daniel S. Katz, Shantenu Jha, Kevin Shinpaugh, Susan Coghlan and Daniel A. Reed. Distributed I/O with ParaMEDIC: Experiences with a Worldwide Supercomputer. International Supercomputing Conference (ISC). Outstanding Paper Award. Jun. 17–20, 2008, Dresden, Germany. [pdf] [slides]
  61. Pavan Balaji, Wu-chun Feng, Jeremy Archuleta, Heshan Lin, Rajkumar Kettimuthu, Rajeev S. Thakur and Xiaosong Ma. Semantics-based Distributed I/O for mpiBLAST. Short paper. ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP). Feb. 20–23, 2008, Salt Lake City, Utah. [pdf] [poster]
  62. Pavan Balaji, Wu-chun Feng, Sitha Bhagvat, Dhabaleswar K. Panda, Rajeev S. Thakur and William D. Gropp. Analyzing the Impact of Supporting Out-of-Order Communication on In-order Performance with iWARP. IEEE/ACM International Conference for High Performance Computing, Networking, Storage and Analysis (SC). Nov. 10–16, 2007, Reno, Nevada. [pdf] [slides]
  63. Pavan Balaji, Wu-chun Feng, Jeremy Archuleta and Heshan Lin. ParaMEDIC: Parallel Metadata Environment for Distributed I/O and Computing. IEEE/ACM International Conference for High Performance Computing, Networking, Storage and Analysis (SC). Storage Challenge Award. Nov. 10–16, 2007, Reno, Nevada. [pdf] [slides]
  64. Pavan Balaji, Sitha Bhagvat, Dhabaleswar K. Panda, Rajeev S. Thakur and William D. Gropp. Advanced Flow-control Mechanisms for the Sockets Direct Protocol over InfiniBand. IEEE International Conference on Parallel Processing (ICPP). Sep. 10–14, 2007, Xi’an, China. [pdf] [slides]
  65. Mohammad Kamrul Islam, Pavan Balaji, Gerald Sabin and P. Sadayappan. Analyzing and Minimizing the Impact of Opportunity Cost in QoS-aware Job Scheduling. IEEE International Conference on Parallel Processing (ICPP). Sep. 10–14, 2007, Xi’an, China. [pdf] [slides]
  66. Ganesh Narayanaswamy, Pavan Balaji and Wu-chun Feng. An Analysis of 10-Gigabit Ethernet Protocol Stacks in Multicore Environments. IEEE International Symposium on High-Performance Interconnects (HotI). Aug. 22–24, 2007, Palo Alto, California. [pdf] [slides]
  67. Pavan Balaji, Darius T. Buntinas, Satish Balay, Barry F. Smith, Rajeev S. Thakur and William D. Gropp. Nonuniformly Communicating Noncontiguous Data: A Case Study with PETSc and MPI. IEEE International Parallel and Distributed Processing Symposium (IPDPS). Mar. 26–30, 2007, Long Beach, California. [pdf] [slides]
  68. Pavan Balaji, Wu-chun Feng, Qi Gao, Ranjit Noronha, Weikuan Yu and Dhabaleswar K. Panda. Head-to-TOE Comparison for High Performance Sockets over Protocol Offload Engines. IEEE International Conference on Cluster Computing (Cluster). Sep. 26–30, 2005, Boston, Massachusetts. [pdf] [slides]
  69. Wu-chun Feng, Pavan Balaji, Christopher Baron, Laxmi N. Bhuyan and Dhabaleswar K. Panda. Performance Characterization of a 10-Gigabit Ethernet TOE. IEEE International Symposium on High Performance Interconnects (HotI). Aug. 17–19, 2005, Palo Alto, California. [pdf] [slides]
  70. Sundeep Narravula, Pavan Balaji, Karthikeyan Vaidyanathan, Hyun-Wook Jin and Dhabaleswar K. Panda. Architecture for Caching Responses with Multiple Dynamic Dependencies in Multi-Tier Data-Centers over InfiniBand. IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGrid). May 9–12, 2005, Cardiff, UK. [pdf] [slides]
  71. Pavan Balaji, Karthikeyan Vaidyanathan, Sundeep Narravula, Hyun-Wook Jin and Dhabaleswar K. Panda. On the Provision of Prioritization and Soft QoS in Dynamically Reconfigurable Shared Data-Centers over InfiniBand. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS). Mar. 20–22, 2005, Austin, Texas. [pdf] [slides]
  72. Mohammad Kamrul Islam, Pavan Balaji, P. Sadayappan and Dhabaleswar K. Panda. Towards Provision of Quality of Service Guarantees in Job Scheduling. IEEE International Conference on Cluster Computing (Cluster). Sep. 20–23, 2004, San Diego, California. [pdf] [slides]
  73. Pavan Balaji, Sundeep Narravula, Karthikeyan Vaidyanathan, Savitha Krishnamoorthy, Jiesheng Wu and Dhabaleswar K. Panda. Sockets Direct Protocol over InfiniBand in Clusters: Is it Beneficial? IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS). Mar. 10–12, 2004, Austin, Texas. [pdf] [slides]
  74. Rohan Kurian, Pavan Balaji and P. Sadayappan. Opportune Job Shredding: An Efficient Approach for Scheduling Parameter Sweep Applications. Los Alamos Computer Science Institute (LACSI) Symposium. Oct. 12–14, 2003, Santa Fe, New Mexico. [pdf] [slides]
  75. Pavan Balaji, Jiesheng Wu, Tahsin Kurc, Ümit V. Çatalyürek, Dhabaleswar K. Panda and Joel Saltz. Impact of High Performance Sockets on Data Intensive Applications. IEEE International Symposium on High Performance Distributed Computing (HPDC). Jun. 22–24, 2003, Seattle, Washington. [pdf] [slides]
  76. Rinku K. Gupta, Pavan Balaji, Jarek Nieplocha and Dhabaleswar K. Panda. Efficient Collective Operations using Remote Memory Operations on VIA-based Clusters. IEEE International Parallel and Distributed Processing Symposium (IPDPS). Apr. 22–26, 2003, Nice, France. [pdf] [slides]
  77. Pavan Balaji, Piyush Shivam, Peter Wyckoff and Dhabaleswar K. Panda. High Performance User-level Sockets over Gigabit Ethernet. IEEE International Conference on Cluster Computing (Cluster). Sep. 23–26, 2002, Chicago, Illinois. [pdf] [slides]


Refereed Workshop Publications

  1. Antonio J. Peña and Pavan Balaji. A Framework for Tracking Memory Accesses in Scientific Applications. International Workshop on Parallel Programming Models and Systems Software for High-End Computing (P2S2); held in conjunction with the International Conference on Parallel Processing (ICPP). Sep. 12th, 2014, Minneapolis, Minnesota.
  2. Ralf Gunter Correa Carvalho, David J. Goodell, James S. Dinan, and Pavan Balaji. Optimizing Charm++ over MPI. Annual Workshop on Charm++ and its Applications. April 15-16, 2013, Urbana-Champaign, Illinois. [pdf] [slides]
  3. Ashwin M. Aji, Pavan Balaji, James S. Dinan, Wu-chun Feng and Rajeev S. Thakur. Synchronization and Ordering Semantics in Hybrid MPI+GPU Programming. Workshop on Accelerators and Hybrid Exascale Systems (AsHES); held in conjunction with the IEEE International Parallel and Distributed Processing Symposium (IPDPS). May 20th, 2013, Boston, Massachusetts. [pdf] [slides]
  4. Rinku K. Gupta, Kamil Iskra, Kazutomo Yoshii, Peter H. Beckman and Pavan Balaji. Introspective Fault Tolerance for Exascale Systems. U.S. Department of Energy Advanced Scientific Computing Research, OS and Runtime Technical Council Workshop. Oct. 4–5, 2012, Washington, DC. [pdf] [slides]
  5. Feng Ji, James S. Dinan, Darius T. Buntinas, Pavan Balaji, Xiaosong Ma and Wu-chun Feng. Optimizing GPU-to-GPU intra-node communication in MPI. Workshop on Accelerators and Hybrid Exascale Systems (AsHES); held in conjunction with the IEEE International Parallel and Distributed Processing Symposium (IPDPS). May 25th, 2012, Shanghai, China. [pdf] [slides]
  6. Jeffrey A. Stuart, Pavan Balaji, and John D. Owens. Extending MPI to Accelerators. Workshop on Architectures and Systems for Big Data (ASBD); held in conjunction with the International Conference on Parallel Architectures and Compilation Techniques (PACT). Oct. 10th, 2011, Galveston Island, Texas. [pdf] [slides]
  7. Abhinav Vishnu, Manoj Krishnan and Pavan Balaji. Dynamic Time-Variant Connection Management for PGAS Models on InfiniBand. Workshop on Communication Architecture for Scalable Systems (CASS); held in conjunction with the IEEE International Parallel and Distributed Processing Symposium (IPDPS). May 16th, 2011, Anchorage, Alaska. [pdf] [slides]
  8. Pavan Balaji, Sitha Bhagvat, Hyun-Wook Jin and Dhabaleswar K. Panda. Asynchronous Zero-copy Communication for Synchronous Sockets in the Sockets Direct Protocol (SDP) over InfiniBand. Workshop on Communication Architecture for Clusters (CAC); held in conjunction with the IEEE International Parallel and Distributed Processing Symposium (IPDPS). Apr. 25th, 2006, Rhodes Island, Greece. [pdf] [slides]
  9. Venkatram Vishwanath, Pavan Balaji, Wu-chun Feng, Jason Leigh, Dhabaleswar K. Panda. A Case for UDP Offload Engines in LambdaGrids. Workshop on Protocols for Fast Long-Distance Networks (PFLDnet). Feb. 2-3, 2006, Nara, Japan. [pdf] [slides]
  10. Pavan Balaji, Hyun-Wook Jin, Karthikeyan Vaidyanathan and Dhabaleswar K. Panda. Supporting iWARP Compatibility and Features for Regular Network Adapters. Workshop on Remote Direct Memory Access (RDMA): Applications, Implementations and Techniques (RAIT); held in conjunction with IEEE International conference on Cluster Computing (Cluster). Sep. 26th, 2005, Boston, Massachusetts. [pdf] [slides]
  11. Hyun-Wook Jin, Sundeep Narravula, Gregory Brown, Karthikeyan Vaidyanathan, Pavan Balaji and Dhabaleswar K. Panda. Performance Evaluation of RDMA over IP Networks: A Study with the Ammasso Gigabit Ethernet NIC. Workshop on High Performance Interconnects for Distributed Computing (HPI-DC); held in conjunction with IEEE International Symposium on High Performance Distributed Computing (HPDC). Jul. 24th, 2005, Research Triangle Park, North Carolina. [pdf] [slides]
  12. Karthikeyan Vaidyanathan, Pavan Balaji, Hyun-Wook Jin and Dhabaleswar K. Panda. Workload driven analysis of File Systems in Shared Multi-Tier Data-Centers over InfiniBand. Workshop on Computer Architecture Evaluation using Commercial Workloads (CAECW); held in conjunction with IEEE International Symposium on High Performance Computer Architecture (HPCA). Feb. 12th, 2005, San Francisco, California. [pdf] [slides]
  13. Pavan Balaji, Hemal V. Shah and Dhabaleswar K. Panda. Sockets vs. RDMA Interface over 10-Gigabit Networks: An In depth Analysis of the Memory Traffic Bottleneck. Workshop on Remote Direct Memory Access (RDMA): Applications, Implementations and Technologies (RAIT); held in conjunction with IEEE International Conference on Cluster Computing (Cluster). Sep. 20th, 2004, San Diego, California. [pdf] [slides]
  14. Pavan Balaji, Karthikeyan Vaidyanathan, Sundeep Narravula, Savitha Krishnamoorthy, Hyun-Wook Jin and Dhabaleswar K. Panda. Exploiting Remote Memory Operations to Design Efficient Reconfiguration for Shared Data-Centers over InfiniBand. Workshop on Remote Direct Memory Access (RDMA): Applications, Implementations and Technologies (RAIT); held in conjunction with IEEE International Conference on Cluster Computing (Cluster). Sep. 20th, 2004, San Diego, California. [pdf] [slides]
  15. Sundeep Narravula, Pavan Balaji, Karthikeyan Vaidyanathan, Savitha Krishnamoorthy, Jiesheng Wu and Dhabaleswar K. Panda. Supporting Strong Coherency for Active Caches in Multi-Tier Data-Centers over InfiniBand. Workshop on System Area Networks (SAN); held in conjunction with IEEE International Symposium on High Performance Computer Architecture (HPCA). Feb. 14th, 2004, Madrid, Spain. [pdf] [slides]
  16. Mohammad Kamrul Islam, Pavan Balaji, P. Sadayappan and Dhabaleswar K. Panda. QoPS: A QoS based scheme for Parallel Job Scheduling. Job Scheduling Strategies for Parallel Processing (JSSPP) workshop; held in conjunction with IEEE International Symposium on High Performance Distributed Computing (HPDC). Jun. 24th, 2003, Seattle, Washington. [pdf] [slides]


Refereed Poster Papers

  1. David Ozog, Jeffrey R. Hammond, James S. Dinan, Pavan Balaji, Sameer Shende, Allen D. Malony. Inspector-Executor Load Balancing Algorithms for Block-Sparse Tensor Contractions. ACM International Conference on Supercomputing (ICS). June 10–14, 2013, Eugene, Oregon. [pdf]
  2. Zachary Rubenstein, Hajime Fujita, Guoming Lu, Aiman Fang, Ziming Zheng, Andrew A. Chien, Pavan Balaji, Kamil Iskra, Peter H. Beckman, James S. Dinan, Jeffrey R. Hammond, Robert Schreiber. The Global View Resilience Model. Greater Chicago Area System Research Workshop (GCASR). May 3rd, 2013, Evanston, Illinois. [poster]
  3. Jintao Meng, Bingqiang Wang, Yanjie Wei, Shengzhong Feng, Jiefeng Cheng and Pavan Balaji. SWAP-Assembler: A Scalable De Bruijn Graph Based Assembler for Massive Genome Data. International Conference on Research in Computational Molecular Biology (RECOMB). Apr. 7–10, 2013, Beijing, China. [poster]
  4. James S. Dinan, Pavan Balaji, Jeffrey R. Hammond, Sriram Krishnamoorthy, and Vinod Tipparaju. High-Level, One-Sided Programming Models on MPI: A Case Study with Global Arrays and NWChem. IEEE/ACM International Conference for High Performance Computing, Networking, Storage and Analysis (SC). Nov. 12–18, 2011, Seattle, Washington. [pdf] [poster]
  5. Jeffrey R. Hammond, Sreeram Potluri, Zheng (Cynthia) Gu, Alex Dickson, James S. Dinan, Ivo Kabadshow, Pavan Balaji, and Vinod Tipparaju. Fast One-Sided Communication on Supercomputers and Application to Three Scientific Codes. IEEE/ACM International Conference for High Performance Computing, Networking, Storage and Analysis (SC). Nov. 12–18, 2011, Seattle, Washington. [poster]


Invited Papers

  1. Rajeev S. Thakur, Pavan Balaji, Darius T. Buntinas, David J. Goodell, William D. Gropp, Torsten Hoefler, Sameer Kumar, Ewing L. (Rusty) Lusk and Jesper Larsson Träff. MPI at Exascale. Department of Energy SciDAC workshop. Jul. 11-15th, 2010, Chattanooga, Tennessee. [pdf] [slides]
  2. Wu-chun Feng, Pavan Balaji and Ajeet Singh. Network Interface Cards as First-Class Citizens. Workshop on The Influence of I/O on Microprocessor Architecture (IOM); held in conjunction with the IEEE International Symposium on High Performance Computer Architecture (HPCA). Feb. 15th, 2009, Raleigh, North Carolina. [pdf] [slides]
  3. Karthikeyan Vaidyanathan, Sundeep Narravula, Pavan Balaji and Dhabaleswar K. Panda. Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers. Workshop on the National Science Foundation Next Generation Software (NSFNGS) Program; held in conjunction with the IEEE International Parallel and Distributed Processing Symposium (IPDPS). Mar. 26th, 2007, Long Beach, California. [pdf] [slides]
  4. Pavan Balaji, Karthikeyan Vaidyanathan, Sundeep Narravula, Hyun-Wook Jin and Dhabaleswar K. Panda. Designing Next Generation Data-centers with Advanced Communication Protcols and Systems Services. Workshop on the National Science Foundation Next Generation Software (NSFNGS) Program; held in conjunction with the IEEE International Parallel and Distributed Processing Symposium (IPDPS). Apr. 25th, 2006, Rhodes Island, Greece. [pdf] [slides]


Meeting Reports

  1. Sonia R. Sachs, Katherine Yelick, Saman Amarasinghe, Mary Hall, Richard Lethin, Keshav Pingali, Dan Quinlan, Vivek Sarkar, John Shalf, Robert Lucas, Pavan Balaji, Pedro C. Diniz, Alice Koniges, and Marc Snir. Exascale Programming Challenges Workshop Report. The ASCR Programming Models Workshop, July, 2011. [pdf]
  2. Jack A. Gilbert, Folker Meyer, Dion Antonopoulos, Pavan Balaji, Christopher T. Brown, Narayan Desai, Jonathan A. Eisen, Dick Evers, Dawn Field, Wu-chun Feng, Daniel Huson, Janet Jansson, Rob Knight, James Knight, Eugene Kolker, Kostas Konstantindis, Joel Kostka, Nikos Kyrpides, Rachel Mackelprang, Alice McHardy, Christopher Quince, Jeroen Raes, Alexander Sczyrba, Ashley Shade, and Rick Stevens. Meeting Report: The Terabase Metagenomics Workshop and the Vision of an Earth Microbiome Project. Institute of Computing in Science (ICiS) Workshop on the Earth Microbiome Project (EMP), 2010. [pdf]


International Standards

  1. MPI: A Message-Passing Interface Standard, Version 3.0. The Message Passing Interface Forum, Sep. 21st, 2012. [pdf]
  2. MPI: A Message-Passing Interface Standard, Version 2.2. The Message Passing Interface Forum, Sep. 4th, 2009. [pdf]
  3. MPI: A Message-Passing Interface Standard, Version 2.1. The Message Passing Interface Forum, Jun. 23rd, 2008. [pdf]


Technical Reports

  1. Karthikeyan Vaidyanathan, Sitha Bhagvat, Pavan Balaji and Dhabaleswar K. Panda. Understanding the Significance of Network Performance in End Applications: A Case Study with EtherFabric and InfiniBand. Technical Report, OSU-CISRC-2/06-TR19, The Ohio State University. Feb, 2006. [pdf]
  2. Karthikeyan Vaidyanathan, Pavan Balaji, Jiesheng Wu, Hyun-Wook Jin and Dhabaleswar K. Panda. An Architectural Study of Cluster-based Multi-tier Data-Centers. Technical Report, OSU-CISRC-5/04-TR25, The Ohio State University. May, 2004. [pdf]
  3. Savitha Krishnamoorthy, Pavan Balaji, Karthikeyan Vaidyanathan, Hyun-Wook Jin and Dhabaleswar K. Panda. Dynamic Reconfigurability Support for providing Soft Quality of Service Guarantees in Multi-Tier Data-Centers over InfiniBand. Technical Report, OSU-CISRC-2/04-TR10, The Ohio State University. Feb, 2004. [pdf]