PAVAN BALAJI

Photo Bldg. 221, Room: D-251,
Mathematics and Computer Science Division,
Argonne National Laboratory,
9700 South Cass Avenue,
Argonne, Illinois 60439-4844

Tel: 630-252-3017
Fax: 630-252-5986
Email: balaji (at) mcs (dot) anl (dot) gov


I hold a joint appointment as an assistant computer scientist in the Mathematics and Computer Science (MCS) Division at the Argonne National Laboratory (ANL), and as a fellow of the Computation Institute at the University of Chicago. My areas of research interest include Parallel Programming Models, High-speed Networking Systems, Parallel and Distributed Computing, Job Scheduling and Resource Management and the general areas of Cluster and Grid Computing. More specific research projects can be found below. The information on this website is getting outdated quickly, so please see my CV below for more up-to-date information.


More Links:

Research Work | Publications | CV | Personal Useful Links | Other Useful Links



CHAIRMANSHIPS and EDITORSHIPS:



CURRENT RESEARCH PROJECTS:

  1. MPICH2: An Implementation of the Message Passing Interface (MPI)

  2. Sockets Direct Protocol (SDP) and other High Performance Sockets Implementations

  3. Argonne Micro-benchmark Suite

  4. High-speed Networking Stacks (InfiniBand, iWARP, Myrinet)

  5. Cluster-based Commercial Data-centers

  6. Job Scheduling and Resource Management Techniques


PUBLICATIONS

Journal Articles:

  1. P. Balaji, W. Feng and D. K. Panda, Bridging the Ethernet-Ethernot Performance Gap. IEEE Micro Journal Special Issue on High-Performance Interconnects, pp. 24-40, May/June Volume, Issue 3, 2006.

  2. H. -W. Jin, P. Balaji, C. Yoo, J . Y. Choi and D. K. Panda, Exploiting NIC Architectural Support for Enhancing IP based Protocols on High Performance Networks. Special Issue of the Journal of Parallel and Distributed Computing (JPDC) on Design and Performance of Networks for Super-, Cluster- and Grid-Computing, pp. 1348-1365, Vol. 65, Issue 11, 2005.

  3. M. Islam, P. Balaji, P. Sadayappan and D. K. Panda, QoPS: A QoS based scheme for Parallel Job Scheduling (extended journal version). IEEE Springer LNCS Journal Series, pp. 252-268, Vol. 2862, 2003.

Invited Papers:

  1. K. Vaidyanathan, S. Narravula, P. Balaji and D. K. Panda, Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers. In the workshop on the National Science Foundation Next Generation Software (NSFNGS) Program; in conjunction with the IEEE International Parallel and Distributed Processing Symposium (IPDPS), Mar 26th, 2007, Long Beach, California.

  2. P. Balaji, K. Vaidyanathan, S. Narravula, H. -W. Jin, and D. K. Panda, Designing Next Generation Data-centers with Advanced Communication Protocols and Systems Services. In the workshop on the National Science Foundation Next Generation Software (NSFNGS) Program; in conjunction with the IEEE International Parallel and Distributed Processing Symposium (IPDPS), Apr 25th, 2006, Rhodes Island, Greece.

Conference and Workshop Publications:

  1. P. Balaji, S. Bhagvat, R. Thakur and D. K. Panda. Sockets Direct Protocol for Hybrid Network Stacks: A Case Study with iWARP over 10G Ethernet. International Conference on High Performance Computing (HiPC), Dec. 17-20, 2008, Bangalore, India.

  2. A. Chan, P. Balaji, W. Gropp and R. Thakur. Communication Analysis of Parallel 3D FFT for Flat Cartesian Meshes on Large Blue Gene Systems. International Conference on High Performance Computing (HiPC), Dec. 17-20, 2008, Bangalore, India.

  3. M. Kumar, V. Chaube, P. Balaji, W. Feng and H.-W. Jin. Making a Case for Proactive Flow Control in Optical Circuit-Switched Networks. International Conference on High Performance Computing (HiPC), Dec. 17-20, 2008, Bangalore, India.

  4. H. Lin, P. Balaji, R. Poole, C. Sosa, X. Ma and W. Feng. Massively Parallel Genomic Sequence Search on the Blue Gene/P Architecture. IEEE/ACM International Conference for High Performance Computing, Networking, Storage and Analysis (SC), Nov. 15-21, 2008, Austin, Texas.

  5. T. Scogland, P. Balaji, W. Feng and G. Narayanaswamy. Asymmetric Interactions in Symmetric Multi-core Systems: Analysis, Enhancements and Evaluation. IEEE/ACM International Conference for High Performance Computing, Networking, Storage and Analysis (SC), Nov. 15-21, 2008, Austin, Texas.

  6. N. Desai, P. Balaji, P. Sadayappan and M. Islam. Are Non-Blocking Networks Really Needed for High-End-Computing Workloads?. IEEE International Conference on Cluster Computing (Cluster), Best Paper Award Sep. 29 - Oct. 1st, 2008, Tsukuba, Japan.

  7. P. Balaji, A. Chan, W. Gropp, R. Thakur and E. Lusk. Non-Data-Communication Overheads in MPI: Analysis on Blue Gene/P. The Euro PVM/MPI Users' Group Conference (EuroPVM/MPI), Outstanding Paper Award, Sep. 7-10, 2008, Dublin, Ireland.

  8. P. Balaji, D. Buntinas, D. Goodell, W. Gropp and R. Thakur. Toward Efficient Support for Multithreaded MPI Communication. The Euro PVM/MPI Users' Group Conference (EuroPVM/MPI), Sep. 7-10, 2008, Dublin, Ireland.

  9. J. L. Traff, A. Ripke, C. Siebert, P. Balaji, R. Thakur and W. Gropp. A Simple, Pipelined Algorithm for Large, Irregular All-gather Problems. The Euro PVM/MPI Users' Group Conference (EuroPVM/MPI), Sep. 7-10, 2008, Dublin, Ireland.

  10. G. Narayanaswamy, P. Balaji and W. Feng. Impact of Network Sharing in Multi-core Architectures. IEEE International Conference on Computer Communication and Networks (ICCCN), Aug. 3-7, 2008, St. Thomas, U.S. Virgin Islands.

  11. P. Balaji, W. Feng, H. Lin, J. Archuleta, S. Matsuoka, A. Warren, J. Setubal, E. Lusk, R. Thakur, I. Foster, D. S. Katz, S. Jha, K. Shinpaugh, S. Coghlan and D. Reed. Distributed I/O with ParaMEDIC: Experiences with a Worldwide Supercomputer. International Supercomputing Conference (ISC), Outstanding Paper Award, Jun. 17-20, 2008, Dresden, Germany.

  12. P. Balaji, W. Feng, and H. Lin. Semantics-based Distributed I/O with the ParaMEDIC Framework. In the ACM/IEEE International Symposium on High Performance Distributed Computing (HPDC), Jun 23-27, 2008, Boston, Massachusetts.

  13. P. Balaji, W. Feng, J. Archuleta, H. Lin, R. Kettimuttu, R. Thakur and X. Ma. Semantics-based Distributed I/O for mpiBLAST (short paper). In the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP), Feb 20-23, 2008, Salt Lake City, Utah.

  14. P. Balaji, W. Feng, S. Bhagvat, D. K. Panda, R. Thakur and W. Gropp. Analyzing the Impact of Supporting Out-of-Order Communication on In-order Performance with iWARP. In the IEEE/ACM International Conference for High Performance Computing, Networking, Storage and Analysis (SC), Nov 10th to 16th, 2007, Reno, Nevada.

  15. P. Balaji, W. Feng, J. Archuleta and H. Lin. ParaMEDIC: Parallel Metadata Environment for Distributed I/O and Computing. In the IEEE/ACM International Conference for High Performance Computing, Networking, Storage and Analysis (SC). Storage Challenge Award Winner, Nov 10th to 16th, 2007, Reno, Nevada.

  16. G. Narayanaswamy, P. Balaji, and W. Feng, An Analysis of 10-Gigabit Ethernet Protocol Stacks in Multicore Environments. In the IEEE International Symposium on High Performance Interconnects (HotI), Aug 22-24, 2007, Palo Alto, California.

  17. P. Balaji, S. Bhagvat, D. K. Panda, R. Thakur, and W. Gropp, Advanced Flow-control Mechanisms for the Sockets Direct Protocol over InfiniBand. In the IEEE International Conference on Parallel Processing (ICPP), Sep 10-14, 2007, XiAn, China.

  18. M. Islam, P. Balaji, G. Sabin and P. Sadayappan, Analyzing and Minimizing the Impact of Opportunity Cost in QoS-aware Job Scheduling. In the IEEE International Conference on Parallel Processing (ICPP), Sep 10-14, 2007, XiAn, China.

  19. P. Balaji, D. Buntinas, S. Balay, B. Smith, R. Thakur and W. Gropp, Nonuniformly Communicating Noncontiguous Data: A Case Study with PETSc and MPI. In the IEEE Parallel and Distributed Processing Symposium (IPDPS), Mar 26-30, 2007, Long Beach, California.

  20. P. Balaji, S. Bhagvat, H. -W. Jin and D. K. Panda, Asynchronous Zero-copy Communication for Synchronous Sockets in the Sockets Direct Protocol (SDP) over InfiniBand. In the workshop on Communication Architecture for Clusters (CAC); in conjunction with the IEEE International Parallel and Distributed Processing Symposium (IPDPS), Apr 25th, 2006, Rhodes Island, Greece.

  21. V. Viswanathan, P. Balaji, W. Feng, J. Leigh, D. K. Panda, A Case for UDP Offload Engines in LambdaGrids. In the workshop on Protocols for Fast Long-Distance Networks (PFLDnet), Feb 2nd and 3rd, 2006, Nara, Japan.

  22. P. Balaji, W. Feng, Q. Gao, R. Noronha, W. Yu and D. K. Panda, Head-to-TOE Evaluation of High Performance Sockets over Protocol Offload Engines. In the proceedings of the IEEE International Conference on Cluster Computing, Sep 27-30, 2005, Boston, Massachusetts.

  23. P. Balaji, H. -W. Jin, K. Vaidyanathan and D. K. Panda, Supporting iWARP Compatibility and Features for Regular Network Adapters. In the proceedings of the workshop on Remote Direct Memory Access (RDMA): Applications, Implementations, and Technologies (RAIT); held in conjunction with the IEEE International Conference on Cluster Computing, Sep 26th, 2005, Boston, Massachusetts.

  24. H. -W. Jin, S. Narravula, G. Brown, K. Vaidyanathan. P. Balaji and D. K. Panda, Performance Evaluation of RDMA over IP: A Case Study with the Ammasso Gigabit Ethernet NIC. In the workshop on High Performance Interconnects for Distributed Computing (HPI-DC); to be held in conjunction with the 14th International Symposium on High Performance Distributed Computing (HPDC-14), Jul 24th, 2005, Research Triangle Park, NC.

  25. W. Feng, P. Balaji, C. Baron, L. N. Bhuyan and D. K. Panda, Performance Characterization of a 10-Gigabit Ethernet TOE. In the proceedings of the IEEE International Symposium on High-Performance Interconnects (HotI), Aug 17-19, 2005, Palo Alto, California.

  26. P. Balaji, S. Narravula, K. Vaidyanathan, H. -W. Jin and D. K. Panda, On the Provision of Prioritization and Soft QoS in Dynamically Reconfigurable Shared Data-Centers over InfiniBand. In the proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), Mar 20-22, 2005, Austin, Texas.

  27. K. Vaidyanathan, P. Balaji, H. -W. Jin and D. K. Panda, Workload-driven Analysis of File Systems in Shared Multi-tier Data-Centers over InfiniBand. In the eighth workshop on Computer Architecture Evaluation using Commercial Workloads (CAECW-8); to be held in conjunction with the 11th International Symposium on High Performance Computer Architecture (HPCA-11), Feb 12, 2005, San Francisco, CA.

  28. S. Narravula, P. Balaji, K. Vaidyanathan, H. -W. Jin and D. K. Panda, Architecture for Caching Responses with Multiple Dynamic Dependencies in Multi-Tier Data-Centers over InfiniBand. In the proceedings of the IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGrid), May 9-12, 2005, Cardiff, UK.

  29. P. Balaji, H. Shah and D. K. Panda, Sockets vs RDMA Interface over 10-Gigabit Networks: An In-depth analysis of the Memory Traffic Bottleneck. In the workshop on Remote Direct Memory Access (RDMA): Applications, Implementations, and Technologies (RAIT); held in conjunction with the IEEE International Conference on Cluster Computing, Sep 20th, 2004, San Diego, California.

  30. P. Balaji, K. Vaidyanathan, S. Narravula, S. Krishnamoorthy, H. -W. Jin and D. K. Panda, Exploiting Remote Memory Operations to Design Efficient Reconfiguration for Shared Data-Centers over InfiniBand. In the workshop on Remote Direct Memory Access (RDMA): Applications, Implementations, and Technologies (RAIT); held in conjunction with the IEEE International Conference on Cluster Computing, Sep 20th, 2004, San Diego, California.

  31. M. Islam, P. Balaji, P. Sadayappan and D. K. Panda, Towards Provision of Quality of Service Guarantees in Job Scheduling. In the proceedings of the IEEE International Conference on Cluster Computing, Sep 20-23, 2004, San Diego, California.

  32. S. Narravula, P. Balaji, K. Vaidyanathan, S. Krishnamoorthy, J. Wu and D. K. Panda, Supporting Strong Coherency for Active Caches in Multi-Tier Data-Centers over InfiniBand. In the workshop on System Area Networks (SAN); held in conjuntion with the IEEE International Symposium on High Performance Computer Architecture (HPCA), Feb 14th, 2004, Madrid, Spain.

  33. P. Balaji, S. Narravula, K. Vaidyanathan, S. Krishnamoorthy, J. Wu and D. K. Panda, Sockets Direct Protocol over InfiniBand in Clusters: Is it Beneficial?. In the proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), Mar 10-12, 2004, Austin, Texas.

  34. R. Kurian, P. Balaji, P. Sadayappan, Opportune Job Shredding: An Effective approach for scheduling Parameter Sweep Applications. In the proceedings of the Los Alamos Computer Science Institute Symposium (LACSI), Oct 12-14, 2003, Santa Fe, New Mexico.

  35. M. Islam, P. Balaji, P. Sadayappan and D. K. Panda, QoPS: A QoS based scheme for Parallel Job Scheduling. In the Job Scheduling Strategies for Parallel Processing workshop (JSSPP); held in conjunction with the IEEE International Symposium on High Performance Distributed Computing (HPDC), Jun 24th, 2003, Seattle, WA.

  36. P. Balaji, J. Wu, T. Kurc, U. Catalyurek, D. K. Panda and J. Saltz, Impact of High Performance Sockets on Data Intensive Applications. In the proceedings of the IEEE International Symposium on High Performance Distributed Computing (HPDC), Jun 22-24, 2003, Seattle, WA.

  37. R. Gupta, P. Balaji, J. Nieplocha and D. K. Panda, Efficient Collective Operations using Remote Memory Operations on VIA-Based Clusters. In the proceedings of the IEEE International Parallel and Distributed Processing Symposium (IPDPS), Apr 22-26, 2003, Nice, France.

  38. P. Balaji, P. Shivam, P. Wyckoff and D. K. Panda, High Performance User-Level Sockets over Gigabit Ethernet. In the proceedings of the IEEE International Conference on Cluster Computing, Sept 23-26, 2002, Chicago, IL.


PERSONAL USEFUL LINKS:



OTHER USEFUL LINKS:




Pavan Balaji
Last Updated: Nov. 19, 2008.