Books Edited
-
P. Balaji and A. Vishnu. Programming Models,
Software and Tools for High-End Computing; Special
Issue of the International Journal of High
Perforformance Computing Applications (IJHPCA). To be
published.
-
P. Balaji and A. Vishnu. Programming Models and
Systems Software Support for High-End Computing
Applications; Special Issue of the International Journal
of High Perforformance Computing Applications (IJHPCA),
Volume 24, No. 3, Jul, 2010.
-
W. Feng and P. Balaji. Tools and Environments for
Multicore and Many-core Architectures. Special Issue of
IEEE Computer, Dec, 2009.
Book chapters:
-
D. K. Panda, P. Balaji, S. Sur and
M. Koop. Commodity High Performance
Interconnects; Chapter in the book Attaining
High Performance Communication: A Vertical
Approach. CRC Press, 2009.
-
W. Feng and P. Balaji. Ethernet vs.
Ethernot; Chapter in the book Attaining High
Performance Communication: A Vertical
Approach. CRC Press, 2009.
-
P. Balaji, P. Sadayappan and
M. Islam. Techniques on Providing Hard Quality of
Service Guarantees in Job Scheduling; Chapter in
the
book Market-Oriented
Grid and Utility Computing. Wiley Publishers,
2008.
Magazine Articles
-
P. Balaji. Are Power-Conscious Hardware
Architectures the Next Step in High-End
Computing?. HPC Source, Issue on Power and
Performance: Confronting the Need for Speed,
2010.
Journal Articles:
-
G. L. Valentini, W. Lassonde, S. U. Khan, N. Min-Allah,
S. A. Madani, J. Li, L. Zhang, L. Wang, N. Ghani,
J. Kolodziej, H. Li, A. Y. Zomaya,
C. -Z. Xu, P. Balaji, A. Vishnu, F. Pinel,
J. E. Pecero, D. Kliazovich, and P. Bouvry. An Overview
of Energy Efficiency Techniques in Cluster Computing
Systems. Special edition of the Springer Journal of
Cluster Computing on Green Computing and Communications,
2012.
-
P. Balaji, R. Gupta, A. Vishnu and
P. Beckman; Mapping
Communication Layouts to Network Hardware Characteristics
on Massive-Scale Blue Gene Systems. Special edition of
the Springer Journal of Computer Science on Research and
Development (presented at the International Supercomputing
Conference (ISC)), pp. 247-256, Vol. 26, Issue 3-4,
2011.
-
P. Balaji, D. Buntinas, D. Goodell,W. Gropp,
T. Hoefler, S. Kumar, E. Lusk, R. Thakur and
J. L. Traff; MPI on Millions of Cores. Parallel
Processing Letters (PPL) Journal, pp. 45-60, Vol. 21,
Issue 1, 2011.
-
P. Balaji, W. Feng, H. Lin, J. Archuleta,
S. Matsuoka, A.Warren, J. Setubal, E. Lusk, R. Thakur,
I. Foster, D. S. Katz, S. Jha, K. Shinpaugh, S. Coghlan,
and
D. Reed. Global-scale
Distributed I/O with ParaMEDIC. Accepted for
publication at the International Journal of Concurrency
and Computation: Practice and Experience (CCPE),
pp. 2266-2281, Vol. 22, Issue 16, 2010.
-
P. Balaji, A. Chan, W. Gropp, R. Thakur and
E. Lusk. The Importance of Non-Data-Communication
Overheads in MPI. International Journal of High
Performance Computing Applications (IJHPCA), pp. 5-15,
Vol. 24, Issue 1, 2010.
-
P. Balaji, D. Buntinas, D. Goodell, W. Gropp and
R. Thakur. Fine-Grained Multithreading Support for
Hybrid Threaded MPI Programming. International Journal
of High Performance Computing Applications (IJHPCA),
pp. 49-57, Vol. 24, Issue 1, 2010.
-
J. L. Traff, A. Ripke, C. Siebert, P. Balaji,
R. Thakur, and W. Gropp. A Pipelined Algorithm for
Large, Irregular Allgather Problems. International
Journal of High Performance Computing Applications
(IJHPCA), pp. 58-68, Vol. 24, Issue 1, 2010.
-
P. Balaji, A. Chan, R. Thakur, W. Gropp and
E. Lusk. Toward
Message Passing for a Million Processes: Characterizing
MPI on a Massive Scale Blue Gene/P. Special edition of
the Springer Journal of Computer Science on Research and
Development (presented at the International Supercomputing
Conference (ISC)), pp. 11-19, Vol. 24, Issue 1,
2009. Best Paper Award at
ISC.
-
P. Lai, P. Balaji, R. Thakur and
D. K. Panda. ProOnE:
A General Purpose Protocol Onload Engine for Multi- and
Many-Core Architectures. Special edition of the
Springer Journal of Computer Science on Research and
Development (presented at the International Supercomputing
Conference (ISC)), pp. 133-142, Vol. 23, Issue 3,
2009.
-
P. Balaji, W. Feng and
D. K. Panda, Bridging
the Ethernet-Ethernot Performance Gap. IEEE Micro
Journal Special Issue on High-Performance Interconnects,
pp. 24-40, Volume 26, Issue 3, 2006.
-
H. -W. Jin, P. Balaji, C. Yoo, J . Y. Choi and
D. K. Panda, Exploiting
NIC Architectural Support for Enhancing IP based Protocols
on High Performance Networks. Special Issue of the
Journal of Parallel and Distributed Computing (JPDC) on
Design and Performance of Networks for Super-, Cluster-
and Grid-Computing, pp. 1348-1365, Vol. 65, Issue 11,
2005.
-
M. Islam, P. Balaji, P. Sadayappan and
D. K. Panda,
QoPS: A QoS based scheme for Parallel Job Scheduling
(extended journal version). IEEE Springer LNCS Journal
Series, pp. 252-268, Vol. 2862, 2003.
Invited Papers:
-
R. Thakur, P. Balaji, D. Buntinas, D. Goodell,
W. Gropp, T. Hoefler, S. Kumar, E. Lusk and
J. L. Traff. MPI
at Exascale. Department of Energy SciDAC workshop,
Jul. 11-15th, 2010, Chattanooga, Tennessee.
-
W. Feng, P. Balaji, and
A. Singh, Network
Interface Cards as First Class Citizens. In the
workshop on The Influence of I/O on Microprocessor
Architecture (IOM); in conjunction with the IEEE
International Symposium on High Performance Computer
Architecture (HPCA), Feb. 15th, 2009, Raleigh, North
Carolina.
-
K. Vaidyanathan, S. Narravula, P. Balaji and
D. K. Panda, Designing
Efficient Systems Services and Primitives for
Next-Generation Data-Centers. In the workshop on the
National Science Foundation Next Generation Software
(NSFNGS) Program; in conjunction with the IEEE
International Parallel and Distributed Processing
Symposium (IPDPS), Mar 26th, 2007, Long Beach,
California.
-
P. Balaji, K. Vaidyanathan, S. Narravula,
H. -W. Jin, and
D. K. Panda, Designing
Next Generation Data-centers with Advanced Communication
Protocols and Systems Services. In the workshop on the
National Science Foundation Next Generation Software
(NSFNGS) Program; in conjunction with the IEEE
International Parallel and Distributed Processing
Symposium (IPDPS), Apr 25th, 2006, Rhodes Island,
Greece.
Conference and Workshop publications:
-
R. Wang, E. Yao, P. Balaji, D. Buntinas, M. Chen
and
G. Tan. Building
Algorithmically Nonstop Fault Tolerant MPI
Programs. IEEE International Conference on High
Performance Computing (HiPC). Dec. 18-21, 2011, Bangalore,
India.
-
J. Dinan, S. Krishnamoorthy, P. Balaji, J. Hammond,
M. Krishnan, V. Tipparaju and
A. Vishnu. Noncollective
Communicator Creation in MPI. The Euro MPI Users'
Group Conference (EuroMPI); special session on Improving
MPI User and Developer Interaction (IMUDI). Sep. 18-21,
2011, Santorini, Greece.
-
M. Rashti, J. Green and A. Afsahi P. Balaji and
W. Gropp. Multi-core
and Network Aware MPI Topology Functions. The Euro MPI
Users' Group Conference (EuroMPI). Sep. 18-21, 2011,
Santorini, Greece.
-
A. Vishnu, M. Krishnan
and P. Balaji. Dynamic
Time-Variant Connection Management for PGAS Models on
InfiniBand. Workshop on Communication Architecture for
Scalable Systems (CASS); in conjunction with the IEEE
International Parallel and Distributed Processing
Symposium (IPDPS), May 16th, 2011, Anchorage,
Alaska.
-
R. Grant, M. Rashti, P. Balaji and
A. Afsahi. RDMA
Capable iWARP over Datagrams. IEEE International
Parallel and Distributed Processing Symposium (IPDPS). May
16-20, 2011, Anchorage, Alaska.
-
M. Rashti, R. Grant, P. Balaji and
A. Afsahi. iWARP
Redefined: Scalable Connectionless Communication over
High-Speed Ethernet. IEEE International Conference on
High Performance Computing (HiPC). Dec. 19-22, 2010,
Goa,India.
-
A. Vishnu, H. V. Dam, W. D. Jong, P. Balaji and
S. Song. Fault
Tolerant Communication Runtime Support for Data Centric
Programming Models. IEEE International Conference on
High Performance Computing (HiPC). Dec. 19-22, 2010, Goa,
India.
-
Y. Jiao, H. Lin, P. Balaji and
W. Feng. Power
and Performance Characterization of Computational Kernels
on the GPU. IEEE/ACM International Conference on Green
Computing and Communications (GreenCom). Dec. 18-20,2010,
Hangzhou, China.
-
A. Vishnu, S. Song, A. Marquez, K. Barker, D. Kerbyson,
K. Cameron, P. Balaji. Designing
Energy Efficient Communication Runtime Systems for Data
Centric Programming Models. IEEE/ACM International
Conference on Green Computing and Communications
(GreenCom). Dec. 18-20, 2010, Hangzhou, China.
-
D. Goodell, P. Balaji, D. Buntinas, G. D“ozsa,
W. Gropp, S. Kumar, B. R. de Supinski and
R. Thakur.
Minimizing MPI Resource Contention in Multithreaded
Multicore Environments. IEEE International Conference
on Cluster Computing (Cluster). Sep. 20-24, 2010,
Heraklion, Crete, Greece.
-
P. Balaji, D. Buntinas, D. Goodell, W. Gropp,
J. Krishna, E. Lusk and
R. Thakur. PMI:
A Scalable Parallel Process Management Interface for
Extreme-Scale Systems. The Euro MPI Users' Group
Conference (EuroMPI). Sep. 12-15, 2010, Stuttgart,
Germany.
-
G. Dozsa, S. Kumar, P. Balaji, D. Buntinas,
D. Goodell, W. Gropp, J. Ratterman and
R. Thakur. Enabling
Concurrent Multithreaded MPI Communication on Multicore
Petascale Systems. The Euro MPI Users' Group
Conference (EuroMPI). Sep. 12-15, 2010, Stuttgart,
Germany.
-
J. Krishna, P. Balaji, E. Lusk, R. Thakur and
F. Tiller. Implementing
MPI on Windows: Comparison with Common Approaches on
Unix. The Euro MPI Users' Group Conference
(EuroMPI). Sep. 12-15, 2010, Stuttgart, Germany.
-
J. Dinan, P. Balaji, E. Lusk, P. Sadayappan and
R. Thakur. Hybrid
Parallel Programming with MPI and Unified Parallel
C. ACM International Conference on Computing Frontiers
(CF), May 17-19, 2010, Bertinoro, Italy.
-
R. E. Grant, P. Balaji and
A. Afsahi. A
Study of Hardware Assisted IP over InfiniBand and its
Impact on Enterprise Data Center Performance. IEEE
International Symposium on Performance Analysis of Systems
and Software (ISPASS), Mar. 28-30, 2010, White Plains,
NY.
-
P. Balaji, H. Naik and
N. Desai. Understanding
Network Saturation Behavior on Large-Scale Blue Gene/P
Systems. International Conference on Parallel and
Distributed Systems (ICPADS), Dec. 8-11, 2009, Shenzhen,
China.
-
R. E. Grant, A. Afsahi
and P. Balaji. An
Evaluation of ConnectX Virtual Protocol Interconnect for
Data Centers. International Conference on Parallel and
Distributed Systems (ICPADS), Dec. 8-11, 2009, Shenzhen,
China.
-
A. Singh, P. Balaji and
W. Feng. GePSeA:
A General-Purpose Software Acceleration Framework For
Lightweight Task Offloading. International Conference
on Parallel Processing (ICPP), Sep. 22-25, 2009, Vienna,
Austria.
-
N. Desai, D. Buntinas, D. Buettner, P. Balaji, and
A. Chan. Improving
Resource Availability by Relaxing Network Allocation
Constraints on the Blue Gene/P. International
Conference on Parallel Processing (ICPP), Sep. 22-25,
2009, Vienna, Austria.
-
P. Balaji, D. Buntinas, D. Goodell, W. Gropp,
S. Kumar, E. Lusk, R. Thakur and
J. L. Traff. MPI
on a Million Processors. The Euro PVM/MPI Users' Group
Conference (Euro
PVM/MPI), Outstanding Paper
Award, Sep. 7-10, 2009, Espoo, Finland.
-
G. Santhanaraman, P. Balaji, K. Gopalakrishnan,
R. Thakur, W. Gropp and
D. K. Panda. Natively
Supporting True One-sided Communication in MPI on
Multi-core Systems with InfiniBand. IEEE International
Symposium on Cluster Computing and the Grid (CCGrid), May
18-21, 2009, Shanghai, China.
-
P. Balaji, S. Bhagvat, R. Thakur and
D. K. Panda. Sockets
Direct Protocol for Hybrid Network Stacks: A Case Study
with iWARP over 10G Ethernet. IEEE/ACM International
Conference on High Performance Computing (HiPC),
Dec. 17-20, 2008, Bangalore, India.
-
A. Chan, P. Balaji, W. Gropp and
R. Thakur. Communication
Analysis of Parallel 3D FFT for Flat Cartesian Meshes on
Large Blue Gene Systems. IEEE/ACM International
Conference on High Performance Computing (HiPC),
Dec. 17-20, 2008, Bangalore, India.
-
M. Kumar, V. Chaube, P. Balaji, W. Feng and
H.-W. Jin. Making
a Case for Proactive Flow Control in Optical
Circuit-Switched Networks. IEEE/ACM International
Conference on High Performance Computing (HiPC),
Dec. 17-20, 2008, Bangalore, India.
-
H. Lin, P. Balaji, R. Poole, C. Sosa, X. Ma and
W. Feng. Massively
Parallel Genomic Sequence Search on the Blue Gene/P
Architecture. IEEE/ACM International Conference for
High Performance Computing, Networking, Storage and
Analysis (SC), Nov. 15-21, 2008, Austin, Texas.
-
T. Scogland, P. Balaji, W. Feng and
G. Narayanaswamy. Asymmetric
Interactions in Symmetric Multi-core Systems: Analysis,
Enhancements and Evaluation. IEEE/ACM International
Conference for High Performance Computing, Networking,
Storage and Analysis (SC), Nov. 15-21, 2008, Austin,
Texas.
-
N. Desai, P. Balaji, P. Sadayappan and
M. Islam. Are
Non-Blocking Networks Really Needed for High-End-Computing
Workloads?. IEEE International Conference on Cluster
Computing (Cluster), Best Paper
Award Sep. 29 - Oct. 1st, 2008, Tsukuba,
Japan.
-
P. Balaji, A. Chan, W. Gropp, R. Thakur and
E. Lusk. Non-Data-Communication
Overheads in MPI: Analysis on Blue Gene/P. The Euro
PVM/MPI Users' Group Conference (EuroPVM/MPI),
Outstanding Paper
Award, Sep. 7-10, 2008,
Dublin, Ireland.
-
P. Balaji, D. Buntinas, D. Goodell, W. Gropp and
R. Thakur. Toward
Efficient Support for Multithreaded MPI
Communication. The Euro PVM/MPI Users' Group
Conference (EuroPVM/MPI), Sep. 7-10, 2008, Dublin,
Ireland.
-
J. L. Traff, A. Ripke, C. Siebert, P. Balaji,
R. Thakur and
W. Gropp. A
Simple, Pipelined Algorithm for Large, Irregular
All-gather Problems. The Euro PVM/MPI Users' Group
Conference (EuroPVM/MPI), Sep. 7-10, 2008, Dublin,
Ireland.
-
G. Narayanaswamy, P. Balaji and
W. Feng. Impact
of Network Sharing in Multi-core Architectures. IEEE
International Conference on Computer Communication and
Networks (ICCCN), Aug. 3-7, 2008, St. Thomas, U.S. Virgin
Islands.
-
P. Balaji, W. Feng, H. Lin, J. Archuleta,
S. Matsuoka, A. Warren, J. Setubal, E. Lusk, R. Thakur,
I. Foster, D. S. Katz, S. Jha, K. Shinpaugh, S. Coghlan
and
D. Reed. Distributed
I/O with ParaMEDIC: Experiences with a Worldwide
Supercomputer. International Supercomputing Conference
(ISC),
Outstanding Paper Award, Jun. 17-20,
2008, Dresden, Germany.
-
P. Balaji, W. Feng, and
H. Lin. Semantics-based
Distributed I/O with the ParaMEDIC Framework. In the
ACM/IEEE International Symposium on High Performance
Distributed Computing (HPDC), Jun 23-27, 2008, Boston,
Massachusetts.
-
P. Balaji, W. Feng, J. Archuleta, H. Lin,
R. Kettimuttu, R. Thakur and
X. Ma. Semantics-based
Distributed I/O for mpiBLAST (short paper). In the ACM
SIGPLAN Symposium on Principles and Practice of Parallel
Programming (PPoPP), Feb 20-23, 2008, Salt Lake City,
Utah.
-
P. Balaji, W. Feng, S. Bhagvat, D. K. Panda,
R. Thakur and
W. Gropp. Analyzing
the Impact of Supporting Out-of-Order Communication on
In-order Performance with iWARP. In the IEEE/ACM
International Conference for High Performance Computing,
Networking, Storage and Analysis (SC), Nov 10th to 16th,
2007, Reno, Nevada.
-
P. Balaji, W. Feng, J. Archuleta and
H. Lin. ParaMEDIC:
Parallel Metadata Environment for Distributed I/O and
Computing. In the IEEE/ACM International Conference
for High Performance Computing, Networking, Storage and
Analysis (SC). Storage Challenge
Award Winner, Nov 10th to 16th, 2007, Reno,
Nevada.
-
G. Narayanaswamy, P. Balaji, and
W. Feng, An
Analysis of 10-Gigabit Ethernet Protocol Stacks in
Multicore Environments. In the IEEE International
Symposium on High Performance Interconnects (HotI), Aug
22-24, 2007, Palo Alto, California.
-
P. Balaji, S. Bhagvat, D. K. Panda, R. Thakur, and
W. Gropp, Advanced
Flow-control Mechanisms for the Sockets Direct Protocol
over InfiniBand. In the IEEE International Conference
on Parallel Processing (ICPP), Sep 10-14, 2007, XiAn,
China.
-
M. Islam, P. Balaji, G. Sabin and
P. Sadayappan, Analyzing
and Minimizing the Impact of Opportunity Cost in QoS-aware
Job Scheduling. In the IEEE International Conference
on Parallel Processing (ICPP), Sep 10-14, 2007, XiAn,
China.
-
P. Balaji, D. Buntinas, S. Balay, B. Smith,
R. Thakur and
W. Gropp, Nonuniformly
Communicating Noncontiguous Data: A Case Study with PETSc
and MPI. In the IEEE Parallel and Distributed
Processing Symposium (IPDPS), Mar 26-30, 2007, Long Beach,
California.
-
P. Balaji, S. Bhagvat, H. -W. Jin and
D. K. Panda, Asynchronous
Zero-copy Communication for Synchronous Sockets in the
Sockets Direct Protocol (SDP) over InfiniBand. In the
workshop on Communication Architecture for Clusters (CAC);
in conjunction with the IEEE International Parallel and
Distributed Processing Symposium (IPDPS), Apr 25th, 2006,
Rhodes Island, Greece.
-
V. Viswanath, P. Balaji, W. Feng, J. Leigh,
D. K. Panda, A
Case for UDP Offload Engines in LambdaGrids. In the
workshop on Protocols for Fast Long-Distance Networks
(PFLDnet), Feb 2nd and 3rd, 2006, Nara, Japan.
-
P. Balaji, W. Feng, Q. Gao, R. Noronha, W. Yu and
D. K. Panda,
Head-to-TOE
Evaluation of High Performance Sockets over Protocol
Offload Engines. In the proceedings of the IEEE
International Conference on Cluster Computing, Sep
27-30, 2005, Boston, Massachusetts.
-
P. Balaji, H. -W. Jin, K. Vaidyanathan and
D. K. Panda, Supporting
iWARP Compatibility and Features for Regular Network
Adapters. In the proceedings of the workshop on Remote
Direct Memory Access (RDMA): Applications,
Implementations, and Technologies (RAIT); held in
conjunction with the IEEE International Conference on
Cluster Computing, Sep 26th, 2005, Boston,
Massachusetts.
-
H. -W. Jin, S. Narravula, G. Brown,
K. Vaidyanathan. P. Balaji and
D. K. Panda, Performance
Evaluation of RDMA over IP: A Case Study with the Ammasso
Gigabit Ethernet NIC. In the workshop on High
Performance Interconnects for Distributed Computing
(HPI-DC); to be held in conjunction with the 14th
International Symposium on High Performance Distributed
Computing (HPDC-14), Jul 24th, 2005, Research Triangle
Park, NC.
-
W. Feng, P. Balaji, C. Baron, L. N. Bhuyan and
D. K. Panda, Performance
Characterization of a 10-Gigabit Ethernet TOE. In the
proceedings of the IEEE International Symposium on
High-Performance Interconnects (HotI), Aug 17-19, 2005,
Palo Alto, California.
-
P. Balaji, S. Narravula, K. Vaidyanathan,
H. -W. Jin and
D. K. Panda, On
the Provision of Prioritization and Soft QoS in
Dynamically Reconfigurable Shared Data-Centers over
InfiniBand. In the proceedings of the IEEE
International Symposium on Performance Analysis of Systems
and Software (ISPASS), Mar 20-22, 2005, Austin,
Texas.
-
K. Vaidyanathan, P. Balaji, H. -W. Jin and
D. K. Panda, Workload-driven
Analysis of File Systems in Shared Multi-tier Data-Centers
over InfiniBand. In the eighth workshop on Computer
Architecture Evaluation using Commercial Workloads
(CAECW-8); to be held in conjunction with the 11th
International Symposium on High Performance Computer
Architecture (HPCA-11), Feb 12, 2005, San Francisco,
CA.
-
S. Narravula, P. Balaji, K. Vaidyanathan,
H. -W. Jin and
D. K. Panda, Architecture
for Caching Responses with Multiple Dynamic Dependencies
in Multi-Tier Data-Centers over InfiniBand. In the
proceedings of the IEEE/ACM International Symposium on
Cluster Computing and the Grid (CCGrid), May 9-12, 2005,
Cardiff, UK.
-
P. Balaji, H. Shah and
D. K. Panda, Sockets
vs RDMA Interface over 10-Gigabit Networks: An In-depth
analysis of the Memory Traffic Bottleneck. In the
workshop on Remote Direct Memory Access (RDMA):
Applications, Implementations, and Technologies (RAIT);
held in conjunction with the IEEE International Conference
on Cluster Computing, Sep 20th, 2004, San Diego,
California.
-
P. Balaji, K. Vaidyanathan, S. Narravula,
S. Krishnamoorthy, H. -W. Jin and
D. K. Panda, Exploiting
Remote Memory Operations to Design Efficient
Reconfiguration for Shared Data-Centers over
InfiniBand. In the workshop on Remote Direct Memory
Access (RDMA): Applications, Implementations, and
Technologies (RAIT); held in conjunction with the IEEE
International Conference on Cluster Computing, Sep 20th,
2004, San Diego, California.
-
M. Islam, P. Balaji, P. Sadayappan and
D. K. Panda, Towards
Provision of Quality of Service Guarantees in Job
Scheduling. In the proceedings of the IEEE
International Conference on Cluster Computing, Sep 20-23,
2004, San Diego, California.
-
S. Narravula, P. Balaji, K. Vaidyanathan,
S. Krishnamoorthy, J. Wu and
D. K. Panda, Supporting
Strong Coherency for Active Caches in Multi-Tier
Data-Centers over InfiniBand. In the workshop on
System Area Networks (SAN); held in conjuntion with the
IEEE International Symposium on High Performance Computer
Architecture (HPCA), Feb 14th, 2004, Madrid,
Spain.
-
P. Balaji, S. Narravula, K. Vaidyanathan,
S. Krishnamoorthy, J. Wu and
D. K. Panda, Sockets
Direct Protocol over InfiniBand in Clusters: Is it
Beneficial?. In the proceedings of the IEEE
International Symposium on Performance Analysis of Systems
and Software (ISPASS), Mar 10-12, 2004, Austin,
Texas.
-
R. Kurian, P. Balaji,
P. Sadayappan, Opportune
Job Shredding: An Effective approach for scheduling
Parameter Sweep Applications. In the proceedings of
the Los Alamos Computer Science Institute Symposium
(LACSI), Oct 12-14, 2003, Santa Fe, New Mexico.
-
M. Islam, P. Balaji, P. Sadayappan and
D. K. Panda, QoPS:
A QoS based scheme for Parallel Job Scheduling. In the
Job Scheduling Strategies for Parallel Processing workshop
(JSSPP); held in conjunction with the IEEE International
Symposium on High Performance Distributed Computing
(HPDC), Jun 24th, 2003, Seattle, WA.
-
P. Balaji, J. Wu, T. Kurc, U. Catalyurek,
D. K. Panda and
J. Saltz, Impact
of High Performance Sockets on Data Intensive
Applications. In the proceedings of the IEEE
International Symposium on High Performance Distributed
Computing (HPDC), Jun 22-24, 2003, Seattle, WA.
-
R. Gupta, P. Balaji, J. Nieplocha and
D. K. Panda, Efficient
Collective Operations using Remote Memory Operations on
VIA-Based Clusters. In the proceedings of the IEEE
International Parallel and Distributed Processing
Symposium (IPDPS), Apr 22-26, 2003, Nice, France.
-
P. Balaji, P. Shivam, P. Wyckoff and
D. K. Panda, High
Performance User-Level Sockets over Gigabit
Ethernet. In the proceedings of the IEEE International
Conference on Cluster Computing, Sept 23-26, 2002,
Chicago, IL.