Seminars & Events
Mathematics and Computer Science Division
"Scalable, flexible tools for understanding the HPC environment"
DATE: February 10, 2012 to February 10, 2012
TIME: 10:30 AM - 11:30 AM
SPEAKER: Jim Brandt, Principal Member of Technical Staff, Sandia National Laboratories
LOCATION: Building 240, TCS Conference Center, 1406 & 1407, Argonne National Laboratory
HOST: Narayan Desai
Description:
Understanding applications' behaviors and their interactions with system software and hardware is becoming increasingly difficult as the complexity of all three components increases. Thus, tools for understanding these in the contexts of both failure and performance are becoming more important. In the case of failure, early detection and attribution can increase productivity of both platform and user through the ability to quickly respond. In the context of performance, understanding how resources are being used can again drive increased productivity through more intelligent resource requests, allocations, and use. This talk will present work being done at Sandia on scalable lightweight tools for HPC monitoring and analysis of all three components as well as for feedback to drive application load balancing.
Jim Brandt is a Principal Member of Technical Staff at Sandia National Laboratories. His research interest is in strategies and enabling capabilities for intelligent dynamic resource management for improved HPC system and application performance. His work targets both failure and non-failure (e.g., memory contention) scenarios. Jim leads the OVIS project at Sandia for scalable, real-time analysis of very large datasets, targeting the analysis of HPC system data to characterize system health and application resource utilization and to determine and invoke beneficial response.
Save the event to your calendar [schedule.ics]
