This release includes performance and bug fixes. It also includes a new utility to convert Darshan log files, while also optionally anonymizing them or re-compressing them in bzip2 format.
- new darshan-convert command line utility for converting existing log files, with optional anonymization and optional bzip2 compression
- bzip2 support in command line utilities (but not in the darshan library itself)
- updated log file format that allows for string key/value pairs to be stored in the header
- added ability to set MPI-IO hints when writing darshan log
- at configure time: –with-log-hints
- at run time: DARSHAN_LOGHINTS environment variable
- bug fix contributed by Sandra Schröder: use case-insensitive search for MPI symbols in Fortran wrapper script
- performance bug fix: remove unecessary call to MPI_File_set_size when writing log
- added –with-logpath-by-env configure option to allow absolute log path to be specified via environment variable
A paper featuring Darshan (“Understanding and Improving Computational Science Storage Access through Continuous Characterization“) was awarded Best Paper at the 27th IEEE (MSST 2011) Symposium on Massive Storage Systems and Technologies. The paper outlines a methodology for characterizing a large scale production workload and presents a 2 month study of I/O activity on the Intrepid Blue Gene/P system at Argonne National Laboratory.
This release primarily enhances portability and adds the option to use LD_PRELOAD for instrumentation rather than link time wrappers. This release does not add any new instrumentation or change the log file format.
- additional environment variables to control log location, jobid and alignment parameters
- additional configure tests to improve portability
- bug fixes for darshan-parser –perf calculations
- support for MPI1.x
- support for OpenMPI
- support for PGI and Intel compilers
- new libdarshan.so dynamic library for use with LD_PRELOAD
The Darshan 2.0.0 release is now available for download. From a user’s perspective, the biggest difference is that you no longer have to run darshan-parser before darshan-job-summary.pl if you just want to see the summary report for a job. The darshan-job-summary.pl script operates directly on the binary .gz files now. We also introduced new characterization counters as well as some additional tables in the summary view. Here is the full change log:
- new output file format that is portable across architectures (NOTE: Darshan 1.x output files are incompatible with the tools in this release unless they were generated on a ppc32 architecture (Blue Gene))
- 8 new counters that record the rank of the fastest and slowest process that opened each shared file, along with the number of seconds and number of bytes consumed by those processes. It also reports the variance in both time and amount of data.
- new –with-jobid-env configure argument to support recording job identifiers from different schedulers
- job ID is now recorded within the Darshan log in addition to in the file name
- opens output files directly without using intermediate darshan-parser output
- table showing data usage per file system
- table showing I/O variance in shared files
- fixes for bugs reported by Noah Watkins:
- avoid name collision in hashing function
- divide by zero error in darshan-job-summary.pl
Darshan 2.0.0 is now available on the download
The Darshan project now has a dedicated mailing list and Trac page. You can find more information in the “bug reports” tab at the top of the Darshan web site.
- bug fixes to darshan-gen-cc.pl and darshan-gen-cxx.pl
- basic testing of darshan-gen-cc.pl with mpich/pgi
- update autoconf so that –with-mpi argument overrides mpicc in path
- fix warnings on 64 bit machines
Darshan 1.1.15 is now available on the download page.
- fixed erroneous incompatibility warning when opening old logs in darshan-parser
Darshan 1.1.14 is now available on the download page.
- updated compiler scripts to fix link order in some cases with -lhdf5
Darshan 1.1.13 is now available on the download page.