SciDAC-Data: Enabling Data Driven Modeling of Exascale Computing
|Title||SciDAC-Data: Enabling Data Driven Modeling of Exascale Computing|
|Year of Publication||2017|
|Authors||Mubarak, M, Ding, P, Aliaga, L, Tsaris, A, Norman, A, Lyon, A, Ross, R|
The SciDAC-Data project is a DOE-funded initiative to analyze and exploit two decades of information and analytics that have been collected by the Fermilab data center on the organization, movement, and consumption of high energy physics (HEP) data. The project analyzes the analysis patterns and data organization that have been used by NOvA, MicroBooNE, MINERvA, CDF, D0, and other experiments to develop realistic models of HEP analysis workflows and data processing. The SciDAC-Data project aims to provide both realistic input vectors and corresponding output data that can be used to optimize and validate simulations of HEP analysis. These simulations are designed to address questions of data handling, cache optimization, and workflow structures that are the prerequisites for modern HEP analysis chains to be mapped and optimized to run on the next generation of leadership-class exascale computing facilities.