Improved Cache Performance in Monte Carlo Transport Calculations Using Energy Banding
|Title||Improved Cache Performance in Monte Carlo Transport Calculations Using Energy Banding|
|Publication Type||Journal Article|
|Year of Publication||2013|
|Authors||Siegel, AR, Smith, K, Felker, KG, Romano, PK, Forget, B, Beckman, PH|
|Journal||Computer Physics Communications|
We present an energy banding algorithm for Monte Carlo (MC) neutral particle transport simulations which depend on large cross section lookup tables. In MC codes, read-only cross section data tables are accessed frequently, exhibit poor locality, and are typically much too large to fit in fast memory. Thus, performance is often limited by long latencies to RAM, or by off-node communication latencies when the data footprint is very large and must be decomposed on a distributed memory machine. The proposed energy banding algorithm allows maximal temporal reuse of data in band sizes that can flexibly accommodate different architectural features. The energy banding algorithm is general and has a number of benefits compared to the traditional approach. In the present analysis we explore its potential to achieve improvements in time-to-solution on modern cache-based architectures.