Skip to content

Latest commit

 

History

History
17 lines (8 loc) · 791 Bytes

README.md

File metadata and controls

17 lines (8 loc) · 791 Bytes

nersc-roofline

This repo contains files necessary to generate results here

https://docs.nersc.gov/programming/performance-debugging-tools/roofline/.

and for the following two papers

C. Yang, R. Gayatri, T. Kurth, P. Basu, Z. Ronaghi, A. Adetokunbo, B. Friesen, B. Cook, D. Doerfler, L. Oliker et al., An Empirical Roofline Methodology for Quantitatively Assessing Performance Portability, in 2018 IEEE/ACM International Workshop on Performance, Portability and Productivity in HPC (P3HPC). IEEE, 2018, pp. 14-23.

C. Yang, Hierarchical Roofline Analysis: How to Collect Data using Performance Tools on Intel CPUs and NVIDIA GPUs, arXiv.org

The data collection methodology for Roofline analysis on NVIDIA GPUs has been updated here

https://gitlab.com/NERSC/roofline-on-nvidia-gpus