1. 2.1 summit and Sierra: designing ai/hpc supercomputers;Kahle,2019
2. Cycles, cells and platters: an empirical analysis of hardware failures on a million consumer pcs;Nightingale,2011
3. A large-scale study of failures on petascale supercomputers;Liu;J. Comput. Sci. Technol.,2018
4. Analyzing a five-year failure record of a leadership-class supercomputer;Rojas,2019
5. An analysis of resilience techniques for exascale computing platforms;Dauwe,2017