1. 29th TOP500 Supercomputer Sites;Dongarra,1994
2. Anomaly detection and anticipation in high performance computing systems;Borghesi;IEEE Trans. Parallel Distrib. Syst.,2022
3. RUAD: Unsupervised anomaly detection in HPC systems;Molan;Future Gener. Comput. Syst.,2023
4. Predicting faults in high performance computing systems: An in-depth survey of the state-of-the-practice;Jauk,2019
5. Q. Guan, Z. Zhang, S. Fu, Proactive Failure Management by Integrated Unsupervised and Semi-Supervised Learning for Dependable Cloud Systems, in: 2011 Sixth International Conference on Availability, Reliability and Security, 2011, pp. 83–90, http://dx.doi.org/10.1109/ARES.2011.20.