Author:
Almási George,Heidelberger Philip,Archer Charles J.,Martorell Xavier,Erway C. Chris,Moreira José E.,Steinmacher-Burow B.,Zheng Yili
Cited by
44 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. gZCCL: Compression-Accelerated Collective Communication Framework for GPU Clusters;Proceedings of the 38th ACM International Conference on Supercomputing;2024-05-30
2. An Optimized Error-controlled MPI Collective Framework Integrated with Lossy Compression;2024 IEEE International Parallel and Distributed Processing Symposium (IPDPS);2024-05-27
3. LIBRA: Enabling Workload-Aware Multi-Dimensional Network Topology Optimization for Distributed Training of Large AI Models;2024 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS);2024-05-05
4. Enhancing Collective Communication in MCM Accelerators for Deep Learning Training;2024 IEEE International Symposium on High-Performance Computer Architecture (HPCA);2024-03-02
5. POSTER: Optimizing Collective Communications with Error-bounded Lossy Compression for GPU Clusters;Proceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming;2024-02-20