Power Limitations and Dark Silicon Challenge the Future of Multicore-Reference-Cited by-同舟云学术

Power Limitations and Dark Silicon Challenge the Future of Multicore

Published:2012-08 Issue:3 Volume:30 Page:1-27
ISSN:0734-2071
Container-title:ACM Transactions on Computer Systems
language:en
Short-container-title:ACM Trans. Comput. Syst.

Author:

Esmaeilzadeh Hadi¹,Blem Emily²,St. Amant Renée³,Sankaralingam Karthikeyan²,Burger Doug⁴

Affiliation:

1. University of Washington

2. University of Wisconsin-Madison

3. The University of Texas at Austin

4. Microsoft Research

Abstract

Since 2004, processor designers have increased core counts to exploit Moore’s Law scaling, rather than focusing on single-core performance. The failure of Dennard scaling, to which the shift to multicore parts is partially a response, may soon limit multicore scaling just as single-core scaling has been curtailed. This paper models multicore scaling limits by combining device scaling, single-core scaling, and multicore scaling to measure the speedup potential for a set of parallel workloads for the next five technology generations. For device scaling, we use both the ITRS projections and a set of more conservative device scaling parameters. To model single-core scaling, we combine measurements from over 150 processors to derive Pareto-optimal frontiers for area/performance and power/performance. Finally, to model multicore scaling, we build a detailed performance model of upper-bound performance and lower-bound core power. The multicore designs we study include single-threaded CPU-like and massively threaded GPU-like multicore chip organizations with symmetric, asymmetric, dynamic, and composed topologies. The study shows that regardless of chip organization and topology, multicore scaling is power limited to a degree not widely appreciated by the computing community. Even at 22 nm (just one year from now), 21% of a fixed-size chip must be powered off, and at 8 nm, this number grows to more than 50%. Through 2024, only 7.9× average speedup is possible across commonly used parallel workloads for the topologies we study, leaving a nearly 24-fold gap from a target of doubled performance per generation.

Funder

Division of Computer and Network Systems

Division of Computing and Communication Foundations

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/2324876.2324879

Reference29 articles.

1. Validity of the single processor approach to achieving large scale computing capabilities

2. Energy-performance tradeoffs in processor architecture and circuit design

3. Bakhoda , A. , Yuan , G. L. , Fung , W. W. L. , Wong , H. , and Aamodt , T. M . 2009. Analyzing CUDA workloads using a detailed GPU simulator . In Proceedings of the 2009 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS). Bakhoda, A., Yuan, G. L., Fung, W. W. L., Wong, H., and Aamodt, T. M. 2009. Analyzing CUDA workloads using a detailed GPU simulator. In Proceedings of the 2009 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS).

4. Understanding PARSEC performance on contemporary CMPs

5. The PARSEC benchmark suite

Cited by 37 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An innovative manifold microchannel heat sink for thermal management of high-power chips with multiple hotspots;Numerical Heat Transfer, Part A: Applications;2024-07-14

2. RETRACTED: An improved hybrid network‐on‐chip with flexible topology and frugal routing;The Journal of Engineering;2024-06

3. Energy Efficient Computing Systems: Architectures, Abstractions and Modeling to Techniques and Standards;ACM Computing Surveys;2022-01-31

4. Performance Analysis of Impinging Chip-Attached Micro Pin Fin Direct Liquid Cooling Package for Hotspot Targeted Applications;2021 20th IEEE Intersociety Conference on Thermal and Thermomechanical Phenomena in Electronic Systems (iTherm);2021-06-01

5. Cross-Layer Reliability, Energy Efficiency, and Performance Optimization of Near-Threshold Data Paths;Journal of Low Power Electronics and Applications;2020-12-03