Research on Convolutional Neural Network Inference Acceleration and Performance Optimization for Edge Intelligence-Reference-Cited by-同舟云学术

Research on Convolutional Neural Network Inference Acceleration and Performance Optimization for Edge Intelligence

Published:2023-12-31 Issue:1 Volume:24 Page:240
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Liang Yong¹²,Tan Junwen¹²^ORCID,Xie Zhisong²,Chen Zetao²,Lin Daoqian²,Yang Zhenhao²

Affiliation:

1. Key Laboratory of Advanced Manufacturing and Automation Technology (Guilin University of Technology), Education Department of Guangxi Zhuang, Autonomous Region, Guilin 541006, China

2. College of Mechanical and Control Engineering, Guilin University of Technology, Guilin 541006, China

Abstract

In recent years, edge intelligence (EI) has emerged, combining edge computing with AI, and specifically deep learning, to run AI algorithms directly on edge devices. In practical applications, EI faces challenges related to computational power, power consumption, size, and cost, with the primary challenge being the trade-off between computational power and power consumption. This has rendered traditional computing platforms unsustainable, making heterogeneous parallel computing platforms a crucial pathway for implementing EI. In our research, we leveraged the Xilinx Zynq 7000 heterogeneous computing platform, employed high-level synthesis (HLS) for design, and implemented two different accelerators for LeNet-5 using loop unrolling and pipelining optimization techniques. The experimental results show that when running at a clock speed of 100 MHz, the PIPELINE accelerator, compared to the UNROLL accelerator, experiences an 8.09% increase in power consumption but speeds up by 14.972 times, making the PIPELINE accelerator superior in performance. Compared to the CPU, the PIPELINE accelerator reduces power consumption by 91.37% and speeds up by 70.387 times, while compared to the GPU, it reduces power consumption by 93.35%. This study provides two different optimization schemes for edge intelligence applications through design and experimentation and demonstrates the impact of different quantization methods on FPGA resource consumption. These experimental results can provide a reference for practical applications, thereby providing a reference hardware acceleration scheme for edge intelligence applications.

Funder

Science and Technology Program of Guangxi, China

Guangxi Education Department of China

Guilin University of Technology

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/24/1/240/pdf

Reference45 articles.

1. A Modified adaptive hysteresis smoothing approach for image denoising based on spatial domain redundancy;Rajabi;Sens. Imaging,2021

2. Rajabi, M., Golshan, H., and Hasanzadeh, R.P. (2023). Non-local adaptive hysteresis despeckling approach for medical ultrasound images. Biomed. Signal Process. Control, 85.

3. Automated detection model in classification of B-lymphoblast cells from normal B-lymphoid precursors in blood smear microscopic images based on the majority voting technique;Ghaderzadeh;Sci. Program.,2022

4. Yu, G., Wang, T., Guo, G., and Liu, H. (2023). SFHG-YOLO: A Simple Real-Time Small-Object-Detection Method for Estimating Pineapple Yield from Unmanned Aerial Vehicles. Sensors, 23.

5. Slam, W., Li, Y., and Urouvas, N. (2023). Frontier Research on Low-Resource Speech Recognition Technology. Sensors, 23.

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Research on Fintech Stock Index Forecasting Methods Based on Big Data Artificial Intelligence;Proceedings of the 2024 Guangdong-Hong Kong-Macao Greater Bay Area International Conference on Digital Economy and Artificial Intelligence;2024-01-19