Circuits and Systems

Volume 8, Issue 6 (June 2017)

ISSN Print: 2153-1285   ISSN Online: 2153-1293

Google-based Impact Factor: 0.48  Citations  

A Multithreaded CGRA for Convolutional Neural Network Processing

HTML  XML Download Download as PDF (Size: 4129KB)  PP. 149-170  
DOI: 10.4236/cs.2017.86010    2,392 Downloads   5,418 Views  Citations

ABSTRACT

Convolutional neural network (CNN) is an essential model to achieve high accuracy in various machine learning applications, such as image recognition and natural language processing. One of the important issues for CNN acceleration with high energy efficiency and processing performance is efficient data reuse by exploiting the inherent data locality. In this paper, we propose a novel CGRA (Coarse Grained Reconfigurable Array) architecture with time-domain multithreading for exploiting input data locality. The multithreading on each processing element enables the input data reusing through multiple computation periods. This paper presents the accelerator design performance analysis of the proposed architecture. We examine the structure of memory subsystems, as well as the architecture of the computing array, to supply required data with minimal performance overhead. We explore efficient architecture design alternatives based on the characteristics of modern CNN configurations. The evaluation results show that the available bandwidth of the external memory can be utilized efficiently when the output plane is wider (in earlier layers of many CNNs) while the input data locality can be utilized maximally when the number of output channel is larger (in later layers).

Share and Cite:

Ando, K. , Takamaeda-Yamazaki, S. , Ikebe, M. , Asai, T. and Motomura, M. (2017) A Multithreaded CGRA for Convolutional Neural Network Processing. Circuits and Systems, 8, 149-170. doi: 10.4236/cs.2017.86010.

Cited by

[1] Improving Energy Efficiency of CGRAs with Low-Overhead Fine-Grained Power Domains
ACM Transactions on …, 2022
[2] Remarn: A Reconfigurable Multi-threaded Multi-core Accelerator for Recurrent Neural Networks
ACM Transactions on …, 2022
[3] Speeding Up of CGRAs by Reshaping and Stochastic FMA
2021 Ninth International …, 2021
[4] A Reconfigurable Multithreaded Accelerator for Recurrent Neural Networks
2020
[5] Online perception with machine learning for automated driving
2020
[6] Deteksi Sinyal P300 Menggunakan Metode Batch Normalization Neural Network= P300 Signal Detection Using Batch Normalization Neural Network
2018
[7] On the Degree-Based Topological Indices of the Tickysim SpiNNaker Model
Axioms, 2018
[8] Stream Processing Dual-Track CGRA for Object Inference
2018

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.