Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Cheng J, Wu J, Leng C, Wang Y, Hu Q. Quantized CNN: A Unified Approach to Accelerate and Compress Convolutional Networks. IEEE Trans Neural Netw Learn Syst 2018;29:4730-4743. [PMID: 29990226 DOI: 10.1109/tnnls.2017.2774288] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]

For:	Cheng J, Wu J, Leng C, Wang Y, Hu Q. Quantized CNN: A Unified Approach to Accelerate and Compress Convolutional Networks. IEEE Trans Neural Netw Learn Syst 2018;29:4730-4743. [PMID: 29990226 DOI: 10.1109/tnnls.2017.2774288] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]

Number

Cited by Other Article(s)

Pei Z, Yao X, Zhao W, Yu B. Quantization via Distillation and Contrastive Learning. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024;35:17164-17176. [PMID: 37610897 DOI: 10.1109/tnnls.2023.3300309] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/25/2023]

Boudardara F, Boussif A, Meyer PJ, Ghazel M. INNAbstract: An INN-Based Abstraction Method for Large-Scale Neural Network Verification. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024;35:18455-18469. [PMID: 37792651 DOI: 10.1109/tnnls.2023.3316551] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/06/2023]

Fabre W, Haroun K, Lorrain V, Lepecq M, Sicard G. From Near-Sensor to In-Sensor: A State-of-the-Art Review of Embedded AI Vision Systems. SENSORS (BASEL, SWITZERLAND) 2024;24:5446. [PMID: 39205141 PMCID: PMC11360785 DOI: 10.3390/s24165446] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/16/2024] [Revised: 08/07/2024] [Accepted: 08/19/2024] [Indexed: 09/04/2024]

Abstract

In modern cyber-physical systems, the integration of AI into vision pipelines is now a standard practice for applications ranging from autonomous vehicles to mobile devices. Traditional AI integration often relies on cloud-based processing, which faces challenges such as data access bottlenecks, increased latency, and high power consumption. This article reviews embedded AI vision systems, examining the diverse landscape of near-sensor and in-sensor processing architectures that incorporate convolutional neural networks. We begin with a comprehensive analysis of the critical characteristics and metrics that define the performance of AI-integrated vision systems. These include sensor resolution, frame rate, data bandwidth, computational throughput, latency, power efficiency, and overall system scalability. Understanding these metrics provides a foundation for evaluating how different embedded processing architectures impact the entire vision pipeline, from image capture to AI inference. Our analysis delves into near-sensor systems that leverage dedicated hardware accelerators and commercially available components to efficiently process data close to their source, minimizing data transfer overhead and latency. These systems offer a balance between flexibility and performance, allowing for real-time processing in constrained environments. In addition, we explore in-sensor processing solutions that integrate computational capabilities directly into the sensor. This approach addresses the rigorous demand constraints of embedded applications by significantly reducing data movement and power consumption while also enabling in-sensor feature extraction, pre-processing, and CNN inference. By comparing these approaches, we identify trade-offs related to flexibility, power consumption, and computational performance. Ultimately, this article provides insights into the evolving landscape of embedded AI vision systems and suggests new research directions for the development of next-generation machine vision systems.

Collapse

Lin M, Ji R, Li S, Wang Y, Wu Y, Huang F, Ye Q. Network Pruning Using Adaptive Exemplar Filters. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2022;33:7357-7366. [PMID: 34101606 DOI: 10.1109/tnnls.2021.3084856] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]

Hu SG, Qiao GC, Liu XK, Liu YH, Zhang CM, Zuo Y, Zhou P, Liu YA, Ning N, Yu Q, Liu Y. A Co-Designed Neuromorphic Chip With Compact (17.9K F²) and Weak Neuron Number-Dependent Neuron/Synapse Modules. IEEE TRANSACTIONS ON BIOMEDICAL CIRCUITS AND SYSTEMS 2022;16:1250-1260. [PMID: 36150001 DOI: 10.1109/tbcas.2022.3209073] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/05/2023]

Lin M, Cao L, Li S, Ye Q, Tian Y, Liu J, Tian Q, Ji R. Filter Sketch for Network Pruning. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2022;33:7091-7100. [PMID: 34125685 DOI: 10.1109/tnnls.2021.3084206] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]

Fei W, Dai W, Li C, Zou J, Xiong H. General Bitwidth Assignment for Efficient Deep Convolutional Neural Network Quantization. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2022;33:5253-5267. [PMID: 33830929 DOI: 10.1109/tnnls.2021.3069886] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]

Hu Y, Wen G, Luo M, Dai D, Cao W, Yu Z, Hall W. Inner-Imaging Networks: Put Lenses Into Convolutional Structure. IEEE TRANSACTIONS ON CYBERNETICS 2022;52:8547-8560. [PMID: 34398768 DOI: 10.1109/tcyb.2020.3034605] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]

Verma S, Wang C, Zhu L, Liu W. Attn-HybridNet: Improving Discriminability of Hybrid Features With Attention Fusion. IEEE TRANSACTIONS ON CYBERNETICS 2022;52:6567-6578. [PMID: 33739927 DOI: 10.1109/tcyb.2021.3060176] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]

Jin X, Xie Y, Wei XS, Zhao BR, Zhang Y, Tan X, Yu Y. A Lightweight Encoder-Decoder Path for Deep Residual Networks. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2022;33:866-878. [PMID: 33180736 DOI: 10.1109/tnnls.2020.3029613] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]

Heuristic-based automatic pruning of deep neural networks. Neural Comput Appl 2022. [DOI: 10.1007/s00521-021-06679-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

Xu TB, Liu CL. Deep Neural Network Self-Distillation Exploiting Data Representation Invariance. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2022;33:257-269. [PMID: 33074828 DOI: 10.1109/tnnls.2020.3027634] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]

Bao T, Zaidi SAR, Xie S, Yang P, Zhang ZQ. Inter-Subject Domain Adaptation for CNN-Based Wrist Kinematics Estimation Using sEMG. IEEE Trans Neural Syst Rehabil Eng 2021;29:1068-1078. [PMID: 34086574 DOI: 10.1109/tnsre.2021.3086401] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Wang P, He X, Chen Q, Cheng A, Liu Q, Cheng J. Unsupervised Network Quantization via Fixed-Point Factorization. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2021;32:2706-2720. [PMID: 32706647 DOI: 10.1109/tnnls.2020.3007749] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]

Zhang Y, Cui M, Shen L, Zeng Z. Memristive Quantized Neural Networks: A Novel Approach to Accelerate Deep Learning On-Chip. IEEE TRANSACTIONS ON CYBERNETICS 2021;51:1875-1887. [PMID: 31059463 DOI: 10.1109/tcyb.2019.2912205] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]

Liu X, Li L, Wang S, Zha ZJ, Huang Q. Local-binarized very deep residual network for visual categorization. Neurocomputing 2021. [DOI: 10.1016/j.neucom.2020.11.041] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]

Chen H, Wang Y, Xu C, Xu C, Tao D. Learning Student Networks via Feature Embedding. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2021;32:25-35. [PMID: 32092018 DOI: 10.1109/tnnls.2020.2970494] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]

RGB Image Prioritization Using Convolutional Neural Network on a Microprocessor for Nanosatellites. REMOTE SENSING 2020. [DOI: 10.3390/rs12233941] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]

Wu K, Guo Y, Zhang C. Compressing Deep Neural Networks With Sparse Matrix Factorization. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2020;31:3828-3838. [PMID: 31725393 DOI: 10.1109/tnnls.2019.2946636] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]

Effective node selection technique towards sparse learning. APPL INTELL 2020. [DOI: 10.1007/s10489-020-01720-5] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]

A comprehensive survey on model compression and acceleration. Artif Intell Rev 2020. [DOI: 10.1007/s10462-020-09816-7] [Citation(s) in RCA: 37] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]

Lin S, Ji R, Li Y, Deng C, Li X. Toward Compact ConvNets via Structure-Sparsity Regularized Filter Pruning. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2020;31:574-588. [PMID: 30990448 DOI: 10.1109/tnnls.2019.2906563] [Citation(s) in RCA: 41] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]

Abstract

The success of convolutional neural networks (CNNs) in computer vision applications has been accompanied by a significant increase of computation and memory costs, which prohibits their usage on resource-limited environments, such as mobile systems or embedded devices. To this end, the research of CNN compression has recently become emerging. In this paper, we propose a novel filter pruning scheme, termed structured sparsity regularization (SSR), to simultaneously speed up the computation and reduce the memory overhead of CNNs, which can be well supported by various off-the-shelf deep learning libraries. Concretely, the proposed scheme incorporates two different regularizers of structured sparsity into the original objective function of filter pruning, which fully coordinates the global output and local pruning operations to adaptively prune filters. We further propose an alternative updating with Lagrange multipliers (AULM) scheme to efficiently solve its optimization. AULM follows the principle of alternating direction method of multipliers (ADMM) and alternates between promoting the structured sparsity of CNNs and optimizing the recognition loss, which leads to a very efficient solver ( 2.5× to the most recent work that directly solves the group sparsity-based regularization). Moreover, by imposing the structured sparsity, the online inference is extremely memory-light since the number of filters and the output feature maps are simultaneously reduced. The proposed scheme has been deployed to a variety of state-of-the-art CNN structures, including LeNet, AlexNet, VGGNet, ResNet, and GoogLeNet, over different data sets. Quantitative results demonstrate that the proposed scheme achieves superior performance over the state-of-the-art methods. We further demonstrate the proposed compression scheme for the task of transfer learning, including domain adaptation and object detection, which also show exciting performance gains over the state-of-the-art filter pruning methods.

Collapse

Leyva R, Sanchez V, Li CT. Compact and Low-Complexity Binary Feature Descriptor and Fisher Vectors for Video Analytics. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2019;28:6169-6184. [PMID: 31251186 DOI: 10.1109/tip.2019.2922826] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]

Passalis N, Tefas A. Training Lightweight Deep Convolutional Neural Networks Using Bag-of-Features Pooling. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2019;30:1705-1715. [PMID: 30369453 DOI: 10.1109/tnnls.2018.2872995] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]