1
|
Sun Y, Li P, Xu H, Wang R. Structural prior-driven feature extraction with gradient-momentum combined optimization for convolutional neural network image classification. Neural Netw 2024; 179:106511. [PMID: 39146718 DOI: 10.1016/j.neunet.2024.106511] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2024] [Revised: 06/12/2024] [Accepted: 07/03/2024] [Indexed: 08/17/2024]
Abstract
Recent image classification efforts have achieved certain success by incorporating prior information such as labels and logical rules to learn discriminative features. However, these methods overlook the variability of features, resulting in feature inconsistency and fluctuations in model parameter updates, which further contribute to decreased image classification accuracy and model instability. To address this issue, this paper proposes a novel method combining structural prior-driven feature extraction with gradient-momentum (SPGM), from the perspectives of consistent feature learning and precise parameter updates, to enhance the accuracy and stability of image classification. Specifically, SPGM leverages a structural prior-driven feature extraction (SPFE) approach to calculate gradients of multi-level features and original images to construct structural information, which is then transformed into prior knowledge to drive the network to learn features consistent with the original images. Additionally, an optimization strategy integrating gradients and momentum (GMO) is introduced, dynamically adjusting the direction and step size of parameter updates based on the angle and norm of the sum of gradients and momentum, enabling precise model parameter updates. Extensive experiments on CIFAR10 and CIFAR100 datasets demonstrate that the SPGM method significantly reduces the top-1 error rate in image classification, enhances the classification performance, and outperforms state-of-the-art methods.
Collapse
Affiliation(s)
- Yunyun Sun
- School of Internet of Things, Nanjing University of Posts and Telecommunications, Nanjing, 210023, Jiangsu, China.
| | - Peng Li
- School of Computer Science, Nanjing University of Posts and Telecommunications, Nanjing, 210023, Jiangsu, China; Jiangsu High Technology Research Key Laboratory for Wireless Sensor Networks, Nanjing, 210023, Jiangsu, China.
| | - He Xu
- School of Computer Science, Nanjing University of Posts and Telecommunications, Nanjing, 210023, Jiangsu, China; Jiangsu High Technology Research Key Laboratory for Wireless Sensor Networks, Nanjing, 210023, Jiangsu, China.
| | - Ruchuan Wang
- School of Computer Science, Nanjing University of Posts and Telecommunications, Nanjing, 210023, Jiangsu, China; Jiangsu High Technology Research Key Laboratory for Wireless Sensor Networks, Nanjing, 210023, Jiangsu, China.
| |
Collapse
|
2
|
Tan C, Zhang J, Liu J. Understanding Short-Range Memory Effects in Deep Neural Networks. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:10576-10590. [PMID: 37027555 DOI: 10.1109/tnnls.2023.3242969] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Stochastic gradient descent (SGD) is of fundamental importance in deep learning. Despite its simplicity, elucidating its efficacy remains challenging. Conventionally, the success of SGD is ascribed to the stochastic gradient noise (SGN) incurred in the training process. Based on this consensus, SGD is frequently treated and analyzed as the Euler-Maruyama discretization of stochastic differential equations (SDEs) driven by either Brownian or Lévy stable motion. In this study, we argue that SGN is neither Gaussian nor Lévy stable. Instead, inspired by the short-range correlation emerging in the SGN series, we propose that SGD can be viewed as a discretization of an SDE driven by fractional Brownian motion (FBM). Accordingly, the different convergence behavior of SGD dynamics is well-grounded. Moreover, the first passage time of an SDE driven by FBM is approximately derived. The result suggests a lower escaping rate for a larger Hurst parameter, and thus, SGD stays longer in flat minima. This happens to coincide with the well-known phenomenon that SGD favors flat minima that generalize well. Extensive experiments are conducted to validate our conjecture, and it is demonstrated that short-range memory effects persist across various model architectures, datasets, and training strategies. Our study opens up a new perspective and may contribute to a better understanding of SGD.
Collapse
|
3
|
Jia X, Feng X, Yong H, Meng D. Weight Decay With Tailored Adam on Scale-Invariant Weights for Better Generalization. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:6936-6947. [PMID: 36279330 DOI: 10.1109/tnnls.2022.3213536] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
Weight decay (WD) is a fundamental and practical regularization technique in improving generalization of current deep learning models. However, it is observed that the WD does not work effectively for an adaptive optimization algorithm (such as Adam), as it works for SGD. Specifically, the solution found by Adam with the WD often generalizes unsatisfactorily. Though efforts have been made to mitigate this issue, the reason for such deficiency is still vague. In this article, we first show that when using the Adam optimizer, the weight norm increases very fast along with the training procedure, which is in contrast to SGD where the weight norm increases relatively slower and tends to converge. The fast increase of weight norm is adverse to WD; in consequence, the Adam optimizer will lose efficacy in finding solution that generalizes well. To resolve this problem, we propose to tailor Adam by introducing a regularization term on the adaptive learning rate, such that it is friendly to WD. Meanwhile, we introduce first moment on the WD to further enhance the regularization effect. We show that the proposed method is able to find solution with small norm and generalizes better than SGD. We test the proposed method on general image classification and fine-grained image classification tasks with different networks. Experimental results on all these cases substantiate the effectiveness of the proposed method in help improving the generalization. Specifically, the proposed method improves the test accuracy of Adam by a large margin and even improves the performance of SGD by 0.84% on CIFAR 10 and 1.03 % on CIFAR 100 with ResNet-50. The code of this article is public available at xxx.
Collapse
|
4
|
Ye S, Peng Q, Sun W, Xu J, Wang Y, You X, Cheung YM. Discriminative Suprasphere Embedding for Fine-Grained Visual Categorization. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:5092-5102. [PMID: 36107889 DOI: 10.1109/tnnls.2022.3202534] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Despite the great success of the existing work in fine-grained visual categorization (FGVC), there are still several unsolved challenges, e.g., poor interpretation and vagueness contribution. To circumvent this drawback, motivated by the hypersphere embedding method, we propose a discriminative suprasphere embedding (DSE) framework, which can provide intuitive geometric interpretation and effectively extract discriminative features. Specifically, DSE consists of three modules. The first module is a suprasphere embedding (SE) block, which learns discriminative information by emphasizing weight and phase. The second module is a phase activation map (PAM) used to analyze the contribution of local descriptors to the suprasphere feature representation, which uniformly highlights the object region and exhibits remarkable object localization capability. The last module is a class contribution map (CCM), which quantitatively analyzes the network classification decision and provides insight into the domain knowledge about classified objects. Comprehensive experiments on three benchmark datasets demonstrate the effectiveness of our proposed method in comparison with state-of-the-art methods.
Collapse
|
5
|
Agarwal V, Lohani MC, Bist AS. A novel deep learning technique for medical image analysis using improved optimizer. Health Informatics J 2024; 30:14604582241255584. [PMID: 38755759 DOI: 10.1177/14604582241255584] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/18/2024]
Abstract
Application of Convolutional neural network in spectrum of Medical image analysis are providing benchmark outputs which converges the interest of many researchers to explore it in depth. Latest preprocessing technique Real ESRGAN (Enhanced super resolution generative adversarial network) and GFPGAN (Generative facial prior GAN) are proving their efficacy in providing high resolution dataset. Objective: Optimizer plays a vital role in upgrading the functioning of CNN model. Different optimizers like Gradient descent, Stochastic Gradient descent, Adagrad, Adadelta and Adam etc. are used for classification and segmentation of Medical image but they suffer from slow processing due to their large memory requirement. Stochastic Gradient descent suffers from high variance and is computationally expensive. Dead neuron problem also proves to detrimental to the performance of most of the optimizers. A new optimization technique Gradient Centralization is providing the unparalleled result in terms of generalization and execution time. Method: Our paper explores the next factor which is the employment of new optimization technique, Gradient centralization (GC) to our integrated framework (Model with advanced preprocessing technique). Result and conclusion: Integrated Framework of Real ESRGAN and GFPGAN with Gradient centralization provides an optimal solution for deep learning models in terms of Execution time and Loss factor improvement.
Collapse
|
6
|
Ding Y, Li Z, Liu C, Kang Z, Sun M, Sun J, Chen L. Ballistic Coefficient Calculation Based on Optical Angle Measurements of Space Debris. SENSORS (BASEL, SWITZERLAND) 2023; 23:7668. [PMID: 37765725 PMCID: PMC10538076 DOI: 10.3390/s23187668] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/22/2023] [Revised: 09/02/2023] [Accepted: 09/04/2023] [Indexed: 09/29/2023]
Abstract
Atmospheric drag is an important factor affecting orbit determination and prediction of low-orbit space debris. To obtain accurate ballistic coefficients of space debris, we propose a calculation method based on measured optical angles. Angle measurements of space debris with a perigee height below 1400 km acquired from a photoelectric array were used for orbit determination. Perturbation equations of atmospheric drag were used to calculate the semi-major-axis variation. The ballistic coefficients of space debris were estimated and compared with those published by the North American Aerospace Defense Command in terms of orbit prediction error. The 48 h orbit prediction error of the ballistic coefficients obtained from the proposed method is reduced by 18.65% compared with the published error. Hence, our method seems suitable for calculating space debris ballistic coefficients and supporting related practical applications.
Collapse
Affiliation(s)
- Yigao Ding
- Changchun Observatory, National Astronomical Observatories, Chinese Academy of Sciences, Changchun 130117, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Zhenwei Li
- Changchun Observatory, National Astronomical Observatories, Chinese Academy of Sciences, Changchun 130117, China
- Key Laboratory of Space Object and Debris Observation, Purple Mountain Observatory, Chinese Academy of Sciences, Nanjing 210008, China
| | - Chengzhi Liu
- Changchun Observatory, National Astronomical Observatories, Chinese Academy of Sciences, Changchun 130117, China
- Key Laboratory of Space Object and Debris Observation, Purple Mountain Observatory, Chinese Academy of Sciences, Nanjing 210008, China
| | - Zhe Kang
- Changchun Observatory, National Astronomical Observatories, Chinese Academy of Sciences, Changchun 130117, China
| | - Mingguo Sun
- Changchun Observatory, National Astronomical Observatories, Chinese Academy of Sciences, Changchun 130117, China
| | - Jiannan Sun
- Changchun Observatory, National Astronomical Observatories, Chinese Academy of Sciences, Changchun 130117, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Long Chen
- Changchun Observatory, National Astronomical Observatories, Chinese Academy of Sciences, Changchun 130117, China
| |
Collapse
|
7
|
Ali HS, Ismail AI, El‐Rabaie EM, Abd El‐Samie FE. Deep residual architectures and ensemble learning for efficient brain tumour classification. EXPERT SYSTEMS 2023; 40. [DOI: 10.1111/exsy.13226] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/14/2022] [Accepted: 12/12/2022] [Indexed: 09/02/2023]
Abstract
AbstractThe prompt and accurate detection of brain tumours is essential for disease management and life‐saving. This paper introduces an efficient and robust completely automated system for classifying the three prominent types of brain tumour. The aim is to contribute for enhanced classification accuracy with minimum pre‐processing and less inference time. The power of deep networks is thoroughly investigated, with and without transfer learning. Fine‐tuned deep Residual Networks (ResNets) with depth up to 101 are introduced to manage the complex nature of brain images, and to capture their microstructural information. The proposed residual architectures with their in‐depth representations are evaluated and compared to other fine‐tuned networks (AlexNet, GoogLeNet and VGG16). A novel Convolutional Network (ConvNet) built and trained from scratch is also proposed for tumour type classification. Proven models are integrated by combining their decisions using majority voting to obtain the final classification accuracy. Results show that the residual architectures can be optimized efficiently, and a noticeable accuracy can be gained with them. Although ResNet models are deeper than VGG16, they show lower complexity. Results also indicate that building ensemble of models is a successful strategy to enhance the system performance. Each model in the ensemble learns specific patterns with certain filters. This stochastic nature boosts the classification accuracy. The accuracies obtained from ResNet18, ResNet101, and the proposed ConvNet are 98.91%, 97.39% and 95.43%, respectively. The accuracy based on decision fusion for the three networks is 99.57%, which is better than those of all state‐of‐the‐art techniques. The accuracy obtained with ResNet50 is 98.26%, and its fusion with ResNet18 and the designed network yields a 99.35% accuracy, which is also better than those of previous methods, meanwhile achieving minimum detection time requirements. Finally, visual representation of the learned features is provided to understand what the models have learned.
Collapse
Affiliation(s)
- Hanaa S. Ali
- Electronics & Communication Department, Faculty of Engineering Zagazig University Zagazig Egypt
| | - Asmaa I. Ismail
- Department of Electronics and Electrical Communications, Faculty of Electronic Engineering Menoufia University Menouf Egypt
| | - El‐Sayed M. El‐Rabaie
- Department of Electronics and Electrical Communications, Faculty of Electronic Engineering Menoufia University Menouf Egypt
| | - Fathi E. Abd El‐Samie
- Department of Electronics and Electrical Communications, Faculty of Electronic Engineering Menoufia University Menouf Egypt
| |
Collapse
|
8
|
Hong JW, Kim SH, Han GT. Detection of Multiple Respiration Patterns Based on 1D SNN from Continuous Human Breathing Signals and the Range Classification Method for Each Respiration Pattern. SENSORS (BASEL, SWITZERLAND) 2023; 23:s23115275. [PMID: 37300002 DOI: 10.3390/s23115275] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/30/2023] [Revised: 05/27/2023] [Accepted: 05/30/2023] [Indexed: 06/12/2023]
Abstract
Human respiratory information is being used as an important source of biometric information that can enable the analysis of health status in the healthcare domain. The analysis of the frequency or duration of a specific respiration pattern and the classification of respiration patterns in the corresponding section for a certain period of time are important for the utilization of respiratory information in various ways. Existing methods require window slide processing to classify sections for each respiration pattern from the breathing data for a certain time period. In this case, when multiple respiration patterns exist within one window, the recognition rate can be lowered. To solve this problem, a 1D Siamese neural network (SNN)-based human respiration pattern detection model and a merge-and-split algorithm for the classification of multiple respiration patterns in each region for all respiration sections are proposed in this study. When calculating the accuracy based on intersection over union (IOU) for the respiration range classification result for each pattern, the accuracy was found to be improved by approximately 19.3% compared with the existing deep neural network (DNN) and 12.4% compared with a 1D convolutional neural network (CNN). The accuracy of detection based on the simple respiration pattern was approximately 14.5% higher than that of the DNN and 5.3% higher than that of the 1D CNN.
Collapse
Affiliation(s)
- Jin-Woo Hong
- Department of Computer Engineering, Gachon University, Seongnam 13120, Republic of Korea
| | | | - Gi-Tae Han
- Department of Computer Engineering, Gachon University, Seongnam 13120, Republic of Korea
| |
Collapse
|
9
|
Suyun W, Suying Z. Application of big data classification effects based on neural network in video English course and relevant optimization suggestions. Soft comput 2023; 27:7615-7625. [PMID: 37192890 PMCID: PMC10105353 DOI: 10.1007/s00500-023-08123-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/28/2023] [Indexed: 05/18/2023]
Abstract
Due to the improvement of Internet technology and information technology, more and more students hope to learn and consolidate knowledge through video in the classroom. Teachers are more accustomed to using video in the classroom to improve and improve their teaching quality. In the current English class, teachers and students are more accustomed to using video English for teaching. English teaching videos are informative, intuitive and efficient. Through video teaching, we can make the classroom atmosphere more interesting, thus simplifying complex problems. In this context, this paper analyzes how neural networks can improve the application effect of English video courses in the context of big data, optimizes the PDCNO algorithm by using the neural network principle, and then discusses the impact of the optimized PDCNO algorithm on classification and system performance. This improves the accuracy of English video, reduces the execution time of the algorithm and reduces the memory occupation. Compared with ordinary video, the training time required under the same training parameters is shorter, and the convergence speed of the model itself will be faster. From the students' attitude towards video teaching, we can see that students prefer video English teaching, which also reflects the effectiveness of neural network big data in English video teaching. This paper introduces the neural network and big data technology into the video English course to improve the teaching effectiveness.
Collapse
Affiliation(s)
- Wen Suyun
- Department of Public Foreign Languages, Shijiazhuang University of Applied Technology, Shijiazhuang, 050081 Hebei China
| | - Zheng Suying
- Department of International Communication and Culture and Art, Hebei Professional College of Political Science and Law, Shijiazhuang, 050061 Hebei China
| |
Collapse
|
10
|
Abd Rahman NH, Mohamad Zaki MH, Hasikin K, Abd Razak NA, Ibrahim AK, Lai KW. Predicting medical device failure: a promise to reduce healthcare facilities cost through smart healthcare management. PeerJ Comput Sci 2023; 9:e1279. [PMID: 37346641 PMCID: PMC10280478 DOI: 10.7717/peerj-cs.1279] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2022] [Accepted: 02/15/2023] [Indexed: 06/23/2023]
Abstract
Background The advancement of biomedical research generates myriad healthcare-relevant data, including medical records and medical device maintenance information. The COVID-19 pandemic significantly affects the global mortality rate, creating an enormous demand for medical devices. As information technology has advanced, the concept of intelligent healthcare has steadily gained prominence. Smart healthcare utilises a new generation of information technologies, such as the Internet of Things (loT), big data, cloud computing, and artificial intelligence, to completely transform the traditional medical system. With the intention of presenting the concept of smart healthcare, a predictive model is proposed to predict medical device failure for intelligent management of healthcare services. Methods Present healthcare device management can be improved by proposing a predictive machine learning model that prognosticates the tendency of medical device failures toward smart healthcare. The predictive model is developed based on 8,294 critical medical devices from 44 different types of equipment extracted from 15 healthcare facilities in Malaysia. The model classifies the device into three classes; (i) class 1, where the device is unlikely to fail within the first 3 years of purchase, (ii) class 2, where the device is likely to fail within 3 years from purchase date, and (iii) class 3 where the device is likely to fail more than 3 years after purchase. The goal is to establish a precise maintenance schedule and reduce maintenance and resource costs based on the time to the first failure event. A machine learning and deep learning technique were compared, and the best robust model for smart healthcare was proposed. Results This study compares five algorithms in machine learning and three optimizers in deep learning techniques. The best optimized predictive model is based on ensemble classifier and SGDM optimizer, respectively. An ensemble classifier model produces 77.90%, 87.60%, and 75.39% for accuracy, specificity, and precision compared to 70.30%, 83.71%, and 67.15% for deep learning models. The ensemble classifier model improves to 79.50%, 88.36%, and 77.43% for accuracy, specificity, and precision after significant features are identified. The result concludes although machine learning has better accuracy than deep learning, more training time is required, which is 11.49 min instead of 1 min 5 s when deep learning is applied. The model accuracy shall be improved by introducing unstructured data from maintenance notes and is considered the author's future work because dealing with text data is time-consuming. The proposed model has proven to improve the devices' maintenance strategy with a Malaysian Ringgit (MYR) cost reduction of approximately MYR 326,330.88 per year. Therefore, the maintenance cost would drastically decrease if this smart predictive model is included in the healthcare management system.
Collapse
Affiliation(s)
- Noorul Husna Abd Rahman
- Department of Biomedical Engineering, Universiti Malaya, Lembah Pantai, Wilayah Persekutuan Kuala Lumpur, Malaysia
- Engineering Services Division, Ministry of Health, Putrajaya, Wilayah Persekutuan Putrajaya, Malaysia
| | - Muhammad Hazim Mohamad Zaki
- Department of Biomedical Engineering, Universiti Malaya, Lembah Pantai, Wilayah Persekutuan Kuala Lumpur, Malaysia
| | - Khairunnisa Hasikin
- Department of Biomedical Engineering, Universiti Malaya, Lembah Pantai, Wilayah Persekutuan Kuala Lumpur, Malaysia
- Center of Intelligent Systems for Emerging Technology (CISET), Faculty of Engineering, Universiti Malaya, Lembah Pantai, Wilayah Persekutuan Kuala Lumpur, Malaysia
| | - Nasrul Anuar Abd Razak
- Department of Biomedical Engineering, Universiti Malaya, Lembah Pantai, Wilayah Persekutuan Kuala Lumpur, Malaysia
| | - Ayman Khaleel Ibrahim
- Faculty of Computing and Informatics, Universiti Malaysia Sabah, Kota Kinabalu, Sabah, Malaysia
| | - Khin Wee Lai
- Department of Biomedical Engineering, Universiti Malaya, Lembah Pantai, Wilayah Persekutuan Kuala Lumpur, Malaysia
| |
Collapse
|
11
|
An Improved Adam Optimization Algorithm Combining Adaptive Coefficients and Composite Gradients Based on Randomized Block Coordinate Descent. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2023; 2023:4765891. [PMID: 36660559 PMCID: PMC9845049 DOI: 10.1155/2023/4765891] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/22/2022] [Revised: 12/28/2022] [Accepted: 01/02/2023] [Indexed: 01/12/2023]
Abstract
An improved Adam optimization algorithm combining adaptive coefficients and composite gradients based on randomized block coordinate descent is proposed to address issues of the Adam algorithm such as slow convergence, the tendency to miss the global optimal solution, and the ineffectiveness of processing high-dimensional vectors. The adaptive coefficient is used to adjust the gradient deviation value and correct the search direction firstly. Then, the predicted gradient is introduced, and the current gradient and the first-order momentum are combined to form a composite gradient to improve the global optimization ability. Finally, the random block coordinate method is used to determine the gradient update mode, which reduces the computational overhead. Simulation experiments on two standard datasets for classification show that the convergence speed and accuracy of the proposed algorithm are higher than those of the six gradient descent methods, and the CPU and memory utilization are significantly reduced. In addition, based on logging data, the BP neural networks optimized by six algorithms, respectively, are used to predict reservoir porosity. Results show that the proposed method has lower system overhead, higher accuracy, and stronger stability, and the absolute error of more than 86% data is within 0.1%, which further verifies its effectiveness.
Collapse
|
12
|
Bhakta S, Nandi U, Si T, Ghosal SK, Changdar C, Pal RK. DiffMoment: an adaptive optimization technique for convolutional neural network. APPL INTELL 2022. [DOI: 10.1007/s10489-022-04382-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
|
13
|
Ensemble of Networks for Multilabel Classification. SIGNALS 2022. [DOI: 10.3390/signals3040054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Multilabel learning goes beyond standard supervised learning models by associating a sample with more than one class label. Among the many techniques developed in the last decade to handle multilabel learning best approaches are those harnessing the power of ensembles and deep learners. This work proposes merging both methods by combining a set of gated recurrent units, temporal convolutional neural networks, and long short-term memory networks trained with variants of the Adam optimization approach. We examine many Adam variants, each fundamentally based on the difference between present and past gradients, with step size adjusted for each parameter. We also combine Incorporating Multiple Clustering Centers and a bootstrap-aggregated decision trees ensemble, which is shown to further boost classification performance. In addition, we provide an ablation study for assessing the performance improvement that each module of our ensemble produces. Multiple experiments on a large set of datasets representing a wide variety of multilabel tasks demonstrate the robustness of our best ensemble, which is shown to outperform the state-of-the-art.
Collapse
|
14
|
A multivariate adaptive gradient algorithm with reduced tuning efforts. Neural Netw 2022; 152:499-509. [DOI: 10.1016/j.neunet.2022.05.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2021] [Revised: 03/14/2022] [Accepted: 05/15/2022] [Indexed: 11/20/2022]
|
15
|
An open-set framework for underwater image classification using autoencoders. SN APPLIED SCIENCES 2022. [DOI: 10.1007/s42452-022-05105-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022] Open
Abstract
AbstractIn this paper, we mainly intend to address the underwater image classification problem in an open-set scenario. Image classification algorithms have been mostly provided with a small set of species, while there exist lots of species not available to the algorithms or even unknown to ourselves. Thus, we deal with an open-set problem and extremely high false alarm rate in real scenarios, especially in the case of unseen species. Motivated by these challenges, our proposed scheme aims to prevent the unseen species from going to the classifier section. To this end, we introduce a new framework based on convolutional neural networks (CNNs) that automatically identifies various species of fishes and then classifies them into certain classes using a novel technique. In the proposed method, an autoencoder is employed to distinguish between seen and unseen species. To clarify, the autoencoder is trained to reconstruct the available species with high accuracy and filter out species that are not in our training set. In the following, a classifier based on EfficientNet is trained to classify the samples that are accepted by the autoencoder (AE), i.e. the samples that have small reconstruction error. Our proposed method is evaluated in terms of precision, recall, and accuracy and compared to the state-of-the-art methods utilizing WildFish dataset. Simulation results reveal the supremacy of the proposed method.
Collapse
|
16
|
|
17
|
Cui J, Li X, Zhao H, Wang H, Li B, Li X. Epoch-Evolving Gaussian Process Guided Learning for Classification. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2022; PP:326-337. [PMID: 35604997 DOI: 10.1109/tnnls.2022.3174207] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
The conventional mini-batch gradient descent algorithms are usually trapped in the local batch-level distribution information, resulting in the ``zig-zag'' effect in the learning process. To characterize the correlation information between the batch-level distribution and the global data distribution, we propose a novel learning scheme called epoch-evolving Gaussian process guided learning (GPGL) to encode the global data distribution information in a non-parametric way. Upon a set of class-aware anchor samples, our GP model is built to estimate the class distribution for each sample in mini-batch through label propagation from the anchor samples to the batch samples. The class distribution, also named the context label, is provided as a complement for the ground-truth one-hot label. Such a class distribution structure has a smooth property and usually carries a rich body of contextual information that is capable of speeding up the convergence process. With the guidance of the context label and ground-truth label, the GPGL scheme provides a more efficient optimization through updating the model parameters with a triangle consistency loss. Furthermore, our GPGL scheme can be generalized and naturally applied to the current deep models, outperforming the state-of-the-art optimization methods on six benchmark datasets.
Collapse
|
18
|
Munishamaiaha K, Rajagopal G, Venkatesan DK, Arif M, Vicoveanu D, Chiuchisan I, Izdrui D, Geman O. Robust Spatial-Spectral Squeeze-Excitation AdaBound Dense Network (SE-AB-Densenet) for Hyperspectral Image Classification. SENSORS 2022; 22:s22093229. [PMID: 35590921 PMCID: PMC9105163 DOI: 10.3390/s22093229] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/04/2022] [Revised: 04/11/2022] [Accepted: 04/19/2022] [Indexed: 02/06/2023]
Abstract
Increasing importance in the field of artificial intelligence has led to huge progress in remote sensing. Deep learning approaches have made tremendous progress in hyperspectral image (HSI) classification. However, the complexity in classifying the HSI data using a common convolutional neural network is still a challenge. Further, the network architecture becomes more complex when different spatial-spectral feature information is extracted. Usually, CNN has a large number of trainable parameters, which increases the computational complexity of HSI data. In this paper, an optimized squeeze-excitation AdaBound dense network (SE-AB-DenseNet) is designed to emphasize the significant spatial-spectral features of HSI data. The dense network is combined with the AdaBound and squeeze-excitation modules to give lower computation costs and better classification performance. The AdaBound optimizer gives the proposed model the ability to improve its stability and enhance its classification accuracy by approximately 2%. Additionally, the cutout regularization technique is used for HSI spatial-spectral classification to overcome the problem of overfitting. The experiments were carried out on two commonly used hyperspectral datasets (Indian Pines and Salinas). The experiment results on the datasets show a competitive classification accuracy when compared with state-of-the-art methods with limited training samples. From the SE-AB-DenseNet with the cutout model, the overall accuracies for the Indian Pines and Salinas datasets were observed to be 99.37 and 99.78, respectively.
Collapse
Affiliation(s)
- Kavitha Munishamaiaha
- Department of Electronics and Communication Engineering, Sri Venkateswara College of Engineering, Chennai 602117, India;
| | - Gayathri Rajagopal
- Department of Electronics and Communication Engineering, Sri Venkateswara College of Engineering, Chennai 602117, India;
- Correspondence: (G.R.); (O.G.)
| | - Dhilip Kumar Venkatesan
- Department of Computer Science Engineering, Vel Tech Rangarajan Dr. Sagunthala R&D Institute of Science and Technology, Chennai 600062, India;
| | - Muhammad Arif
- Department of Computer Science and Information Technology, The University of Lahore, Lahore 54590, Pakistan;
| | - Dragos Vicoveanu
- Electrical Engineering and Computer Science Faculty, Stefan cel Mare University, 720229 Suceava, Romania; (D.V.); (I.C.); (D.I.)
| | - Iuliana Chiuchisan
- Electrical Engineering and Computer Science Faculty, Stefan cel Mare University, 720229 Suceava, Romania; (D.V.); (I.C.); (D.I.)
| | - Diana Izdrui
- Electrical Engineering and Computer Science Faculty, Stefan cel Mare University, 720229 Suceava, Romania; (D.V.); (I.C.); (D.I.)
| | - Oana Geman
- Electrical Engineering and Computer Science Faculty, Stefan cel Mare University, 720229 Suceava, Romania; (D.V.); (I.C.); (D.I.)
- Correspondence: (G.R.); (O.G.)
| |
Collapse
|
19
|
Mohapatra R, Saha S, Coello CAC, Bhattacharya A, Dhavala SS, Saha S. AdaSwarm: Augmenting Gradient-Based Optimizers in Deep Learning With Swarm Intelligence. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE 2022. [DOI: 10.1109/tetci.2021.3083428] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
20
|
Xia W, Zheng L, Fang J, Li F, Zhou Y, Zeng Z, Zhang B, Li Z, Li H, Zhu F. PFmulDL: a novel strategy enabling multi-class and multi-label protein function annotation by integrating diverse deep learning methods. Comput Biol Med 2022; 145:105465. [PMID: 35366467 DOI: 10.1016/j.compbiomed.2022.105465] [Citation(s) in RCA: 38] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2022] [Revised: 03/22/2022] [Accepted: 03/25/2022] [Indexed: 02/06/2023]
Abstract
Bioinformatic annotation of protein function is essential but extremely sophisticated, which asks for extensive efforts to develop effective prediction method. However, the existing methods tend to amplify the representativeness of the families with large number of proteins by misclassifying the proteins in the families with small number of proteins. That is to say, the ability of the existing methods to annotate proteins in the 'rare classes' remains limited. Herein, a new protein function annotation strategy, PFmulDL, integrating multiple deep learning methods, was thus constructed. First, the recurrent neural network was integrated, for the first time, with the convolutional neural network to facilitate the function annotation. Second, a transfer learning method was introduced to the model construction for further improving the prediction performances. Third, based on the latest data of Gene Ontology, the newly constructed model could annotate the largest number of protein families comparing with the existing methods. Finally, this newly constructed model was found capable of significantly elevating the prediction performance for the 'rare classes' without sacrificing that for the 'major classes'. All in all, due to the emerging requirements on improving the prediction performance for the proteins in 'rare classes', this new strategy would become an essential complement to the existing methods for protein function prediction. All the models and source codes are freely available and open to all users at: https://github.com/idrblab/PFmulDL.
Collapse
Affiliation(s)
- Weiqi Xia
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou, 310058, China
| | - Lingyan Zheng
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou, 310058, China; Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou, 330110, China
| | - Jiebin Fang
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou, 310058, China
| | - Fengcheng Li
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou, 310058, China
| | - Ying Zhou
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou, 310058, China
| | - Zhenyu Zeng
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou, 330110, China
| | - Bing Zhang
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou, 330110, China
| | - Zhaorong Li
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou, 330110, China
| | - Honglin Li
- School of Pharmacy, East China University of Science and Technology, Shanghai, 200237, China.
| | - Feng Zhu
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou, 310058, China; Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou, 330110, China.
| |
Collapse
|
21
|
Dong H, Zhang H, Wu F, Qiu J, Zhang J, Wang H. A Rectal CT Tumor Segmentation Method Based on Improved U-Net. INT J PATTERN RECOGN 2022. [DOI: 10.1142/s0218001422500069] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Automatic and accurate segmentation of tumor area from rectal CT image plays an extremely key role in the treatment and diagnosis of rectal cancer. This paper proposes the MR-U-Net network model. The improvement is that a pair of encoder and decoder is added longitudinally to the U-shaped structure, which is the network structure of the fifth layer, and a residual module is added horizontally to the encoder and decoder of each layer. This model is used to conduct targeted research on the automatic segmentation method of rectal cancer. [H. Gao et al., Rectal tumor segmentation method based on U-Net improved model, J. Comput. Appl.40(8) (2020) 2392–2397] also improved U-Net and used the same dataset as this paper, but the Dice coefficient of all targets was only 83.15%, and the Dice coefficient of small targets was only 87.17%. This paper evaluates the improved MR-U-Net network model with the three indicators of precision, recall and Dice coefficient, and finds that in comparison to Ref. 4 the precision is 95.13%, 2.29% higher than the former work, recall is 94.28%, higher than the former work by 0.34%, Dice coefficient of all targets is 88.45%, increased by 5.3% compared with the former work, and the small targets Dice coefficient is increased by 1.28%, which is the best optimization state of this paper. Experiments show that for datasets with extremely skewed positive and negative samples, the MR-U-Net network structure after improving the hyperparameters in the optimizer can more accurately segment the rectal CT tumor lesion area.
Collapse
Affiliation(s)
- Haowei Dong
- School of Computer and Information Engineering, Nantong Institute of Technology, Nantong, Jiangsu, P. R. China
| | - Haifei Zhang
- School of Computer and Information Engineering, Nantong Institute of Technology, Nantong, Jiangsu, P. R. China
| | - Fang Wu
- School of Computer and Information Engineering, Nantong Institute of Technology, Nantong, Jiangsu, P. R. China
| | - Jianlin Qiu
- School of Computer and Information Engineering, Nantong Institute of Technology, Nantong, Jiangsu, P. R. China
| | - Jian Zhang
- Soochow College, Soochow University, Suzhou, Jiangsu, P. R. China
- Wenzheng College, Soochow University, Suzhou, Jiangsu, P. R. China
| | - Haoyu Wang
- School of Computer Science and Communication Engineering, Jiangsu University, Zhenjiang, Jiangsu, P. R. China
| |
Collapse
|
22
|
Nanni L, Manfè A, Maguolo G, Lumini A, Brahnam S. High performing ensemble of convolutional neural networks for insect pest image detection. ECOL INFORM 2022. [DOI: 10.1016/j.ecoinf.2021.101515] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
|
23
|
Ilboudo WEL, Kobayashi T, Sugimoto K. Robust Stochastic Gradient Descent With Student-t Distribution Based First-Order Momentum. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2022; 33:1324-1337. [PMID: 33326388 DOI: 10.1109/tnnls.2020.3041755] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Remarkable achievements by deep neural networks stand on the development of excellent stochastic gradient descent methods. Deep-learning-based machine learning algorithms, however, have to find patterns between observations and supervised signals, even though they may include some noise that hides the true relationship between them, more or less especially in the robotics domain. To perform well even with such noise, we expect them to be able to detect outliers and discard them when needed. We, therefore, propose a new stochastic gradient optimization method, whose robustness is directly built in the algorithm, using the robust student-t distribution as its core idea. We integrate our method to some of the latest stochastic gradient algorithms, and in particular, Adam, the popular optimizer, is modified through our method. The resultant algorithm, called t-Adam, along with the other stochastic gradient methods integrated with our core idea is shown to effectively outperform Adam and their original versions in terms of robustness against noise on diverse tasks, ranging from regression and classification to reinforcement learning problems.
Collapse
|
24
|
Binvignat M, Pedoia V, Butte AJ, Louati K, Klatzmann D, Berenbaum F, Mariotti-Ferrandiz E, Sellam J. Use of machine learning in osteoarthritis research: a systematic literature review. RMD Open 2022; 8:e001998. [PMID: 35296530 PMCID: PMC8928401 DOI: 10.1136/rmdopen-2021-001998] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2021] [Accepted: 02/16/2022] [Indexed: 11/21/2022] Open
Abstract
OBJECTIVE The aim of this systematic literature review was to provide a comprehensive and exhaustive overview of the use of machine learning (ML) in the clinical care of osteoarthritis (OA). METHODS A systematic literature review was performed in July 2021 using MEDLINE PubMed with key words and MeSH terms. For each selected article, the number of patients, ML algorithms used, type of data analysed, validation methods and data availability were collected. RESULTS From 1148 screened articles, 46 were selected and analysed; most were published after 2017. Twelve articles were related to diagnosis, 7 to prediction, 4 to phenotyping, 12 to severity and 11 to progression. The number of patients included ranged from 18 to 5749. Overall, 35% of the articles described the use of deep learning And 74% imaging analyses. A total of 85% of the articles involved knee OA and 15% hip OA. No study investigated hand OA. Most of the studies involved the same cohort, with data from the OA initiative described in 46% of the articles and the MOST and Cohort Hip and Cohort Knee cohorts in 11% and 7%. Data and source codes were described as publicly available respectively in 54% and 22% of the articles. External validation was provided in only 7% of the articles. CONCLUSION This review proposes an up-to-date overview of ML approaches used in clinical OA research and will help to enhance its application in this field.
Collapse
Affiliation(s)
- Marie Binvignat
- Department of Rheumatology, Hôpital Saint-Antoine, Assistance Publique - Hôpitaux de Paris (AP-HP), Centre de Recherche Saint-Antoine, Inserm UMRS_938, Assistance Publique - Hôpitaux de Paris (AP-HP), Sorbonne Universite, Paris, France
- Bakar Computational Health Science Institute, University of California, San Francisco, California, USA
- Immunology Immunopathology Immunotherapy UMRS_959, Sorbonne Universite, Paris, France
| | - Valentina Pedoia
- Center for Intelligent Imaging (CI2), Department of Radiology and Biomedical Imaging, University of California, San Francisco, California, USA
| | - Atul J Butte
- Bakar Computational Health Science Institute, University of California, San Francisco, California, USA
| | - Karine Louati
- Department of Rheumatology, Hôpital Saint-Antoine, Assistance Publique - Hôpitaux de Paris (AP-HP), Centre de Recherche Saint-Antoine, Inserm UMRS_938, Assistance Publique - Hôpitaux de Paris (AP-HP), Sorbonne Universite, Paris, France
| | - David Klatzmann
- Immunology Immunopathology Immunotherapy UMRS_959, Sorbonne Universite, Paris, France
- Biotherapy (CIC-BTi) and Inflammation Immunopathology-Biotherapy Department (i2B), Hôpital Pitié-Salpêtrière, AP-HP, Paris, France
| | - Francis Berenbaum
- Department of Rheumatology, Hôpital Saint-Antoine, Assistance Publique - Hôpitaux de Paris (AP-HP), Centre de Recherche Saint-Antoine, Inserm UMRS_938, Assistance Publique - Hôpitaux de Paris (AP-HP), Sorbonne Universite, Paris, France
| | | | - Jérémie Sellam
- Department of Rheumatology, Hôpital Saint-Antoine, Assistance Publique - Hôpitaux de Paris (AP-HP), Centre de Recherche Saint-Antoine, Inserm UMRS_938, Assistance Publique - Hôpitaux de Paris (AP-HP), Sorbonne Universite, Paris, France
| |
Collapse
|
25
|
CSA-GAN: Cyclic synthesized attention guided generative adversarial network for face synthesis. APPL INTELL 2022. [DOI: 10.1007/s10489-021-03064-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
|
26
|
Baghel N, Nangia V, Dutta MK. ALSD-Net: Automatic lung sounds diagnosis network from pulmonary signals. Neural Comput Appl 2021. [DOI: 10.1007/s00521-021-06302-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
27
|
Liu Z, Liu F, Chen W, Tao Y, Liu X, Zhang F, Shen J, Guan H, Zhen H, Wang S, Chen Q, Chen Y, Hou X. Automatic Segmentation of Clinical Target Volume and Organs-at-Risk for Breast Conservative Radiotherapy Using a Convolutional Neural Network. Cancer Manag Res 2021; 13:8209-8217. [PMID: 34754241 PMCID: PMC8572021 DOI: 10.2147/cmar.s330249] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2021] [Accepted: 10/04/2021] [Indexed: 12/14/2022] Open
Abstract
Objective Delineation of clinical target volume (CTV) and organs at risk (OARs) is important for radiotherapy but is time-consuming. We trained and evaluated a U-ResNet model to provide fast and consistent auto-segmentation. Methods We collected 160 patients’ CT scans with breast cancer who underwent breast-conserving surgery (BCS) and were treated with radiotherapy. CTV and OARs were delineated manually and were used for model training. The dice similarity coefficient (DSC) and 95th percentile Hausdorff distance (95HD) were used to assess the performance of our model. CTV and OARs were randomly selected as ground truth (GT) masks, and artificial intelligence (AI) masks were generated by the proposed model. Two clinicians randomly compared CTV score differences of the contour. The consistency between two clinicians was tested. Time cost for auto-delineation was evaluated. Results The mean DSC values of the proposed method were 0.94, 0.95, 0.94, 0.96, 0.96 and 0.93 for breast CTV, contralateral breast, heart, right lung, left lung and spinal cord, respectively. The mean 95HD values were 4.31mm, 3.59mm, 4.86mm, 3.18mm, 2.79mm and 4.37mm for the above structures, respectively. The average CTV scores for AI and GT were 2.89 versus 2.92 when evaluated by oncologist A (P=0.612), and 2.75 versus 2.83 by oncologist B (P=0.213), with no statistically significant differences. The consistency between two clinicians was poor (kappa=0.282). The time for auto-segmentation of CTV and OARs was 10.03 s. Conclusion Our proposed model (U-ResNet) can improve the efficiency and accuracy of delineation compared with U-Net, performing equally well with the segmentation generated by oncologists.
Collapse
Affiliation(s)
- Zhikai Liu
- Department of Radiation Oncology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences, Beijing, 100730, People's Republic of China
| | - Fangjie Liu
- Department of Radiation Oncology, Sun Yat-sen University Cancer Center, State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Guangzhou, 510060, People's Republic of China
| | - Wanqi Chen
- Department of Radiation Oncology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences, Beijing, 100730, People's Republic of China
| | - Yinjie Tao
- Department of Radiation Oncology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences, Beijing, 100730, People's Republic of China
| | - Xia Liu
- Department of Radiation Oncology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences, Beijing, 100730, People's Republic of China
| | - Fuquan Zhang
- Department of Radiation Oncology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences, Beijing, 100730, People's Republic of China
| | - Jing Shen
- Department of Radiation Oncology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences, Beijing, 100730, People's Republic of China
| | - Hui Guan
- Department of Radiation Oncology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences, Beijing, 100730, People's Republic of China
| | - Hongnan Zhen
- Department of Radiation Oncology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences, Beijing, 100730, People's Republic of China
| | - Shaobin Wang
- MedMind Technology Co., Ltd., Beijing, 100055, People's Republic of China
| | - Qi Chen
- MedMind Technology Co., Ltd., Beijing, 100055, People's Republic of China
| | - Yu Chen
- MedMind Technology Co., Ltd., Beijing, 100055, People's Republic of China
| | - Xiaorong Hou
- Department of Radiation Oncology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences, Beijing, 100730, People's Republic of China
| |
Collapse
|
28
|
Abstract
In the last few years, intensive research has been done to enhance artificial intelligence (AI) using optimization techniques. In this paper, we present an extensive review of artificial neural networks (ANNs) based optimization algorithm techniques with some of the famous optimization techniques, e.g., genetic algorithm (GA), particle swarm optimization (PSO), artificial bee colony (ABC), and backtracking search algorithm (BSA) and some modern developed techniques, e.g., the lightning search algorithm (LSA) and whale optimization algorithm (WOA), and many more. The entire set of such techniques is classified as algorithms based on a population where the initial population is randomly created. Input parameters are initialized within the specified range, and they can provide optimal solutions. This paper emphasizes enhancing the neural network via optimization algorithms by manipulating its tuned parameters or training parameters to obtain the best structure network pattern to dissolve the problems in the best way. This paper includes some results for improving the ANN performance by PSO, GA, ABC, and BSA optimization techniques, respectively, to search for optimal parameters, e.g., the number of neurons in the hidden layers and learning rate. The obtained neural net is used for solving energy management problems in the virtual power plant system.
Collapse
|
29
|
Nanni L, Minchio G, Brahnam S, Sarraggiotto D, Lumini A. Closing the Performance Gap between Siamese Networks for Dissimilarity Image Classification and Convolutional Neural Networks. SENSORS (BASEL, SWITZERLAND) 2021; 21:5809. [PMID: 34502700 PMCID: PMC8433960 DOI: 10.3390/s21175809] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/02/2021] [Revised: 08/18/2021] [Accepted: 08/25/2021] [Indexed: 11/30/2022]
Abstract
In this paper, we examine two strategies for boosting the performance of ensembles of Siamese networks (SNNs) for image classification using two loss functions (Triplet and Binary Cross Entropy) and two methods for building the dissimilarity spaces (FULLY and DEEPER). With FULLY, the distance between a pattern and a prototype is calculated by comparing two images using the fully connected layer of the Siamese network. With DEEPER, each pattern is described using a deeper layer combined with dimensionality reduction. The basic design of the SNNs takes advantage of supervised k-means clustering for building the dissimilarity spaces that train a set of support vector machines, which are then combined by sum rule for a final decision. The robustness and versatility of this approach are demonstrated on several cross-domain image data sets, including a portrait data set, two bioimage and two animal vocalization data sets. Results show that the strategies employed in this work to increase the performance of dissimilarity image classification using SNN are closing the gap with standalone CNNs. Moreover, when our best system is combined with an ensemble of CNNs, the resulting performance is superior to an ensemble of CNNs, demonstrating that our new strategy is extracting additional information.
Collapse
Affiliation(s)
- Loris Nanni
- Department of Information Engineering (DEI), University of Padova, 35131 Padova, Italy; (G.M.); (D.S.)
| | - Giovanni Minchio
- Department of Information Engineering (DEI), University of Padova, 35131 Padova, Italy; (G.M.); (D.S.)
| | - Sheryl Brahnam
- Department of Information Technology and Cybersecurity, Missouri State University, 901 S, National Street, Springfield, MO 65804, USA;
| | - Davide Sarraggiotto
- Department of Information Engineering (DEI), University of Padova, 35131 Padova, Italy; (G.M.); (D.S.)
| | - Alessandra Lumini
- Department of Computer Science and Engineering (DISI), University of Bologna, Via dell’Università 50, 47521 Cesena, Italy;
| |
Collapse
|
30
|
Hybrid Spatial–Temporal Graph Convolutional Networks for On-Street Parking Availability Prediction. REMOTE SENSING 2021. [DOI: 10.3390/rs13163338] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
With the development of sensors and of the Internet of Things (IoT), smart cities can provide people with a variety of information for a more convenient life. Effective on-street parking availability prediction can improve parking efficiency and, at times, alleviate city congestion. Conventional methods of parking availability prediction often do not consider the spatial–temporal features of parking duration distributions. To this end, we propose a parking space prediction scheme called the hybrid spatial–temporal graph convolution networks (HST-GCNs). We use graph convolutional networks and gated linear units (GLUs) with a 1D convolutional neural network to obtain the spatial features and the temporal features, respectively. Then, we construct a spatial–temporal convolutional block to obtain the instantaneous spatial–temporal correlations. Based on the similarity of the parking duration distributions, we propose an attention mechanism called distAtt to measure the similarity of parking duration distributions. Through the distAtt mechanism, we add the long-term spatial–temporal correlations to our spatial–temporal convolutional block, and thus, we can capture complex hybrid spatial–temporal correlations to achieve a higher accuracy of parking availability prediction. Based on real-world datasets, we compare the proposed scheme with the benchmark models. The experimental results show that the proposed scheme has the best performance in predicting the parking occupancy rate.
Collapse
|
31
|
Kim KS, Choi YS. HyAdamC: A New Adam-Based Hybrid Optimization Algorithm for Convolution Neural Networks. SENSORS (BASEL, SWITZERLAND) 2021; 21:4054. [PMID: 34204695 PMCID: PMC8231656 DOI: 10.3390/s21124054] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/05/2021] [Revised: 06/07/2021] [Accepted: 06/09/2021] [Indexed: 11/16/2022]
Abstract
As the performance of devices that conduct large-scale computations has been rapidly improved, various deep learning models have been successfully utilized in various applications. Particularly, convolution neural networks (CNN) have shown remarkable performance in image processing tasks such as image classification and segmentation. Accordingly, more stable and robust optimization methods are required to effectively train them. However, the traditional optimizers used in deep learning still have unsatisfactory training performance for the models with many layers and weights. Accordingly, in this paper, we propose a new Adam-based hybrid optimization method called HyAdamC for training CNNs effectively. HyAdamC uses three new velocity control functions to adjust its search strength carefully in term of initial, short, and long-term velocities. Moreover, HyAdamC utilizes an adaptive coefficient computation method to prevent that a search direction determined by the first momentum is distorted by any outlier gradients. Then, these are combined into one hybrid method. In our experiments, HyAdamC showed not only notable test accuracies but also significantly stable and robust optimization abilities when training various CNN models. Furthermore, we also found that HyAdamC could be applied into not only image classification and image segmentation tasks.
Collapse
Affiliation(s)
- Kyung-Soo Kim
- Center for Computational Social Science, Hanyang University, Seoul 04763, Korea;
| | - Yong-Suk Choi
- Department of Computer Science and Engineering, Hanyang University, Seoul 04763, Korea
| |
Collapse
|
32
|
Gradient-Sensitive Optimization for Convolutional Neural Networks. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2021. [DOI: 10.1155/2021/6671830] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Convolutional neural networks (CNNs) are effective models for image classification and recognition. Gradient descent optimization (GD) is the basic algorithm for CNN model optimization. Since GD appeared, a series of improved algorithms have been derived. Among these algorithms, adaptive moment estimation (Adam) has been widely recognized. However, local changes are ignored in Adam to some extent. In this paper, we introduce an adaptive learning rate factor based on current and recent gradients. According to this factor, we can dynamically adjust the learning rate of each independent parameter to adaptively adjust the global convergence process. We use the factor to adjust the learning rate for each parameter. The convergence of the proposed algorithm is proven by using the regret bound approach of the online learning framework. In the experimental section, comparisons are conducted between the proposed algorithm and other existing algorithms, such as AdaGrad, RMSprop, Adam, diffGrad, and AdaHMG, on test functions and the MNIST dataset. The results show that Adam and RMSprop combined with our algorithm can not only find the global minimum faster in the experiment using the test function but also have a better convergence curve and higher test set accuracy in experiments using datasets. Our algorithm is a supplement to the existing gradient descent algorithms, which can be combined with many other existing gradient descent algorithms to improve the efficiency of iteration, speed up the convergence of the cost function, and improve the final recognition rate.
Collapse
|
33
|
Age-group determination of living individuals using first molar images based on artificial intelligence. Sci Rep 2021; 11:1073. [PMID: 33441753 PMCID: PMC7806774 DOI: 10.1038/s41598-020-80182-8] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2020] [Accepted: 12/15/2020] [Indexed: 11/14/2022] Open
Abstract
Dental age estimation of living individuals is difficult and challenging, and there is no consensus method in adults with permanent dentition. Thus, we aimed to provide an accurate and robust artificial intelligence (AI)-based diagnostic system for age-group estimation by incorporating a convolutional neural network (CNN) using dental X-ray image patches of the first molars extracted via panoramic radiography. The data set consisted of four first molar images from the right and left sides of the maxilla and mandible of each of 1586 individuals across all age groups, which were extracted from their panoramic radiographs. The accuracy of the tooth-wise estimation was 89.05 to 90.27%. Performance accuracy was evaluated mainly using a majority voting system and area under curve (AUC) scores. The AUC scores ranged from 0.94 to 0.98 for all age groups, which indicates outstanding capacity. The learned features of CNNs were visualized as a heatmap, and revealed that CNNs focus on differentiated anatomical parameters, including tooth pulp, alveolar bone level, or interdental space, depending on the age and location of the tooth. With this, we provided a deeper understanding of the most informative regions distinguished by age groups. The prediction accuracy and heat map analyses support that this AI-based age-group determination model is plausible and useful.
Collapse
|
34
|
Yabushita H, Goto S, Nakamura S, Oka H, Nakayama M, Goto S. Development of Novel Artificial Intelligence to Detect the Presence of Clinically Meaningful Coronary Atherosclerotic Stenosis in Major Branch from Coronary Angiography Video. J Atheroscler Thromb 2020; 28:835-843. [PMID: 33012741 PMCID: PMC8326176 DOI: 10.5551/jat.59675] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022] Open
Abstract
Aim:
The clinically meaningful coronary stenosis is diagnosed by trained interventional cardiologists. Whether artificial intelligence (AI) could detect coronary stenosis from CAG video is unclear.
Methods:
The 199 consecutive patients who underwent coronary arteriography (CAG) with chest pain between December 2018 and May 2019 was enrolled. Each patient underwent CAG with multiple view resulting in total numbers of 1,838 videos. A multi-layer 3-dimensional convolution neural network (CNN) was trained as an AI to detect clinically meaningful coronary artery stenosis diagnosed by the expert interventional cardiologist, using data from 146 patients (resulted in 1,359 videos) randomly selected from the entire dataset (training dataset). This training dataset was further split into 109 patients (989 videos) for derivation and 37 patients (370 videos) for validation. The AI developed in derivation cohort was tuned in validation cohort to make final model.
Results:
The final model was selected as the model with best performance in validation dataset. Then, the predictive accuracy of final model was tested with the remaining 53 patients (479 videos) in test dataset. Our AI model showed a c-statistic of 0.61 in validation dataset and 0.61 in test dataset, respectively.
Conclusion:
An artificial intelligence applied to CAG videos could detect clinically meaningful coronary atherosclerotic stenosis diagnosed by expert cardiologists with modest predictive value. Further studies with improved AI at larger sample size is necessary.
Collapse
Affiliation(s)
- Hiroto Yabushita
- Department of Medicine (Cardiology), Tokai University School of Medicine.,Department of Cardiology, New-Tokyo Hospital
| | - Shinichi Goto
- Department of Medicine (Cardiology), Tokai University School of Medicine
| | | | - Hideki Oka
- Department of Medicine (Cardiology), Tokai University School of Medicine
| | - Masamitsu Nakayama
- Department of Medicine (Cardiology), Tokai University School of Medicine
| | - Shinya Goto
- Department of Medicine (Cardiology), Tokai University School of Medicine
| |
Collapse
|
35
|
Hyperparameter Optimization Method Based on Harmony Search Algorithm to Improve Performance of 1D CNN Human Respiration Pattern Recognition System. SENSORS 2020; 20:s20133697. [PMID: 32630344 PMCID: PMC7374394 DOI: 10.3390/s20133697] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/09/2020] [Revised: 06/25/2020] [Accepted: 06/29/2020] [Indexed: 11/17/2022]
Abstract
In this study, we propose a method to find an optimal combination of hyperparameters to improve the accuracy of respiration pattern recognition in a 1D (Dimensional) convolutional neural network (CNN). The proposed method is designed to integrate with a 1D CNN using the harmony search algorithm. In an experiment, we used the depth of the convolutional layer of the 1D CNN, the number and size of kernels in each layer, and the number of neurons in the dense layer as hyperparameters for optimization. The experimental results demonstrate that the proposed method provided a recognition rate for five respiration patterns of approximately 96.7% on average, which is an approximately 2.8% improvement over an existing method. In addition, the number of iterations required to derive the optimal combination of hyperparameters was 2,000,000 in the previous study. In contrast, the proposed method required only 3652 iterations.
Collapse
|