1
|
Zhang Y, Huang H, Shen G. Adaptive CL-BFGS Algorithms for Complex-Valued Neural Networks. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2023; 34:6313-6327. [PMID: 34995196 DOI: 10.1109/tnnls.2021.3135553] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Complex-valued limited-memory BFGS (CL-BFGS) algorithm is efficient for the training of complex-valued neural networks (CVNNs). As an important parameter, the memory size represents the number of saved vector pairs and would essentially affect the performance of the algorithm. However, the determination of a suitable memory size for the CL-BFGS algorithm remains challenging. To deal with this issue, an adaptive method is proposed in which the memory size is allowed to vary during the iteration process. Basically, at each iteration, with the help of multistep quasi-Newton method, an appropriate memory size is chosen from a variable set {1,2, ... , M} by approximating complex Hessian matrix as close as possible. To reduce the computational complexity and ensure desired performance, the upper bound M is adjustable according to the moving average of memory sizes found in previous iterations. The proposed adaptive CL-BFGS (ACL-BFGS) algorithm can be efficiently applied for the training of CVNNs. Moreover, it is suggested to take multiple memory sizes to construct the search direction, which further improves the performance of the ACL-BFGS algorithm. Experimental results on some benchmark problems including the pattern classification, complex function approximation, and nonlinear channel equalization problems are given to illustrate the advantages of the developed algorithms over some previous ones.
Collapse
|
2
|
Novel global polynomial stability criteria of impulsive complex-valued neural networks with multi-proportional delays. Neural Comput Appl 2021. [DOI: 10.1007/s00521-021-06555-w] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
3
|
Xiao J, Jia Y, Jiang X, Wang S. Circular Complex-Valued GMDH-Type Neural Network for Real-Valued Classification Problems. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2020; 31:5285-5299. [PMID: 32078563 DOI: 10.1109/tnnls.2020.2966031] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Recently, applications of complex-valued neural networks (CVNNs) to real-valued classification problems have attracted significant attention. However, most existing CVNNs are black-box models with poor explanation performance. This study extends the real-valued group method of data handling (RGMDH)-type neural network to the complex field and constructs a circular complex-valued group method of data handling (C-CGMDH)-type neural network, which is a white-box model. First, a complex least squares method is proposed for parameter estimation. Second, a new complex-valued symmetric regularity criterion is constructed with a logarithmic function to represent explicitly the magnitude and phase of the actual and predicted complex output to evaluate and select the middle candidate models. Furthermore, the property of this new complex-valued external criterion is proven to be similar to that of the real external criterion. Before training this model, a circular transformation is used to transform the real-valued input features to the complex field. Twenty-five real-valued classification data sets from the UCI Machine Learning Repository are used to conduct the experiments. The results show that both RGMDH and C-CGMDH models can select the most important features from the complete feature space through a self-organizing modeling process. Compared with RGMDH, the C-CGMDH model converges faster and selects fewer features. Furthermore, its classification performance is statistically significantly better than the benchmark complex-valued and real-valued models. Regarding time complexity, the C-CGMDH model is comparable with other models in dealing with the data sets that have few features. Finally, we demonstrate that the GMDH-type neural network can be interpretable.
Collapse
|
4
|
Rajasree R, Columbus CC, Shilaja C. Multiscale-based multimodal image classification of brain tumor using deep learning method. Neural Comput Appl 2020. [DOI: 10.1007/s00521-020-05332-5] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
5
|
Adaptive complex-valued stepsize based fast learning of complex-valued neural networks. Neural Netw 2020; 124:233-242. [DOI: 10.1016/j.neunet.2020.01.011] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2019] [Revised: 12/05/2019] [Accepted: 01/14/2020] [Indexed: 11/23/2022]
|
6
|
Fully complex conjugate gradient-based neural networks using Wirtinger calculus framework: Deterministic convergence and its application. Neural Netw 2019; 115:50-64. [DOI: 10.1016/j.neunet.2019.02.011] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2018] [Revised: 01/15/2019] [Accepted: 02/28/2019] [Indexed: 11/16/2022]
|
7
|
|
8
|
Ozkan E, Vishnevsky V, Goksel O. Inverse Problem of Ultrasound Beamforming With Sparsity Constraints and Regularization. IEEE TRANSACTIONS ON ULTRASONICS, FERROELECTRICS, AND FREQUENCY CONTROL 2018; 65:356-365. [PMID: 28961111 DOI: 10.1109/tuffc.2017.2757880] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
Ultrasound (US) beamforming is the process of reconstructing an image from acquired echo traces on several transducer elements. Typical beamforming approaches, such as delay-and-sum, perform simple projection operations, while techniques using statistical information also exist, e.g., adaptive, phase coherence, delay-multiply-and-sum, and sparse coding approaches. Inspired by the feasibility and success of inverse problem (IP) formulations in several image reconstruction problems, such as computed tomography, we herein devise an IP approach for US beamforming. We define a linear forward model for the synthesis of the beamformed image, and solve its IP thanks to several intuitive and physics-based constraints and regularization terms proposed. These reflect the prior knowledge about the spectra of carrier signal and spatial coherence of modulated signal. These constraints admit effective formulation through sparse representations. Our proposed method was evaluated for plane-wave imaging (PWI) that transmits unfocused waves, enabling high frame rates with large field of view at the expense of much lower image quality with conventional beamforming techniques. Results are evaluated in numerical simulations, as well as tissue-mimicking phantoms and in vivo data provided by PWI challenge in medical US. The best results achieved by our proposed techniques are 0.39-mm full-width at half-maximum for spatial resolution and 16.3-dB contrast-to-noise ratio, using a single plane-wave transmit.
Collapse
|
9
|
Isikli Esener I, Ergin S, Yuksel T. A New Feature Ensemble with a Multistage Classification Scheme for Breast Cancer Diagnosis. JOURNAL OF HEALTHCARE ENGINEERING 2017; 2017:3895164. [PMID: 29065592 PMCID: PMC5494793 DOI: 10.1155/2017/3895164] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/07/2017] [Revised: 03/11/2017] [Accepted: 04/06/2017] [Indexed: 11/21/2022]
Abstract
A new and effective feature ensemble with a multistage classification is proposed to be implemented in a computer-aided diagnosis (CAD) system for breast cancer diagnosis. A publicly available mammogram image dataset collected during the Image Retrieval in Medical Applications (IRMA) project is utilized to verify the suggested feature ensemble and multistage classification. In achieving the CAD system, feature extraction is performed on the mammogram region of interest (ROI) images which are preprocessed by applying a histogram equalization followed by a nonlocal means filtering. The proposed feature ensemble is formed by concatenating the local configuration pattern-based, statistical, and frequency domain features. The classification process of these features is implemented in three cases: a one-stage study, a two-stage study, and a three-stage study. Eight well-known classifiers are used in all cases of this multistage classification scheme. Additionally, the results of the classifiers that provide the top three performances are combined via a majority voting technique to improve the recognition accuracy on both two- and three-stage studies. A maximum of 85.47%, 88.79%, and 93.52% classification accuracies are attained by the one-, two-, and three-stage studies, respectively. The proposed multistage classification scheme is more effective than the single-stage classification for breast cancer diagnosis.
Collapse
Affiliation(s)
- Idil Isikli Esener
- Department of Electrical Electronics Engineering, Bilecik Seyh Edebali University, 11210 Bilecik, Turkey
| | - Semih Ergin
- Department of Electrical Electronics Engineering, Eskisehir Osmangazi University, 26480 Eskisehir, Turkey
| | - Tolga Yuksel
- Department of Electrical Electronics Engineering, Bilecik Seyh Edebali University, 11210 Bilecik, Turkey
| |
Collapse
|
10
|
Zhang H, Mandic DP. Is a Complex-Valued Stepsize Advantageous in Complex-Valued Gradient Learning Algorithms? IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2016; 27:2730-2735. [PMID: 26561486 DOI: 10.1109/tnnls.2015.2494361] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
Complex gradient methods have been widely used in learning theory, and typically aim to optimize real-valued functions of complex variables. The stepsize of complex gradient learning methods (CGLMs) is a positive number, and little is known about how a complex stepsize would affect the learning process. To this end, we undertake a comprehensive analysis of CGLMs with a complex stepsize, including the search space, convergence properties, and the dynamics near critical points. Furthermore, several adaptive stepsizes are derived by extending the Barzilai-Borwein method to the complex domain, in order to show that the complex stepsize is superior to the corresponding real one in approximating the information in the Hessian. A numerical example is presented to support the analysis.
Collapse
|
11
|
Liu P, Zeng Z, Wang J. Complete stability of delayed recurrent neural networks with Gaussian activation functions. Neural Netw 2016; 85:21-32. [PMID: 27814464 DOI: 10.1016/j.neunet.2016.09.006] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2016] [Revised: 08/13/2016] [Accepted: 09/20/2016] [Indexed: 11/25/2022]
Abstract
This paper addresses the complete stability of delayed recurrent neural networks with Gaussian activation functions. By means of the geometrical properties of Gaussian function and algebraic properties of nonsingular M-matrix, some sufficient conditions are obtained to ensure that for an n-neuron neural network, there are exactly 3k equilibrium points with 0≤k≤n, among which 2k and 3k-2k equilibrium points are locally exponentially stable and unstable, respectively. Moreover, it concludes that all the states converge to one of the equilibrium points; i.e., the neural networks are completely stable. The derived conditions herein can be easily tested. Finally, a numerical example is given to illustrate the theoretical results.
Collapse
Affiliation(s)
- Peng Liu
- School of Automation, Huazhong University of Science and Technology, Wuhan 430074, China; Key Laboratory of Image Processing and Intelligent Control of Education Ministry of China, Wuhan 430074, China.
| | - Zhigang Zeng
- School of Automation, Huazhong University of Science and Technology, Wuhan 430074, China; Key Laboratory of Image Processing and Intelligent Control of Education Ministry of China, Wuhan 430074, China.
| | - Jun Wang
- Department of Computer Science, City University of Hong Kong, Kowloon Tong, Hong Kong.
| |
Collapse
|
12
|
Liang J, Gong W, Huang T. Multistability of complex-valued neural networks with discontinuous activation functions. Neural Netw 2016; 84:125-142. [PMID: 27718391 DOI: 10.1016/j.neunet.2016.08.008] [Citation(s) in RCA: 61] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2016] [Revised: 06/21/2016] [Accepted: 08/23/2016] [Indexed: 11/29/2022]
Abstract
In this paper, based on the geometrical properties of the discontinuous activation functions and the Brouwer's fixed point theory, the multistability issue is tackled for the complex-valued neural networks with discontinuous activation functions and time-varying delays. To address the network with discontinuous functions, Filippov solution of the system is defined. Through rigorous analysis, several sufficient criteria are obtained to assure the existence of 25n equilibrium points. Among them, 9n points are locally stable and 16n-9n equilibrium points are unstable. Furthermore, to enlarge the attraction basins of the 9n equilibrium points, some mild conditions are imposed. Finally, one numerical example is provided to illustrate the effectiveness of the obtained results.
Collapse
Affiliation(s)
- Jinling Liang
- Department of Mathematics, Southeast University, Nanjing 210096, China.
| | - Weiqiang Gong
- Department of Mathematics, Southeast University, Nanjing 210096, China.
| | | |
Collapse
|
13
|
|
14
|
Sivachitra M, Vijayachitra S. A Metacognitive Fully Complex Valued Functional Link Network for solving real valued classification problems. Appl Soft Comput 2015. [DOI: 10.1016/j.asoc.2015.04.022] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|
15
|
Abstract
This letter investigates the characteristics of the complex-valued neuron model with parameters represented by polar coordinates (called polar variable complex-valued neuron). The parameters of the polar variable complex-valued neuron are unidentifiable. The plateau phenomenon can occur during learning of the polar variable complex-valued neuron. Furthermore, computer simulations suggest that a single polar variable complex-valued neuron has the following characteristics in the case of using the steepest gradient-descent method with square error: (1) unidentifiable parameters (singular points) degrade the learning speed and (2) a plateau can occur during learning. When the weight is attracted to the singular point, the learning tends to become stuck. However, computer simulations also show that the steepest gradient-descent method with amplitude-phase error and the complex-valued natural gradient method could reduce the effects of the singular points. The learning dynamics near singular points depends on the error functions and the training algorithms used.
Collapse
Affiliation(s)
- Tohru Nitta
- National Institute of Advanced Industrial Science and Technology, Tsukuba, Ibaraki, 305-8568 Japan
| |
Collapse
|
16
|
Sivachitra M, Savitha R, Suresh S, Vijayachitra S. A Fully Complex-valued Fast Learning Classifier (FC-FLC) for real-valued classification problems. Neurocomputing 2015. [DOI: 10.1016/j.neucom.2014.04.075] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
17
|
D S, E S. A CAD system to analyse mammogram images using fully complex-valued relaxation neural network ensembled classifier. J Med Eng Technol 2014; 38:359-66. [DOI: 10.3109/03091902.2014.942041] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
|
18
|
Subramanian K, Savitha R, Suresh S. A complex-valued neuro-fuzzy inference system and its learning mechanism. Neurocomputing 2014. [DOI: 10.1016/j.neucom.2013.06.009] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
19
|
A meta-cognitive interval type-2 fuzzy inference system and its projection based learning algorithm. EVOLVING SYSTEMS 2013. [DOI: 10.1007/s12530-013-9102-9] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|