1
|
Han L, Wang Y, Yao H, Huang X, Yang H, Zhang Q, Xin X. Perturbation dimensional reduction theory assisted low complexity convolutional neural network equalizer for optical communications. OPTICS EXPRESS 2025; 33:15724-15741. [PMID: 40219479 DOI: 10.1364/oe.550530] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/02/2024] [Accepted: 03/24/2025] [Indexed: 04/14/2025]
Abstract
The integration of neural networks with perturbation theory has been recognized as a pivotal factor contributing to the enhancement of transmission efficiency in modern optical communication systems. Research focuses on balancing superior equalization performance against system complexity minimization. This paper presents a novel and efficient data dimensionality reduction compression methodology for developing low-complexity neural network-based equalizers intended to mitigate nonlinear impairments in optical fiber communication systems. Firstly, we propose a dimensionality reduction (DR) method exploiting the spatial symmetry of perturbation coefficients, enabling low-complexity DR feature map construction at the input end. Then, a complex-valued CNN (CvCNN) equalizer is then designed, utilizing single-channel complex-valued feature maps to preserve data characteristics while further reducing complexity. The effectiveness of the proposed equalizer is verified through numerical studies, and further performance comparisons and complexity analyses with other equalizers are conducted in a 20 GBaud PDM 64-QAM coherent optical transmission experimental system. The research and analysis are grounded in a fair performance comparison of Q-factors, accompanied by thorough analyses of time and space complexity under equivalent performance conditions. Finally, The DR feature map achieves an 82.64% reduction in effective feature units. When deployed in CvCNN, the equalizer reduces space and time complexity by 66.90% and 70.55%, respectively.
Collapse
|
2
|
Zhang Z, Lei Z, Zhou M, Hasegawa H, Gao S. Complex-Valued Convolutional Gated Recurrent Neural Network for Ultrasound Beamforming. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2025; 36:5668-5679. [PMID: 38598398 DOI: 10.1109/tnnls.2024.3384314] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/12/2024]
Abstract
Ultrasound detection is a potent tool for the clinical diagnosis of various diseases due to its real-time, convenient, and noninvasive qualities. Yet, existing ultrasound beamforming and related methods face a big challenge to improve both the quality and speed of imaging for the required clinical applications. The most notable characteristic of ultrasound signal data is its spatial and temporal features. Because most signals are complex-valued, directly processing them by using real-valued networks leads to phase distortion and inaccurate output. In this study, for the first time, we propose a complex-valued convolutional gated recurrent (CCGR) neural network to handle ultrasound analytic signals with the aforementioned properties. The complex-valued network operations proposed in this study improve the beamforming accuracy of complex-valued ultrasound signals over traditional real-valued methods. Further, the proposed deep integration of convolution and recurrent neural networks makes a great contribution to extracting rich and informative ultrasound signal features. Our experimental results reveal its outstanding imaging quality over existing state-of-the-art methods. More significantly, its ultrafast processing speed of only 0.07 s per image promises considerable clinical application potential. The code is available at https://github.com/zhangzm0128/CCGR.
Collapse
|
3
|
Shah ZH, Müller M, Hübner W, Ortkrass H, Hammer B, Huser T, Schenck W. Image restoration in frequency space using complex-valued CNNs. Front Artif Intell 2024; 7:1353873. [PMID: 39376505 PMCID: PMC11456741 DOI: 10.3389/frai.2024.1353873] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2023] [Accepted: 09/03/2024] [Indexed: 10/09/2024] Open
Abstract
Real-valued convolutional neural networks (RV-CNNs) in the spatial domain have outperformed classical approaches in many image restoration tasks such as image denoising and super-resolution. Fourier analysis of the results produced by these spatial domain models reveals the limitations of these models in properly processing the full frequency spectrum. This lack of complete spectral information can result in missing textural and structural elements. To address this limitation, we explore the potential of complex-valued convolutional neural networks (CV-CNNs) for image restoration tasks. CV-CNNs have shown remarkable performance in tasks such as image classification and segmentation. However, CV-CNNs for image restoration problems in the frequency domain have not been fully investigated to address the aforementioned issues. Here, we propose several novel CV-CNN-based models equipped with complex-valued attention gates for image denoising and super-resolution in the frequency domains. We also show that our CV-CNN-based models outperform their real-valued counterparts for denoising super-resolution structured illumination microscopy (SR-SIM) and conventional image datasets. Furthermore, the experimental results show that our proposed CV-CNN-based models preserve the frequency spectrum better than their real-valued counterparts in the denoising task. Based on these findings, we conclude that CV-CNN-based methods provide a plausible and beneficial deep learning approach for image restoration in the frequency domain.
Collapse
Affiliation(s)
- Zafran Hussain Shah
- Center for Applied Data Science, Faculty of Engineering and Mathematics, Bielefeld University of Applied Sciences and Arts, Bielefeld, Germany
| | - Marcel Müller
- Biomolecular Photonics Group, Faculty of Physics, Bielefeld University, Bielefeld, Germany
| | - Wolfgang Hübner
- Biomolecular Photonics Group, Faculty of Physics, Bielefeld University, Bielefeld, Germany
| | - Henning Ortkrass
- Biomolecular Photonics Group, Faculty of Physics, Bielefeld University, Bielefeld, Germany
| | - Barbara Hammer
- CITEC—Center for Cognitive Interaction Technology, Bielefeld University, Bielefeld, Germany
| | - Thomas Huser
- Biomolecular Photonics Group, Faculty of Physics, Bielefeld University, Bielefeld, Germany
| | - Wolfram Schenck
- Center for Applied Data Science, Faculty of Engineering and Mathematics, Bielefeld University of Applied Sciences and Arts, Bielefeld, Germany
| |
Collapse
|
4
|
Bentaleb A, Sintes C, Conze PH, Rousseau F, Guezou-Philippe A, Hamitouche C. Complex Residual Attention U-Net for Fast Ultrasound Imaging from a Single Plane-Wave Equivalent to Diverging Wave Imaging. SENSORS (BASEL, SWITZERLAND) 2024; 24:5111. [PMID: 39204807 PMCID: PMC11360587 DOI: 10.3390/s24165111] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/06/2024] [Revised: 07/12/2024] [Accepted: 08/05/2024] [Indexed: 09/04/2024]
Abstract
Plane wave imaging persists as a focal point of research due to its high frame rate and low complexity. However, in spite of these advantages, its performance can be compromised by several factors such as noise, speckle, and artifacts that affect the image quality and resolution. In this paper, we propose an attention-based complex convolutional residual U-Net to reconstruct improved in-phase/quadrature complex data from a single insonification acquisition that matches diverging wave imaging. Our approach introduces an attention mechanism to the complex domain in conjunction with complex convolution to incorporate phase information and improve the image quality matching images obtained using coherent compounding imaging. To validate the effectiveness of this method, we trained our network on a simulated phased array dataset and evaluated it using in vitro and in vivo data. The experimental results show that our approach improved the ultrasound image quality by focusing the network's attention on critical aspects of the complex data to identify and separate different regions of interest from background noise.
Collapse
Affiliation(s)
- Ahmed Bentaleb
- Département Image et Traitement de l’Information, Institue Mines-Télécom (IMT) Atlantique, 29200 Brest, France
| | | | | | | | | | | |
Collapse
|
5
|
Sheng W, Liu C, Xiao J, Sun L, Cai Y, Fu HY, Li Q, Ning Liu G. Complex-valued recurrent neural network equalizer with low complexity for a 120-Gbps 50-km optical PAM-4 IM/DD system. OPTICS EXPRESS 2024; 32:27624-27634. [PMID: 39538595 DOI: 10.1364/oe.529318] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/10/2024] [Accepted: 07/01/2024] [Indexed: 11/16/2024]
Abstract
This paper introduces a novel complex-valued recurrent neural networks equalizer (RNNE) designed for a 120-Gbps, 50-km optical 4-level pulse-amplitude modulation (PAM-4) intensity modulation and direct detection (IM/DD) system. By mapping adjacent symbols of PAM-4 signals onto the complex domain, the correlation between two adjacent symbols of PAM-4 signals can be preserved. Based on experimental results, the proposed complex-valued RNNE outperforms the traditional real-valued RNNE with a 1.38-dB system power budget gain at the 7% overhead forward error correction BER threshold of 3.8 × 10-3. We believe that complex-valued RNNE has an advantage over real-valued RNNE in processing real-valued signals in IM/DD systems.
Collapse
|
6
|
Wang Z, Guo D, Tu Z, Huang Y, Zhou Y, Wang J, Feng L, Lin D, You Y, Agback T, Orekhov V, Qu X. A Sparse Model-Inspired Deep Thresholding Network for Exponential Signal Reconstruction-Application in Fast Biological Spectroscopy. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2023; 34:7578-7592. [PMID: 35120010 DOI: 10.1109/tnnls.2022.3144580] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
The nonuniform sampling (NUS) is a powerful approach to enable fast acquisition but requires sophisticated reconstruction algorithms. Faithful reconstruction from partially sampled exponentials is highly expected in general signal processing and many applications. Deep learning (DL) has shown astonishing potential in this field, but many existing problems, such as lack of robustness and explainability, greatly limit its applications. In this work, by combining the merits of the sparse model-based optimization method and data-driven DL, we propose a DL architecture for spectra reconstruction from undersampled data, called MoDern. It follows the iterative reconstruction in solving a sparse model to build the neural network, and we elaborately design a learnable soft-thresholding to adaptively eliminate the spectrum artifacts introduced by undersampling. Extensive results on both synthetic and biological data show that MoDern enables more robust, high-fidelity, and ultrafast reconstruction than the state-of-the-art methods. Remarkably, MoDern has a small number of network parameters and is trained on solely synthetic data while generalizing well to biological data in various scenarios. Furthermore, we extend it to an open-access and easy-to-use cloud computing platform (XCloud-MoDern), contributing a promising strategy for further development of biological applications.
Collapse
|
7
|
Huang Y, Zhao J, Wang Z, Orekhov V, Guo D, Qu X. Exponential Signal Reconstruction With Deep Hankel Matrix Factorization. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2023; 34:6214-6226. [PMID: 34941531 DOI: 10.1109/tnnls.2021.3134717] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Exponential function is a basic form of temporal signals, and how to fast acquire this signal is one of the fundamental problems and frontiers in signal processing. To achieve this goal, partial data may be acquired but result in severe artifacts in its spectrum, which is the Fourier transform of exponentials. Thus, reliable spectrum reconstruction is highly expected in the fast data acquisition in many applications, such as chemistry, biology, and medical imaging. In this work, we propose a deep learning method whose neural network structure is designed by imitating the iterative process in the model-based state-of-the-art exponentials' reconstruction method with the low-rank Hankel matrix factorization. With the experiments on synthetic data and realistic biological magnetic resonance signals, we demonstrate that the new method yields much lower reconstruction errors and preserves the low-intensity signals much better than compared methods.
Collapse
|
8
|
Du H, Riddell RP, Wang X. A hybrid complex-valued neural network framework with applications to electroencephalogram (EEG). Biomed Signal Process Control 2023. [DOI: 10.1016/j.bspc.2023.104862] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/30/2023]
|
9
|
Gao S, Zhou M, Wang Z, Sugiyama D, Cheng J, Wang J, Todo Y. Fully Complex-Valued Dendritic Neuron Model. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2023; 34:2105-2118. [PMID: 34487498 DOI: 10.1109/tnnls.2021.3105901] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/10/2023]
Abstract
A single dendritic neuron model (DNM) that owns the nonlinear information processing ability of dendrites has been widely used for classification and prediction. Complex-valued neural networks that consist of a number of multiple/deep-layer McCulloch-Pitts neurons have achieved great successes so far since neural computing was utilized for signal processing. Yet no complex value representations appear in single neuron architectures. In this article, we first extend DNM from a real-value domain to a complex-valued one. Performance of complex-valued DNM (CDNM) is evaluated through a complex XOR problem, a non-minimum phase equalization problem, and a real-world wind prediction task. Also, a comparative analysis on a set of elementary transcendental functions as an activation function is implemented and preparatory experiments are carried out for determining hyperparameters. The experimental results indicate that the proposed CDNM significantly outperforms real-valued DNM, complex-valued multi-layer perceptron, and other complex-valued neuron models.
Collapse
|
10
|
Yao X, Yang H, Sheng M. Automatic Modulation Classification for Underwater Acoustic Communication Signals Based on Deep Complex Networks. ENTROPY (BASEL, SWITZERLAND) 2023; 25:318. [PMID: 36832684 PMCID: PMC9955885 DOI: 10.3390/e25020318] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/11/2023] [Revised: 02/03/2023] [Accepted: 02/07/2023] [Indexed: 06/18/2023]
Abstract
Automatic modulation classification (AMC) is an important method for monitoring and identifying any underwater communication interference. Since the underwater acoustic communication scenario is full of multi-path fading and ocean ambient noise (OAN), coupled with the application of modern communication technology, which is usually susceptible to environmental influences, automatic modulation classification (AMC) becomes particularly difficult when it comes to an underwater scenario. Motivated by the deep complex networks (DCN), which have an innate ability to process complex data, we explore DCN for AMC of underwater acoustic communication signals. To integrate the signal processing method with deep learning and overcome the influences of underwater acoustic channels, we propose two complex physical signal processing layers based on DCN. The proposed layers include a deep complex matched filter (DCMF) and deep complex channel equalizer (DCCE), which are designed to remove noise and reduce the influence of multi-path fading for the received signals, respectively. Hierarchical DCN is constructed using the proposed method to achieve better performance of AMC. The influence of the real-world underwater acoustic communication scenario is taken into account; two underwater acoustic multi-path fading channels are conducted using the real-world ocean observation dataset, white Gaussian noise, and real-world OAN are used as the additive noise, respectively. Contrastive experiments show that the AMC based on DCN can achieve better performance than the traditional deep neural network based on real value (the average accuracy of the DCN is 5.3% higher than real-valued DNN). The proposed method based on DCN can effectively reduce the influence of underwater acoustic channels and improve the AMC performance in different underwater acoustic channels. The performance of the proposed method was verified on the real-world dataset. In the underwater acoustic channels, the proposed method outperforms a series of advanced AMC method.
Collapse
|
11
|
Sun D, Xi Y, Yaqot A, Hellbrück H, Wu H. Throughput Maximization Using Deep Complex Networks for Industrial Internet of Things. SENSORS (BASEL, SWITZERLAND) 2023; 23:951. [PMID: 36679748 PMCID: PMC9867188 DOI: 10.3390/s23020951] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/26/2022] [Revised: 01/11/2023] [Accepted: 01/11/2023] [Indexed: 06/17/2023]
Abstract
The high-density Industrial Internet of Things needs to meet the requirements of high-density device access and massive data transmission, which requires the support of multiple-input multiple-output (MIMO) antenna cognitive systems to keep high throughput. In such a system, spectral efficiency (SE) optimization based on dynamic power allocation is an effective way to enhance the network throughput as the channel quality variations significantly affect the spectral efficiency performance. Deep learning methods have illustrated the ability to efficiently solve the non-convexity of resource allocation problems induced by the channel multi-path and inter-user interference effects. However, current real-valued deep-learning-based power allocation methods have failed to utilize the representational capacity of complex-valued data as they regard the complex-valued channel data as two parts: real and imaginary data. In this paper, we propose a complex-valued power allocation network (AttCVNN) with cross-channel and in-channel attention mechanisms to improve the model performance where the former considers the relationship between cognitive users and the primary user, i.e., inter-network users, while the latter focuses on the relationship among cognitive users, i.e., intra-network users. Comparison experiments indicate that the proposed AttCVNN notably outperforms both the equal power allocation method (EPM) and the real-valued and the complex-valued fully connected network (FNN, CVFNN) and shows a better convergence rate in the training phase than the real-valued convolutional neural network (AttCNN).
Collapse
Affiliation(s)
- Danfeng Sun
- Key Laboratory of Discrete Industrial Internet of Things of Zhejiang Province, Hangzhou Dianzi University, Hangzhou 310018, China
| | - Yanlong Xi
- Key Laboratory of Discrete Industrial Internet of Things of Zhejiang Province, Hangzhou Dianzi University, Hangzhou 310018, China
| | - Abdullah Yaqot
- Department of Electrical Engineering and Computer Science, University of Applied Science Lübeck, 23562 Lübeck, Germany
| | - Horst Hellbrück
- Department of Electrical Engineering and Computer Science, University of Applied Science Lübeck, 23562 Lübeck, Germany
| | - Huifeng Wu
- Key Laboratory of Discrete Industrial Internet of Things of Zhejiang Province, Hangzhou Dianzi University, Hangzhou 310018, China
| |
Collapse
|
12
|
Wang R, Liu Z, Zhang B, Guo G, Doermann D. Few-Shot Learning with Complex-Valued Neural Networks and Dependable Learning. Int J Comput Vis 2022. [DOI: 10.1007/s11263-022-01700-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
|
13
|
Zhang P, Hui W, Wang B, Zhao D, Song D, Lioma C, Simonsen JG. Complex-valued Neural Network-based Quantum Language Models. ACM T INFORM SYST 2022. [DOI: 10.1145/3505138] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/01/2022]
Abstract
Language modeling is essential in Natural Language Processing and Information Retrieval related tasks. After the statistical language models, Quantum Language Model (QLM) has been proposed to unify both single words and compound terms in the same probability space without extending term space exponentially. Although QLM achieved good performance in ad hoc retrieval, it still has two major limitations: (1) QLM cannot make use of supervised information, mainly due to the iterative and non-differentiable estimation of the density matrix, which represents both queries and documents in QLM. (2) QLM assumes the exchangeability of words or word dependencies, neglecting the order or position information of words.
This article aims to generalize QLM and make it applicable to more complicated matching tasks (e.g., Question Answering) beyond ad hoc retrieval. We propose a complex-valued neural network-based QLM solution called C-NNQLM to employ an end-to-end approach to build and train density matrices in a light-weight and differentiable manner, and it can therefore make use of external well-trained word vectors and supervised labels. Furthermore, C-NNQLM adopts complex-valued word vectors whose phase vectors can directly encode the order (or position) information of words. Note that complex numbers are also essential in the quantum theory. We show that the real-valued NNQLM (R-NNQLM) is a special case of C-NNQLM.
The experimental results on the QA task show that both R-NNQLM and C-NNQLM achieve much better performance than the vanilla QLM, and C-NNQLM’s performance is on par with state-of-the-art neural network models. We also evaluate the proposed C-NNQLM on text classification and document retrieval tasks. The results on most datasets show that the C-NNQLM can outperform R-NNQLM, which demonstrates the usefulness of the complex representation for words and sentences in C-NNQLM.
Collapse
Affiliation(s)
| | | | | | | | - Dawei Song
- Beijing Institute of Technology, Beijing, China
| | | | | |
Collapse
|
14
|
Özcan B, Kınlı F, Kıraç F. Generalization to unseen viewpoint images of objects via alleviated pose attentive capsule agreement. Neural Comput Appl 2022. [DOI: 10.1007/s00521-022-07900-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
15
|
Quan Y, Lin P, Xu Y, Nan Y, Ji H. Nonblind Image Deblurring via Deep Learning in Complex Field. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2022; 33:5387-5400. [PMID: 33852398 DOI: 10.1109/tnnls.2021.3070596] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Nonblind image deblurring is about recovering the latent clear image from a blurry one generated by a known blur kernel, which is an often-seen yet challenging inverse problem in imaging. Its key is how to robustly suppress noise magnification during the inversion process. Recent approaches made a breakthrough by exploiting convolutional neural network (CNN)-based denoising priors in the image domain or the gradient domain, which allows using a CNN for noise suppression. The performance of these approaches is highly dependent on the effectiveness of the denoising CNN in removing magnified noise whose distribution is unknown and varies at different iterations of the deblurring process for different images. In this article, we introduce a CNN-based image prior defined in the Gabor domain. The prior not only utilizes the optimal space-frequency resolution and strong orientation selectivity of the Gabor transform but also enables using complex-valued (CV) representations in intermediate processing for better denoising. A CV CNN is developed to exploit the benefits of the CV representations, with better generalization to handle unknown noises over the real-valued ones. Combining our Gabor-domain CV CNN-based prior with an unrolling scheme, we propose a deep-learning-based approach to nonblind image deblurring. Extensive experiments have demonstrated the superior performance of the proposed approach over the state-of-the-art ones.
Collapse
|
16
|
Mehta D, Chen T, Tang T, Hauenstein JD. The Loss Surface of Deep Linear Networks Viewed Through the Algebraic Geometry Lens. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2022; 44:5664-5680. [PMID: 33822722 DOI: 10.1109/tpami.2021.3071289] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
By using the viewpoint of modern computational algebraic geometry, we explore properties of the optimization landscapes of deep linear neural network models. After providing clarification on the various definitions of "flat" minima, we show that the geometrically flat minima, which are merely artifacts of residual continuous symmetries of the deep linear networks, can be straightforwardly removed by a generalized L2-regularization. Then, we establish upper bounds on the number of isolated stationary points of these networks with the help of algebraic geometry. Combining these upper bounds with a method in numerical algebraic geometry, we find all stationary points for modest depth and matrix size. We demonstrate that, in the presence of the non-zero regularization, deep linear networks can indeed possess local minima which are not global minima. Finally, we show that even though the number of stationary points increases as the number of neurons (regularization parameters) increases (decreases), higher index saddles are surprisingly rare.
Collapse
|
17
|
Yan T, Yang R, Zheng Z, Lin X, Xiong H, Dai Q. All-optical graph representation learning using integrated diffractive photonic computing units. SCIENCE ADVANCES 2022; 8:eabn7630. [PMID: 35704580 PMCID: PMC9200271 DOI: 10.1126/sciadv.abn7630] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/17/2021] [Accepted: 04/29/2022] [Indexed: 05/28/2023]
Abstract
Photonic neural networks perform brain-inspired computations using photons instead of electrons to achieve substantially improved computing performance. However, existing architectures can only handle data with regular structures but fail to generalize to graph-structured data beyond Euclidean space. Here, we propose the diffractive graph neural network (DGNN), an all-optical graph representation learning architecture based on the diffractive photonic computing units (DPUs) and on-chip optical devices to address this limitation. Specifically, the graph node attributes are encoded into strip optical waveguides, transformed by DPUs, and aggregated by optical couplers to extract their feature representations. DGNN captures complex dependencies among node neighborhoods during the light-speed optical message passing over graph structures. We demonstrate the applications of DGNN for node and graph-level classification tasks with benchmark databases and achieve superior performance. Our work opens up a new direction for designing application-specific integrated photonic circuits for high-efficiency processing large-scale graph data structures using deep learning.
Collapse
Affiliation(s)
- Tao Yan
- Department of Automation, Tsinghua University, Beijing 100084, China
- Institute for Brain and Cognitive Sciences, Tsinghua University, Beijing 100084, China
| | - Rui Yang
- Department of Automation, Tsinghua University, Beijing 100084, China
- Department of Electronic Engineering, Tsinghua University, Beijing 100084, China
- Department of Electronic Engineering, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Ziyang Zheng
- Department of Automation, Tsinghua University, Beijing 100084, China
- Department of Electronic Engineering, Tsinghua University, Beijing 100084, China
- Department of Electronic Engineering, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Xing Lin
- Institute for Brain and Cognitive Sciences, Tsinghua University, Beijing 100084, China
- Department of Electronic Engineering, Tsinghua University, Beijing 100084, China
- Beijing National Research Center for Information Science and Technology, Tsinghua University, Beijing 100084, China
- Beijing Laboratory of Brain and Cognitive Intelligence, Beijing Municipal Education Commission, Beijing 100084, China
| | - Hongkai Xiong
- Department of Electronic Engineering, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Qionghai Dai
- Department of Automation, Tsinghua University, Beijing 100084, China
- Institute for Brain and Cognitive Sciences, Tsinghua University, Beijing 100084, China
- Beijing National Research Center for Information Science and Technology, Tsinghua University, Beijing 100084, China
- Beijing Laboratory of Brain and Cognitive Intelligence, Beijing Municipal Education Commission, Beijing 100084, China
| |
Collapse
|
18
|
Li X, Fang JA, Huang T. Event-Triggered Exponential Stabilization for State-Based Switched Inertial Complex-Valued Neural Networks With Multiple Delays. IEEE TRANSACTIONS ON CYBERNETICS 2022; 52:4585-4595. [PMID: 33237870 DOI: 10.1109/tcyb.2020.3031379] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
This article explores the exponential stabilization issue of a class of state-based switched inertial complex-valued neural networks with multiple delays via event-triggered control. First, the state-based switched inertial complex-valued neural networks with multiple delays are modeled. Second, by separating the real and imaginary parts of complex values, the state-based switched inertial complex-valued neural networks are transformed into two state-based switched inertial real-valued neural networks. Through the variable substitution method, the model of the second-order inertial neural networks is transformed into a model of the first-order neural networks. Third, an event-triggered controller with the transmission sequence is designed to study the exponential stabilization issue of neural networks constructed above. Then, by constructing the Lyapunov functions and based on some inequalities, we obtain sufficient conditions for exponential stabilization of the proposed neural networks. Furthermore, it is proved that the Zeno phenomenon cannot happen under the designed event-triggered controller. Finally, a simulation example is given to illustrate the correctness of the results.
Collapse
|
19
|
Zhang SQ, Gao W, Zhou ZH. Towards understanding theoretical advantages of complex-reaction networks. Neural Netw 2022; 151:80-93. [DOI: 10.1016/j.neunet.2022.03.024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Revised: 01/21/2022] [Accepted: 03/22/2022] [Indexed: 11/17/2022]
|
20
|
Chakraborty R, Xing Y, Yu SX. SurReal: Complex-Valued Learning as Principled Transformations on a Scaling and Rotation Manifold. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2022; 33:940-951. [PMID: 33170785 DOI: 10.1109/tnnls.2020.3030565] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Complex-valued data are ubiquitous in signal and image processing applications, and complex-valued representations in deep learning have appealing theoretical properties. While these aspects have long been recognized, complex-valued deep learning continues to lag far behind its real-valued counterpart. We propose a principled geometric approach to complex-valued deep learning. Complex-valued data could often be subject to arbitrary complex-valued scaling; as a result, real and imaginary components could covary. Instead of treating complex values as two independent channels of real values, we recognize their underlying geometry: we model the space of complex numbers as a product manifold of nonzero scaling and planar rotations. Arbitrary complex-valued scaling naturally becomes a group of transitive actions on this manifold. We propose to extend the property instead of the form of real-valued functions to the complex domain. We define convolution as the weighted Fréchet mean on the manifold that is equivariant to the group of scaling/rotation actions and define distance transform on the manifold that is invariant to the action group. The manifold perspective also allows us to define nonlinear activation functions, such as tangent ReLU and G -transport, as well as residual connections on the manifold-valued data. We dub our model SurReal, as our experiments on MSTAR and RadioML deliver high performance with only a fractional size of real- and complex-valued baseline models.
Collapse
|
21
|
Lu J, Millioz F, Garcia D, Salles S, Ye D, Friboulet D. Complex Convolutional Neural Networks for Ultrafast Ultrasound Imaging Reconstruction From In-Phase/Quadrature Signal. IEEE TRANSACTIONS ON ULTRASONICS, FERROELECTRICS, AND FREQUENCY CONTROL 2022; 69:592-603. [PMID: 34767508 DOI: 10.1109/tuffc.2021.3127916] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Ultrafast ultrasound imaging remains an active area of interest in the ultrasound community due to its ultrahigh frame rates. Recently, a wide variety of studies based on deep learning have sought to improve ultrafast ultrasound imaging. Most of these approaches have been performed on radio frequency (RF) signals. However, in- phase/quadrature (I/Q) digital beamformers are now widely used as low-cost strategies. In this work, we used complex convolutional neural networks for reconstruction of ultrasound images from I/Q signals. We recently described a convolutional neural network architecture called ID-Net, which exploited an inception layer designed for reconstruction of RF diverging-wave ultrasound images. In the present study, we derive the complex equivalent of this network, i.e., complex-valued inception for diverging-wave network (CID-Net) that operates on I/Q data. We provide experimental evidence that CID-Net provides the same image quality as that obtained from RF-trained convolutional neural networks, i.e., using only three I/Q images, CID-Net produces high-quality images that can compete with those obtained by coherently compounding 31 RF images. Moreover, we show that CID-Net outperforms the straightforward architecture that consists of processing real and imaginary parts of the I/Q signal separately, which thereby indicates the importance of consistently processing the I/Q signals using a network that exploits the complex nature of such signals.
Collapse
|
22
|
Goto S, Natsuaki R, Hirose A. Proposal of higher-order tensor independent component analysis for signal separation in multiple-input multiple-output respiration/heartbeat remote sensing . ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2021; 2021:325-328. [PMID: 34891301 DOI: 10.1109/embc46164.2021.9630656] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
This paper proposes higher-order tensor independent component analysis (HOT-ICA). HOT-ICA is a tensor ICA that makes effective use of the relationships among the axes of a separating tensor. We deal with multiple-target signal separation in a multiple-input multiple-output (MIMO) radar system to detect respiration and heartbeat. Numerical physical experiments demonstrate the significance of the HOT-ICA which keeps the tensor structure unchanged to fully utilizes the high-dimensionality of the separation tensor.
Collapse
|
23
|
Xiao YL, Li S, Situ G, You Z. Optical random phase dropout in a diffractive deep neural network. OPTICS LETTERS 2021; 46:5260-5263. [PMID: 34653167 DOI: 10.1364/ol.428761] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/26/2021] [Accepted: 07/04/2021] [Indexed: 06/13/2023]
Abstract
Unitary learning is a backpropagation (BP) method that serves to update unitary weights in fully connected deep complex-valued neural networks, meeting a prior unitary in an active modulation diffractive deep neural network. However, the square matrix characteristic of unitary weights in each layer results in its learning belonging to a small-sample training, which produces an almost useless network that has a fairly poor generalization capability. To alleviate such a serious over-fitting problem, in this Letter, optical random phase dropout is formulated and designed. The equivalence between unitary forward and diffractive networks deduces a synthetic mask that is seamlessly compounded with a computational modulation and a random sampling comb called dropout. The dropout is filled with random phases in its zero positions that satisfy the Bernoulli distribution, which could slightly deflect parts of transmitted optical rays in each output end to generate statistical inference networks. The enhancement of generalization benefits from the fact that massively parallel full connection with different optical links is involved in the training. The random phase comb introduced into unitary BP is in the form of conjugation, which indicates the significance of optical BP.
Collapse
|
24
|
Yang B, Bao W, Zhang W, Wang H, Song C, Chen Y, Jiang X. Reverse engineering gene regulatory network based on complex-valued ordinary differential equation model. BMC Bioinformatics 2021; 22:448. [PMID: 34544363 PMCID: PMC8451084 DOI: 10.1186/s12859-021-04367-2] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2021] [Accepted: 09/09/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The growing researches of molecular biology reveal that complex life phenomena have the ability to demonstrating various types of interactions in the level of genomics. To establish the interactions between genes or proteins and understand the intrinsic mechanisms of biological systems have become an urgent need and study hotspot. RESULTS In order to forecast gene expression data and identify more accurate gene regulatory network, complex-valued version of ordinary differential equation (CVODE) is proposed in this paper. In order to optimize CVODE model, a complex-valued hybrid evolutionary method based on Grammar-guided genetic programming and complex-valued firefly algorithm is presented. CONCLUSIONS When tested on three real gene expression datasets from E. coli and Human Cell, the experiment results suggest that CVODE model could improve 20-50% prediction accuracy of gene expression data, which could also infer more true-positive regulatory relationships and less false-positive regulations than ordinary differential equation.
Collapse
Affiliation(s)
- Bin Yang
- School of Information Science and Engineering, Zaozhuang University, Zaozhuang, 277160, China
| | - Wenzheng Bao
- School of Information and Electrical Engineering, Xuzhou University of Technology, Xuzhou, 221018, China.
| | - Wei Zhang
- School of Information Science and Engineering, Zaozhuang University, Zaozhuang, 277160, China
| | - Haifeng Wang
- School of Information Science and Engineering, Zaozhuang University, Zaozhuang, 277160, China
| | - Chuandong Song
- School of Information Science and Engineering, Zaozhuang University, Zaozhuang, 277160, China
| | - Yuehui Chen
- School of Information Science and Engineering, University of Jinan, Jinan, 250022, China
| | - Xiuying Jiang
- School of Information Science and Engineering, Zaozhuang University, Zaozhuang, 277160, China
| |
Collapse
|
25
|
Chen H, Niu W, Zhao Y, Zhang J, Chi N, Li Z. Adaptive deep-learning equalizer based on constellation partitioning scheme with reduced computational complexity in UVLC system. OPTICS EXPRESS 2021; 29:21773-21782. [PMID: 34265957 DOI: 10.1364/oe.432351] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/25/2021] [Accepted: 06/17/2021] [Indexed: 06/13/2023]
Abstract
Visible light communication (VLC) system has emerged as a promising solution for high-speed underwater data transmission. To tackle with the linear and nonlinear impairments, deep learning inspired equalization is introduced into VLC. Despite their success in accuracy, deep learning approaches often come with high computational budget. In this paper, we propose an adaptive deep-learning equalizer based on complex-valued neural network and constellation partitioning scheme for 64 QAM-CAP modulated underwater VLC (UVLC) system. Inspired by the fact that symbols modulated at different levels experience various extent of nonlinear distortion, we adaptively partition the received symbols in constellation and design compact equalization networks for specific regions to reduce computation consumption. Experiments demonstrate that the partitioned equalizer can achieve the bit error rate below the 7% hard-decision forward error correction (HD-FEC) limit of 3.8 × 10-3 at 2.85 Gbps similar to the standard complex-valued network, yet with 56.1% total computational complexity reduction. This work paves the path for online data processing in high speed UVLC system.
Collapse
|
26
|
Abstract
Recurrent Neural Networks (RNNs) are known for their ability to learn relationships within temporal sequences. Gated Recurrent Unit (GRU) networks have found use in challenging time-dependent applications such as Natural Language Processing (NLP), financial analysis and sensor fusion due to their capability to cope with the vanishing gradient problem. GRUs are also known to be more computationally efficient than their variant, the Long Short-Term Memory neural network (LSTM), due to their less complex structure and as such, are more suitable for applications requiring more efficient management of computational resources. Many of such applications require a stronger mapping of their features to further enhance the prediction accuracy. A novel Quaternion Gated Recurrent Unit (QGRU) is proposed in this paper, which leverages the internal and external dependencies within the quaternion algebra to map correlations within and across multidimensional features. The QGRU can be used to efficiently capture the inter- and intra-dependencies within multidimensional features unlike the GRU, which only captures the dependencies within the sequence. Furthermore, the performance of the proposed method is evaluated on a sensor fusion problem involving navigation in Global Navigation Satellite System (GNSS) deprived environments as well as a human activity recognition problem. The results obtained show that the QGRU produces competitive results with almost 3.7 times fewer parameters compared to the GRU. The QGRU code is available at.
Collapse
|
27
|
Uykan Z. Shadow-Cuts Minimization/Maximization and Complex Hopfield Neural Networks. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2021; 32:1096-1109. [PMID: 32310787 DOI: 10.1109/tnnls.2020.2980237] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
In this article, we continue our very recent work by extending it to the complex case. Having been inspired by the real Hopfield neural network (HNN) results, our investigations here yield various novel results, some of which are as follows. First, extending the "biased pseudo-cut" concept to the complex HNN (CHNN) case, we introduce a "shadow-cut" that is defined as the sum of intercluster phased edges. Second, while the discrete-time real HNN strictly minimizes the "biased pseudo-cut" in each neuron state change, the CHNN "tends" to minimize the shadow-cut (as the CHNN energy function is minimized). Third, these definitions pose a novel L-phased graph clustering (partitioning) problem in which the sum of the shadow-cuts is minimized (or maximized) for the Hermitian complex and the directed graphs whose edges are (possibly arbitrary positive/negative) complex numbers. Finally, combining the CHNN and the pioneering algorithm GADIA of Babadi and Tarokh and their modified versions, we propose simple indirect algorithms to solve the defined shadow-cuts minimization/maximization problem. The proposed algorithms naturally include the CHNN as well as the GADIA as its special cases. The computer simulations confirm the findings.
Collapse
|
28
|
Chen C, Liu SJ, Wang ML, Zhang ZG, Xiao YL. Phase-shift determination for a 4 × 4 intelligent photonic neural network with compatible learning. APPLIED OPTICS 2021; 60:2100-2108. [PMID: 33690304 DOI: 10.1364/ao.417935] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/21/2020] [Accepted: 02/03/2021] [Indexed: 06/12/2023]
Abstract
Intelligent photonic circuits (IPCs) tuned with an appropriate phase-shift vector could enable a photonic intelligent matrix possibly implemented in multiple neural layers for a task-oriented topologies. A photonic Mach-Zehnder Interferometer (MZI) is a fundamental photonic component in IPCs, whose matrix representation could be broadcasted into an arbitrary matrix that is equipped with an optimized phase-shift vector. The initialized MZIs' phases are tentatively probed between analytical elements and a digital weight matrix that is learned from samples with efficient compatible learning for complex-valued neural networks. Nonlinear least squares is utilized to formulate a phase determination system to refine the optimal phase-shift solutions. The robustness of phase determination system for photonic neural networks is discussed in detail. For a preliminary implementation, a basic 4×4 intelligent photonic neural network is utilized to verify the proof of concept on phase-shift determination in IPC through numerical experiments.
Collapse
|
29
|
Guo C, Poletti D. Scheme for automatic differentiation of complex loss functions with applications in quantum physics. Phys Rev E 2021; 103:013309. [PMID: 33601628 DOI: 10.1103/physreve.103.013309] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2020] [Accepted: 01/01/2021] [Indexed: 11/07/2022]
Abstract
The past few years have seen a significant transfer of tools from machine learning to solve quantum physics problems. Automatic differentiation is one standard algorithm used to efficiently compute gradients of loss functions for generic neural networks. In this work we show how to extend automatic differentiation to the case of complex loss function in a way that can be readily implemented in existing frameworks and which is compatible with the common case of real loss functions. We then combine this tool with matrix product states and apply it to compute the ground state and the steady state of a close and an open quantum system. Compared to the traditional density matrix renormalization group algorithm, complex automatic differentiation allows both straightforward GPU accelerations as well as generalizations to different types of tensor and neural networks.
Collapse
Affiliation(s)
- Chu Guo
- Key Laboratory of Low-Dimensional Quantum Structures and Quantum Control of Ministry of Education, Department of Physics and Synergetic Innovation Center for Quantum Effects and Applications, Hunan Normal University, Changsha 410081, China
| | - Dario Poletti
- Science and Math Cluster and Engineering Product Development, Singapore University of Technology and Design, 8 Somapah Road, 487372 Singapore
| |
Collapse
|
30
|
Zhang H, Gu M, Jiang XD, Thompson J, Cai H, Paesani S, Santagati R, Laing A, Zhang Y, Yung MH, Shi YZ, Muhammad FK, Lo GQ, Luo XS, Dong B, Kwong DL, Kwek LC, Liu AQ. An optical neural chip for implementing complex-valued neural network. Nat Commun 2021; 12:457. [PMID: 33469031 PMCID: PMC7815828 DOI: 10.1038/s41467-020-20719-7] [Citation(s) in RCA: 88] [Impact Index Per Article: 22.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2020] [Accepted: 12/14/2020] [Indexed: 01/29/2023] Open
Abstract
Complex-valued neural networks have many advantages over their real-valued counterparts. Conventional digital electronic computing platforms are incapable of executing truly complex-valued representations and operations. In contrast, optical computing platforms that encode information in both phase and magnitude can execute complex arithmetic by optical interference, offering significantly enhanced computational speed and energy efficiency. However, to date, most demonstrations of optical neural networks still only utilize conventional real-valued frameworks that are designed for digital computers, forfeiting many of the advantages of optical computing such as efficient complex-valued operations. In this article, we highlight an optical neural chip (ONC) that implements truly complex-valued neural networks. We benchmark the performance of our complex-valued ONC in four settings: simple Boolean tasks, species classification of an Iris dataset, classifying nonlinear datasets (Circle and Spiral), and handwriting recognition. Strong learning capabilities (i.e., high accuracy, fast convergence and the capability to construct nonlinear decision boundaries) are achieved by our complex-valued ONC compared to its real-valued counterpart.
Collapse
Affiliation(s)
- H Zhang
- Quantum Science and Engineering Centre (QSec), Nanyang Technological University, 50 Nanyang Ave, 639798, Singapore, Singapore
| | - M Gu
- Complexity Institute and School of Physical and Mathematical Sciences, Nanyang Technological University, 50 Nanyang Ave, 639798, Singapore, Singapore.
- Centre for Quantum Technologies, National University of Singapore, Block S15, 3 Science Drive 2, Singapore, 117543, Singapore.
| | - X D Jiang
- Quantum Science and Engineering Centre (QSec), Nanyang Technological University, 50 Nanyang Ave, 639798, Singapore, Singapore.
| | - J Thompson
- Centre for Quantum Technologies, National University of Singapore, Block S15, 3 Science Drive 2, Singapore, 117543, Singapore
| | - H Cai
- Institute of Microelectronics, A*STAR (Agency for Science, Technology and Research), 138634, Singapore, Singapore
| | - S Paesani
- Centre for Quantum Photonics, H. H. Wills Physics Laboratory and Department of Electrical and Electronic Engineering, University of Bristol, Merchant Venturers Building, Woodland Road, Bristol, BS8 1UB, UK
| | - R Santagati
- Centre for Quantum Photonics, H. H. Wills Physics Laboratory and Department of Electrical and Electronic Engineering, University of Bristol, Merchant Venturers Building, Woodland Road, Bristol, BS8 1UB, UK
| | - A Laing
- Centre for Quantum Photonics, H. H. Wills Physics Laboratory and Department of Electrical and Electronic Engineering, University of Bristol, Merchant Venturers Building, Woodland Road, Bristol, BS8 1UB, UK
| | - Y Zhang
- Quantum Science and Engineering Centre (QSec), Nanyang Technological University, 50 Nanyang Ave, 639798, Singapore, Singapore
- School of Mechanical & Aerospace Engineering, Nanyang Technological University, 50 Nanyang Ave, 639798, Singapore, Singapore
| | - M H Yung
- Institute for Quantum Science and Engineering, Department of Physics, Southern University of Science and Technology, Shenzhen, 518055, China
- Shenzhen Key Laboratory of Quantum Science and Engineering, Southern University of Science and Technology, Shenzhen, 518055, China
| | - Y Z Shi
- Quantum Science and Engineering Centre (QSec), Nanyang Technological University, 50 Nanyang Ave, 639798, Singapore, Singapore
| | - F K Muhammad
- Quantum Science and Engineering Centre (QSec), Nanyang Technological University, 50 Nanyang Ave, 639798, Singapore, Singapore
| | - G Q Lo
- Advanced Micro Foundry, 11 Science Park Road, 117685, Singapore, Singapore
| | - X S Luo
- Advanced Micro Foundry, 11 Science Park Road, 117685, Singapore, Singapore
| | - B Dong
- Advanced Micro Foundry, 11 Science Park Road, 117685, Singapore, Singapore
| | - D L Kwong
- Institute of Microelectronics, A*STAR (Agency for Science, Technology and Research), 138634, Singapore, Singapore
| | - L C Kwek
- Quantum Science and Engineering Centre (QSec), Nanyang Technological University, 50 Nanyang Ave, 639798, Singapore, Singapore.
- Centre for Quantum Technologies, National University of Singapore, Block S15, 3 Science Drive 2, Singapore, 117543, Singapore.
- National Institute of Education, 1 Nanyang Walk, 637616, Singapore, Singapore.
| | - A Q Liu
- Quantum Science and Engineering Centre (QSec), Nanyang Technological University, 50 Nanyang Ave, 639798, Singapore, Singapore.
| |
Collapse
|
31
|
Xiao J, Jia Y, Jiang X, Wang S. Circular Complex-Valued GMDH-Type Neural Network for Real-Valued Classification Problems. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2020; 31:5285-5299. [PMID: 32078563 DOI: 10.1109/tnnls.2020.2966031] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Recently, applications of complex-valued neural networks (CVNNs) to real-valued classification problems have attracted significant attention. However, most existing CVNNs are black-box models with poor explanation performance. This study extends the real-valued group method of data handling (RGMDH)-type neural network to the complex field and constructs a circular complex-valued group method of data handling (C-CGMDH)-type neural network, which is a white-box model. First, a complex least squares method is proposed for parameter estimation. Second, a new complex-valued symmetric regularity criterion is constructed with a logarithmic function to represent explicitly the magnitude and phase of the actual and predicted complex output to evaluate and select the middle candidate models. Furthermore, the property of this new complex-valued external criterion is proven to be similar to that of the real external criterion. Before training this model, a circular transformation is used to transform the real-valued input features to the complex field. Twenty-five real-valued classification data sets from the UCI Machine Learning Repository are used to conduct the experiments. The results show that both RGMDH and C-CGMDH models can select the most important features from the complete feature space through a self-organizing modeling process. Compared with RGMDH, the C-CGMDH model converges faster and selects fewer features. Furthermore, its classification performance is statistically significantly better than the benchmark complex-valued and real-valued models. Regarding time complexity, the C-CGMDH model is comparable with other models in dealing with the data sets that have few features. Finally, we demonstrate that the GMDH-type neural network can be interpretable.
Collapse
|
32
|
Finite-time synchronization of complex-valued neural networks with finite-time distributed delays. Neurocomputing 2020. [DOI: 10.1016/j.neucom.2019.01.114] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
33
|
Ganesan S, Gnaneswaran N. State-Feedback Filtering for Delayed Discrete-Time Complex-Valued Neural Networks. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2020; 31:4726-4736. [PMID: 31905153 DOI: 10.1109/tnnls.2019.2957304] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
This article explores a new filtering problem for the class of delayed discrete-time complex-valued neural networks (CVNNs) via state-feedback control design. The novelty of this article comes from the consideration of the newly developed complex-valued reciprocal convex matrix inequality as well as the complex-valued Jensen-based summation inequalities (JSIs). By employing an appropriate Lyapunov-Krasovskii functional (LKF) and by using newly proposed complex-valued inequalities, attention is concentrated on the design of a state-feedback filter such that the associated filtering error system is asymptotically stable with prescribed filter and control gain matrices. The proposed theoretical results are presented in terms of complex-valued linear matrix inequalities (LMIs) that can be solved numerically by using the YALMIP toolbox in MATLAB software. Additionally, one numerical example is given to confirm the validity of the resulting sufficient conditions with the availability of the suitable control and filter design.
Collapse
|
34
|
Zhang H, Zhang Y, Zhu S, Xu D. Deterministic convergence of complex mini-batch gradient learning algorithm for fully complex-valued neural networks. Neurocomputing 2020. [DOI: 10.1016/j.neucom.2020.04.114] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
35
|
Xiao X, Jiang C, Lu H, Jin L, Liu D, Huang H, Pan Y. A parallel computing method based on zeroing neural networks for time-varying complex-valued matrix Moore-Penrose inversion. Inf Sci (N Y) 2020. [DOI: 10.1016/j.ins.2020.03.043] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
36
|
Adaptive complex-valued stepsize based fast learning of complex-valued neural networks. Neural Netw 2020; 124:233-242. [DOI: 10.1016/j.neunet.2020.01.011] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2019] [Revised: 12/05/2019] [Accepted: 01/14/2020] [Indexed: 11/23/2022]
|
37
|
An adaptive neuro-complex-fuzzy-inferential modeling mechanism for generating higher-order TSK models. Neurocomputing 2019. [DOI: 10.1016/j.neucom.2019.07.042] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
|
38
|
|
39
|
Liu Y, Qin Y, Huang J, Huang T, Yang X. Finite-Time Synchronization of Complex-Valued Neural Networks with Multiple Time-Varying Delays and Infinite Distributed Delays. Neural Process Lett 2018. [DOI: 10.1007/s11063-018-9958-6] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
40
|
Hu S, Nagae S, Hirose A. Millimeter-wave Adaptive Glucose Concentration Estimation with Complex-Valued Neural Networks. IEEE Trans Biomed Eng 2018; 66:2065-2071. [PMID: 30489258 DOI: 10.1109/tbme.2018.2883085] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
In this paper, we propose an adaptive glucose concentration estimation system. The system estimates glucose concentration values non-invasively by making full use of transmission magnitude and phase data. Debye relaxation model indicates that, in millimeter wave frequency range, we can acquire both a high sensitivity and a sufficient penetration depth. Based on the physical model, we choose 60-80 GHz frequency band millimeter wave. We build a single output-neuron complex-valued neural network (CVNN) for adaptive concentration estimation. Glucose water solution samples ranging from 0 to 300 mg/dL are measured. Transmission magnitude and phase data with teacher signals are fed to a CVNN for training and validation. The change in the glucose concentration presents a variation of both transmission magnitude and phase. The CVNN learns the relationship between the transmission data and the glucose concentrations. We find that the system shows a good generalization ability to estimate the concentration for unknown samples. It is effective in the estimation of the glucose concentration in the clinically practical range. Non-invasive methods usually suffer from instability in measurement condition. Our proposed method has the adaptability to different measurement conditions through the learning process based on a set of sample transmission magnitude and phase data with corresponding teacher signals.
Collapse
|
41
|
|
42
|
Wu R, Huang H, Qian X, Huang T. A L-BFGS Based Learning Algorithm for Complex-Valued Feedforward Neural Networks. Neural Process Lett 2018. [DOI: 10.1007/s11063-017-9692-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
43
|
|
44
|
Kobayashi M. Singularities of Three-Layered Complex-Valued Neural Networks With Split Activation Function. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2018; 29:1900-1907. [PMID: 28422693 DOI: 10.1109/tnnls.2017.2688322] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
There are three important concepts related to learning processes in neural networks: reducibility, nonminimality, and singularity. Although the definitions of these three concepts differ, they are equivalent in real-valued neural networks. This is also true of complex-valued neural networks (CVNNs) with hidden neurons not employing biases. The situation of CVNNs with hidden neurons employing biases, however, is very complicated. Exceptional reducibility was found, and it was shown that reducibility and nonminimality are not the same. Irreducibility consists of minimality and exceptional reducibility. The relationship between minimality and singularity has not yet been established. In this paper, we describe our surprising finding that minimality and singularity are independent. We also provide several examples based on exceptional reducibility.
Collapse
|
45
|
The Global Exponential Stability of the Delayed Complex-Valued Neural Networks with Almost Periodic Coefficients and Discontinuous Activations. Neural Process Lett 2017. [DOI: 10.1007/s11063-017-9736-x] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
46
|
Exponential stability analysis for delayed complex-valued memristor-based recurrent neural networks. Neural Comput Appl 2017. [DOI: 10.1007/s00521-017-3166-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
47
|
Zhang Z, Liu X, Chen J, Guo R, Zhou S. Further stability analysis for delayed complex-valued recurrent neural networks. Neurocomputing 2017. [DOI: 10.1016/j.neucom.2017.04.013] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
48
|
Adaptive land classification and new class generation by unsupervised double-stage learning in Poincare sphere space for polarimetric synthetic aperture radars. Neurocomputing 2017. [DOI: 10.1016/j.neucom.2016.11.072] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
|
49
|
Wang XZ, Wei Y, Stanimirović PS. Complex Neural Network Models for Time-Varying Drazin Inverse. Neural Comput 2016; 28:2790-2824. [PMID: 27391685 DOI: 10.1162/neco_a_00866] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
Abstract
Two complex Zhang neural network (ZNN) models for computing the Drazin inverse of arbitrary time-varying complex square matrix are presented. The design of these neural networks is based on corresponding matrix-valued error functions arising from the limit representations of the Drazin inverse. Two types of activation functions, appropriate for handling complex matrices, are exploited to develop each of these networks. Theoretical results of convergence analysis are presented to show the desirable properties of the proposed complex-valued ZNN models. Numerical results further demonstrate the effectiveness of the proposed models.
Collapse
Affiliation(s)
- Xue-Zhong Wang
- School of Mathematical Sciences, Fudan University, Shanghai, 200433, P.R.C.
| | - Yimin Wei
- School of Mathematical Sciences and Key Laboratory of Mathematics for Nonlinear Sciences, Fudan University, Shanghai, 200433, P.R.C.
| | | |
Collapse
|
50
|
A complex-valued neural dynamical optimization approach and its stability analysis. Neural Netw 2015; 61:59-67. [DOI: 10.1016/j.neunet.2014.10.003] [Citation(s) in RCA: 50] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2014] [Revised: 09/04/2014] [Accepted: 10/02/2014] [Indexed: 11/20/2022]
|