1
|
Fu Z, Liu Z, Ping S, Li W, Liu J. TRA-ACGAN: A motor bearing fault diagnosis model based on an auxiliary classifier generative adversarial network and transformer network. ISA TRANSACTIONS 2024; 149:381-393. [PMID: 38604873 DOI: 10.1016/j.isatra.2024.03.033] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Revised: 03/26/2024] [Accepted: 03/27/2024] [Indexed: 04/13/2024]
Abstract
Motor bearing fault diagnosis is essential to guarantee production efficiency and avoid catastrophic accidents. Deep learning-based methods have been developed and widely used for fault diagnosis, and these methods have proven to be very effective in accurately diagnosing bearing faults. In this paper, study the application of generative adversarial networks (GANs) in motor bearing fault diagnosis to address the practical issue of insufficient fault data in industrial testing. Focus on the auxiliary classifier generative adversarial network (ACGAN), and the data expansion is carried out for small datasets. This paper present a novel transformer network and auxiliary classifier generative adversarial network (TRA-ACGAN) for motor bearing fault diagnosis, where the TRA-ACGAN combines an ACGAN with a transformer network to avoid the traditional iterative and convolutional structures. The attention mechanism is fully utilized to extract more effective features, and the dual-task coupling problem encountered in classical ACGANs is avoided. Experimental results with the CWRU dataset and the PU dataset in the field of motor bearing fault diagnosis demonstrate the suitability and superiority of the TRA-ACGAN.
Collapse
Affiliation(s)
- Zhaoyang Fu
- School of Automation, Northwestern Polytechnical University, Xi'an 710129, China.
| | - Zheng Liu
- Luoyang Power Supply Company, State Grid Corporation of China, Luoyang 471023, China
| | - Shuangrui Ping
- School of Automation, Northwestern Polytechnical University, Xi'an 710129, China
| | - Weilin Li
- School of Automation, Northwestern Polytechnical University, Xi'an 710129, China
| | - Jinglin Liu
- School of Automation, Northwestern Polytechnical University, Xi'an 710129, China
| |
Collapse
|
2
|
Li N, Ma L, Yu G, Xue B, Zhang M, Jin Y. Survey on Evolutionary Deep Learning: Principles, Algorithms, Applications, and Open Issues. ACM COMPUTING SURVEYS 2024; 56:1-34. [DOI: 10.1145/3603704] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/21/2022] [Accepted: 05/31/2023] [Indexed: 01/04/2025]
Abstract
Over recent years, there has been a rapid development of deep learning (DL) in both industry and academia fields. However, finding the optimal hyperparameters of a DL model often needs high computational cost and human expertise. To mitigate the above issue, evolutionary computation (EC) as a powerful heuristic search approach has shown significant merits in the automated design of DL models, so-called evolutionary deep learning (EDL). This article aims to analyze EDL from the perspective of automated machine learning (AutoML). Specifically, we first illuminate EDL from DL and EC and regard EDL as an optimization problem. According to the DL pipeline, we systematically introduce EDL methods ranging from data preparation, model generation, to model deployment with a new taxonomy (i.e., what and how to evolve/optimize), and focus on the discussions of solution representation and search paradigm in handling the optimization problem by EC. Finally, key applications, open issues, and potentially promising lines of future research are suggested. This survey has reviewed recent developments of EDL and offers insightful guidelines for the development of EDL.
Collapse
Affiliation(s)
- Nan Li
- Northeastern University, China
| | | | - Guo Yu
- Nanjing Tech University, China
| | - Bing Xue
- Victoria University of Wellington, New Zealand
| | | | | |
Collapse
|
3
|
Quasi-Projective and Mittag-Leffler Synchronization of Discrete-Time Fractional-Order Complex-Valued Fuzzy Neural Networks. Neural Process Lett 2023. [DOI: 10.1007/s11063-023-11153-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
|
4
|
An Intuitionistic Fuzzy Random Vector Functional Link Classifier. Neural Process Lett 2022. [DOI: 10.1007/s11063-022-11043-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
5
|
A survey on dendritic neuron model: Mechanisms, algorithms and practical applications. Neurocomputing 2022. [DOI: 10.1016/j.neucom.2021.08.153] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
6
|
Li X, Fang JA, Huang T. Event-Triggered Exponential Stabilization for State-Based Switched Inertial Complex-Valued Neural Networks With Multiple Delays. IEEE TRANSACTIONS ON CYBERNETICS 2022; 52:4585-4595. [PMID: 33237870 DOI: 10.1109/tcyb.2020.3031379] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
This article explores the exponential stabilization issue of a class of state-based switched inertial complex-valued neural networks with multiple delays via event-triggered control. First, the state-based switched inertial complex-valued neural networks with multiple delays are modeled. Second, by separating the real and imaginary parts of complex values, the state-based switched inertial complex-valued neural networks are transformed into two state-based switched inertial real-valued neural networks. Through the variable substitution method, the model of the second-order inertial neural networks is transformed into a model of the first-order neural networks. Third, an event-triggered controller with the transmission sequence is designed to study the exponential stabilization issue of neural networks constructed above. Then, by constructing the Lyapunov functions and based on some inequalities, we obtain sufficient conditions for exponential stabilization of the proposed neural networks. Furthermore, it is proved that the Zeno phenomenon cannot happen under the designed event-triggered controller. Finally, a simulation example is given to illustrate the correctness of the results.
Collapse
|
7
|
Cheng H, Wang Z, Wei Z, Ma L, Liu X. On Adaptive Learning Framework for Deep Weighted Sparse Autoencoder: A Multiobjective Evolutionary Algorithm. IEEE TRANSACTIONS ON CYBERNETICS 2022; 52:3221-3231. [PMID: 32780708 DOI: 10.1109/tcyb.2020.3009582] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
In this article, an adaptive learning framework is established for a deep weighted sparse autoencoder (AE) by resorting to the multiobjective evolutionary algorithm (MOEA). The weighted sparsity is introduced to facilitate the design of the varying degrees of the sparsity constraints imposed on the hidden units of the AE. The MOEA is exploited to adaptively seek appropriate hyperparameters, where the divide-and-conquer strategy is implemented to enhance the MOEA's performance in the context of deep neural networks. Moreover, a sharing scheme is proposed to further reduce the time complexity of the learning process at the slight expense of the learning precision. It is shown via extensive experiments that the established adaptive learning framework is effective, where different sparse models are utilized to demonstrate the generality of the proposed results. Then, the generality of the proposed framework is examined on the convolutional AE and VGG-16 network. Finally, the developed framework is applied to the blind image quantity assessment that illustrates the applicability of the established algorithms.
Collapse
|
8
|
Liang J, Chen G, Qu B, Yue C, Yu K, Qiao K. Niche-based cooperative co-evolutionary ensemble neural network for classification. Appl Soft Comput 2021. [DOI: 10.1016/j.asoc.2021.107951] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
9
|
Ji J, Tang Y, Ma L, Li J, Lin Q, Tang Z, Todo Y. Accuracy Versus Simplification in an Approximate Logic Neural Model. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2021; 32:5194-5207. [PMID: 33156795 DOI: 10.1109/tnnls.2020.3027298] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
An approximate logic neural model (ALNM) is a novel single-neuron model with plastic dendritic morphology. During the training process, the model can eliminate unnecessary synapses and useless branches of dendrites. It will produce a specific dendritic structure for a particular task. The simplified structure of ALNM can be substituted by a logic circuit classifier (LCC) without losing any essential information. The LCC merely consists of the comparator and logic NOT, AND, and OR gates. Thus, it can be easily implemented in hardware. However, the architecture of ALNM affects the learning capacity, generalization capability, computing time and approximation of LCC. Thus, a Pareto-based multiobjective differential evolution (MODE) algorithm is proposed to simultaneously optimize ALNM's topology and weights. MODE can generate a concise and accurate LCC for every specific task from ALNM. To verify the effectiveness of MODE, extensive experiments are performed on eight benchmark classification problems. The statistical results demonstrate that MODE is superior to conventional learning methods, such as the backpropagation algorithm and single-objective evolutionary algorithms. In addition, compared against several commonly used classifiers, both ALNM and LCC are capable of obtaining promising and competitive classification performances on the benchmark problems. Besides, the experimental results also verify that the LCC obtains the faster classification speed than the other classifiers.
Collapse
|
10
|
Li H, Zhang L. A Bilevel Learning Model and Algorithm for Self-Organizing Feed-Forward Neural Networks for Pattern Classification. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2021; 32:4901-4915. [PMID: 33017295 DOI: 10.1109/tnnls.2020.3026114] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Conventional artificial neural network (ANN) learning algorithms for classification tasks, either derivative-based optimization algorithms or derivative-free optimization algorithms work by training ANN first (or training and validating ANN) and then testing ANN, which are a two-stage and one-pass learning mechanism. Thus, this learning mechanism may not guarantee the generalization ability of a trained ANN. In this article, a novel bilevel learning model is constructed for self-organizing feed-forward neural network (FFNN), in which the training and testing processes are integrated into a unified framework. In this bilevel model, the upper level optimization problem is built for testing error on testing data set and network architecture based on network complexity, whereas the lower level optimization problem is constructed for network weights based on training error on training data set. For the bilevel framework, an interactive learning algorithm is proposed to optimize the architecture and weights of an FFNN with consideration of both training error and testing error. In this interactive learning algorithm, a hybrid binary particle swarm optimization (BPSO) taken as an upper level optimizer is used to self-organize network architecture, whereas the Levenberg-Marquardt (LM) algorithm as a lower level optimizer is utilized to optimize the connection weights of an FFNN. The bilevel learning model and algorithm have been tested on 20 benchmark classification problems. Experimental results demonstrate that the bilevel learning algorithm can significantly produce more compact FFNNs with more excellent generalization ability when compared with conventional learning algorithms.
Collapse
|
11
|
Ünal HT, Başçiftçi F. Evolutionary design of neural network architectures: a review of three decades of research. Artif Intell Rev 2021. [DOI: 10.1007/s10462-021-10049-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
12
|
Singh P, Chaudhury S, Panigrahi BK. Hybrid MPSO-CNN: Multi-level Particle Swarm optimized hyperparameters of Convolutional Neural Network. SWARM AND EVOLUTIONARY COMPUTATION 2021; 63:100863. [DOI: 10.1016/j.swevo.2021.100863] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/19/2023]
|
13
|
|
14
|
Takagi T, Ogiri Y, Kato R, Kodama M, Yamanoi Y, Nishino W, Masakado Y, Watanabe M. Selective motor fascicle transfer and neural-machine interface: case report. J Neurosurg 2020; 132:825-831. [PMID: 30797219 DOI: 10.3171/2018.10.jns181865] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2018] [Accepted: 10/29/2018] [Indexed: 11/06/2022]
Abstract
An amputated nerve transferred to a nearby muscle produces a transcutaneously detectable electromyographic signal corresponding to the transferred nerve; this technique is known as targeted muscle reinnervation (TMR). There are 2 issues to overcome to improve this technique: the caliber and the selectivity of the transferred nerve. It is optimal to select and transfer each motor fascicle to achieve highly developed myoelectric arms with multiple degrees-of-freedom motion. The authors report on a case in which they first identified the remnant stumps of the amputated median and radial nerves and then identified the sensory fascicles using somatosensory evoked potentials. Each median nerve fascicle was transferred to the long head branch of the biceps or the brachialis branch, while the short head branch of the biceps was retained for elbow flexion. Each radial nerve fascicle was transferred to the medial or lateral head branch of the triceps, while the long head branch of the triceps was retained for elbow extension. Electrophysiological and functional tests were conducted in the reinnervated muscles. Functional and electrophysiological improvement was noted, with marked improvement in the identification rate for each digit, forearm, and elbow motion after the selective nerve transfers. The authors note that more selective nerve transfers may be required for the development of prostheses with multiple degrees of freedom.
Collapse
Affiliation(s)
- Takehiko Takagi
- 1Division of Orthopaedic Surgery, Department of Surgical Specialties, National Center for Child Health and Development, Setagaya-ku, Tokyo
- 2Department of Orthopaedic Surgery, Surgical Science, Tokai University School of Medicine, Isehara-shi, Kanagawa
| | - Yosuke Ogiri
- 3Graduate School of Engineering, Yokohama National University, Yokohama-shi, Kanagawa; and
| | - Ryu Kato
- 3Graduate School of Engineering, Yokohama National University, Yokohama-shi, Kanagawa; and
| | - Mitsuhiko Kodama
- 4Department of Rehabilitation Medicine, Tokai University School of Medicine, Isehara-shi, Kanagawa, Japan
| | - Yusuke Yamanoi
- 3Graduate School of Engineering, Yokohama National University, Yokohama-shi, Kanagawa; and
| | - Wataru Nishino
- 3Graduate School of Engineering, Yokohama National University, Yokohama-shi, Kanagawa; and
| | - Yoshihisa Masakado
- 4Department of Rehabilitation Medicine, Tokai University School of Medicine, Isehara-shi, Kanagawa, Japan
| | - Masahiko Watanabe
- 2Department of Orthopaedic Surgery, Surgical Science, Tokai University School of Medicine, Isehara-shi, Kanagawa
| |
Collapse
|
15
|
|
16
|
Han HG, Lu W, Hou Y, Qiao JF. An Adaptive-PSO-Based Self-Organizing RBF Neural Network. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2018; 29:104-117. [PMID: 28113788 DOI: 10.1109/tnnls.2016.2616413] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
In this paper, a self-organizing radial basis function (SORBF) neural network is designed to improve both accuracy and parsimony with the aid of adaptive particle swarm optimization (APSO). In the proposed APSO algorithm, to avoid being trapped into local optimal values, a nonlinear regressive function is developed to adjust the inertia weight. Furthermore, the APSO algorithm can optimize both the network size and the parameters of an RBF neural network simultaneously. As a result, the proposed APSO-SORBF neural network can effectively generate a network model with a compact structure and high accuracy. Moreover, the analysis of convergence is given to guarantee the successful application of the APSO-SORBF neural network. Finally, multiple numerical examples are presented to illustrate the effectiveness of the proposed APSO-SORBF neural network. The results demonstrate that the proposed method is more competitive in solving nonlinear problems than some other existing SORBF neural networks.
Collapse
|
17
|
|
18
|
Che Y, Shu H, Liu Y. Exponential mean-square <mml:math altimg="si0001.gif" overflow="scroll" xmlns:xocs="http://www.elsevier.com/xml/xocs/dtd" xmlns:xs="http://www.w3.org/2001/XMLSchema" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://www.elsevier.com/xml/ja/dtd" xmlns:ja="http://www.elsevier.com/xml/ja/dtd" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:tb="http://www.elsevier.com/xml/common/table/dtd" xmlns:sb="http://www.elsevier.com/xml/common/struct-bib/dtd" xmlns:ce="http://www.elsevier.com/xml/common/dtd" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:cals="http://www.elsevier.com/xml/common/cals/dtd" xmlns:sa="http://www.elsevier.com/xml/common/struct-aff/dtd"><mml:msub><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mo>∞</mml:mo></mml:mrow></mml:msub></mml:math> filtering for arbitrarily switched neural networks with missing measurements. Neurocomputing 2016. [DOI: 10.1016/j.neucom.2016.02.019] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
|
19
|
Hybrid evolutionary neuro-fuzzy approach based on mutual adaptation for human gesture recognition. Appl Soft Comput 2016. [DOI: 10.1016/j.asoc.2016.01.047] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
20
|
|
21
|
Kleftogiannis D, Theofilatos K, Likothanassis S, Mavroudi S. YamiPred: A Novel Evolutionary Method for Predicting Pre-miRNAs and Selecting Relevant Features. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2015; 12:1183-1192. [PMID: 26451829 DOI: 10.1109/tcbb.2014.2388227] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
MicroRNAs (miRNAs) are small non-coding RNAs, which play a significant role in gene regulation. Predicting miRNA genes is a challenging bioinformatics problem and existing experimental and computational methods fail to deal with it effectively. We developed YamiPred, an embedded classification method that combines the efficiency and robustness of support vector machines (SVM) with genetic algorithms (GA) for feature selection and parameters optimization. YamiPred was tested in a new and realistic human dataset and was compared with state-of-the-art computational intelligence approaches and the prevalent SVM-based tools for miRNA prediction. Experimental results indicate that YamiPred outperforms existing approaches in terms of accuracy and of geometric mean of sensitivity and specificity. The embedded feature selection component selects a compact feature subset that contributes to the performance optimization. Further experimentation with this minimal feature subset has achieved very high classification performance and revealed the minimum number of samples required for developing a robust predictor. YamiPred also confirmed the important role of commonly used features such as entropy and enthalpy, and uncovered the significance of newly introduced features, such as %A-U aggregate nucleotide frequency and positional entropy. The best model trained on human data has successfully predicted pre-miRNAs to other organisms including the category of viruses.
Collapse
|
22
|
Tan SC, Watada J, Ibrahim Z, Khalid M. Evolutionary fuzzy ARTMAP neural networks for classification of semiconductor defects. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2015; 26:933-950. [PMID: 25014967 DOI: 10.1109/tnnls.2014.2329097] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
Wafer defect detection using an intelligent system is an approach of quality improvement in semiconductor manufacturing that aims to enhance its process stability, increase production capacity, and improve yields. Occasionally, only few records that indicate defective units are available and they are classified as a minority group in a large database. Such a situation leads to an imbalanced data set problem, wherein it engenders a great challenge to deal with by applying machine-learning techniques for obtaining effective solution. In addition, the database may comprise overlapping samples of different classes. This paper introduces two models of evolutionary fuzzy ARTMAP (FAM) neural networks to deal with the imbalanced data set problems in a semiconductor manufacturing operations. In particular, both the FAM models and hybrid genetic algorithms are integrated in the proposed evolutionary artificial neural networks (EANNs) to classify an imbalanced data set. In addition, one of the proposed EANNs incorporates a facility to learn overlapping samples of different classes from the imbalanced data environment. The classification results of the proposed evolutionary FAM neural networks are presented, compared, and analyzed using several classification metrics. The outcomes positively indicate the effectiveness of the proposed networks in handling classification problems with imbalanced data sets.
Collapse
|
23
|
SAR image segmentation based on quantum-inspired multiobjective evolutionary clustering algorithm. INFORM PROCESS LETT 2014. [DOI: 10.1016/j.ipl.2013.12.010] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
24
|
Lu TC, Yu GR, Juang JC. Quantum-based algorithm for optimizing artificial neural networks. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2013; 24:1266-1278. [PMID: 24808566 DOI: 10.1109/tnnls.2013.2249089] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
This paper presents a quantum-based algorithm for evolving artificial neural networks (ANNs). The aim is to design an ANN with few connections and high classification performance by simultaneously optimizing the network structure and the connection weights. Unlike most previous studies, the proposed algorithm uses quantum bit representation to codify the network. As a result, the connectivity bits do not indicate the actual links but the probability of the existence of the connections, thus alleviating mapping problems and reducing the risk of throwing away a potential candidate. In addition, in the proposed model, each weight space is decomposed into subspaces in terms of quantum bits. Thus, the algorithm performs a region by region exploration, and evolves gradually to find promising subspaces for further exploitation. This is helpful to provide a set of appropriate weights when evolving the network structure and to alleviate the noisy fitness evaluation problem. The proposed model is tested on four benchmark problems, namely breast cancer and iris, heart, and diabetes problems. The experimental results show that the proposed algorithm can produce compact ANN structures with good generalization ability compared to other algorithms.
Collapse
|
25
|
Han HG, Wang LD, Qiao JF. Efficient self-organizing multilayer neural network for nonlinear system modeling. Neural Netw 2013; 43:22-32. [DOI: 10.1016/j.neunet.2013.01.015] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2012] [Revised: 01/27/2013] [Accepted: 01/27/2013] [Indexed: 11/27/2022]
|
26
|
van den Dries S, Wiering MA. Neural-fitted TD-leaf learning for playing Othello with structured neural networks. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2012; 23:1701-1713. [PMID: 24808066 DOI: 10.1109/tnnls.2012.2210559] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
This paper describes a methodology for quickly learning to play games at a strong level. The methodology consists of a novel combination of three techniques, and a variety of experiments on the game of Othello demonstrates their usefulness. First, structures or topologies in neural network connectivity patterns are used to decrease the number of learning parameters and to deal more effectively with the structural credit assignment problem, which is to change individual network weights based on the obtained feedback. Furthermore, the structured neural networks are trained with the novel neural-fitted temporal difference (TD) learning algorithm to create a system that can exploit most of the training experiences and enhance learning speed and performance. Finally, we use the neural-fitted TD-leaf algorithm to learn more effectively when look-ahead search is performed by the game-playing program. Our extensive experimental study clearly indicates that the proposed method outperforms linear networks and fully connected neural networks or evaluation functions evolved with evolutionary algorithms.
Collapse
|
27
|
Performance evaluation of multilayer perceptrons for discriminating and quantifying multiple kinds of odors with an electronic nose. Neural Netw 2012; 33:204-15. [PMID: 22717447 DOI: 10.1016/j.neunet.2012.05.009] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2012] [Revised: 05/14/2012] [Accepted: 05/23/2012] [Indexed: 11/21/2022]
Abstract
This paper studies several types and arrangements of perceptron modules to discriminate and quantify multiple odors with an electronic nose. We evaluate the following types of multilayer perceptron. (A) A single multi-output (SMO) perceptron both for discrimination and for quantification. (B) An SMO perceptron for discrimination followed by multiple multi-output (MMO) perceptrons for quantification. (C) An SMO perceptron for discrimination followed by multiple single-output (MSO) perceptrons for quantification. (D) MSO perceptrons for discrimination followed by MSO perceptrons for quantification, called the MSO-MSO perceptron model, under the following conditions: (D1) using a simple one-against-all (OAA) decomposition method; (D2) adopting a simple OAA decomposition method and virtual balance step; and (D3) employing a local OAA decomposition method, virtual balance step and local generalization strategy all together. The experimental results for 12 kinds of volatile organic compounds at 85 concentration levels in the training set and 155 concentration levels in the test set show that the MSO-MSO perceptron model with the D3 learning procedure is the most effective of those tested for discrimination and quantification of many kinds of odors.
Collapse
|
28
|
Alkhasawne MS, Ngah UKB, Tien TL, Isa NABM. Landslide Susceptibility Hazard Mapping Techniques Review. ACTA ACUST UNITED AC 2012. [DOI: 10.3923/jas.2012.802.808] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
|