1
|
Peralez-González C, Pérez-Rodríguez J, Durán-Rosal AM. Boosting ridge for the extreme learning machine globally optimised for classification and regression problems. Sci Rep 2023; 13:11809. [PMID: 37479841 PMCID: PMC10362034 DOI: 10.1038/s41598-023-38948-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Accepted: 07/18/2023] [Indexed: 07/23/2023] Open
Abstract
This paper explores the boosting ridge (BR) framework in the extreme learning machine (ELM) community and presents a novel model that trains the base learners as a global ensemble. In the context of Extreme Learning Machine single-hidden-layer networks, the nodes in the hidden layer are preconfigured before training, and the optimisation is performed on the weights in the output layer. The previous implementation of the BR ensemble with ELM (BRELM) as base learners fix the nodes in the hidden layer for all the ELMs. The ensemble learning method generates different output layer coefficients by reducing the residual error of the ensemble sequentially as more base learners are added to the ensemble. As in other ensemble methodologies, base learners are selected until fulfilling ensemble criteria such as size or performance. This paper proposes a global learning method in the BR framework, where base learners are not added step by step, but all are calculated in a single step looking for ensemble performance. This method considers (i) the configurations of the hidden layer are different for each base learner, (ii) the base learners are optimised all at once, not sequentially, thus avoiding saturation, and (iii) the ensemble methodology does not have the disadvantage of working with strong classifiers. Various regression and classification benchmark datasets have been selected to compare this method with the original BRELM implementation and other state-of-the-art algorithms. Particularly, 71 datasets for classification and 52 for regression, have been considered using different metrics and analysing different characteristics of the datasets, such as the size, the number of classes or the imbalanced nature of them. Statistical tests indicate the superiority of the proposed method in both regression and classification problems in all experimental scenarios.
Collapse
|
2
|
de Oliveira JFL, Silva EG, de Mattos Neto PSG. A Hybrid System Based on Dynamic Selection for Time Series Forecasting. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2022; 33:3251-3263. [PMID: 33513115 DOI: 10.1109/tnnls.2021.3051384] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Hybrid systems, which combine statistical and machine learning (ML) techniques using residual (error forecasting) modeling, have been highlighted in the literature due to their accuracy and ability to forecast time series with different characteristics. In these architectures, a crucial task is the proper modeling of the residuals since they may present random fluctuations, complex nonlinear patterns, and heteroscedastic behavior. Hence, the selection, specification, and training of one ML model to forecast the residuals are costly and challenging tasks since issues, such as underfitting, overfitting, and misspecification, can lead to a system with low accuracy or even deteriorate the linear forecast of the time series. This article proposes a hybrid system, named dynamic residual forecasting (DReF), that employs a modified dynamic selection (DS) algorithm to decide: the most suitable ML model to forecast a pattern of the residual series and if it is a promising candidate to increase the accuracy of the time series forecast from the linear combination. Thus, the DReF aims to reduce the uncertainty of the ML model selection and avoid the deterioration of the time series forecast. Furthermore, the proposed system searches for the most suitable parameters of the DS algorithm for each data set. In this article, the proposed method uses a pool of five ML models widely adopted in the literature: multilayer perceptron, support vector regression, radial basis function, long short-term memory, and convolutional neural network. An experimental evaluation was conducted using ten well-known time series. The results show that the DReF obtains superior results for the majority of the data sets compared with single and hybrid models of the literature.
Collapse
|
3
|
A stock price prediction method based on meta-learning and variational mode decomposition. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2022.109324] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
4
|
Chen J, Cao F, Gao P. A Fetal ECG Extraction Method Based on ELM Optimized by Improved PSO Algorithm. Crit Rev Biomed Eng 2022; 50:35-47. [PMID: 36374955 DOI: 10.1615/critrevbiomedeng.2022044778] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
The extraction of fetal electrocardiogram (FECG) is of great significance for perinatal fetal monitoring. In order to improve the prediction accuracy of FECG, a FECG extraction method based on extreme learning machine (ELM) optimized by an improved particle swarm optimization (IPSO) was proposed (IPSO-ELM). First, according to the characteristics of the mixed signal on the maternal abdominal wall, and based on the global search ability of IPSO, the initial weight matrix and hidden layer bias of ELM were optimized to match with the mixed signal of the maternal abdominal wall and the network topology. Then, an ELM model was established using the optimal network parameters obtained by IPSO. The nonlinear transformation of the maternal ECG (MECG) signal to the abdominal wall was estimated by the IPSO-ELM network. Finally, the non-linearly transformed MECG signal was mixed with the abdominal wall subtract to obtain clear FECG. The experimental results of clinical ECG signals in DaISy dataset showed that, compared with the traditional normalized minimum mean square error, support vector machine method, and ELM neural network methods, the proposed method can extract clearer FECG signals and improve the signal-to-noise ratio of extracted FECG.
Collapse
Affiliation(s)
- Jiqin Chen
- All-Purpose Quality Education College, Wuchang University of Technology, Wuhan, 430065, China
| | - Fenglin Cao
- All-Purpose Quality Education College, Wuchang University of Technology, Wuhan, 430065, China
| | - Ping Gao
- All-Purpose Quality Education College, Wuchang University of Technology, Wuhan, 430065, China
| |
Collapse
|
5
|
An introduction of a reward-based time-series forecasting model and its application in predicting the dynamic and complicated behavior of the Earth rotation (Delta-T values). Appl Soft Comput 2021. [DOI: 10.1016/j.asoc.2021.107920] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
6
|
CDA-LSTM: an evolutionary convolution-based dual-attention LSTM for univariate time series prediction. Neural Comput Appl 2021. [DOI: 10.1007/s00521-021-06212-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
7
|
Piccialli F, Giampaolo F, Salvi A, Cuomo S. A robust ensemble technique in forecasting workload of local healthcare departments. Neurocomputing 2021. [DOI: 10.1016/j.neucom.2020.02.138] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
8
|
Zhang S, Chen Y, Zhang W, Feng R. A novel ensemble deep learning model with dynamic error correction and multi-objective ensemble pruning for time series forecasting. Inf Sci (N Y) 2021. [DOI: 10.1016/j.ins.2020.08.053] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|
9
|
EDense: a convolutional neural network with ELM-based dense connections. Neural Comput Appl 2020. [DOI: 10.1007/s00521-020-05181-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|
10
|
|
11
|
DTW-NN: A novel neural network for time series recognition using dynamic alignment between inputs and weights. Knowl Based Syst 2020. [DOI: 10.1016/j.knosys.2019.104971] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
12
|
Liu Y, Yang C, Huang K, Gui W. Non-ferrous metals price forecasting based on variational mode decomposition and LSTM network. Knowl Based Syst 2020. [DOI: 10.1016/j.knosys.2019.105006] [Citation(s) in RCA: 55] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
|
13
|
Bahrami M, Sajedi H. Image concept detection in imbalanced datasets with ensemble of convolutional neural networks. INTELL DATA ANAL 2019. [DOI: 10.3233/ida-184327] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
|
14
|
CLR-based deep convolutional spiking neural network with validation based stopping for time series classification. APPL INTELL 2019. [DOI: 10.1007/s10489-019-01552-y] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
15
|
Fernández JC, Carbonero M, Gutiérrez PA, Hervás-Martínez C. Multi-objective evolutionary optimization using the relationship between F1 and accuracy metrics in classification tasks. APPL INTELL 2019. [DOI: 10.1007/s10489-019-01447-y] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
16
|
An intelligent hybridization of ARIMA with machine learning models for time series forecasting. Knowl Based Syst 2019. [DOI: 10.1016/j.knosys.2019.03.011] [Citation(s) in RCA: 70] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
|
17
|
Soheily-Khah S, Marteau PF. Sparsification of the alignment path search space in dynamic time warping. Appl Soft Comput 2019. [DOI: 10.1016/j.asoc.2019.03.009] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
18
|
Hybrid generative discriminative approaches based on Multinomial Scaled Dirichlet mixture models. APPL INTELL 2019. [DOI: 10.1007/s10489-019-01437-0] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
19
|
Zhang L, Yang H, Jiang Z. Imbalanced biomedical data classification using self-adaptive multilayer ELM combined with dynamic GAN. Biomed Eng Online 2018; 17:181. [PMID: 30514298 PMCID: PMC6280414 DOI: 10.1186/s12938-018-0604-3] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2018] [Accepted: 11/10/2018] [Indexed: 02/08/2023] Open
Abstract
BACKGROUND Imbalanced data classification is an inevitable problem in medical intelligent diagnosis. Most of real-world biomedical datasets are usually along with limited samples and high-dimensional feature. This seriously affects the classification performance of the model and causes erroneous guidance for the diagnosis of diseases. Exploring an effective classification method for imbalanced and limited biomedical dataset is a challenging task. METHODS In this paper, we propose a novel multilayer extreme learning machine (ELM) classification model combined with dynamic generative adversarial net (GAN) to tackle limited and imbalanced biomedical data. Firstly, principal component analysis is utilized to remove irrelevant and redundant features. Meanwhile, more meaningful pathological features are extracted. After that, dynamic GAN is designed to generate the realistic-looking minority class samples, thereby balancing the class distribution and avoiding overfitting effectively. Finally, a self-adaptive multilayer ELM is proposed to classify the balanced dataset. The analytic expression for the numbers of hidden layer and node is determined by quantitatively establishing the relationship between the change of imbalance ratio and the hyper-parameters of the model. Reducing interactive parameters adjustment makes the classification model more robust. RESULTS To evaluate the classification performance of the proposed method, numerical experiments are conducted on four real-world biomedical datasets. The proposed method can generate authentic minority class samples and self-adaptively select the optimal parameters of learning model. By comparing with W-ELM, SMOTE-ELM, and H-ELM methods, the quantitative experimental results demonstrate that our method can achieve better classification performance and higher computational efficiency in terms of ROC, AUC, G-mean, and F-measure metrics. CONCLUSIONS Our study provides an effective solution for imbalanced biomedical data classification under the condition of limited samples and high-dimensional feature. The proposed method could offer a theoretical basis for computer-aided diagnosis. It has the potential to be applied in biomedical clinical practice.
Collapse
Affiliation(s)
- Liyuan Zhang
- School of Computer Science and Technology, Medical Imaging Engineering Laboratory, Changchun University of Science and Technology, No.7089, Weixing Road, Changchun, China
| | - Huamin Yang
- School of Computer Science and Technology, Medical Imaging Engineering Laboratory, Changchun University of Science and Technology, No.7089, Weixing Road, Changchun, China.
| | - Zhengang Jiang
- School of Computer Science and Technology, Medical Imaging Engineering Laboratory, Changchun University of Science and Technology, No.7089, Weixing Road, Changchun, China
| |
Collapse
|
20
|
|
21
|
Li Y, Jiang P, She Q, Lin G. Research on air pollutant concentration prediction method based on self-adaptive neuro-fuzzy weighted extreme learning machine. ENVIRONMENTAL POLLUTION (BARKING, ESSEX : 1987) 2018; 241:1115-1127. [PMID: 30029320 DOI: 10.1016/j.envpol.2018.05.072] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/09/2018] [Revised: 05/03/2018] [Accepted: 05/21/2018] [Indexed: 06/08/2023]
Abstract
In order to improve the prediction accuracy and real-time of the air pollutant concentration prediction, this paper proposes self-adaptive neuro-fuzzy weighted extreme learning machine (ANFIS-WELM) based on the weighted extreme learning machine (WELM) and the adaptive neuro-fuzzy inference system (ANFIS) combined air pollutant concentration prediction method. Firstly, Gaussian membership function parameters are selected to fuzzify the input values and calculate the membership degree of each input variable. Secondly, corresponding fuzzy rules are activated, and the firing strength is normalized to calculate the output matrix of hidden nodes. Then, the optimal parameters (C, M), weights are assigned to weighted ELM by using locally weighted linear regression, and the regularized WELM target formula with equality constraint is optimized by the Karush-Kuhn-Tucker (KKT) conditions, the output weight matrix is calculated, and finally the prediction output matrix is calculated. Based on the air pollutant concentration data collected in Datong, Taiwan, the data on the pollutants containing carbon monoxide (CO), nitric oxide (NO), PM2.5 (particulate matter) and PM10, are selected by different historical time series lengths, using genetic algorithm-backpropagation neural network (GA-BPNN), support vector regression (SVR), extreme learning machine (ELM), WELM, ANFIS, regularized extreme learning adaptive neuro-fuzzy inference system (R-ELANFIS) and ANFIS-WELM are built for predict the concentration of each pollutant collected by single monitoring point in single-step time series. The experimental results show that the ANFIS-WELM presented in this paper has better prediction accuracy and real-time performance, realizes the prediction of multi-step time series on the basis of the ANFIS-WELM, and realizes the engineering application of the ANFIS-WELM algorithm package on the self-developed mobile source emissions online monitoring data center software system.
Collapse
Affiliation(s)
- Yongan Li
- College of Automation, Hangzhou Dianzi University, Hangzhou, 310018, China.
| | - Peng Jiang
- College of Automation, Hangzhou Dianzi University, Hangzhou, 310018, China.
| | - Qingshan She
- College of Automation, Hangzhou Dianzi University, Hangzhou, 310018, China.
| | - Guang Lin
- Zhejiang Province Environmental Monitoring Center, China.
| |
Collapse
|
22
|
|
23
|
Li C, Zhu Z. Research and application of a novel hybrid air quality early-warning system: A case study in China. THE SCIENCE OF THE TOTAL ENVIRONMENT 2018; 626:1421-1438. [PMID: 29898549 DOI: 10.1016/j.scitotenv.2018.01.195] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/17/2017] [Revised: 01/10/2018] [Accepted: 01/19/2018] [Indexed: 06/08/2023]
Abstract
As one of the most serious meteorological disasters in modern society, air pollution has received extensive attention from both citizens and decision-makers. With the complexity of pollution components and the uncertainty of prediction, it is both critical and challenging to construct an effective and practical early-warning system. In this paper, a novel hybrid air quality early-warning system for pollution contaminant monitoring and analysis was proposed. To improve the efficiency of the system, an advanced attribute selection method based on fuzzy evaluation and rough set theory was developed to select the main pollution contaminants for cities. Moreover, a hybrid model composed of the theory of "decomposition and ensemble", an extreme learning machine and an advanced heuristic algorithm was developed for pollution contaminant prediction; it provides deterministic and interval forecasting for tackling the uncertainty of future air quality. Daily pollution contaminants of six major cities in China were selected as a dataset to evaluate the practicality and effectiveness of the developed air quality early-warning system. The superior experimental performance determined by the values of several error indexes illustrated that the proposed early-warning system was of great effectiveness and efficiency.
Collapse
Affiliation(s)
- Chen Li
- School of Statistics, Dongbei University of Finance and Economics, No. 217, Jianshan Road, Shahekou District, Dalian, Liaoning Province 116025, China
| | - Zhijie Zhu
- School of Statistics, Dongbei University of Finance and Economics, No. 217, Jianshan Road, Shahekou District, Dalian, Liaoning Province 116025, China.
| |
Collapse
|