1
|
Song S, Wang Q, Zou X, Li Z, Ma Z, Jiang D, Fu Y, Liu Q. High-precision prediction of blood glucose concentration utilizing Fourier transform Raman spectroscopy and an ensemble machine learning algorithm. Spectrochim Acta A Mol Biomol Spectrosc 2023; 303:123176. [PMID: 37494812 DOI: 10.1016/j.saa.2023.123176] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/03/2023] [Revised: 07/17/2023] [Accepted: 07/19/2023] [Indexed: 07/28/2023]
Abstract
Raman spectroscopy has gained popularity in analyzing blood glucose levels due to its non-invasive identification and minimal interference from water. However, the challenge lies in how to accurately predict blood glucose concentrations in human blood using Raman spectroscopy. This paper researches a novel integrated machine learning algorithm called Bagging-ABC-ELM. The optimal input weights and biases of extreme learning machine (ELM) model are obtained by artificial bee colony (ABC) algorithm. The bagging algorithm is used to obtain a better the stability of the model and higher performance than ELM algorithm. The results show that the mean value of coefficient of determination is 0.9928, and root mean square error is 0.1928. Compared to other regression models, the Bagging-ABC-ELM model exhibited superior prediction accuracy, robustness, and generalization capability. The Bagging-ABC-ELM model presents a promising alternative for analyzing blood glucose levels in clinical and research settings.
Collapse
Affiliation(s)
- Shuai Song
- College of Information Science and Engineering, Northeastern University, Shenyang, Liaoning Province 110819, China
| | - Qiaoyun Wang
- College of Information Science and Engineering, Northeastern University, Shenyang, Liaoning Province 110819, China; Hebei Key Laboratory of Micro-Nano Precision Optical Sensing and Measurement Technology, Qinhuangdao 066004, China.
| | - Xin Zou
- College of Information Science and Engineering, Northeastern University, Shenyang, Liaoning Province 110819, China
| | - Zhigang Li
- College of Information Science and Engineering, Northeastern University, Shenyang, Liaoning Province 110819, China
| | - Zhenhe Ma
- College of Information Science and Engineering, Northeastern University, Shenyang, Liaoning Province 110819, China
| | - Daying Jiang
- Zhongyou BSS (Qinhuangdao) Petropipe Company Limited, Qinhuangdao 066004, China
| | - YongQing Fu
- Faculty of Engineering and Environment, Northumbria University, Newcastle upon Tyne NE1 8ST, UK
| | - Qiang Liu
- College of Information Science and Engineering, Northeastern University, Shenyang, Liaoning Province 110819, China; Hebei Key Laboratory of Micro-Nano Precision Optical Sensing and Measurement Technology, Qinhuangdao 066004, China
| |
Collapse
|
2
|
Nahiduzzaman M, Faruq Goni MO, Robiul Islam M, Sayeed A, Shamim Anower M, Ahsan M, Haider J, Kowalski M. Detection of various lung diseases including COVID-19 using extreme learning machine algorithm based on the features extracted from a lightweight CNN architecture. Biocybern Biomed Eng 2023; 43:S0208-5216(23)00037-2. [PMID: 38620111 PMCID: PMC10292668 DOI: 10.1016/j.bbe.2023.06.003] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2022] [Revised: 04/04/2023] [Accepted: 06/16/2023] [Indexed: 11/09/2023]
Abstract
Around the world, several lung diseases such as pneumonia, cardiomegaly, and tuberculosis (TB) contribute to severe illness, hospitalization or even death, particularly for elderly and medically vulnerable patients. In the last few decades, several new types of lung-related diseases have taken the lives of millions of people, and COVID-19 has taken almost 6.27 million lives. To fight against lung diseases, timely and correct diagnosis with appropriate treatment is crucial in the current COVID-19 pandemic. In this study, an intelligent recognition system for seven lung diseases has been proposed based on machine learning (ML) techniques to aid the medical experts. Chest X-ray (CXR) images of lung diseases were collected from several publicly available databases. A lightweight convolutional neural network (CNN) has been used to extract characteristic features from the raw pixel values of the CXR images. The best feature subset has been identified using the Pearson Correlation Coefficient (PCC). Finally, the extreme learning machine (ELM) has been used to perform the classification task to assist faster learning and reduced computational complexity. The proposed CNN-PCC-ELM model achieved an accuracy of 96.22% with an Area Under Curve (AUC) of 99.48% for eight class classification. The outcomes from the proposed model demonstrated better performance than the existing state-of-the-art (SOTA) models in the case of COVID-19, pneumonia, and tuberculosis detection in both binary and multiclass classifications. For eight class classification, the proposed model achieved precision, recall and fi-score and ROC are 100%, 99%, 100% and 99.99% respectively for COVID-19 detection demonstrating its robustness. Therefore, the proposed model has overshadowed the existing pioneering models to accurately differentiate COVID-19 from the other lung diseases that can assist the medical physicians in treating the patient effectively.
Collapse
Affiliation(s)
- Md Nahiduzzaman
- Department of Electrical & Computer Engineering, Rajshahi University of Engineering & Technology, Rajshahi 6204, Bangladesh
| | - Md Omaer Faruq Goni
- Department of Electrical & Computer Engineering, Rajshahi University of Engineering & Technology, Rajshahi 6204, Bangladesh
| | - Md Robiul Islam
- Department of Electrical & Computer Engineering, Rajshahi University of Engineering & Technology, Rajshahi 6204, Bangladesh
| | - Abu Sayeed
- Department of Computer Science & Engineering, Rajshahi University of Engineering & Technology, Rajshahi 6204, Bangladesh
| | - Md Shamim Anower
- Department of Electrical & Electronic Engineering, Rajshahi University of Engineering & Technology, Rajshahi 6204, Bangladesh
| | - Mominul Ahsan
- Department of Computer Science, University of York, Deramore Lane, Heslington, York YO10 5GH, UK
| | - Julfikar Haider
- Department of Engineering, Manchester Metropolitan University, Chester St, Manchester M1 5GD, UK
| | - Marcin Kowalski
- Institute of Optoelectronics, Military University of Technology, Gen. S. Kaliskiego 2, Warsaw, Poland
| |
Collapse
|
3
|
Novykov V, Bilson C, Gepp A, Harris G, Vanstone BJ. Empirical validation of ELM trained neural networks for financial modelling. Neural Comput Appl 2023; 35:1581-1605. [PMID: 36212216 PMCID: PMC9525949 DOI: 10.1007/s00521-022-07792-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2022] [Accepted: 09/06/2022] [Indexed: 01/12/2023]
Abstract
The purpose of this work is to compare predictive performance of neural networks trained using the relatively novel technique of training single hidden layer feedforward neural networks (SFNN), called Extreme Learning Machine (ELM), with commonly used backpropagation-trained recurrent neural networks (RNN) as applied to the task of financial market prediction. Evaluated on a set of large capitalisation stocks on the Australian market, specifically the components of the ASX20, ELM-trained SFNNs showed superior performance over RNNs for individual stock price prediction. While this conclusion of efficacy holds generally, long short-term memory (LSTM) RNNs were found to outperform for a small subset of stocks. Subsequent analysis identified several areas of performance deviations which we highlight as potentially fruitful areas for further research and performance improvement.
Collapse
Affiliation(s)
| | | | - Adrian Gepp
- Bond Business School, Bond University, Gold Coast, QLD Australia
| | - Geoff Harris
- Bond Business School, Bond University, Gold Coast, QLD Australia
| | - Bruce James Vanstone
- Bond Business School, Bond University, Gold Coast, QLD Australia ,Bangor Business School, Bangor University, Wales, UK
| |
Collapse
|
4
|
Cheng J, Sun J, Yao K, Xu M, Wang S, Fu L. Development of multi-disturbance bagging Extreme Learning Machine method for cadmium content prediction of rape leaf using hyperspectral imaging technology. Spectrochim Acta A Mol Biomol Spectrosc 2022; 279:121479. [PMID: 35696971 DOI: 10.1016/j.saa.2022.121479] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/30/2022] [Revised: 05/19/2022] [Accepted: 06/04/2022] [Indexed: 06/15/2023]
Abstract
Exploring the cadmium (Cd) pollution in rape is of great significance to food safety and consumer health. In this study, a rapid, nondestructive and accurate method for the determination of Cd content in rape leaves based on hyperspectral imaging (HSI) technology was proposed. The spectral data of rape leaves under different Cd stress from 431 nm to 962 nm were collected by visible-near infrared HSI spectrometer. In order to improve the robustness and accuracy of the regression model, a machine learning algorithm was proposed, named multi-disturbance bagging Extreme Learning Machine (MdbaggingELM). The prediction models of Cd content in rape leaves based on MdbaggingELM and ELM-based method (ELM and baggingELM) were established and compared. The results showed that the model of the proposed MdbaggingELM method performed significantly in the prediction of Cd content, with Rp2 of 0.9830 and RMSEP of 2.8963 mg/kg. The results confirmed that MdbaggingELM is an efficient regression algorithm, which played a positive role in enhancing the stability and the prediction effect of the model. The combination of MdbaggingELM and HSI technology has great potential in the detection of Cd content in rape leaves.
Collapse
Affiliation(s)
- Jiehong Cheng
- School of Electrical and Information Engineering of Jiangsu University, Zhenjiang 212013, China
| | - Jun Sun
- School of Electrical and Information Engineering of Jiangsu University, Zhenjiang 212013, China
| | - Kunshan Yao
- School of Electrical and Information Engineering of Jiangsu University, Zhenjiang 212013, China
| | - Min Xu
- School of Electrical and Information Engineering of Jiangsu University, Zhenjiang 212013, China
| | - Simin Wang
- School of Electrical and Information Engineering of Jiangsu University, Zhenjiang 212013, China
| | - Lvhui Fu
- School of Electrical and Information Engineering of Jiangsu University, Zhenjiang 212013, China
| |
Collapse
|
5
|
Das S, Sahu TP, Janghel RR, Sahu BK. Effective forecasting of stock market price by using extreme learning machine optimized by PSO-based group oriented crow search algorithm. Neural Comput Appl 2021;:1-37. [PMID: 34413575 DOI: 10.1007/s00521-021-06403-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2020] [Accepted: 07/31/2021] [Indexed: 11/13/2022]
Abstract
Stock index price forecasting is the influential indicator for investors and financial investigators by which decision making capability to achieve maximum benefit with minimum risk can be improved. So, a robust engine with capability to administer useful information is desired to achieve the success. The forecasting effectiveness of stock market is improved in this paper by integrating a modified crow search algorithm (CSA) and extreme learning machine (ELM). The effectiveness of proposed modified CSA entitled as Particle Swarm Optimization (PSO)-based Group oriented CSA (PGCSA) to outperform other existing algorithms is observed by solving 12 benchmark problems. PGCSA algorithm is used to achieve relevant weights and biases of ELM to improve the effectiveness of conventional ELM. The impact of hybrid PGCSA ELM model to predict next day closing price of seven different stock indices is observed by using performance measures, technical indicators and hypothesis test (paired t-test). The seven stock indices are considered by incorporating data during COVID-19 outbreak. This model is tested by comparing with existing techniques proposed in published works. The simulation results provide that PGCSA ELM model can be considered as a suitable tool to predict next day closing price.
Collapse
|
6
|
Ren LR, Gao YL, Liu JX, Zhu R, Kong XZ. L 2,1- Extreme Learning Machine: An Efficient Robust Classifier for Tumor Classification. Comput Biol Chem 2020; 89:107368. [PMID: 32919230 DOI: 10.1016/j.compbiolchem.2020.107368] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2019] [Revised: 08/17/2020] [Accepted: 08/25/2020] [Indexed: 12/12/2022]
Abstract
With the development of cancer research, various gene expression datasets containing cancer information show an explosive growth trend. In addition, due to the continuous maturity of single-cell RNA sequencing (scRNA-seq) technology, the protein information and pedigree information of a single cell are also continuously mined. It is a technical problem of how to classify these high-dimensional data correctly. In recent years, Extreme Learning Machine (ELM) has been widely used in the field of supervised learning and unsupervised learning. However, the traditional ELM does not consider the robustness of the method. To improve the robustness of ELM, in this paper, a novel ELM method based on L2,1-norm named L2,1-Extreme Learning Machine (L2,1 -ELM) has been proposed. The method introduces L2,1-norm on loss function to improve the robustness, and minimizes the influence of noise and outliers. Firstly, we evaluate the new method on five UCI datasets. The experiment results prove that our method can achieve competitive results. Next, the novel method is applied to the problem of classification of cancer samples and single-cell RNA sequencing datasets. The experimental results on The Cancer Genome Atlas (TCGA) datasets and scRNA-seq datasets prove that ELM and its variants has great potential in the classification of cancer samples.
Collapse
Affiliation(s)
- Liang-Rui Ren
- School of Information Science and Engineering, Qufu Normal University, Rizhao, China.
| | - Ying-Lian Gao
- Library of Qufu Normal University, Qufu Normal University, Rizhao, China.
| | - Jin-Xing Liu
- School of Information Science and Engineering, Qufu Normal University, Rizhao, China.
| | - Rong Zhu
- School of Information Science and Engineering, Qufu Normal University, Rizhao, China.
| | - Xiang-Zhen Kong
- School of Information Science and Engineering, Qufu Normal University, Rizhao, China.
| |
Collapse
|
7
|
da Silva BLS, Inaba FK, Salles EOT, Ciarelli PM. Fast Deep Stacked Networks based on Extreme Learning Machine applied to regression problems. Neural Netw 2020; 131:14-28. [PMID: 32721826 DOI: 10.1016/j.neunet.2020.07.018] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2019] [Revised: 07/11/2020] [Accepted: 07/13/2020] [Indexed: 10/23/2022]
Abstract
Deep learning techniques are commonly used to process large amounts of data, and good results are obtained in many applications. Those methods, however, can lead to long training times. An alternative to simultaneously tune all parameters of a large network is to stack smaller modules, improving the model efficiency. However, methods such as Deep Stacked Network (DSN) have some problems that increase its training time and memory usage. To deal with these problems, Fast DSN (FDSN) was proposed, where the modules are trained using an Extreme Learning Machine (ELM) variant. Nonetheless, to speed-up the FDSN training, the ELM random feature mapping is shared among the modules, which can impact the network performance if the weights are not properly chosen. In this paper, we focus on the weight initialization of FDSN in order to improve its performance. We also propose FKDSN, a kernel-based variant of FDSN, besides discussing the theoretical complexity of the methods. We evaluate three different initialization approaches on ELM-trained neural networks over 50 public real-world regression datasets. Our experiments show that FDSN when combined with a more complex initialization method achieves similar results to ELM algorithms applied to large SLFNs, besides having a shorter training time and memory usage, implying that it can be suitable to be used on systems with restrict resources, such as Internet of Things devices. FKDSN also obtained similar results and training time to the large SLFNs, requiring less memory.
Collapse
Affiliation(s)
- Bruno Légora Souza da Silva
- Department of Electrical Engineering, Universidade Federal do Espírito Santo, Av. Fernando Ferrari, 514, Vitória, Espírito Santo, Brazil.
| | - Fernando Kentaro Inaba
- Department of Electrical Engineering, Universidade Federal do Espírito Santo, Av. Fernando Ferrari, 514, Vitória, Espírito Santo, Brazil.
| | - Evandro Ottoni Teatini Salles
- Department of Electrical Engineering, Universidade Federal do Espírito Santo, Av. Fernando Ferrari, 514, Vitória, Espírito Santo, Brazil.
| | - Patrick Marques Ciarelli
- Department of Electrical Engineering, Universidade Federal do Espírito Santo, Av. Fernando Ferrari, 514, Vitória, Espírito Santo, Brazil.
| |
Collapse
|
8
|
Diker A, Avci E, Tanyildizi E, Gedikpinar M. A novel ECG signal classification method using DEA-ELM. Med Hypotheses 2019; 136:109515. [PMID: 31855682 DOI: 10.1016/j.mehy.2019.109515] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2019] [Revised: 11/25/2019] [Accepted: 11/30/2019] [Indexed: 01/17/2023]
Abstract
Electrocardiogram (ECG) signals represent the electrical mobility of the human heart. In recent years, computer-aided systems have helped to cardiologists in the detection, classification and diagnosis of ECG. The aim of this paper is to optimize the number hidden neurons of the traditional Extreme Learning Machine (ELM) using Differential Evolution Algorithm (DEA) and contribute to the classification of ECG signals with a higher accuracy rate. In this paper, publicly ECG records in Physionet was utilized. Pan-Tompkins technique (PTT) and Discrete Wavelet Transform (DWT) approaches were implemented to obtain characteristic properties which are PR period, QT period, ST period and QRS wave of ECG signals. Then, ELM was executed to the ECG samples. Lastly, DEA on software ELM was developed for the assign of the number of hidden neurons, which were used in the ELM algorithm. The performance criterions were used in order to compare the performance of the classification exerted. Concordantly, it was realized that the highest classification achievement values were reached to Accuracy 97.5% and values 93 of number of hidden neurons, with the practice improved with the DEA compared to conventional ELM.
Collapse
Affiliation(s)
- Aykut Diker
- Bitlis Eren University, Department of Informatics, TR-13100 Bitlis, Turkey
| | - Engin Avci
- Fırat University, Department of Software Engineering, TR-23100 Elazig, Turkey.
| | - Erkan Tanyildizi
- Fırat University, Department of Software Engineering, TR-23100 Elazig, Turkey.
| | - Mehmet Gedikpinar
- Fırat University, Department of Electric-Electronic Engineering, TR-23100 Elazig, Turkey.
| |
Collapse
|
9
|
Melekoodappattu JG, Subbian PS. A Hybridized ELM for Automatic Micro Calcification Detection in Mammogram Images Based on Multi-Scale Features. J Med Syst 2019; 43:183. [PMID: 31093789 DOI: 10.1007/s10916-019-1316-3] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2019] [Accepted: 04/25/2019] [Indexed: 01/27/2023]
Abstract
Detection of masses and micro calcifications are a stimulating task for radiologists in digital mammogram images. Radiologists using Computer Aided Detection (CAD) frameworks to find the breast lesion. Micro calcification may be the early sign of breast cancer. There are different kinds of methods used to detect and recognize micro calcification from mammogram images. This paper presents an ELM (Extreme Learning Machine) algorithm for micro calcification detection in digital mammogram images. The interference of mammographic image is removed at the pre-processing stages. A multi-scale features are extracted by a feature generation model. The performance did not improve by all extracted feature, therefore feature selection is performed by nature-inspired optimization algorithm. At last, the hybridized ELM classifier taken the selected optimal features to classify malignant from benign micro calcifications. The proposed work is compared with various classifiers and it shown better performance in training time, sensitivity, specificity and accuracy. The existing approaches considered here are SVM (Support Vector Machine) and NB (Naïve Bayes classifier). The proposed detection system provides 99.04% accuracy which is the better performance than the existing approaches. The optimal selection of feature vectors and the efficient classifier improves the performance of proposed system. Results illustrate the classification performance is better when compared with several other classification approaches.
Collapse
|
10
|
Abstract
BACKGROUND The morbidity of breast cancer has been increased in these years and ranked the first of all female diseases. Computer-aided diagnosis techniques for mammograms can help radiologists find early breast lesions. In mammograms, the degree of malignancy of the tumor is not only related to its morphology and texture features, but also closely related to the density of the tumor. However, in the current research on breast masses detection and diagnosis, people usually use the fusion feature of morphology and texture but neglect density, or only the density feature is considered. Therefore, this paper proposes a method to detect and diagnose the breast mass using fused features with density. METHODS In this paper, we first propose a method based on sub-region clustering to detect the breast mass. The breast region is divided into sub-regions of equal size, and each sub-region is extracted based on local density feature, after that, an Unsupervised ELM (US-ELM) is used for clustering to complete the mass detection. Second, the feature model is constructed based on the mass. This model is composed of the mass region density feature, morphology feature and texture feature. And Genetic Algorithm is used for feature selection, and the optimized feature model is formed. Finally, ELM is used to diagnose benign or malignant mass. RESULTS An experiment on the real dataset of 480 mammograms in Northeast China shows that our proposed method can effectively improve the detection and diagnosis accuracy of breast masses, where we obtained 0.9184 precision in detection of breast masses and 0.911 accuracy in diagnosis of breast masses. CONCLUSIONS We have proposed a mass detection system, which achieves better detection accuracy performance than the existing state-of-art algorithm. We also propose a mass diagnosis system based on the fused features with density, which is more efficient than other feature model and classifier on the same dataset.
Collapse
Affiliation(s)
- Zhiqiong Wang
- Sino-Dutch Biomedical and Information Engineering School, Northeastern University, China
- Neusoft Research of Intelligent Healthcare Technology, Co. Ltd., China
- Acoustics Science and Technology Laboratory, Harbin Engineering University, China
| | - Yukun Huang
- College of Information Science and Engineering, Northeastern University, China
| | - Mo Li
- School of Computer Science and Engineering, Key Laboratory of Big Data Management and Analytics (Liaoning), Northeastern University, China
| | - Hao Zhang
- Department of Breast Surgery, Shengjing Hospital of China Medical University, China
| | - Chen Li
- Sino-Dutch Biomedical and Information Engineering School, Northeastern University, China
| | - Junchang Xin
- School of Computer Science and Engineering, Key Laboratory of Big Data Management and Analytics (Liaoning), Northeastern University, China
| | - Wei Qian
- College of Engineering, University of Texas at El Paso, USA
| |
Collapse
|
11
|
Deng WY, Bai Z, Huang GB, Zheng QH. A Fast SVD-Hidden-nodes based Extreme Learning Machine for Large-Scale Data Analytics. Neural Netw 2015; 77:14-28. [PMID: 26907860 DOI: 10.1016/j.neunet.2015.09.003] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2015] [Revised: 08/03/2015] [Accepted: 09/07/2015] [Indexed: 10/22/2022]
Abstract
Big dimensional data is a growing trend that is emerging in many real world contexts, extending from web mining, gene expression analysis, protein-protein interaction to high-frequency financial data. Nowadays, there is a growing consensus that the increasing dimensionality poses impeding effects on the performances of classifiers, which is termed as the "peaking phenomenon" in the field of machine intelligence. To address the issue, dimensionality reduction is commonly employed as a preprocessing step on the Big dimensional data before building the classifiers. In this paper, we propose an Extreme Learning Machine (ELM) approach for large-scale data analytic. In contrast to existing approaches, we embed hidden nodes that are designed using singular value decomposition (SVD) into the classical ELM. These SVD nodes in the hidden layer are shown to capture the underlying characteristics of the Big dimensional data well, exhibiting excellent generalization performances. The drawback of using SVD on the entire dataset, however, is the high computational complexity involved. To address this, a fast divide and conquer approximation scheme is introduced to maintain computational tractability on high volume data. The resultant algorithm proposed is labeled here as Fast Singular Value Decomposition-Hidden-nodes based Extreme Learning Machine or FSVD-H-ELM in short. In FSVD-H-ELM, instead of identifying the SVD hidden nodes directly from the entire dataset, SVD hidden nodes are derived from multiple random subsets of data sampled from the original dataset. Comprehensive experiments and comparisons are conducted to assess the FSVD-H-ELM against other state-of-the-art algorithms. The results obtained demonstrated the superior generalization performance and efficiency of the FSVD-H-ELM.
Collapse
Affiliation(s)
- Wan-Yu Deng
- School of Computer, Xian University of Posts & Telecommunications, Shaanxi, China; School of Computer Engineering, Nanyang Technological University, Singapore.
| | - Zuo Bai
- School of Electrical & Electronic Engineering, Nanyang Technological University, Singapore
| | - Guang-Bin Huang
- School of Electrical & Electronic Engineering, Nanyang Technological University, Singapore
| | - Qing-Hua Zheng
- Department of Computer Science and Technology, Xi'an Jiaotong University, China
| |
Collapse
|