101
|
Fan J, Pan Z, Wang L, Wang Y. Codebook-softened Product Quantization for High Accuracy Approximate Nearest Neighbor Search. Neurocomputing 2022. [DOI: 10.1016/j.neucom.2022.08.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/15/2022]
|
102
|
Bui TT, Vu TD, Jang E, Hwang GS, Choi D, Chung H. Feasibility for SERS-based discrimination of gallbladder cancer from polyp by indirect recognition of components in bile. Anal Chim Acta 2022; 1221:340152. [DOI: 10.1016/j.aca.2022.340152] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2021] [Revised: 07/04/2022] [Accepted: 07/05/2022] [Indexed: 11/27/2022]
|
103
|
Ramón A, Torres AM, Milara J, Cascón J, Blasco P, Mateo J. eXtreme Gradient Boosting-based method to classify patients with COVID-19. J Investig Med 2022; 70:jim-2021-002278. [PMID: 35850970 DOI: 10.1136/jim-2021-002278] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/15/2022] [Indexed: 01/08/2023]
Abstract
Different demographic, clinical and laboratory variables have been related to the severity and mortality following SARS-CoV-2 infection. Most studies applied traditional statistical methods and in some cases combined with a machine learning (ML) method. This is the first study to date to comparatively analyze five ML methods to select the one that most closely predicts mortality in patients admitted with COVID-19. The aim of this single-center observational study is to classify, based on different types of variables, adult patients with COVID-19 at increased risk of mortality. SARS-CoV-2 infection was defined by a positive reverse transcriptase PCR. A total of 203 patients were admitted between March 15 and June 15, 2020 to a tertiary hospital. Data were extracted from the electronic medical record. Four supervised ML algorithms (k-nearest neighbors (KNN), decision tree (DT), Gaussian naïve Bayes (GNB) and support vector machine (SVM)) were compared with the eXtreme Gradient Boosting (XGB) method proposed to have excellent scalability and high running speed, among other qualities. The results indicate that the XGB method has the best prediction accuracy (92%), high precision (>0.92) and high recall (>0.92). The KNN, SVM and DT approaches present moderate prediction accuracy (>80%), moderate recall (>0.80) and moderate precision (>0.80). The GNB algorithm shows relatively low classification performance. The variables with the greatest weight in predicting mortality were C reactive protein, procalcitonin, glutamyl oxaloacetic transaminase, glutamyl pyruvic transaminase, neutrophils, D-dimer, creatinine, lactic acid, ferritin, days of non-invasive ventilation, septic shock and age. Based on these results, XGB is a solid candidate for correct classification of patients with COVID-19.
Collapse
Affiliation(s)
- Antonio Ramón
- Pharmacy Department, General University Hospital Consortium of Valencia, Valencia, Spain
| | - Ana Maria Torres
- Institute of Technology, Universidad de Castilla-La Mancha, Cuenca, Spain
| | - Javier Milara
- Pharmacy Department, General University Hospital Consortium of Valencia, Valencia, Spain
- Pharmacy Department, University of Valencia, Valencia, Spain
| | - Joaquín Cascón
- Institute of Technology, Universidad de Castilla-La Mancha, Cuenca, Spain
| | - Pilar Blasco
- Pharmacy Department, General University Hospital Consortium of Valencia, Valencia, Spain
| | - Jorge Mateo
- Institute of Technology, Universidad de Castilla-La Mancha, Cuenca, Spain
| |
Collapse
|
104
|
Membership score machine for highly nonlinear classification for small data. APPL INTELL 2022. [DOI: 10.1007/s10489-022-03652-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
105
|
Lei B, Zhang Y, Liu D, Xu Y, Yue G, Cao J, Hu H, Yu S, Yang P, Wang T, Qiu Y, Xiao X, Wang S. Longitudinal study of early mild cognitive impairment via similarity-constrained group learning and self-attention based SBi-LSTM. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2022.109466] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
106
|
Chen W, Yang K, Yu Z, Zhang W. Double-kernel based class-specific broad learning system for multiclass imbalance learning. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2022.109535] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/16/2022]
|
107
|
|
108
|
Gurazada SG, Gao SC, Burstein F, Buntine P. Predicting Patient Length of Stay in Australian Emergency Departments Using Data Mining. SENSORS 2022; 22:s22134968. [PMID: 35808458 PMCID: PMC9269793 DOI: 10.3390/s22134968] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/06/2022] [Revised: 06/14/2022] [Accepted: 06/28/2022] [Indexed: 02/01/2023]
Abstract
Length of Stay (LOS) is an important performance metric in Australian Emergency Departments (EDs). Recent evidence suggests that an LOS in excess of 4 h may be associated with increased mortality, but despite this, the average LOS continues to remain greater than 4 h in many EDs. Previous studies have found that Data Mining (DM) can be used to help hospitals to manage this metric and there is continued research into identifying factors that cause delays in ED LOS. Despite this, there is still a lack of specific research into how DM could use these factors to manage ED LOS. This study adds to the emerging literature and offers evidence that it is possible to predict delays in ED LOS to offer Clinical Decision Support (CDS) by using DM. Sixteen potentially relevant factors that impact ED LOS were identified through a literature survey and subsequently used as predictors to create six Data Mining Models (DMMs). An extract based on the Victorian Emergency Minimum Dataset (VEMD) was used to obtain relevant patient details and the DMMs were implemented using the Weka Software. The DMMs implemented in this study were successful in identifying the factors that were most likely to cause ED LOS > 4 h and also identify their correlation. These DMMs can be used by hospitals, not only to identify risk factors in their EDs that could lead to ED LOS > 4 h, but also to monitor these factors over time.
Collapse
Affiliation(s)
- Sai Gayatri Gurazada
- Faculty of Information Technology, Monash University, Clayton, Melbourne, VIC 3800, Australia
| | - Shijia Caddie Gao
- Faculty of Information Technology, Monash University, Clayton, Melbourne, VIC 3800, Australia
| | - Frada Burstein
- Faculty of Information Technology, Monash University, Clayton, Melbourne, VIC 3800, Australia
| | - Paul Buntine
- Eastern Health Clinical School Monash University, Box Hill, Melbourne, VIC 3128, Australia
| |
Collapse
|
109
|
Zhang Z, Yang S, Qin Y, Yang Z, Huang Y, Zhou X. Most relevant point query on road networks. Neural Comput Appl 2022:1-11. [PMID: 35789916 PMCID: PMC9244333 DOI: 10.1007/s00521-022-07485-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2022] [Accepted: 05/26/2022] [Indexed: 11/17/2022]
Abstract
Graphs are widespread in many real-life practical applications. One of a graph's fundamental and popular researches is investigating the relations between two given vertices. The relationship between nodes in the graph can be measured by the shortest distance. Moreover, the number of paths is also a popular metric to assess the relationship of different nodes. In many location-based services, users make decisions on the basis of both the two metrics. To address this problem, we propose a new hybrid-metric based on the number of paths with a distance constraint for road networks, which are special graphs. Based on it, a most relevant node query on road networks is identified. To handle this problem, we first propose a Shortest-Distance Constrained DFS, which uses the shortest distance to prune unqualified nodes. To further improve query efficiency, we present Batch Query DFS algorithm, which only needs only one DFS search. Our experiments on four real-life road networks demonstrate the performance of the proposed algorithms.
Collapse
Affiliation(s)
- Zining Zhang
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, Hunan China
| | - Shenghong Yang
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, Hunan China
| | - Yunchuan Qin
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, Hunan China
| | - Zhibang Yang
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, Hunan China
| | - Yang Huang
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, Hunan China
| | - Xu Zhou
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, Hunan China
| |
Collapse
|
110
|
Regional Analysis of Dust Day Duration in Central Iran. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12126248] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
The duration of dust days (DDD) is one of the most important parameters that may worsen the effects of the presence of dust in the affected areas. Persistent pollution over consecutive dusty days can have particularly negative effects on the human respiratory system. The present analysis was conducted in Central Iran, where the phenomenon of dust is one of the most important problems. In this study, using dust codes recorded at 35 synoptic stations, the homogeneity of DDD across the region was investigated using the L-moments method. Then, characteristics of DDD over the period 1999–2018 were calculated. The results showed that the region is statistically homogeneous. Furthermore, Zabol and Zahdan are the stations worst affected, with the longest durations of 22 and 21 days in 2014. Additionally, the values of DDD with return periods of 5, 10, 25, and 50 years were calculated using fitted statistical distributions and kriging and mapped. Finally, using the K nearest neighbor method the most important factor affecting DDD of the spatial characteristics, including longitude, latitude, elevation, average daily temperature (tm), dew point (td), wind altitude (u), maximum wind speed (ffmax), and direction of the fastest wind (ddmax), was determined. It was found that the southeastern parts of the study area are affected by the longest dust storm duration in all return periods; over longer return periods, long dust storms are also found in the central parts, especially the central desert of Iran. Therefore, these areas should be given priority in fighting and controlling wind erosion. Furthermore, the results showed that the maximum wind speed has the greatest effect on DDD.
Collapse
|
111
|
Intelligent Measurement of Coal Moisture Based on Microwave Spectrum via Distance-Weighted kNN. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12126199] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Realizing the rapid measurement of coal moisture content (MC) is of great significance. However, existing measurement methods are time-consuming and damage the original properties of the samples. To address these concerns, a coal MC intelligent measurement system is designed in this study that integrates microwave spectrum analysis and the distance-weighted k-nearest neighbor (DW-kNN) algorithm to realize rapid and non-destructive measurement of coal MC. Specifically, the measurement system is built using portable microwave analysis equipment, which can efficiently collect the microwave signals of coal. To improve the cleanliness of modeling data, an iterative clipping method based on Mahalanobis distance (MD-ICM) is used to detect and eliminate outliers. Based on multiple microwave frequency bands, various machine learning methods are evaluated, and it is found that coal MC measurement using broad frequency signals of 8.05–12.01 GHz yields the best results. Experiments are also carried out on coals from different regions to examine the regional robustness of the proposed method. The results of on-site testing with 27 additional samples show that the method based on the combination of microwave spectrum analysis and DW-kNN has a potential application prospect in the rapid measurement of coal MC.
Collapse
|
112
|
Seeded Classification of Satellite Image Time Series with Lower-Bounded Dynamic Time Warping. REMOTE SENSING 2022. [DOI: 10.3390/rs14122778] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Satellite Image Time Series (SITS) record the continuous temporal behavior of land cover types and thus provide a new perspective for finer-grained land cover classification compared with the usual spectral and spatial information contained in a static image. In addition, SITS data is becoming more accessible in recent years due to newly launched satellites and accumulated historical data. However, the lack of labeled training samples limits the exploration of SITS data, especially with sophisticated methods. Even with a straightforward classifier, such as k-nearest neighbor, the accuracy and efficiency of the SITS similarity measure is also a pending problem. In this paper, we propose SKNN-LB-DTW, a seeded SITS classification method based on lower-bounded Dynamic Time Warping (DTW). The word “seeded” indicates that only a few labeled samples are required, and this is not only because of the lack of labeled samples but also because of our aim to explore the rich information contained in SITS, rather than letting training samples dominate the classification results. We use a combination of cascading lower bounds and early abandoning of DTW as an accurate yet efficient similarity measure for large scale tasks. The experimental results on two real SITS datasets demonstrate the utility of the proposed SKNN-LB-DTW, which could become an effective solution for SITS classification when the amount of unlabeled SITS data far exceeds the labeled data.
Collapse
|
113
|
Combining Disease Mechanism and Machine Learning to Predict Wheat Fusarium Head Blight. REMOTE SENSING 2022. [DOI: 10.3390/rs14122732] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Wheat Fusarium head blight (FHB) can be effectively controlled through prediction. To address the low accuracy and poor stability of model predictions of wheat FHB, a prediction method of wheat FHB that couples a logistic regression mechanism-based model and k-nearest neighbours (KNN) model is proposed in this paper. First, we selected predictive factors, including remote sensing-based and meteorological factors. Then, we quantitatively expressed the factor weights of the disease occurrence and development mechanisms in the disease prediction model by using a logistic model. Subsequently, we integrated the obtained factor weights into the predictive factors and input the predictive factors with weights into the KNN model to predict the incidence of wheat FHB. Finally, the accuracy and generalizability of the models were evaluated. Wheat fields in Changfeng, Dingyuan, Fengyuan, and Feidong counties, Anhui Province, where wheat FHB often occurs, were used as the study area. The incidences of wheat FHB on 29 April and 10 May 2021 were predicted. Compared with a model that did not consider disease mechanism, the accuracy of our model increased by approximately 13%. The overall accuracies of the models for the two dates were 0.88 and 0.92, and the F1 index was 0.86 and 0.94, respectively. The results show that the predictions made with the logistic-KNN model had higher accuracy and better stability than those made with the KNN model, thus achieving remote sensing-based high-precision prediction of wheat FHB.
Collapse
|
114
|
Hu C, Tan Q, Zhang Q, Li Y, Wang F, Zou X, Peng Z. Application of interpretable machine learning for early prediction of prognosis in acute kidney injury. Comput Struct Biotechnol J 2022; 20:2861-2870. [PMID: 35765651 PMCID: PMC9193404 DOI: 10.1016/j.csbj.2022.06.003] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2022] [Revised: 05/31/2022] [Accepted: 06/01/2022] [Indexed: 12/05/2022] Open
Abstract
Background This study aimed to develop an algorithm using the explainable artificial intelligence (XAI) approaches for the early prediction of mortality in intensive care unit (ICU) patients with acute kidney injury (AKI). Methods This study gathered clinical data with AKI patients from the Medical Information Mart for Intensive Care IV (MIMIC-IV) in the US between 2008 and 2019. All the data were further randomly divided into a training cohort and a validation cohort. Seven machine learning methods were used to develop the models for assessing in-hospital mortality. The optimal model was selected based on its accuracy and area under the curve (AUC). The SHapley Additive exPlanation (SHAP) values and Local Interpretable Model-Agnostic Explanations (LIME) algorithm were utilized to interpret the optimal model. Results A total of 22,360 patients with AKI were finally enrolled in this study (median age, 69.5 years; female, 42.8%). They were randomly split into a training cohort (16770, 75%) and a validation cohort (5590, 25%). The eXtreme Gradient Boosting (XGBoost) model achieved the best performance with an AUC of 0.890. The SHAP values showed that Glasgow Coma Scale (GCS), blood urea nitrogen, cumulative urine output on Day 1 and age were the top 4 most important variables contributing to the XGBoost model. The LIME algorithm was used to explain the individualized predictions. Conclusions Machine-learning models based on clinical features were developed and validated with great performance for the early prediction of a high risk of death in patients with AKI.
Collapse
Affiliation(s)
- Chang Hu
- Department of Critical Care Medicine, Zhongnan Hospital of Wuhan University, Wuhan, Hubei 430071, China
- Clinical Research Center of Hubei Critical Care Medicine, Wuhan, Hubei 430071, China
| | - Qing Tan
- School of Mathematics and Statistics, Wuhan University, Wuhan, Hubei 430072, China
| | - Qinran Zhang
- School of Mathematics and Statistics, Wuhan University, Wuhan, Hubei 430072, China
| | - Yiming Li
- Department of Critical Care Medicine, Zhongnan Hospital of Wuhan University, Wuhan, Hubei 430071, China
- Clinical Research Center of Hubei Critical Care Medicine, Wuhan, Hubei 430071, China
| | - Fengyun Wang
- Department of Critical Care Medicine, Zhongnan Hospital of Wuhan University, Wuhan, Hubei 430071, China
- Clinical Research Center of Hubei Critical Care Medicine, Wuhan, Hubei 430071, China
| | - Xiufen Zou
- School of Mathematics and Statistics, Wuhan University, Wuhan, Hubei 430072, China
| | - Zhiyong Peng
- Department of Critical Care Medicine, Zhongnan Hospital of Wuhan University, Wuhan, Hubei 430071, China
- Clinical Research Center of Hubei Critical Care Medicine, Wuhan, Hubei 430071, China
| |
Collapse
|
115
|
Zhou W, Liu H, Lou J, Chen X. Locality sensitive hashing with bit selection. APPL INTELL 2022. [DOI: 10.1007/s10489-022-03546-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
|
116
|
Bonacchi R, Filippi M, Rocca MA. Role of artificial intelligence in MS clinical practice. Neuroimage Clin 2022; 35:103065. [PMID: 35661470 PMCID: PMC9163993 DOI: 10.1016/j.nicl.2022.103065] [Citation(s) in RCA: 37] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2021] [Revised: 05/04/2022] [Accepted: 05/26/2022] [Indexed: 11/24/2022]
Abstract
Machine learning (ML) and its subset, deep learning (DL), are branches of artificial intelligence (AI) showing promising findings in the medical field, especially when applied to imaging data. Given the substantial role of MRI in the diagnosis and management of patients with multiple sclerosis (MS), this disease is an ideal candidate for the application of AI techniques. In this narrative review, we are going to discuss the potential applications of AI for MS clinical practice, together with their limitations. Among their several advantages, ML algorithms are able to automate repetitive tasks, to analyze more data in less time and to achieve higher accuracy and reproducibility than the human counterpart. To date, these algorithms have been applied to MS diagnosis, prognosis, disease and treatment monitoring. Other fields of application have been improvement of MRI protocols as well as automated lesion and tissue segmentation. However, several challenges remain, including a better understanding of the information selected by AI algorithms, appropriate multicenter and longitudinal validations of results and practical aspects regarding hardware and software integration. Finally, one cannot overemphasize the paramount importance of human supervision, in order to optimize the use and take full advantage of the potential of AI approaches.
Collapse
Affiliation(s)
- Raffaello Bonacchi
- Neuroimaging Research Unit, Division of Neuroscience, IRCCS San Raffaele Scientific Institute, Milan, Italy; Neurology Unit, IRCCS San Raffaele Scientific Institute, Milan, Italy; Vita-Salute San Raffaele University, Milan, Italy
| | - Massimo Filippi
- Neuroimaging Research Unit, Division of Neuroscience, IRCCS San Raffaele Scientific Institute, Milan, Italy; Neurology Unit, IRCCS San Raffaele Scientific Institute, Milan, Italy; Neurorehabilitation Unit, IRCCS San Raffaele Scientific Institute, Milan, Italy; Neurophysiology Service, IRCCS San Raffaele Scientific Institute, Milan, Italy; Vita-Salute San Raffaele University, Milan, Italy
| | - Maria A Rocca
- Neuroimaging Research Unit, Division of Neuroscience, IRCCS San Raffaele Scientific Institute, Milan, Italy; Neurology Unit, IRCCS San Raffaele Scientific Institute, Milan, Italy; Vita-Salute San Raffaele University, Milan, Italy.
| |
Collapse
|
117
|
A Lightweight Pose Sensing Scheme for Contactless Abnormal Gait Behavior Measurement. SENSORS 2022; 22:s22114070. [PMID: 35684689 PMCID: PMC9185243 DOI: 10.3390/s22114070] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/03/2022] [Revised: 05/24/2022] [Accepted: 05/25/2022] [Indexed: 12/10/2022]
Abstract
The recognition of abnormal gait behavior is important in the field of motion assessment and disease diagnosis. Currently, abnormal gait behavior is primarily recognized by pressure and inertial data obtained from wearable sensors. However, the data drift and wearing difficulties for patients have impeded the application of these wearable sensors. Here, we propose a contactless abnormal gait behavior recognition method that captures human pose data using a monocular camera. A lightweight OpenPose (OP) model is generated with Depthwise Separable Convolution to recognize joint points and extract their coordinates during walking in real time. For the walking data errors extracted in the 2D plane, a 3D reconstruction is performed on the walking data, and a total of 11 types of abnormal gait features are extracted by the OP model. Finally, the XGBoost algorithm is used for feature screening. The final experimental results show that the Random Forest (RF) algorithm in combination with 3D features delivers the highest precision (92.13%) for abnormal gait behavior recognition. The proposed scheme overcomes the data drift of inertial sensors and sensor wearing challenges in the elderly while reducing the hardware requirements for model deployment. With excellent real-time and contactless capabilities, the scheme is expected to enjoy a wide range of applications in the field of abnormal gait measurement.
Collapse
|
118
|
Wang M, Wang W, Zhang X, Iu HHC. A New Fault Diagnosis of Rolling Bearing Based on Markov Transition Field and CNN. ENTROPY 2022; 24:e24060751. [PMID: 35741472 PMCID: PMC9221820 DOI: 10.3390/e24060751] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/04/2022] [Revised: 05/21/2022] [Accepted: 05/24/2022] [Indexed: 01/25/2023]
Abstract
The rolling bearing is a crucial component of the rotating machine, and it is particularly vital to ensure its normal operation. In addition, the selection of different category features will add uncertainty and bias to the classification results. In order to decrease the interference of these factors to fault diagnosis, a new method that automatically learns the features of the data combined with Markov transition field (MTF) and convolutional neural network (CNN) is proposed in this paper, namely MTF-CNN. The MTF contributes to convert the original time series into corresponding figures, and the CNN is used to extract the deep feature information in the figure to complete the fault diagnosis. The effectiveness of the proposed method is verified by two public data sets. The experimental results show that MTF-CNN can classify different types of faults, and the highest accuracy rate can reach 100%. Likewise, the classification accuracy of this method is higher than some existing methods.
Collapse
Affiliation(s)
- Mengjiao Wang
- School of Automation and Electronic Information, Xiangtan University, Xiangtan 411105, China;
- Correspondence:
| | - Wenjie Wang
- School of Automation and Electronic Information, Xiangtan University, Xiangtan 411105, China;
| | - Xinan Zhang
- School of Electrical, Electronic and Computer Engineering, University of Western Australia, Crawley, Perth, WA 6009, Australia; (X.Z.); (H.H.-C.I.)
| | - Herbert Ho-Ching Iu
- School of Electrical, Electronic and Computer Engineering, University of Western Australia, Crawley, Perth, WA 6009, Australia; (X.Z.); (H.H.-C.I.)
| |
Collapse
|
119
|
Organizational Geosocial Network: A Graph Machine Learning Approach Integrating Geographic and Public Policy Information for Studying the Development of Social Organizations in China. ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION 2022. [DOI: 10.3390/ijgi11050318] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/04/2022]
Abstract
This study aims to give an insight into the development trends and patterns of social organizations (SOs) in China from the perspective of network science integrating geography and public policy information embedded in the network structure. Firstly, we constructed a first-of-its-kind database which encompasses almost all social organizations established in China throughout the past decade. Secondly, we proposed four basic structures to represent the homogeneous and heterogeneous networks between social organizations and related social entities, such as government administrations and community members. Then, we pioneered the application of graph models to the field of organizations and embedded the Organizational Geosocial Network (OGN) into a low-dimensional representation of the social entities and relations while preserving their semantic meaning. Finally, we applied advanced graph deep learning methods, such as graph attention networks (GAT) and graph convolutional networks (GCN), to perform exploratory classification tasks by training models with county-level OGNs dataset and make predictions of which geographic region the county-level OGN belongs to. The experiment proves that different regions possess a variety of development patterns and economic structures where local social organizations are embedded, thus forming differential OGN structures, which can be sensed by graph machine learning algorithms and make relatively accurate predictions. To the best of our knowledge, this is the first application of graph deep learning to the construction and representation learning of geosocial network models of social organizations, which has certain reference significance for research in related fields.
Collapse
|
120
|
Simon CGK, Jhanjhi NZ, Goh WW, Sukumaran S. Applications of Machine Learning in Knowledge Management System: A Comprehensive Review. JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT 2022. [DOI: 10.1142/s0219649222500174] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
As new generations of technology appear, legacy knowledge management solutions and applications become increasingly out of date, necessitating a paradigm shift. Machine learning presents an opportunity by foregoing rule-based knowledge intensive systems inundating the marketplace. An extensive review was made on the literature pertaining to machine learning which common machine learning algorithms were identified. This study has analysed more than 200 papers extracted from Scopus and IEEE databases. Searches ranged with the bulk of the articles from 2018 to 2021, while some articles ranged from 1959 to 2017. The research gap focusses on implementing machine learning algorithm to knowledge management systems, specifically knowledge management attributes. By investigating and reviewing each algorithm extensively, the usability of each algorithm is identified, with its advantages and disadvantages. From there onwards, these algorithms were mapped for what area of knowledge management it may be beneficial. Based on the findings, it is evidently seen how these algorithms are applicable in knowledge management and how it can enhance knowledge management system further. Based on the findings, the paper aims to bridge the gap between the literature in knowledge management and machine learning. A knowledge management–machine learning framework is conceived based on the review done on each algorithm earlier and to bridge the gap between the two literatures. The framework highlights how machine learning algorithm can play a part in different areas of knowledge management. From the framework, it provides practitioners how and where to implement machine learning in knowledge management.
Collapse
Affiliation(s)
| | - Noor Zaman Jhanjhi
- Taylor’s University, 1, Jalan Taylors 47500 Subang Jaya, Selangor, Malaysia
| | - Wei Wei Goh
- Taylor’s University, 1, Jalan Taylors 47500 Subang Jaya, Selangor, Malaysia
| | | |
Collapse
|
121
|
Liu J, Huang Q, Yang X, Ding C. HPE-GCN: predicting efficacy of tonic formulae via graph convolutional networks integrating traditionally defined herbal properties. Methods 2022; 204:101-109. [PMID: 35597515 DOI: 10.1016/j.ymeth.2022.05.003] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2022] [Revised: 05/04/2022] [Accepted: 05/16/2022] [Indexed: 11/29/2022] Open
Abstract
Chinese herbal formulae are the heritage of traditional Chinese medicine (TCM) in treating diseases through thousands of years. The formula function is not just a simple herbal efficacy addition, but produces complex and nonlinear relationships between different herbs and their overall efficacy, which brings challenges to the formula efficacy analysis. In our study, we proposed a model called HPE-GCN that combines graph convolutional networks (GCN) with TCM-defined herbal properties (TCM-HPs) to predict formulae efficacy. In addition, to process the unstructured natural language in the formula text, we proposed a weighting calculation method related to herb frequency and the number of herbs in a formula called Formula-Herb dependence degree (FHDD), to assess the dependency degree of a formula with its herbs. In our research, 214 classic tonic formulae from ancient TCM books such as Synopsis of the Golden Chamber, Jingyue's Complete Works and the Golden Mirror of Medicin were collected as datasets. The performance of HPE-GCN on multi-classification of tonic formulae reached the best result compared with classic machine learning models, such as support vector machine, naive Bayes, logistic regression, gradient boosting decision tree, and K-nearest neighbors. The evaluated index Macro-Precision, Macro-Recall, Macro-F1 of HPE-GCN on the test set were 87.70%, 84.08% and 83.51% respectively, increased by 7.27%, 7.41% and 7.30% respectively from second best compared models. GCN has the advantage of low-dimensional feature expression for herbs and formulae, and is an effective analysis tool for TCM research. HPE-GCN integrates TCM-HPs and fits the complex nonlinear mapping relationship between TCM-HPs and formulae efficacy, which provides new ideas for related research.
Collapse
Affiliation(s)
- Jiajun Liu
- School of Informatics, Hunan University of Chinese Medicine, Changsha, Hunan, 410208, China
| | - Qunfu Huang
- School of Informatics, Hunan University of Chinese Medicine, Changsha, Hunan, 410208, China
| | - Xiaoyan Yang
- School of Informatics, Hunan University of Chinese Medicine, Changsha, Hunan, 410208, China
| | - Changsong Ding
- School of Informatics, Hunan University of Chinese Medicine, Changsha, Hunan, 410208, China; Big Data Analysis Laboratory of Traditional Chinese Medicine, Hunan University of Chinese Medicine, Changsha, Hunan, 410208, China.
| |
Collapse
|
122
|
Alex DM, Chandy DA, Christinal AH, Singh A, Pushkaran M. A Hybrid Random Forest Classifier for Chronic Kidney Disease Prediction from 2D Ultrasound Kidney Images. INT J PATTERN RECOGN 2022. [DOI: 10.1142/s0218001422560109] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Chronic kidney disease (CKD) is one of the causes of mortality in almost all countries across the globe and the notable thing is its asymptomatic nature in the early stages. This disease is characterized by the gradual loss of kidney function in an individual. Frequently chronic kidney disease is diagnosed based on the Estimated Glomerular Filtration Rate (eGFR) determined from blood and urine tests. In order to reduce the risk factors arising due to chronic kidney disease, it is essential to be diagnosed in the earlier stages itself. This work proposes an automated chronic kidney disease detection based on the textural features of the kidney using a hybrid random forest classifier from 2D ultrasound kidney images. The proposed classifier is compared with the other competing machine learning classifiers through experimenting on a dataset of 150 images and gives a better accuracy of [Formula: see text] with [Formula: see text] of recall and precision, thus proving it to be promising in detecting CKD noninvasively in the early stages.
Collapse
Affiliation(s)
- Deepthy Mary Alex
- Department of ECE, Karunya Institute of Technology and Sciences, Karunya Nagar, Coimbatore - 641114, Tamil Nadu, India
| | - D. Abraham Chandy
- Department of ECE, Karunya Institute of Technology and Sciences, Karunya Nagar, Coimbatore - 641114, Tamil Nadu, India
| | - A. Hepzibah Christinal
- Department of Mathematics, Karunya Institute of Technology and Sciences, Karunya Nagar, Coimbatore - 641114, Tamil Nadu, India
| | - Arvinder Singh
- Department of Radiology, Sri Guru Ram Das Institute of Medical Sciences and Research, Sri Amritsar - 143501, Punjab, India
| | - M. Pushkaran
- Radiology Division, Kovai Diagnostic Centre, Coimbatore-641012, Tamil Nadu, India
| |
Collapse
|
123
|
Wang L, Yuan W, Zeng L, Xu J, Mo Y, Zhao X, Peng L. Dementia analysis from functional connectivity network with graph neural networks. Inf Process Manag 2022. [DOI: 10.1016/j.ipm.2022.102901] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
|
124
|
Zhang X, Xiao H, Gao R, Zhang H, Wang Y. K-nearest neighbors rule combining prototype selection and local feature weighting for classification. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2022.108451] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
125
|
Hajizadeh R, Aghagolzadeh A, Ezoji M. Mutual neighborhood and modified majority voting based KNN classifier for multi-categories classification. Pattern Anal Appl 2022. [DOI: 10.1007/s10044-022-01069-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
126
|
Zou X, Wang C, Luo M, Ren Q, Liu Y, Zhang S, Bai Y, Meng J, Zhang W, Su SW. Design of Electronic Nose Detection System for Apple Quality Grading Based on Computational Fluid Dynamics Simulation and K-Nearest Neighbor Support Vector Machine. SENSORS 2022; 22:s22082997. [PMID: 35458982 PMCID: PMC9025600 DOI: 10.3390/s22082997] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/17/2022] [Revised: 04/09/2022] [Accepted: 04/12/2022] [Indexed: 12/04/2022]
Abstract
Apples are one of the most widely planted fruits in the world, with an extremely high annual production. Several issues should be addressed to avoid the damaging of samples during the quality grading process of apples (e.g., the long detection period and the inability to detect the internal quality of apples). In this study, an electronic nose (e-nose) detection system for apple quality grading based on the K-nearest neighbor support vector machine (KNN-SVM) was designed, and the nasal cavity structure of the e-nose was optimized by computational fluid dynamics (CFD) simulation. A KNN-SVM classifier was also proposed to overcome the shortcomings of the traditional SVMs. The performance of the developed device was experimentally verified in the following steps. The apples were divided into three groups according to their external and internal quality. The e-nose data were pre-processed before features extraction, and then Principal Component Analysis (PCA) and Linear Discriminant Analysis (LDA) were used to reduce the dimension of the datasets. The recognition accuracy of the PCA–KNN-SVM classifier was 96.45%, and the LDA–KNN-SVM classifier achieved 97.78%. Compared with other commonly used classifiers, (traditional KNN, SVM, Decision Tree, and Random Forest), KNN-SVM is more efficient in terms of training time and accuracy of classification. Generally, the apple grading system can be used to evaluate the quality of apples during storage.
Collapse
Affiliation(s)
- Xiuguo Zou
- College of Artificial Intelligence, Nanjing Agricultural University, Nanjing 210031, China; (C.W.); (M.L.); (Q.R.); (Y.L.); (S.Z.)
- Correspondence: (X.Z.); (S.W.S.); Tel.: +86-25-5860-6585 (X.Z.)
| | - Chenyang Wang
- College of Artificial Intelligence, Nanjing Agricultural University, Nanjing 210031, China; (C.W.); (M.L.); (Q.R.); (Y.L.); (S.Z.)
| | - Manman Luo
- College of Artificial Intelligence, Nanjing Agricultural University, Nanjing 210031, China; (C.W.); (M.L.); (Q.R.); (Y.L.); (S.Z.)
| | - Qiaomu Ren
- College of Artificial Intelligence, Nanjing Agricultural University, Nanjing 210031, China; (C.W.); (M.L.); (Q.R.); (Y.L.); (S.Z.)
| | - Yingying Liu
- College of Artificial Intelligence, Nanjing Agricultural University, Nanjing 210031, China; (C.W.); (M.L.); (Q.R.); (Y.L.); (S.Z.)
| | - Shikai Zhang
- College of Artificial Intelligence, Nanjing Agricultural University, Nanjing 210031, China; (C.W.); (M.L.); (Q.R.); (Y.L.); (S.Z.)
| | - Yungang Bai
- College of Engineering, Nanjing Agricultural University, Nanjing 210031, China;
| | - Jiawei Meng
- Department of Mechanical Engineering, University College London, London WC1E 7JE, UK;
| | - Wentian Zhang
- Faculty of Engineering and Information Technology, University of Technology Sydney, Sydney, NSW 2007, Australia;
| | - Steven W. Su
- Faculty of Engineering and Information Technology, University of Technology Sydney, Sydney, NSW 2007, Australia;
- Correspondence: (X.Z.); (S.W.S.); Tel.: +86-25-5860-6585 (X.Z.)
| |
Collapse
|
127
|
Yu Z, He Q, Yang J, Luo M. A Supervised ML Applied Classification Model for Brain Tumors MRI. Front Pharmacol 2022; 13:884495. [PMID: 35462901 PMCID: PMC9024329 DOI: 10.3389/fphar.2022.884495] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2022] [Accepted: 03/28/2022] [Indexed: 12/15/2022] Open
Abstract
Brain Tumor originates from abnormal cells, which is developed uncontrollably. Magnetic resonance imaging (MRI) is developed to generate high-quality images and provide extensive medical research information. The machine learning algorithms can improve the diagnostic value of MRI to obtain automation and accurate classification of MRI. In this research, we propose a supervised machine learning applied training and testing model to classify and analyze the features of brain tumors MRI in the performance of accuracy, precision, sensitivity and F1 score. The result presents that more than 95% accuracy is obtained in this model. It can be used to classify features more accurate than other existing methods.
Collapse
Affiliation(s)
- Zhengyu Yu
- Department of Nephrology, The Second Xiangya Hospital, Central South University, Changsha, China
- Faculty of Engneering and IT, University of Technology Sydney, Sydney, NSW, Australia
| | - Qinghu He
- Department of Rehabilitation Medicine and Health Care, Hunan University of Medicine, Huaihua, China
| | - Jichang Yang
- Department of Rehabilitation Medicine and Health Care, Hunan University of Medicine, Huaihua, China
| | - Min Luo
- Department of Nephrology, The Second Xiangya Hospital, Central South University, Changsha, China
- Department of Rehabilitation Medicine and Health Care, Hunan University of Medicine, Huaihua, China
- *Correspondence: Min Luo,
| |
Collapse
|
128
|
Kirby E, Zenha R, Jamone L. Comparing Single Touch to Dynamic Exploratory Procedures for Robotic Tactile Object Recognition. IEEE Robot Autom Lett 2022. [DOI: 10.1109/lra.2022.3151261] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
129
|
Mora D, Nieto JA, Mateo J, Bikdeli B, Barco S, Trujillo-Santos J, Soler S, Font L, Bosevski M, Monreal M. Machine Learning to Predict Outcomes in Patients with Acute Pulmonary Embolism Who Prematurely Discontinued Anticoagulant Therapy. Thromb Haemost 2022; 122:570-577. [PMID: 34107539 DOI: 10.1055/a-1525-7220] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Abstract
BACKGROUND Patients with pulmonary embolism (PE) who prematurely discontinue anticoagulant therapy (<90 days) are at an increased risk for death or recurrences. METHODS We used the data from the RIETE (Registro Informatizado de Pacientes con Enfermedad TromboEmbólica) registry to compare the prognostic ability of five machine-learning (ML) models and logistic regression to identify patients at increased risk for the composite of fatal PE or recurrent venous thromboembolism (VTE) 30 days after discontinuation. ML models included decision tree, k-nearest neighbors algorithm, support vector machine, Ensemble, and neural network [NN]. A "full" model with 70 variables and a "reduced" model with 23 were analyzed. Model performance was assessed by confusion matrix metrics on the testing data for each model and a calibration plot. RESULTS Among 34,447 patients with PE, 1,348 (3.9%) discontinued therapy prematurely. Fifty-one (3.8%) developed fatal PE or sudden death and 24 (1.8%) had nonfatal VTE recurrences within 30 days after discontinuation. ML-NN was the best method for identification of patients experiencing the composite endpoint, predicting the composite outcome with an area under receiver operating characteristic (ROC) curve of 0.96 (95% confidence interval [CI]: 0.95-0.98), using either 70 or 23 variables captured before discontinuation. Similar numbers were obtained for sensitivity, specificity, positive predictive value, negative predictive value, and accuracy. The discrimination of logistic regression was inferior (area under ROC curve, 0.76 [95% CI: 0.70-0.81]). Calibration plots showed similar deviations from the perfect line for ML-NN and logistic regression. CONCLUSION The ML-NN method very well predicted the composite outcome after premature discontinuation of anticoagulation and outperformed traditional logistic regression.
Collapse
Affiliation(s)
- Damián Mora
- Department of Internal Medicine, Hospital Virgen de la Luz, Cuenca, Spain
| | - José A Nieto
- Department of Internal Medicine, Hospital Virgen de la Luz, Cuenca, Spain
| | - Jorge Mateo
- Neurobiological Research Group, Institute of Technology, Universidad de Castilla-La Mancha, Cuenca, Spain
| | - Behnood Bikdeli
- Cardiovascular Medicine Division, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States.,Yale/YNHH Center for Outcomes Research and Evaluation, New Haven, Connecticut, United States.,Cardiovascular Research Foundation (CRF), New York, New York, United States
| | - Stefano Barco
- Clinic of Angiology, University Hospital Zurich, Zurich, Switzerland.,Center for Thrombosis and Hemostasis, University Hospital Mainz, Mainz, Germany
| | - Javier Trujillo-Santos
- Department of Internal Medicine, Hospital General Universitario Santa Lucía, Universidad Católica de Murcia, Murcia, Spain
| | - Silvia Soler
- Department of Internal Medicine, Hospital Olot i Comarcal de la Garrotxa, Gerona, Spain
| | - Llorenç Font
- Department of Haematology, Hospital de Tortosa Verge de la Cinta, Tarragona, Spain
| | - Marijan Bosevski
- Faculty of Medicine, University Cardiology Clinic, Skopje, Republic of Macedonia
| | - Manuel Monreal
- Department of Internal Medicine, Hospital Germans Trias i Pujol, Badalona, Barcelona, Spain.,Department of Medicine, Universidad Católica de Murcia, Murcia, Spain
| | | |
Collapse
|
130
|
Chen H, Huo D, Zhang J. Gas Recognition in E-Nose System: A Review. IEEE TRANSACTIONS ON BIOMEDICAL CIRCUITS AND SYSTEMS 2022; 16:169-184. [PMID: 35412988 DOI: 10.1109/tbcas.2022.3166530] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Gas recognition is essential in an electronic nose (E-nose) system, which is responsible for recognizing multivariate responses obtained by gas sensors in various applications. Over the past decades, classical gas recognition approaches such as principal component analysis (PCA) have been widely applied in E-nose systems. In recent years, artificial neural network (ANN) has revolutionized the field of E-nose, especially spiking neural network (SNN). In this paper, we investigate recent gas recognition methods for E-nose, and compare and analyze them in terms of algorithms and hardware implementations. We find each classical gas recognition method has a relatively fixed framework and a few parameters, which makes it easy to be designed and perform well with limited gas samples, but weak in multi-gas recognition under noise. While ANN-based methods obtain better recognition accuracy with flexible architectures and lots of parameters. However, some ANNs are too complex to be implemented in portable E-nose systems, such as deep convolutional neural networks (CNNs). In contrast, SNN-based gas recognition methods achieve satisfying accuracy and recognize more types of gases, and could be implemented with energy-efficient hardware, which makes them a promising candidate in multi-gas identification.
Collapse
|
131
|
Leone A, Rescio G, Manni A, Siciliano P, Caroppo A. Comparative Analysis of Supervised Classifiers for the Evaluation of Sarcopenia Using a sEMG-Based Platform. SENSORS 2022; 22:s22072721. [PMID: 35408335 PMCID: PMC9002980 DOI: 10.3390/s22072721] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/04/2022] [Revised: 03/22/2022] [Accepted: 03/31/2022] [Indexed: 12/26/2022]
Abstract
Sarcopenia is a geriatric condition characterized by a loss of strength and muscle mass, with a high impact on health status, functional independence and quality of life in older adults. To reduce the effects of the disease, just the diagnostic is not enough, it is necessary more than recognition. Surface electromyography is becoming increasingly relevant for the prevention and diagnosis of sarcopenia, also due to a wide diffusion of smart and minimally invasive wearable devices suitable for electromyographic monitoring. The purpose of this work is manifold. The first aim is the design and implementation of a hardware/software platform. It is based on the elaboration of surface electromyographic signals extracted from the Gastrocnemius Lateralis and Tibialis Anterior muscles, useful to analyze the strength of the muscles with the purpose of distinguishing three different “confidence” levels of sarcopenia. The second aim is to compare the efficiency of state of the art supervised classifiers in the evaluation of sarcopenia. The experimentation stage was performed on an “augmented” dataset starting from data acquired from 32 patients. The latter were distributed in an unbalanced manner on 3 “confidence” levels of sarcopenia. The obtained results in terms of classification accuracy demonstrated the ability of the proposed platform to distinguish different sarcopenia “confidence” levels, with highest accuracy value given by Support Vector Machine classifier, outperforming the other classifiers by an average of 7.7%.
Collapse
|
132
|
Automated BIM Reconstruction of Full-Scale Complex Tubular Engineering Structures Using Terrestrial Laser Scanning. REMOTE SENSING 2022. [DOI: 10.3390/rs14071659] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Due to the accumulation of manufacturing errors of components and construction errors, there are always deviations between an as-built complex tubular engineering structure (CTES) and its as-designed model. As terrestrial laser scanning (TLS) provides accurate point cloud data (PCD) for scanned objects, it can be used in the building information modeling (BIM) reconstruction of as-built CTESs for life cycle management. However, few studies have focused on the BIM reconstruction of a full-scale CTES from missing and noisy PCD. To this end, this study proposes an automated BIM reconstruction method based on the TLS for a full-scale CTES. In particular, a novel algorithm is proposed to extract the central axis of a tubular structure. An extended axis searching algorithm is applied to segment each component PCD. A slice-based method is used to estimate the geometric parameters of curved tubes. The proposed method is validated through a full-scale CTES, where the maximum error is 0.92 mm.
Collapse
|
133
|
Khan A, Javed A, Malik KM, Raza MA, Ryan J, Saudagar AKJ, Malik H. Toward Realigning Automatic Speaker Verification in the Era of COVID-19. SENSORS (BASEL, SWITZERLAND) 2022; 22:2638. [PMID: 35408252 PMCID: PMC9003118 DOI: 10.3390/s22072638] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/24/2022] [Revised: 03/01/2022] [Accepted: 03/04/2022] [Indexed: 06/14/2023]
Abstract
The use of face masks has increased dramatically since the COVID-19 pandemic started in order to to curb the spread of the disease. Additionally, breakthrough infections caused by the Delta and Omicron variants have further increased the importance of wearing a face mask, even for vaccinated individuals. However, the use of face masks also induces attenuation in speech signals, and this change may impact speech processing technologies, e.g., automated speaker verification (ASV) and speech to text conversion. In this paper we examine Automatic Speaker Verification (ASV) systems against the speech samples in the presence of three different types of face mask: surgical, cloth, and filtered N95, and analyze the impact on acoustics and other factors. In addition, we explore the effect of different microphones, and distance from the microphone, and the impact of face masks when speakers use ASV systems in real-world scenarios. Our analysis shows a significant deterioration in performance when an ASV system encounters different face masks, microphones, and variable distance between the subject and microphone. To address this problem, this paper proposes a novel framework to overcome performance degradation in these scenarios by realigning the ASV system. The novelty of the proposed ASV framework is as follows: first, we propose a fused feature descriptor by concatenating the novel Ternary Deviated overlapping Patterns (TDoP), Mel Frequency Cepstral Coefficients (MFCC), and Gammatone Cepstral Coefficients (GTCC), which are used by both the ensemble learning-based ASV and anomaly detection system in the proposed ASV architecture. Second, this paper proposes an anomaly detection model for identifying vocal samples produced in the presence of face masks. Next, it presents a Peak Norm (PN) filter to approximate the signal of the speaker without a face mask in order to boost the accuracy of ASV systems. Finally, the features of filtered samples utilizing the PN filter and samples without face masks are passed to the proposed ASV to test for improved accuracy. The proposed ASV system achieved an accuracy of 0.99 and 0.92, respectively, on samples recorded without a face mask and with different face masks. Although the use of face masks affects the ASV system, the PN filtering solution overcomes this deficiency up to 4%. Similarly, when exposed to different microphones and distances, the PN approach enhanced system accuracy by up to 7% and 9%, respectively. The results demonstrate the effectiveness of the presented framework against an in-house prepared, diverse Multi Speaker Face Masks (MSFM) dataset, (IRB No. FY2021-83), consisting of samples of subjects taken with a variety of face masks and microphones, and from different distances.
Collapse
Affiliation(s)
- Awais Khan
- Department of Computer Science and Engineering, Oakland University, Rochester, MI 48309, USA; (A.K.); (M.A.R.); (J.R.)
| | - Ali Javed
- Department of Software Engineering, University of Engineering and Technology, Taxila 47050, Pakistan;
| | - Khalid Mahmood Malik
- Department of Computer Science and Engineering, Oakland University, Rochester, MI 48309, USA; (A.K.); (M.A.R.); (J.R.)
| | - Muhammad Anas Raza
- Department of Computer Science and Engineering, Oakland University, Rochester, MI 48309, USA; (A.K.); (M.A.R.); (J.R.)
| | - James Ryan
- Department of Computer Science and Engineering, Oakland University, Rochester, MI 48309, USA; (A.K.); (M.A.R.); (J.R.)
| | - Abdul Khader Jilani Saudagar
- Information Systems Department, College of Computer and Information Sciences, Imam Mohammad Ibn Saud Islamic University (IMSIU), Riyadh 11432, Saudi Arabia;
| | - Hafiz Malik
- Department of Electrical and Computer Engineering, University of Michigan, Dearborn, MI 48128, USA;
| |
Collapse
|
134
|
Identification of D Modification Sites Using a Random Forest Model Based on Nucleotide Chemical Properties. Int J Mol Sci 2022; 23:ijms23063044. [PMID: 35328461 PMCID: PMC8950657 DOI: 10.3390/ijms23063044] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2022] [Revised: 02/25/2022] [Accepted: 03/09/2022] [Indexed: 12/03/2022] Open
Abstract
Dihydrouridine (D) is an abundant post-transcriptional modification present in transfer RNA from eukaryotes, bacteria, and archaea. D has contributed to treatments for cancerous diseases. Therefore, the precise detection of D modification sites can enable further understanding of its functional roles. Traditional experimental techniques to identify D are laborious and time-consuming. In addition, there are few computational tools for such analysis. In this study, we utilized eleven sequence-derived feature extraction methods and implemented five popular machine algorithms to identify an optimal model. During data preprocessing, data were partitioned for training and testing. Oversampling was also adopted to reduce the effect of the imbalance between positive and negative samples. The best-performing model was obtained through a combination of random forest and nucleotide chemical property modeling. The optimized model presented high sensitivity and specificity values of 0.9688 and 0.9706 in independent tests, respectively. Our proposed model surpassed published tools in independent tests. Furthermore, a series of validations across several aspects was conducted in order to demonstrate the robustness and reliability of our model.
Collapse
|
135
|
Avellar L, Stefano Filho C, Delgado G, Frizera A, Rocon E, Leal-Junior A. AI-enabled photonic smart garment for movement analysis. Sci Rep 2022; 12:4067. [PMID: 35260746 PMCID: PMC8904460 DOI: 10.1038/s41598-022-08048-9] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2021] [Accepted: 02/24/2022] [Indexed: 02/04/2023] Open
Abstract
Smart textiles are novel solutions for remote healthcare monitoring which involve non-invasive sensors-integrated clothing. Polymer optical fiber (POF) sensors have attractive features for smart textile technology, and combined with Artificial Intelligence (AI) algorithms increase the potential of intelligent decision-making. This paper presents the development of a fully portable photonic smart garment with 30 multiplexed POF sensors combined with AI algorithms to evaluate the system ability on the activity classification of multiple subjects. Six daily activities are evaluated: standing, sitting, squatting, up-and-down arms, walking and running. A k-nearest neighbors classifier is employed and results from 10 trials of all volunteers presented an accuracy of 94.00 (0.14)%. To achieve an optimal amount of sensors, the principal component analysis is used for one volunteer and results showed an accuracy of 98.14 (0.31)% using 10 sensors, 1.82% lower than using 30 sensors. Cadence and breathing rate were estimated and compared to the data from an inertial measurement unit located on the garment back and the highest error was 2.22%. Shoulder flexion/extension was also evaluated. The proposed approach presented feasibility for activity recognition and movement-related parameters extraction, leading to a system fully optimized, including the number of sensors and wireless communication, for Healthcare 4.0.
Collapse
Affiliation(s)
- Leticia Avellar
- Graduate Program in Electrical Engineering, Federal University of Espírito Santo (UFES), Fernando Ferrari Avenue, Vitória, 29075-910, Brazil.
| | - Carlos Stefano Filho
- Neurophysics Group, "Gleb Wataghin" Institute of Physics, University of Campinas, Campinas, Brazil
| | - Gabriel Delgado
- Centro de Automática y Robótica, Ctra. Campo Real, 28500, Arganda del Rey, Madrid, Spain
| | - Anselmo Frizera
- Graduate Program in Electrical Engineering, Federal University of Espírito Santo (UFES), Fernando Ferrari Avenue, Vitória, 29075-910, Brazil
| | - Eduardo Rocon
- Centro de Automática y Robótica, Ctra. Campo Real, 28500, Arganda del Rey, Madrid, Spain
| | - Arnaldo Leal-Junior
- Graduate Program in Electrical Engineering, Federal University of Espírito Santo (UFES), Fernando Ferrari Avenue, Vitória, 29075-910, Brazil
| |
Collapse
|
136
|
Zhang W, Shen J, Wang Y, Cai K, Zhang Q, Cao M. Blood SSR1: A Possible Biomarker for Early Prediction of Parkinson’s Disease. Front Mol Neurosci 2022; 15:762544. [PMID: 35310885 PMCID: PMC8924528 DOI: 10.3389/fnmol.2022.762544] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2021] [Accepted: 01/14/2022] [Indexed: 01/31/2023] Open
Abstract
Parkinson’s disease (PD) is the second most common neurodegenerative disease associated with age. Early diagnosis of PD is key to preventing the loss of dopamine neurons. Peripheral-blood biomarkers have shown their value in recent years because of their easy access and long-term monitoring advantages. However, few peripheral-blood biomarkers have proven useful. This study aims to explore potential peripheral-blood biomarkers for the early diagnosis of PD. Three substantia nigra (SN) transcriptome datasets from the Gene Expression Omnibus (GEO) database were divided into a training cohort and a test cohort. We constructed a protein–protein interaction (PPI) network and a weighted gene co-expression network analysis (WGCNA) network, found their overlapping differentially expressed genes and studied them as the key genes. Analysis of the peripheral-blood transcriptome datasets of PD patients from GEO showed that three key genes were upregulated in PD over healthy participants. Analysis of the relationship between their expression and survival and analysis of their brain expression suggested that these key genes could become biomarkers. Then, animal models were studied to validate the expression of the key genes, and only SSR1 (the signal sequence receptor subunit1) was significantly upregulated in both animal models in peripheral blood. Correlation analysis and logistic regression analysis were used to analyze the correlation between brain dopaminergic neurons and SSR1 expression, and it was found that SSR1 expression was negatively correlated with dopaminergic neuron survival. The upregulation of SSR1 expression in peripheral blood was also found to precede the abnormal behavior of animals. In addition, the application of artificial intelligence technology further showed the value of SSR1 in clinical PD prediction. The three classifiers all showed that SSR1 had high predictability for PD. The classifier with the best prediction accuracy was selected through AUC and MCC to construct a prediction model. In short, this research not only provides potential biomarkers for the early diagnosis of PD but also establishes a possible artificial intelligence model for predicting PD.
Collapse
Affiliation(s)
- Wen Zhang
- Department of Neurology, Affiliated Hospital of Nantong University, Nantong, China
| | - Jiabing Shen
- Department of Neurology, Affiliated Hospital of Nantong University, Nantong, China
| | - Yuhui Wang
- Department of Microelectrics, Peking University, Peking, China
| | - Kefu Cai
- Department of Neurology, Affiliated Hospital of Nantong University, Nantong, China
| | - Qi Zhang
- Key Laboratory of Neuroregeneration of Jiangsu and Ministry of Education, Co-innovation Center of Neuroregeneration, Nantong University, Nantong, China
- *Correspondence: Maohong Cao Qi Zhang
| | - Maohong Cao
- Department of Neurology, Affiliated Hospital of Nantong University, Nantong, China
- *Correspondence: Maohong Cao Qi Zhang
| |
Collapse
|
137
|
Hamid H, Naseer N, Nazeer H, Khan MJ, Khan RA, Shahbaz Khan U. Analyzing Classification Performance of fNIRS-BCI for Gait Rehabilitation Using Deep Neural Networks. SENSORS (BASEL, SWITZERLAND) 2022; 22:1932. [PMID: 35271077 PMCID: PMC8914987 DOI: 10.3390/s22051932] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/28/2022] [Revised: 02/21/2022] [Accepted: 02/25/2022] [Indexed: 05/11/2023]
Abstract
This research presents a brain-computer interface (BCI) framework for brain signal classification using deep learning (DL) and machine learning (ML) approaches on functional near-infrared spectroscopy (fNIRS) signals. fNIRS signals of motor execution for walking and rest tasks are acquired from the primary motor cortex in the brain's left hemisphere for nine subjects. DL algorithms, including convolutional neural networks (CNNs), long short-term memory (LSTM), and bidirectional LSTM (Bi-LSTM) are used to achieve average classification accuracies of 88.50%, 84.24%, and 85.13%, respectively. For comparison purposes, three conventional ML algorithms, support vector machine (SVM), k-nearest neighbor (k-NN), and linear discriminant analysis (LDA) are also used for classification, resulting in average classification accuracies of 73.91%, 74.24%, and 65.85%, respectively. This study successfully demonstrates that the enhanced performance of fNIRS-BCI can be achieved in terms of classification accuracy using DL approaches compared to conventional ML approaches. Furthermore, the control commands generated by these classifiers can be used to initiate and stop the gait cycle of the lower limb exoskeleton for gait rehabilitation.
Collapse
Affiliation(s)
- Huma Hamid
- Department of Mechatronics and Biomedical Engineering, Air University, Islamabad 44000, Pakistan; (H.H.); (H.N.)
| | - Noman Naseer
- Department of Mechatronics and Biomedical Engineering, Air University, Islamabad 44000, Pakistan; (H.H.); (H.N.)
| | - Hammad Nazeer
- Department of Mechatronics and Biomedical Engineering, Air University, Islamabad 44000, Pakistan; (H.H.); (H.N.)
| | - Muhammad Jawad Khan
- School of Mechanical and Manufacturing Engineering, National University of Science and Technology, Islamabad 44000, Pakistan;
| | - Rayyan Azam Khan
- Department of Mechanical Engineering, University of Saskatchewan, Saskatoon, SK S7N 5A9, Canada;
| | - Umar Shahbaz Khan
- Department of Mechatronics Engineering, National University of Sciences and Technology, Islamabad 44000, Pakistan;
- National Centre of Robotics and Automation (NCRA), Rawalpindi 46000, Pakistan
| |
Collapse
|
138
|
Al Rashid SZ. Collaborative Computing-Based K-Nearest Neighbour Algorithm and Mutual Information to Classify Gene Expressions for Type 2 Diabetes. INTERNATIONAL JOURNAL OF E-COLLABORATION 2022. [DOI: 10.4018/ijec.304044] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The classification process is used in gene expression data on venous endothelial cells of umbilical cords in humans to reveal the concepts of regulation of insulin using dynamic gene expression data for two classes, namely, control and exposed to insulin. The mutual information statistical feature selection method is used on all available datasets to select these significant genes. The data reduction results are divided into training and testing, and further supplemented to the KNN classifier for diabetes classification. The results show that the mutual information in KNN reaches the highest ranked 10,000 genes and the test classification accuracy is 100%. Pathway analysis and gene ontology enrichment are used to evaluate the targeted genes. The results clearly exhibit the importance of finding the most informative genes in the database by using the statistical gene selection technique to achieve a reduction in time and cost and increase the efficiency of the classifier. This method exhibits these significant results that can be applied to other data and diseases.
Collapse
|
139
|
A multi-voter multi-commission nearest neighbor classifier. JOURNAL OF KING SAUD UNIVERSITY - COMPUTER AND INFORMATION SCIENCES 2022. [DOI: 10.1016/j.jksuci.2022.01.018] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
140
|
Instance-based learning using the Half-Space Proximal Graph. Pattern Recognit Lett 2022. [DOI: 10.1016/j.patrec.2022.01.025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
|
141
|
Balaha HM, El-Gendy EM, Saafan MM. A complete framework for accurate recognition and prognosis of COVID-19 patients based on deep transfer learning and feature classification approach. Artif Intell Rev 2022; 55:5063-5108. [PMID: 35125606 PMCID: PMC8799451 DOI: 10.1007/s10462-021-10127-8] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
The sudden appearance of COVID-19 has put the world in a serious situation. Due to the rapid spread of the virus and the increase in the number of infected patients and deaths, COVID-19 was declared a pandemic. This pandemic has its destructive effect not only on humans but also on the economy. Despite the development and availability of different vaccines for COVID-19, scientists still warn the citizens of new severe waves of the virus, and as a result, fast diagnosis of COVID-19 is a critical issue. Chest imaging proved to be a powerful tool in the early detection of COVID-19. This study introduces an entire framework for the early detection and early prognosis of COVID-19 severity in the diagnosed patients using laboratory test results. It consists of two phases (1) Early Diagnostic Phase (EDP) and (2) Early Prognostic Phase (EPP). In EDP, COVID-19 patients are diagnosed using CT chest images. In the current study, 5, 159 COVID-19 and 10, 376 normal computed tomography (CT) images of Egyptians were used as a dataset to train 7 different convolutional neural networks using transfer learning. Data augmentation normal techniques and generative adversarial networks (GANs), CycleGAN and CCGAN, were used to increase the images in the dataset to avoid overfitting issues. 28 experiments were applied and multiple performance metrics were captured. Classification with no augmentation yielded \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$99.61\%$$\end{document}99.61% accuracy by EfficientNetB7 architecture. By applying CycleGAN and CC-GAN Augmentation, the maximum reported accuracies were \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$99.57\%$$\end{document}99.57% and \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$99.14\%$$\end{document}99.14% by MobileNetV1 and VGG-16 architectures respectively. In EPP, the prognosis of the severity of COVID-19 in patients is early determined using laboratory test results. In this study, 25 different classification techniques were applied and from the different results, the highest accuracies were \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$98.70\%$$\end{document}98.70% and \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$97.40\%$$\end{document}97.40% reported by the Ensemble Bagged Trees and Tree (Fine, Medium, and Coarse) techniques respectively.
Collapse
Affiliation(s)
- Hossam Magdy Balaha
- Computers and Control Systems Department, Faculty of Engineering, Mansoura University, Mansoura, Egypt
| | - Eman M. El-Gendy
- Computers and Control Systems Department, Faculty of Engineering, Mansoura University, Mansoura, Egypt
| | - Mahmoud M. Saafan
- Computers and Control Systems Department, Faculty of Engineering, Mansoura University, Mansoura, Egypt
| |
Collapse
|
142
|
A Time Series Forecasting of Global Horizontal Irradiance on Geographical Data of Najran Saudi Arabia. ENERGIES 2022. [DOI: 10.3390/en15030928] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Environment-friendly and renewable energy resources are the need of each developed and undeveloped country. Solar energy is one of them, thus accurate forecasting of it can be useful for electricity supply companies. This research focuses on analyzing the daily global solar radiation (GSR) data of Najran province located in Saudi Arabia and proposed a model for the prediction of global horizontal irradiance (GHI). The weather data is collected from Najran University. After inspecting the data, I we found the dependent and independent variables for calculating the GHI. A dataset model has been trained by creating tensor of variables belonging to air, wind, peak wind, relative humidity, and barometric pressure. Furthermore, six machine learning algorithms convolutional neural networks (CNN), K-nearest neighbors (KNN), support vector machines (SVM), logistic regression (LR), random forest classifier (RFC), and support vector classifier (SVC) techniques are used on dataset model to predict the GHI. The evaluation metrics determination coefficients (R2), root mean square error (RMSE), relative root mean square error (rRMSE), mean bias error (MBE), mean absolute bias error (MABE), mean absolute percentage error (MAPE), and T-statistic (t-stat) are used for the result verification of proposed models. Finally, the current work reports that all methods examined in this work may be utilized to accurately predict GHI; however, the SVC technique is the most suitable method amongst all techniques by claiming the precise results using the evaluation metrics.
Collapse
|
143
|
A hybrid meta-heuristic-based multi-objective feature selection with adaptive capsule network for automated email spam detection. INTERNATIONAL JOURNAL OF INTELLIGENT ROBOTICS AND APPLICATIONS 2022. [DOI: 10.1007/s41315-021-00217-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
144
|
Al-Ahmad BI, Al-Zoubi AA, Kabir MF, Al-Tawil M, Aljarah I. Swarm intelligence-based model for improving prediction performance of low-expectation teams in educational software engineering projects. PeerJ Comput Sci 2022; 8:e857. [PMID: 35174274 PMCID: PMC8802785 DOI: 10.7717/peerj-cs.857] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2021] [Accepted: 12/29/2021] [Indexed: 06/14/2023]
Abstract
Software engineering is one of the most significant areas, which extensively used in educational and industrial fields. Software engineering education plays an essential role in keeping students up to date with software technologies, products, and processes that are commonly applied in the software industry. The software development project is one of the most important parts of the software engineering course, because it covers the practical side of the course. This type of project helps strengthening students' skills to collaborate in a team spirit to work on software projects. Software project involves the composition of software product and process parts. Software product part represents software deliverables at each phase of Software Development Life Cycle (SDLC) while software process part captures team activities and behaviors during SDLC. The low-expectation teams face challenges during different stages of software project. Consequently, predicting performance of such teams is one of the most important tasks for learning process in software engineering education. The early prediction of performance for low-expectation teams would help instructors to address difficulties and challenges related to such teams at earliest possible phases of software project to avoid project failure. Several studies attempted to early predict the performance for low-expectation teams at different phases of SDLC. This study introduces swarm intelligence -based model which essentially aims to improve the prediction performance for low-expectation teams at earliest possible phases of SDLC by implementing Particle Swarm Optimization-K Nearest Neighbours (PSO-KNN), and it attempts to reduce the number of selected software product and process features to reach higher accuracy with identifying less than 40 relevant features. Experiments were conducted on the Software Engineering Team Assessment and Prediction (SETAP) project dataset. The proposed model was compared with the related studies and the state-of-the-art Machine Learning (ML) classifiers: Sequential Minimal Optimization (SMO), Simple Linear Regression (SLR), Naïve Bayes (NB), Multilayer Perceptron (MLP), standard KNN, and J48. The proposed model provides superior results compared to the traditional ML classifiers and state-of-the-art studies in the investigated phases of software product and process development.
Collapse
Affiliation(s)
- Bilal I. Al-Ahmad
- Faculty of Information Technology and Systems, University of Jordan, Aqaba, Aqaba, Jordan
| | - Ala’ A. Al-Zoubi
- School of Science, Technology and Engineering, University of Granada, Granada, Spain, Spain
- King Abdullah II School for Information Technology, University of Jordan, Amman, Ãmmãn, Jordan
| | - Md Faisal Kabir
- Pennsylvania State University - Harrisburg, Middletown, PA, USA
- United International University (UIU), Dhaka, Bangladesh
| | - Marwan Al-Tawil
- King Abdullah II School for Information Technology, University of Jordan, Amman, Ãmmãn, Jordan
| | - Ibrahim Aljarah
- King Abdullah II School for Information Technology, University of Jordan, Amman, Ãmmãn, Jordan
| |
Collapse
|
145
|
Abstract
A novel fast target recognition algorithm is proposed under the dynamic scene moving target recognition. Aiming at the poor matching effect of the traditional Oriented Fast and Rotated Brief (ORB) algorithm on underexposed or overexposed images caused by illumination, the idea of combining adaptive histogram equalization with the ORB algorithm is proposed to get better feature point quality and matching efficiency. First, the template image and each frame of the video stream are processed by grayscale. Second, the template image and the image to be input in the video stream are processed by adaptive histogram equalization. Third, the feature point descriptors of the ORB feature are quantized by the Hamming distance. Finally, the K-nearest-neighbor (KNN) matching algorithm is used to match and screen feature points. According to the matching good feature point logarithm, a reasonable threshold is established and the target is classified. The comparison and verification are carried out by experiments. Experimental results show that the algorithm not only maintains the superiority of ORB itself but also significantly improves the performance of ORB under the conditions of underexposure or overexposure. The matching effect of the image is robust to illumination, and the target to be detected can be accurately identified in real time. The target can be accurately classified in the small sample scene, which can meet the actual production requirements.
Collapse
|
146
|
Chola C, Benifa JVB, Guru DS, Muaad AY, Hanumanthappa J, Al-antari MA, AlSalman H, Gumaei AH. Gender Identification and Classification of Drosophila melanogaster Flies Using Machine Learning Techniques. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2022; 2022:4593330. [PMID: 35069782 PMCID: PMC8776435 DOI: 10.1155/2022/4593330] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/23/2021] [Accepted: 12/10/2021] [Indexed: 01/02/2023]
Abstract
Drosophila melanogaster is an important genetic model organism used extensively in medical and biological studies. About 61% of known human genes have a recognizable match with the genetic code of Drosophila flies, and 50% of fly protein sequences have mammalian analogues. Recently, several investigations have been conducted in Drosophila to study the functions of specific genes exist in the central nervous system, heart, liver, and kidney. The outcomes of the research in Drosophila are also used as a unique tool to study human-related diseases. This article presents a novel automated system to classify the gender of Drosophila flies obtained through microscopic images (ventral view). The proposed system takes an image as input and converts it into grayscale illustration to extract the texture features from the image. Then, machine learning (ML) classifiers such as support vector machines (SVM), Naive Bayes (NB), and K-nearest neighbour (KNN) are used to classify the Drosophila as male or female. The proposed model is evaluated using the real microscopic image dataset, and the results show that the accuracy of the KNN is 90%, which is higher than the accuracy of the SVM classifier.
Collapse
Affiliation(s)
- Channabasava Chola
- Department of Computer Science and Engineering, Indian Institute of Information Technology, Kottayam, India
- Department of Studies in Computer Science, University of Mysore, Karnataka, India
| | - J. V. Bibal Benifa
- Department of Computer Science and Engineering, Indian Institute of Information Technology, Kottayam, India
| | - D. S. Guru
- Department of Studies in Computer Science, University of Mysore, Karnataka, India
| | - Abdullah Y. Muaad
- Department of Studies in Computer Science, University of Mysore, Karnataka, India
- Sana'a Community College, Sana'a 5695, Yemen
| | - J. Hanumanthappa
- Department of Studies in Computer Science, University of Mysore, Karnataka, India
| | - Mugahed A. Al-antari
- Department of Computer Science and Engineering, College of Software, Kyung Hee University, Suwon-si 17104, Republic of Korea
| | - Hussain AlSalman
- Department of Computer Science, College of Computer and Information Sciences, King Saud University, Riyadh 11543, Saudi Arabia
| | - Abdu H. Gumaei
- Computer Science Department, Faculty of Applied Sciences, Taiz University, Taiz 6803, Yemen
| |
Collapse
|
147
|
Singh H, Sharma V, Singh D. Comparative analysis of proficiencies of various textures and geometric features in breast mass classification using k-nearest neighbor. Vis Comput Ind Biomed Art 2022; 5:3. [PMID: 35018506 PMCID: PMC8752652 DOI: 10.1186/s42492-021-00100-1] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2021] [Accepted: 12/23/2021] [Indexed: 11/10/2022] Open
Abstract
This paper introduces a comparative analysis of the proficiencies of various textures and geometric features in the diagnosis of breast masses on mammograms. An improved machine learning-based framework was developed for this study. The proposed system was tested using 106 full field digital mammography images from the INbreast dataset, containing a total of 115 breast mass lesions. The proficiencies of individual and various combinations of computed textures and geometric features were investigated by evaluating their contributions towards attaining higher classification accuracies. Four state-of-the-art filter-based feature selection algorithms (Relief-F, Pearson correlation coefficient, neighborhood component analysis, and term variance) were employed to select the top 20 most discriminative features. The Relief-F algorithm outperformed other feature selection algorithms in terms of classification results by reporting 85.2% accuracy, 82.0% sensitivity, and 88.0% specificity. A set of nine most discriminative features were then selected, out of the earlier mentioned 20 features obtained using Relief-F, as a result of further simulations. The classification performances of six state-of-the-art machine learning classifiers, namely k-nearest neighbor (k-NN), support vector machine, decision tree, Naive Bayes, random forest, and ensemble tree, were investigated, and the obtained results revealed that the best classification results (accuracy = 90.4%, sensitivity = 92.0%, specificity = 88.0%) were obtained for the k-NN classifier with the number of neighbors having k = 5 and squared inverse distance weight. The key findings include the identification of the nine most discriminative features, that is, FD26 (Fourier Descriptor), Euler number, solidity, mean, FD14, FD13, periodicity, skewness, and contrast out of a pool of 125 texture and geometric features. The proposed results revealed that the selected nine features can be used for the classification of breast masses in mammograms.
Collapse
Affiliation(s)
- Harmandeep Singh
- Department of Computer Science and Engineering, IKG Punjab Technical University, Jalandhar, Punjab, 144603, India.
| | - Vipul Sharma
- Department of Computer Science and Engineering, IKG Punjab Technical University, Jalandhar, Punjab, 144603, India
| | - Damanpreet Singh
- Department of Computer Science and Engineering, Sant Longowal Institute of Engineering and Technology, Sangrur, Punjab, 148106, India
| |
Collapse
|
148
|
Tang F, Lu Z, Lei H, Lai Y, Lu Z, Li Z, Tang Z, Zhang J, He Z. DNA Methylation Data-Based Classification and Identification of Prognostic Signature of Children With Wilms Tumor. Front Cell Dev Biol 2022; 9:683242. [PMID: 35004665 PMCID: PMC8740190 DOI: 10.3389/fcell.2021.683242] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2021] [Accepted: 12/02/2021] [Indexed: 11/29/2022] Open
Abstract
Background: As an epigenetic alteration, DNA methylation plays an important role in early Wilms tumorigenesis and is possibly used as marker to improve the diagnosis and classification of tumor heterogeneity. Methods: Methylation data, RNA-sequencing (RNA-seq) data, and corresponding clinical information were downloaded from the Therapeutically Applicable Research to Generate Effective Treatments (TARGET) database. The prognostic values of DNA methylation subtypes in Wilms tumor were identified. Results: Four prognostic subtypes of Wilms tumor patients were identified by consensus cluster analysis performed on 312 independent prognostic CpG sites. Cluster one showed the best prognosis, whereas Cluster two represented the worst prognosis. Unique CpG sites identified in Cluster one that were not identified in other subtypes were assessed to construct a prognostic signature. The prognostic methylation risk score was closely related to prognosis, and the area under the curve (AUC) was 0.802. Furthermore, the risk score based on prognostic signature was identified as an independent prognostic factor for Wilms tumor in univariate and multivariate Cox regression analyses. Finally, the abundance of B cell infiltration was higher in the low-risk group than in the high-risk group, based on the methylation data. Conclusion: Collectively, we divided Wilms tumor cases into four prognostic subtypes, which could efficiently identify high-risk Wilms tumor patients. Prognostic methylation risk scores that were significantly associated with the adverse clinical outcomes were established, and this prognostic signature was able to predict the prognosis of Wilms tumor in children, which may be useful in guiding clinicians in therapeutic decision-making. Further independent studies are needed to validate and advance this hypothesis.
Collapse
Affiliation(s)
- Fucai Tang
- Department of Urology, The Eighth Affiliated Hospital, Sun Yat-sen University, Shenzhen, China
| | - Zeguang Lu
- The Second Clinical College of Guangzhou Medical University, Guangzhou, China
| | - Hanqi Lei
- Department of Urology, The Seventh Affiliated Hospital of Sun Yat-Sen University, Shenzhen, China.,Department of Urology, Sun Yat-sen Memorial Hospital of Sun Yat-sen University, Guangzhou, China
| | - Yongchang Lai
- Department of Urology, The Eighth Affiliated Hospital, Sun Yat-sen University, Shenzhen, China
| | - Zechao Lu
- The First Clinical College of Guangzhou Medical University, Guangzhou, China
| | - Zhibiao Li
- The Third Clinical College of Guangzhou Medical University, Guangzhou, China
| | - Zhicheng Tang
- The Third Clinical College of Guangzhou Medical University, Guangzhou, China
| | - Jiahao Zhang
- The Sixth Clinical College of Guangzhou Medical University, Guangzhou, China
| | - Zhaohui He
- Department of Urology, The Eighth Affiliated Hospital, Sun Yat-sen University, Shenzhen, China
| |
Collapse
|
149
|
He H, Chen G, Chen CYC. Machine learning and graph neural network for finding potential drugs related to multiple myeloma. NEW J CHEM 2022. [DOI: 10.1039/d1nj04935f] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
An innovative voting mechanism for virtual drug screening.
Collapse
Affiliation(s)
- Haohuai He
- School of Intelligent Systems Engineering, Sun Yat-sen University, Shenzhen, 510275, China
| | - Guanxing Chen
- School of Intelligent Systems Engineering, Sun Yat-sen University, Shenzhen, 510275, China
| | - Calvin Yu-Chian Chen
- School of Intelligent Systems Engineering, Sun Yat-sen University, Shenzhen, 510275, China
- Department of Medical Research, China Medical University Hospital, Taichung 40447, Taiwan
- Department of Bioinformatics and Medical Engineering, Asia University, Taichung 41354, Taiwan
| |
Collapse
|
150
|
Wang Y, Pan Z, Dong J. A new two-layer nearest neighbor selection method for kNN classifier. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2021.107604] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|