1
|
Alimisis V, Aletraris C, Eleftheriou NP, Serlis EA, James A, Sotiriadis PP. Low-Power Analog Integrated Architecture of the Voting Classification Algorithm for Diabetes Disease Prediction. IEEE TRANSACTIONS ON BIOMEDICAL CIRCUITS AND SYSTEMS 2025; 19:93-107. [PMID: 38954585 DOI: 10.1109/tbcas.2024.3421313] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/04/2024]
Abstract
A low-power ( 600nW), fully analog integrated architecture for a voting classification algorithm is introduced. It can effectively handle multiple-input features, maintaining exceptional levels of accuracy and with very low power consumption. The proposed architecture is based on a versatile Voting algorithm that selectively incorporates one of three key classification models: Bayes or Centroid, or, the Learning Vector Quantization model; all of which are implemented using Gaussian-likelihood and Euclidean distance function circuits, as well as a current comparison circuit. To evaluate the proposed architecture, a comprehensive comparison with popular analog classifiers is performed, using real-life diabetes dataset. All model architectures were trained using Python and compared with the software-based classifiers. The circuit implementations were performed using the TSMC nm CMOS process technology and the Cadence IC Suite was utilized for the design, schematic and post-layout simulations. The proposed classifiers achieved sensitivity of and specificity of .
Collapse
|
2
|
van Veen R, Tamboli NRB, Lövdal S, Meles SK, Renken RJ, de Vries GJ, Arnaldi D, Morbelli S, Clavero P, Obeso JA, Oroz MCR, Leenders KL, Villmann T, Biehl M. Subspace corrected relevance learning with application in neuroimaging. Artif Intell Med 2024; 149:102786. [PMID: 38462286 DOI: 10.1016/j.artmed.2024.102786] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Revised: 01/12/2024] [Accepted: 01/21/2024] [Indexed: 03/12/2024]
Abstract
In machine learning, data often comes from different sources, but combining them can introduce extraneous variation that affects both generalization and interpretability. For example, we investigate the classification of neurodegenerative diseases using FDG-PET data collected from multiple neuroimaging centers. However, data collected at different centers introduces unwanted variation due to differences in scanners, scanning protocols, and processing methods. To address this issue, we propose a two-step approach to limit the influence of center-dependent variation on the classification of healthy controls and early vs. late-stage Parkinson's disease patients. First, we train a Generalized Matrix Learning Vector Quantization (GMLVQ) model on healthy control data to identify a "relevance space" that distinguishes between centers. Second, we use this space to construct a correction matrix that restricts a second GMLVQ system's training on the diagnostic problem. We evaluate the effectiveness of this approach on the real-world multi-center datasets and simulated artificial dataset. Our results demonstrate that the approach produces machine learning systems with reduced bias - being more specific due to eliminating information related to center differences during the training process - and more informative relevance profiles that can be interpreted by medical experts. This method can be adapted to similar problems outside the neuroimaging domain, as long as an appropriate "relevance space" can be identified to construct the correction matrix.
Collapse
Affiliation(s)
- Rick van Veen
- Bernoulli Institute for Mathematics, Computer Science and Artificial Intelligence, University of Groningen, The Netherlands.
| | - Neha Rajendra Bari Tamboli
- Bernoulli Institute for Mathematics, Computer Science and Artificial Intelligence, University of Groningen, The Netherlands.
| | - Sofie Lövdal
- Bernoulli Institute for Mathematics, Computer Science and Artificial Intelligence, University of Groningen, The Netherlands; Department of Nuclear Medicine and Molecular Imaging, University Medical Center Groningen, The Netherlands.
| | - Sanne K Meles
- Department of Neurology, University Medical Center Groningen, The Netherlands.
| | - Remco J Renken
- Department of Biomedical Sciences of Cells & Systems, Cognitive Neuroscience Center, University Medical Center Groningen, The Netherlands.
| | | | - Dario Arnaldi
- Department of Neuroscience, University of Genoa, Italy; IRCCS Ospedale Policlinico San Martino, Genoa, Italy.
| | - Silvia Morbelli
- IRCCS Ospedale Policlinico San Martino, Genoa, Italy; Department of Health Sciences, University of Genoa, Italy.
| | - Pedro Clavero
- Servicio de Neurología, Complejo Hospitalario de Navarra, Pamplona, Spain.
| | - José A Obeso
- Académico de Número Real Academia Nacional de Medicina de España, Spain.
| | - Maria C Rodriguez Oroz
- Neurology Department, Clínica Universidad de Navarra, Spain; Neuroscience Program, Center for Applied Medical Research, Universidad de Navarra, Pamplona, Spain; Navarra Institute for Health Research, Pamplona, Spain
| | - Klaus L Leenders
- Department of Nuclear Medicine and Molecular Imaging, University Medical Center Groningen, The Netherlands.
| | - Thomas Villmann
- Saxon Institute for Computational Intelligence and Machine Learning, University of Applied Sciences Mittweida, Germany.
| | - Michael Biehl
- Bernoulli Institute for Mathematics, Computer Science and Artificial Intelligence, University of Groningen, The Netherlands; SMQB, Inst. of Metabolism and Systems Research, College of Medical and Dental Sciences, Birmingham, United Kingdom.
| |
Collapse
|
3
|
Jiang H, Chen Y, Kong L, Cai G, Jiang H. An LVQ clustering algorithm based on neighborhood granules. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2022. [DOI: 10.3233/jifs-220092] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Learning Vector Quantization (LVQ) is a clustering method with supervised information, simple structures, and powerful functions. LVQ assumes that the data samples are labeled, and the learning process uses labels to assist clustering. However, the LVQ is sensitive to initial values, resulting in a poor clustering effect. To overcome these shortcomings, a granular LVQ clustering algorithm is proposed by adopting the neighborhood granulation technology and the LVQ. Firstly, the neighborhood granulation is carried out on some features of a sample of the data set, then a neighborhood granular vector is formed. Furthermore, the size and operations of neighborhood granular vectors are defined, and the relative and absolute granular distances between granular vectors are proposed. Finally, these granular distances are proved to be metrics, and a granular LVQ clustering algorithm is designed. Some experiments are tested on several UCI data sets, and the results show that the granular LVQ clustering is better than the traditional LVQ clustering under suitable neighborhood parameters and distance measurement.
Collapse
Affiliation(s)
- Hailiang Jiang
- College of Computer and Information Engineering, Xiamen University of Technology, Xiamen, China
| | - Yumin Chen
- College of Computer and Information Engineering, Xiamen University of Technology, Xiamen, China
| | - Liru Kong
- College of Computer and Information Engineering, Xiamen University of Technology, Xiamen, China
| | - Guoqiang Cai
- College of Computer and Information Engineering, Xiamen University of Technology, Xiamen, China
| | - Hongbo Jiang
- College of Economics and Management, Xiamen University of Technology, Xiamen, China
| |
Collapse
|
4
|
de Andrade BM, Margalho LP, Batista DB, Lucena IO, Kamimura BA, Balthazar CF, Brexó RP, Pia AK, Costa RA, Cruz AG, Granato D, Sant’Ana AS, Luna AS, de Gois JS. Chemometric classification of Brazilian artisanal cheeses from different regions according to major and trace elements by ICP-OES. J Food Compost Anal 2022. [DOI: 10.1016/j.jfca.2022.104519] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
5
|
Bilen T, Canberk B, Sharma V, Fahim M, Duong TQ. AI-Driven Aeronautical Ad Hoc Networks for 6G Wireless: Challenges, Opportunities, and the Road Ahead. SENSORS (BASEL, SWITZERLAND) 2022; 22:3731. [PMID: 35632140 PMCID: PMC9147260 DOI: 10.3390/s22103731] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/15/2022] [Revised: 05/05/2022] [Accepted: 05/05/2022] [Indexed: 06/15/2023]
Abstract
Aeronautical ad hoc network (AANET) has been considered a promising candidate to complete the vision of "Internet in the sky" by supporting high-speed broadband connections on airplanes for 6G networks. However, the specific characteristics of AANET restrict the applicability of conventional topology and routing management algorithms. Here, these conventional methodologies reduce the packet delivery success of AANET with higher transfer delay. At that point, the artificial intelligence (AI)-driven solutions have been adapted to AANET to provide intelligent frameworks and architectures to cope with the high complexity. The AI-driven AANET can provide intelligent topology formation, sustainability, and routing management decisions in an automated fashion by considering its specific characteristics during the learning operations. More clearly, AI-driven AANETs support intelligent management architectures, overcoming conventional methodologies' drawbacks. Although AI-based management approaches are widely used in terrestrial networks, there is a lack of a comprehensive study that supports AI-driven solutions for AANETs. To this end, this article explores the possible utilization of primary AI methodologies on the road to AI-driven AANET. Specifically, the article addresses unsupervised, supervised, and reinforcement learning as primary AI methodologies to enable intelligent AANET topology formation, sustainability, and routing management. Here, we identify the challenges and opportunities of these primary AI methodologies during the execution of AANET management. Furthermore, we discuss the critical issue of security in AANET before providing open issues.
Collapse
Affiliation(s)
- Tuğçe Bilen
- Department of Computer Engineering, Faculty of Computer and Informatics, Istanbul Technical University, Istanbul 34469, Turkey;
| | - Berk Canberk
- Artificial Intelligence and Data Engineering Department, Faculty of Computer and Informatics, Istanbul Technical University, Istanbul 34469, Turkey;
| | - Vishal Sharma
- School of Electronics, Electrical Engineering and Computer Science, Queen’s University Belfast, Belfast BT7 1NN, UK; (V.S.); (M.F.)
| | - Muhammad Fahim
- School of Electronics, Electrical Engineering and Computer Science, Queen’s University Belfast, Belfast BT7 1NN, UK; (V.S.); (M.F.)
| | - Trung Q. Duong
- School of Electronics, Electrical Engineering and Computer Science, Queen’s University Belfast, Belfast BT7 1NN, UK; (V.S.); (M.F.)
| |
Collapse
|
6
|
Abstract
AbstractWe present a modelling framework for the investigation of supervised learning in non-stationary environments. Specifically, we model two example types of learning systems: prototype-based learning vector quantization (LVQ) for classification and shallow, layered neural networks for regression tasks. We investigate so-called student–teacher scenarios in which the systems are trained from a stream of high-dimensional, labeled data. Properties of the target task are considered to be non-stationary due to drift processes while the training is performed. Different types of concept drift are studied, which affect the density of example inputs only, the target rule itself, or both. By applying methods from statistical physics, we develop a modelling framework for the mathematical analysis of the training dynamics in non-stationary environments. Our results show that standard LVQ algorithms are already suitable for the training in non-stationary environments to a certain extent. However, the application of weight decay as an explicit mechanism of forgetting does not improve the performance under the considered drift processes. Furthermore, we investigate gradient-based training of layered neural networks with sigmoidal activation functions and compare with the use of rectified linear units. Our findings show that the sensitivity to concept drift and the effectiveness of weight decay differs significantly between the two types of activation function.
Collapse
|
7
|
Artelt A, Hammer B. Efficient computation of counterfactual explanations and counterfactual metrics of prototype-based classifiers. Neurocomputing 2022. [DOI: 10.1016/j.neucom.2021.04.129] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
8
|
Hassan BA, Rashid TA, Hamarashid HK. A novel cluster detection of COVID-19 patients and medical disease conditions using improved evolutionary clustering algorithm star. Comput Biol Med 2021; 138:104866. [PMID: 34598065 PMCID: PMC8445768 DOI: 10.1016/j.compbiomed.2021.104866] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2021] [Revised: 09/08/2021] [Accepted: 09/08/2021] [Indexed: 12/16/2022]
Abstract
With the increasing number of samples, the manual clustering of COVID-19 and medical disease data samples becomes time-consuming and requires highly skilled labour. Recently, several algorithms have been used for clustering medical datasets deterministically; however, these definitions have not been effective in grouping and analysing medical diseases. The use of evolutionary clustering algorithms may help to effectively cluster these diseases. On this presumption, we improved the current evolutionary clustering algorithm star (ECA*), called iECA*, in three manners: (i) utilising the elbow method to find the correct number of clusters; (ii) cleaning and processing data as part of iECA* to apply it to multivariate and domain-theory datasets; (iii) using iECA* for real-world applications in clustering COVID-19 and medical disease datasets. Experiments were conducted to examine the performance of iECA* against state-of-the-art algorithms using performance and validation measures (validation measures, statistical benchmarking, and performance ranking framework). The results demonstrate three primary findings. First, iECA* was more effective than other algorithms in grouping the chosen medical disease datasets according to the cluster validation criteria. Second, iECA* exhibited the lower execution time and memory consumption for clustering all the datasets, compared to the current clustering methods analysed. Third, an operational framework was proposed to rate the effectiveness of iECA* against other algorithms in the datasets analysed, and the results indicated that iECA* exhibited the best performance in clustering all medical datasets. Further research is required on real-world multi-dimensional data containing complex knowledge fields for experimental verification of iECA* compared to evolutionary algorithms.
Collapse
Affiliation(s)
- Bryar A. Hassan
- Department of Computer Networks, Technical College of Informatics, Sulaimani Polytechnic University, Sulaimani, 46001, Iraq,Kurdistan Institution for Strategic Studies and Scientific Research, Sulaimani 46001, Iraq,Corresponding author. Department of Computer Networks, Technical College of Informatics, Sulaimani Polytechnic University, Sulaimani 46001, Iraq
| | - Tarik A. Rashid
- Computer Science and Engineering Department, University of Kurdistan Hewler, Iraq
| | - Hozan K. Hamarashid
- Information Technology Department, Computer Science Institute, Sulaimani Polytechnic University, Sulaimani 46001, Iraq
| |
Collapse
|
9
|
Hernández-Luquin F, Escalante HJ. Multi-branch deep radial basis function networks for facial emotion recognition. Neural Comput Appl 2021. [DOI: 10.1007/s00521-021-06420-w] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
10
|
Tang F, Feng H, Tino P, Si B, Ji D. Probabilistic learning vector quantization on manifold of symmetric positive definite matrices. Neural Netw 2021; 142:105-118. [PMID: 33984734 DOI: 10.1016/j.neunet.2021.04.024] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2020] [Revised: 04/07/2021] [Accepted: 04/20/2021] [Indexed: 10/21/2022]
Abstract
In this paper, we develop a new classification method for manifold-valued data in the framework of probabilistic learning vector quantization. In many classification scenarios, the data can be naturally represented by symmetric positive definite matrices, which are inherently points that live on a curved Riemannian manifold. Due to the non-Euclidean geometry of Riemannian manifolds, traditional Euclidean machine learning algorithms yield poor results on such data. In this paper, we generalize the probabilistic learning vector quantization algorithm for data points living on the manifold of symmetric positive definite matrices equipped with Riemannian natural metric (affine-invariant metric). By exploiting the induced Riemannian distance, we derive the probabilistic learning Riemannian space quantization algorithm, obtaining the learning rule through Riemannian gradient descent. Empirical investigations on synthetic data, image data , and motor imagery electroencephalogram (EEG) data demonstrate the superior performance of the proposed method.
Collapse
Affiliation(s)
- Fengzhen Tang
- State Key Laboratory of Robotics, Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang, 110016, China; Institutes for Robotics and Intelligent Manufacturing, Chinese Academy of Sciences, Shenyang, 110169, China.
| | - Haifeng Feng
- State Key Laboratory of Robotics, Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang, 110016, China; Institutes for Robotics and Intelligent Manufacturing, Chinese Academy of Sciences, Shenyang, 110169, China; University of Chinese Academy of Sciences, Beijing, 100049, China.
| | - Peter Tino
- School of computer Science, University of Birmingham, Birmingham, B15 2TT, UK.
| | - Bailu Si
- School of Systems Science, Beijing Normal University, Beijing, 100875, China.
| | - Daxiong Ji
- Institute of Marine Electronics and Intelligent Systems, Ocean College, Zhejiang University, The Key Laboratory of Ocean Observation-Imaging Testbed of Zhejiang Province, The Engineering Research Center of Oceanic Sensing Technology and Equipment, Ministry of Education, Zhoushan, 316021, China.
| |
Collapse
|
11
|
Bamorovat M, Sharifi I, Rashedi E, Shafiian A, Sharifi F, Khosravi A, Tahmouresi A. A novel diagnostic and prognostic approach for unresponsive patients with anthroponotic cutaneous leishmaniasis using artificial neural networks. PLoS One 2021; 16:e0250904. [PMID: 33951081 PMCID: PMC8099060 DOI: 10.1371/journal.pone.0250904] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2020] [Accepted: 04/15/2021] [Indexed: 11/19/2022] Open
Abstract
Cutaneous leishmaniasis (CL) imposes a major health burden throughout the tropical and subtropical regions of the globe. Unresponsive cases are common phenomena occurred upon exposure to the standard drugs. Therefore, rapid detection, prognosis and classification of the disease are crucial for selecting the proper treatment modality. Using machine learning (ML) techniques, this study aimed to detect unresponsive cases of ACL, caused by Leishmania tropica, which will consequently be used for a more effective treatment modality. This study was conducted as a case-control setting. Patients were selected in a major ACL focus from both unresponsive and responsive cases. Nine unique and relevant features of patients with ACL were selected. To categorize the patients, different classifier models such as k-nearest neighbors (KNN), support vector machines (SVM), multilayer perceptron (MLP), learning vector quantization (LVQ) and multipass LVQ were applied and compared for this supervised learning task. Comparison of the receiver operating characteristic graphs (ROC) and confusion plots for the above models represented that MLP was a fairly accurate prediction model to solve this problem. The overall accuracy in terms of sensitivity, specificity and area under ROC curve (AUC) of MLP classifier were 87.8%, 90.3%, 86% and 0.88%, respectively. Moreover, the duration of the skin lesion was the most influential feature in MLP classifier, while gender was the least. The present investigation demonstrated that MLP model could be utilized for rapid detection, accurate prognosis and effective treatment of unresponsive patients with ACL. The results showed that the major feature affecting the responsiveness to treatments is the duration of the lesion. This novel approach is unique and can be beneficial in developing diagnostic, prophylactic and therapeutic measures against the disease. This attempt could be a preliminary step towards the expansion of ML application in future directions.
Collapse
Affiliation(s)
- Mehdi Bamorovat
- Leishmaniasis Research Center, Kerman University of Medical Sciences, Kerman, Iran
| | - Iraj Sharifi
- Leishmaniasis Research Center, Kerman University of Medical Sciences, Kerman, Iran
| | - Esmat Rashedi
- Department of Electrical and Computer Engineering, Graduate University of Advanced Technology, Kerman, Iran
| | - Alireza Shafiian
- Department of Pathobiology, Faculty of Veterinary Medicine, Shahid Bahonar University of Kerman, Kerman, Iran
| | - Fatemeh Sharifi
- Pharmaceutics Research Center, Institute of Neuropharmacology, Kerman University of Medical Sciences, Kerman, Iran
| | - Ahmad Khosravi
- Leishmaniasis Research Center, Kerman University of Medical Sciences, Kerman, Iran
| | | |
Collapse
|
12
|
Manome N, Shinohara S, Takahashi T, Chen Y, Chung UI. Self-incremental learning vector quantization with human cognitive biases. Sci Rep 2021; 11:3910. [PMID: 33594132 PMCID: PMC7887244 DOI: 10.1038/s41598-021-83182-4] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2020] [Accepted: 01/27/2021] [Indexed: 11/30/2022] Open
Abstract
Human beings have adaptively rational cognitive biases for efficiently acquiring concepts from small-sized datasets. With such inductive biases, humans can generalize concepts by learning a small number of samples. By incorporating human cognitive biases into learning vector quantization (LVQ), a prototype-based online machine learning method, we developed self-incremental LVQ (SILVQ) methods that can be easily interpreted. We first describe a method to automatically adjust the learning rate that incorporates human cognitive biases. Second, SILVQ, which self-increases the prototypes based on the method for automatically adjusting the learning rate, is described. The performance levels of the proposed methods are evaluated in experiments employing four real and two artificial datasets. Compared with the original learning vector quantization algorithms, our methods not only effectively remove the need for parameter tuning, but also achieve higher accuracy from learning small numbers of instances. In the cases of larger numbers of instances, SILVQ can still achieve an accuracy that is equal to or better than those of existing representative LVQ algorithms. Furthermore, SILVQ can learn linearly inseparable conceptual structures with the required and sufficient number of prototypes without overfitting.
Collapse
Affiliation(s)
- Nobuhito Manome
- Graduate School of Engineering, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, 113-8656, Japan.
- Graduate School of Frontier Sciences, The University of Tokyo, Chiba, Japan.
| | - Shuji Shinohara
- Graduate School of Engineering, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, 113-8656, Japan
| | - Tatsuji Takahashi
- Graduate School of Science and Engineering, Tokyo Denki University, Saitama, Japan
| | - Yu Chen
- Graduate School of Frontier Sciences, The University of Tokyo, Chiba, Japan
| | - Ung-Il Chung
- Graduate School of Engineering, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, 113-8656, Japan
- Graduate School of Medicine, The University of Tokyo, Tokyo, Japan
| |
Collapse
|
13
|
Whitelam S. Improving the Accuracy of Nearest-Neighbor Classification Using Principled Construction and Stochastic Sampling of Training-Set Centroids. ENTROPY (BASEL, SWITZERLAND) 2021; 23:149. [PMID: 33530507 PMCID: PMC7911166 DOI: 10.3390/e23020149] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/03/2021] [Accepted: 01/21/2021] [Indexed: 11/16/2022]
Abstract
A conceptually simple way to classify images is to directly compare test-set data and training-set data. The accuracy of this approach is limited by the method of comparison used, and by the extent to which the training-set data cover configuration space. Here we show that this coverage can be substantially increased using coarse-graining (replacing groups of images by their centroids) and stochastic sampling (using distinct sets of centroids in combination). We use the MNIST and Fashion-MNIST data sets to show that a principled coarse-graining algorithm can convert training images into fewer image centroids without loss of accuracy of classification of test-set images by nearest-neighbor classification. Distinct batches of centroids can be used in combination as a means of stochastically sampling configuration space, and can classify test-set data more accurately than can the unaltered training set. On the MNIST and Fashion-MNIST data sets this approach converts nearest-neighbor classification from a mid-ranking- to an upper-ranking member of the set of classical machine-learning techniques.
Collapse
Affiliation(s)
- Stephen Whitelam
- Molecular Foundry, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| |
Collapse
|
14
|
Tang F, Fan M, Tino P. Generalized Learning Riemannian Space Quantization: A Case Study on Riemannian Manifold of SPD Matrices. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2021; 32:281-292. [PMID: 32203035 DOI: 10.1109/tnnls.2020.2978514] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Learning vector quantization (LVQ) is a simple and efficient classification method, enjoying great popularity. However, in many classification scenarios, such as electroencephalogram (EEG) classification, the input features are represented by symmetric positive-definite (SPD) matrices that live in a curved manifold rather than vectors that live in the flat Euclidean space. In this article, we propose a new classification method for data points that live in the curved Riemannian manifolds in the framework of LVQ. The proposed method alters generalized LVQ (GLVQ) with the Euclidean distance to the one operating under the appropriate Riemannian metric. We instantiate the proposed method for the Riemannian manifold of SPD matrices equipped with the Riemannian natural metric. Empirical investigations on synthetic data and real-world motor imagery EEG data demonstrate that the performance of the proposed generalized learning Riemannian space quantization can significantly outperform the Euclidean GLVQ, generalized relevance LVQ (GRLVQ), and generalized matrix LVQ (GMLVQ). The proposed method also shows competitive performance to the state-of-the-art methods on the EEG classification of motor imagery tasks.
Collapse
|
15
|
Raab C, Heusinger M, Schleif FM. Reactive Soft Prototype Computing for Concept Drift Streams. Neurocomputing 2020. [DOI: 10.1016/j.neucom.2019.11.111] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
16
|
Targeted Metabolic Profiling of Urine Highlights a Potential Biomarker Panel for the Diagnosis of Alzheimer's Disease and Mild Cognitive Impairment: A Pilot Study. Metabolites 2020; 10:metabo10090357. [PMID: 32878308 PMCID: PMC7569858 DOI: 10.3390/metabo10090357] [Citation(s) in RCA: 38] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2020] [Revised: 08/27/2020] [Accepted: 08/31/2020] [Indexed: 12/12/2022] Open
Abstract
The lack of sensitive and specific biomarkers for the early detection of mild cognitive impairment (MCI) and Alzheimer’s disease (AD) is a major hurdle to improving patient management. A targeted, quantitative metabolomics approach using both 1H NMR and mass spectrometry was employed to investigate the performance of urine metabolites as potential biomarkers for MCI and AD. Correlation-based feature selection (CFS) and least absolute shrinkage and selection operator (LASSO) methods were used to develop biomarker panels tested using support vector machine (SVM) and logistic regression models for diagnosis of each disease state. Metabolic changes were investigated to identify which biochemical pathways were perturbed as a direct result of MCI and AD in urine. Using SVM, we developed a model with 94% sensitivity, 78% specificity, and 78% AUC to distinguish healthy controls from AD sufferers. Using logistic regression, we developed a model with 85% sensitivity, 86% specificity, and an AUC of 82% for AD diagnosis as compared to cognitively healthy controls. Further, we identified 11 urinary metabolites that were significantly altered to include glucose, guanidinoacetate, urocanate, hippuric acid, cytosine, 2- and 3-hydroxyisovalerate, 2-ketoisovalerate, tryptophan, trimethylamine N oxide, and malonate in AD patients, which are also capable of diagnosing MCI, with a sensitivity value of 76%, specificity of 75%, and accuracy of 81% as compared to healthy controls. This pilot study suggests that urine metabolomics may be useful for developing a test capable of diagnosing and distinguishing MCI and AD from cognitively healthy controls.
Collapse
|
17
|
Shen YY, Zhang YM, Zhang XY, Liu CL. Online semi-supervised learning with learning vector quantization. Neurocomputing 2020. [DOI: 10.1016/j.neucom.2020.03.025] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
18
|
Comparison of Instance Selection and Construction Methods with Various Classifiers. APPLIED SCIENCES-BASEL 2020. [DOI: 10.3390/app10113933] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Instance selection and construction methods were originally designed to improve the performance of the k-nearest neighbors classifier by increasing its speed and improving the classification accuracy. These goals were achieved by eliminating redundant and noisy samples, thus reducing the size of the training set. In this paper, the performance of instance selection methods is investigated in terms of classification accuracy and reduction of training set size. The classification accuracy of the following classifiers is evaluated: decision trees, random forest, Naive Bayes, linear model, support vector machine and k-nearest neighbors. The obtained results indicate that for the most of the classifiers compressing the training set affects prediction performance and only a small group of instance selection methods can be recommended as a general purpose preprocessing step. These are learning vector quantization based algorithms, along with the Drop2 and Drop3. Other methods are less efficient or provide low compression ratio.
Collapse
|
19
|
Chowdhary CL, Acharjya D. Segmentation and Feature Extraction in Medical Imaging: A Systematic Review. ACTA ACUST UNITED AC 2020. [DOI: 10.1016/j.procs.2020.03.179] [Citation(s) in RCA: 51] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
20
|
Weed J, Bach F. Sharp asymptotic and finite-sample rates of convergence of empirical measures in Wasserstein distance. BERNOULLI 2019. [DOI: 10.3150/18-bej1065] [Citation(s) in RCA: 48] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
21
|
Moshkbar-Bakhshayesh K, Mohtashami S. Classification of NPPs transients using change of representation technique: A hybrid of unsupervised MSOM and supervised SVM. PROGRESS IN NUCLEAR ENERGY 2019. [DOI: 10.1016/j.pnucene.2019.103100] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
22
|
Siamidoudaran M, Iscioglu E, Siamidodaran M. Traffic injury severity prediction along with identification of contributory factors using learning vector quantization: a case study of the city of London. SN APPLIED SCIENCES 2019. [DOI: 10.1007/s42452-019-1314-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022] Open
|
23
|
Straat M, Abadi F, Göpfert C, Hammer B, Biehl M. Statistical Mechanics of On-Line Learning Under Concept Drift. ENTROPY 2018; 20:e20100775. [PMID: 33265863 PMCID: PMC7512337 DOI: 10.3390/e20100775] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/04/2018] [Revised: 10/03/2018] [Accepted: 10/08/2018] [Indexed: 12/03/2022]
Abstract
We introduce a modeling framework for the investigation of on-line machine learning processes in non-stationary environments. We exemplify the approach in terms of two specific model situations: In the first, we consider the learning of a classification scheme from clustered data by means of prototype-based Learning Vector Quantization (LVQ). In the second, we study the training of layered neural networks with sigmoidal activations for the purpose of regression. In both cases, the target, i.e., the classification or regression scheme, is considered to change continuously while the system is trained from a stream of labeled data. We extend and apply methods borrowed from statistical physics which have been used frequently for the exact description of training dynamics in stationary environments. Extensions of the approach allow for the computation of typical learning curves in the presence of concept drift in a variety of model situations. First results are presented and discussed for stochastic drift processes in classification and regression problems. They indicate that LVQ is capable of tracking a classification scheme under drift to a non-trivial extent. Furthermore, we show that concept drift can cause the persistence of sub-optimal plateau states in gradient based training of layered neural networks for regression.
Collapse
Affiliation(s)
- Michiel Straat
- Bernoulli Institute for Mathematics, Computer Science and Artificial Intelligence, University of Groningen, Nijenborgh 9, 9747 AG Groningen, The Netherlands
| | - Fthi Abadi
- Bernoulli Institute for Mathematics, Computer Science and Artificial Intelligence, University of Groningen, Nijenborgh 9, 9747 AG Groningen, The Netherlands
| | - Christina Göpfert
- Center of Excellence—Cognitive Interaction Technology (CITEC), Bielefeld University, Inspiration 1, 33619 Bielefeld, Germany
| | - Barbara Hammer
- Center of Excellence—Cognitive Interaction Technology (CITEC), Bielefeld University, Inspiration 1, 33619 Bielefeld, Germany
| | - Michael Biehl
- Bernoulli Institute for Mathematics, Computer Science and Artificial Intelligence, University of Groningen, Nijenborgh 9, 9747 AG Groningen, The Netherlands
- Correspondence: ; Tel.: +31-50-363-3997
| |
Collapse
|
24
|
Abstract
Learning vector quantization (LVQ) network and back-propagation (BP) network are constructed easily making use of MATLAB toolbox on the basis of maintaining the recognition rate. Face images are randomly selected from images set as training data of LVQ network and BP network. LVQ algorithm and BP algorithm are used to train network. The automatic recognition of face orientation is realized when the system obtains convergence network. First, all images are processed by edge detection. Then feature vectors representing position of the eye were extracted from edge detected images. Feature vectors of training set are sent to network to adjust the parameters which ensures the convergence speed and performance of the network. Experimental results show that the constructed LVQ network and BP network can judge face orientation according to feature vectors of input images. Generally, the recognition rate of LVQ network is higher than that of BP network. The LVQ network and BP network are both feasible and effective for face orientation recognition to some extent. The advantage of this work is that the recognition system is efficient and easy to promote. This paper focuses on how to use MATLAB easily to design identification network rather than the complexity of identification system. The future research will focus on the stability and robustness of recognition network.
Collapse
Affiliation(s)
- Suping Li
- School of Mechanical and Electronic Engineering, Chaohu University, Chaohu, Anhui 238000, P. R. China
| | - Zhanfeng Wang
- School of Information Engineering, Chaohu University, Chaohu, Anhui 238000 P. R. China
| | - Jing Wang
- School of Mechanical and Electronic Engineering, Chaohu University, Chaohu, Anhui 238000, P. R. China
| |
Collapse
|
25
|
|
26
|
Can Learning Vector Quantization be an Alternative to SVM and Deep Learning? - Recent Trends and Advanced Variants of Learning Vector Quantization for Classification Learning. JOURNAL OF ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING RESEARCH 2016. [DOI: 10.1515/jaiscr-2017-0005] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Abstract
Learning vector quantization (LVQ) is one of the most powerful approaches for prototype based classification of vector data, intuitively introduced by Kohonen. The prototype adaptation scheme relies on its attraction and repulsion during the learning providing an easy geometric interpretability of the learning as well as of the classification decision scheme. Although deep learning architectures and support vector classifiers frequently achieve comparable or even better results, LVQ models are smart alternatives with low complexity and computational costs making them attractive for many industrial applications like intelligent sensor systems or advanced driver assistance systems.
Nowadays, the mathematical theory developed for LVQ delivers sufficient justification of the algorithm making it an appealing alternative to other approaches like support vector machines and deep learning techniques.
This review article reports current developments and extensions of LVQ starting from the generalized LVQ (GLVQ), which is known as the most powerful cost function based realization of the original LVQ. The cost function minimized in GLVQ is an soft-approximation of the standard classification error allowing gradient descent learning techniques. The GLVQ variants considered in this contribution, cover many aspects like bordersensitive learning, application of non-Euclidean metrics like kernel distances or divergences, relevance learning as well as optimization of advanced statistical classification quality measures beyond the accuracy including sensitivity and specificity or area under the ROC-curve.
According to these topics, the paper highlights the basic motivation for these variants and extensions together with the mathematical prerequisites and treatments for integration into the standard GLVQ scheme and compares them to other machine learning approaches. For detailed description and mathematical theory behind all, the reader is referred to the respective original articles.
Thus, the intention of the paper is to provide a comprehensive overview of the stateof- the-art serving as a starting point to search for an appropriate LVQ variant in case of a given specific classification problem as well as a reference to recently developed variants and improvements of the basic GLVQ scheme.
Collapse
|
27
|
An online and incremental GRLVQ algorithm for prototype generation based on granular computing. Soft comput 2016. [DOI: 10.1007/s00500-016-2042-0] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
28
|
Biehl M, Hammer B, Villmann T. Prototype-based models in machine learning. WILEY INTERDISCIPLINARY REVIEWS. COGNITIVE SCIENCE 2016; 7:92-111. [PMID: 26800334 DOI: 10.1002/wcs.1378] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/20/2015] [Revised: 10/07/2015] [Accepted: 12/09/2015] [Indexed: 11/08/2022]
Abstract
An overview is given of prototype-based models in machine learning. In this framework, observations, i.e., data, are stored in terms of typical representatives. Together with a suitable measure of similarity, the systems can be employed in the context of unsupervised and supervised analysis of potentially high-dimensional, complex datasets. We discuss basic schemes of competitive vector quantization as well as the so-called neural gas approach and Kohonen's topology-preserving self-organizing map. Supervised learning in prototype systems is exemplified in terms of learning vector quantization. Most frequently, the familiar Euclidean distance serves as a dissimilarity measure. We present extensions of the framework to nonstandard measures and give an introduction to the use of adaptive distances in relevance learning.
Collapse
Affiliation(s)
- Michael Biehl
- Johann Bernoulli Institute for Mathematics and Computer Science, Faculty of Mathematics and Natural Sciences, University of Groningen, Groningen, The Netherlands
| | - Barbara Hammer
- CITEC Centre of Excellence, Faculty of Technology, Bielefeld University, Bielefeld, Germany
| | - Thomas Villmann
- Department of Mathematics, University of Applied Sciences Mittweida, Mittweida, Germany
| |
Collapse
|
29
|
|
30
|
Walter O, Haeb-Umbach R, Mokbel B, Paassen B, Hammer B. Autonomous Learning of Representations. KUNSTLICHE INTELLIGENZ 2015. [DOI: 10.1007/s13218-015-0372-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|
31
|
Kaden M, Riedel M, Hermann W, Villmann T. Border-sensitive learning in generalized learning vector quantization: an alternative to support vector machines. Soft comput 2014. [DOI: 10.1007/s00500-014-1496-1] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|