1
|
Ye F, Bors AG. Lifelong Generative Adversarial Autoencoder. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:14684-14698. [PMID: 37410645 DOI: 10.1109/tnnls.2023.3281091] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/08/2023]
Abstract
Lifelong learning describes an ability that enables humans to continually acquire and learn new information without forgetting. This capability, common to humans and animals, has lately been identified as an essential function for an artificial intelligence system aiming to learn continuously from a stream of data during a certain period of time. However, modern neural networks suffer from degenerated performance when learning multiple domains sequentially and fail to recognize past learned tasks after being retrained. This corresponds to catastrophic forgetting and is ultimately induced by replacing the parameters associated with previously learned tasks with new values. One approach in lifelong learning is the generative replay mechanism (GRM) that trains a powerful generator as the generative replay network, implemented by a variational autoencoder (VAE) or a generative adversarial network (GAN). In this article, we study the forgetting behavior of GRM-based learning systems by developing a new theoretical framework in which the forgetting process is expressed as an increase in the model's risk during the training. Although many recent attempts have provided high-quality generative replay samples by using GANs, they are limited to mainly downstream tasks due to the lack of inference. Inspired by the theoretical analysis while aiming to address the drawbacks of existing approaches, we propose the lifelong generative adversarial autoencoder (LGAA). LGAA consists of a generative replay network and three inference models, each addressing the inference of a different type of latent variable. The experimental results show that LGAA learns novel visual concepts without forgetting and can be applied to a wide range of downstream tasks.
Collapse
|
2
|
Guo K, Chen T, Ren S, Li N, Hu M, Kang J. Federated Learning Empowered Real-Time Medical Data Processing Method for Smart Healthcare. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2024; 21:869-879. [PMID: 35737631 DOI: 10.1109/tcbb.2022.3185395] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Computer-aided diagnosis (CAD) has always been an important research topic for applying artificial intelligence in smart healthcare. Sufficient medical data are one of the most critical factors in CAD research. However, medical data are usually obtained in chronological order and cannot be collected all at once, which poses difficulties for the application of deep learning technology in the medical field. The traditional batch learning method consumes considerable time and space resources for real-time medical data, and the incremental learning method often leads to catastrophic forgetting. To solve these problems, we propose a real-time medical data processing method based on federated learning. We divide the process into the model stage and the exemplar stage. In the model stage, we use the federated learning method to fuse the old and new models to mitigate the catastrophic forgetting problem of the new model. In the exemplar stage, we use the most representative exemplars selected from the old data to help the new model review the old knowledge, which further mitigates the catastrophic forgetting problem of the new model. We use this method to conduct experiments on a simulated medical real-time data stream. The experimental results show that our method can learn a disease diagnosis model from a continuous medical real-time data stream. As the amount of data increases, the performance of the disease diagnosis model continues to improve, and the catastrophic forgetting problem has been effectively mitigated. Compared with the traditional batch learning method, our method can significantly save time and space resources.
Collapse
|
3
|
Huo X, Ong KH, Lau KW, Gole L, Young DM, Tan CL, Zhu X, Zhang C, Zhang Y, Li L, Han H, Lu H, Zhang J, Hou J, Zhao H, Gan H, Yin L, Wang X, Chen X, Lv H, Cao H, Yu X, Shi Y, Huang Z, Marini G, Xu J, Liu B, Chen B, Wang Q, Gui K, Shi W, Sun Y, Chen W, Cao D, Sanders SJ, Lee HK, Hue SSS, Yu W, Tan SY. A comprehensive AI model development framework for consistent Gleason grading. COMMUNICATIONS MEDICINE 2024; 4:84. [PMID: 38724730 PMCID: PMC11082180 DOI: 10.1038/s43856-024-00502-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2022] [Accepted: 04/17/2024] [Indexed: 05/12/2024] Open
Abstract
BACKGROUND Artificial Intelligence(AI)-based solutions for Gleason grading hold promise for pathologists, while image quality inconsistency, continuous data integration needs, and limited generalizability hinder their adoption and scalability. METHODS We present a comprehensive digital pathology workflow for AI-assisted Gleason grading. It incorporates A!MagQC (image quality control), A!HistoClouds (cloud-based annotation), Pathologist-AI Interaction (PAI) for continuous model improvement, Trained on Akoya-scanned images only, the model utilizes color augmentation and image appearance migration to address scanner variations. We evaluate it on Whole Slide Images (WSI) from another five scanners and conduct validations with pathologists to assess AI efficacy and PAI. RESULTS Our model achieves an average F1 score of 0.80 on annotations and 0.71 Quadratic Weighted Kappa on WSIs for Akoya-scanned images. Applying our generalization solution increases the average F1 score for Gleason pattern detection from 0.73 to 0.88 on images from other scanners. The model accelerates Gleason scoring time by 43% while maintaining accuracy. Additionally, PAI improve annotation efficiency by 2.5 times and led to further improvements in model performance. CONCLUSIONS This pipeline represents a notable advancement in AI-assisted Gleason grading for improved consistency, accuracy, and efficiency. Unlike previous methods limited by scanner specificity, our model achieves outstanding performance across diverse scanners. This improvement paves the way for its seamless integration into clinical workflows.
Collapse
Affiliation(s)
- Xinmi Huo
- Computational Digital Pathology Lab, Bioinformatics Institute, A*STAR, Singapore, Singapore
| | - Kok Haur Ong
- Computational Digital Pathology Lab, Bioinformatics Institute, A*STAR, Singapore, Singapore
| | - Kah Weng Lau
- Computational & Molecular Pathology Lab, Institute of Molecule and Cell Biology, A*STAR, Singapore, Singapore
- Department of Pathology, National University Hospital, National University Health System, Singapore, Singapore
| | - Laurent Gole
- Computational & Molecular Pathology Lab, Institute of Molecule and Cell Biology, A*STAR, Singapore, Singapore
| | - David M Young
- Computational & Molecular Pathology Lab, Institute of Molecule and Cell Biology, A*STAR, Singapore, Singapore
- Department of Psychiatry and Behavioral Sciences, UCSF Weill Institute for Neurosciences, University of California, San Francisco, USA
| | - Char Loo Tan
- Department of Pathology, National University Hospital, National University Health System, Singapore, Singapore
| | - Xiaohui Zhu
- Department of Pathology, Nanfang Hospital and Basic Medical College, Southern Medical University, Guangzhou, Guangdong Province, China
- Guangdong Province Key Laboratory of Molecular Tumor Pathology, Guangzhou, Guangdong Province, China
| | - Chongchong Zhang
- Department of Pathology, The 910 Hospital of PLA, QuanZhou, Fujian Province, China
| | - Yonghui Zhang
- Department of Pathology, The 910 Hospital of PLA, QuanZhou, Fujian Province, China
| | - Longjie Li
- Computational Digital Pathology Lab, Bioinformatics Institute, A*STAR, Singapore, Singapore
| | - Hao Han
- Computational & Molecular Pathology Lab, Institute of Molecule and Cell Biology, A*STAR, Singapore, Singapore
| | - Haoda Lu
- Computational Digital Pathology Lab, Bioinformatics Institute, A*STAR, Singapore, Singapore
- Institute for AI in Medicine, School of Artificial Intelligence, Nanjing University of Information Science and Technology (NUIST), Nanjing, Jiangsu Province, China
| | - Jing Zhang
- Department of Pathology, Shanghai Changzheng Hospital, Shanghai, China
| | - Jun Hou
- Department of Pathology, Zhongshan Hospital, Fudan University, Shanghai, China
| | - Huanfen Zhao
- Department of Pathology, Hebei General Hospital, Shijiazhuang, Hebei Province, China
| | - Hualei Gan
- Department of Pathology, Fudan University Shanghai Cancer Center, Shanghai, China
| | - Lijuan Yin
- Department of Pathology, Changhai Hospital of Shanghai, Shanghai, China
| | - Xingxing Wang
- Department of Pathology, Zhongshan Hospital, Fudan University, Shanghai, China
| | - Xiaoyue Chen
- Department of Pathology, Hebei General Hospital, Shijiazhuang, Hebei Province, China
| | - Hong Lv
- Department of Pathology, Fudan University Shanghai Cancer Center, Shanghai, China
| | - Haotian Cao
- Department of Pathology, Shanghai Changzheng Hospital, Shanghai, China
| | - Xiaozhen Yu
- Department of Pathology, Zhongshan Hospital, Fudan University, Shanghai, China
| | - Yabin Shi
- Department of Pathology, Hebei General Hospital, Shijiazhuang, Hebei Province, China
| | - Ziling Huang
- Department of Pathology, Fudan University Shanghai Cancer Center, Shanghai, China
| | - Gabriel Marini
- Computational Digital Pathology Lab, Bioinformatics Institute, A*STAR, Singapore, Singapore
| | - Jun Xu
- Institute for AI in Medicine, School of Artificial Intelligence, Nanjing University of Information Science and Technology (NUIST), Nanjing, Jiangsu Province, China
| | - Bingxian Liu
- Ningbo KonFoong Bioinformation Tech Co. Ltd, Ningbo, Zhejiang Province, China
| | - Bingxian Chen
- Ningbo KonFoong Bioinformation Tech Co. Ltd, Ningbo, Zhejiang Province, China
| | - Qiang Wang
- Ningbo KonFoong Bioinformation Tech Co. Ltd, Ningbo, Zhejiang Province, China
| | - Kun Gui
- Ningbo KonFoong Bioinformation Tech Co. Ltd, Ningbo, Zhejiang Province, China
| | - Wenzhao Shi
- Vishuo Biomedical Pte Ltd, Singapore, Singapore
| | - Yingying Sun
- Cancer Center, Department of Pathology, Zhejiang Provincial People's Hospital, Affiliated People's Hospital, Hangzhou Medical College, Hangzhou, Zhejiang Province, China
| | - Wanyuan Chen
- Cancer Center, Department of Pathology, Zhejiang Provincial People's Hospital, Affiliated People's Hospital, Hangzhou Medical College, Hangzhou, Zhejiang Province, China
- Key Laboratory of Endocrine Gland Diseases of Zhejiang Province, Hangzhou, Zhejiang Province, China
- Clinical Research Center for Cancer of Zhejiang Province, Hangzhou, Zhejiang Province, China
| | - Dalong Cao
- Department of Urology, Fudan University Shanghai Cancer Center, Fudan University, Shanghai, China
- Shanghai Genitourinary Cancer Institute, Shanghai, China
- Department of Oncology, Shanghai Medical College, Fudan University, Shanghai, China
| | - Stephan J Sanders
- Department of Psychiatry and Behavioral Sciences, UCSF Weill Institute for Neurosciences, University of California, San Francisco, USA
- Institute of Developmental and Regenerative Medicine, University of Oxford, Oxford, UK
| | - Hwee Kuan Lee
- Computational Digital Pathology Lab, Bioinformatics Institute, A*STAR, Singapore, Singapore
| | - Susan Swee-Shan Hue
- Computational & Molecular Pathology Lab, Institute of Molecule and Cell Biology, A*STAR, Singapore, Singapore.
- Department of Pathology, National University Hospital, National University Health System, Singapore, Singapore.
| | - Weimiao Yu
- Computational Digital Pathology Lab, Bioinformatics Institute, A*STAR, Singapore, Singapore.
- Computational & Molecular Pathology Lab, Institute of Molecule and Cell Biology, A*STAR, Singapore, Singapore.
- Institute for AI in Medicine, School of Artificial Intelligence, Nanjing University of Information Science and Technology (NUIST), Nanjing, Jiangsu Province, China.
| | - Soo Yong Tan
- Department of Pathology, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore.
| |
Collapse
|
4
|
Wang Z, Chen C, Dong D. A Dirichlet Process Mixture of Robust Task Models for Scalable Lifelong Reinforcement Learning. IEEE TRANSACTIONS ON CYBERNETICS 2023; 53:7509-7520. [PMID: 35580095 DOI: 10.1109/tcyb.2022.3170485] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
While reinforcement learning (RL) algorithms are achieving state-of-the-art performance in various challenging tasks, they can easily encounter catastrophic forgetting or interference when faced with lifelong streaming information. In this article, we propose a scalable lifelong RL method that dynamically expands the network capacity to accommodate new knowledge while preventing past memories from being perturbed. We use a Dirichlet process mixture to model the nonstationary task distribution, which captures task relatedness by estimating the likelihood of task-to-cluster assignments and clusters the task models in a latent space. We formulate the prior distribution of the mixture as a Chinese restaurant process (CRP) that instantiates new mixture components as needed. The update and expansion of the mixture are governed by the Bayesian nonparametric framework with an expectation maximization (EM) procedure, which dynamically adapts the model complexity without explicit task boundaries or heuristics. Moreover, we use the domain randomization technique to train robust prior parameters for the initialization of each task model in the mixture; thus, the resulting model can better generalize and adapt to unseen tasks. With extensive experiments conducted on robot navigation and locomotion domains, we show that our method successfully facilitates scalable lifelong RL and outperforms relevant existing methods.
Collapse
|
5
|
Liu Y, Hong X, Tao X, Dong S, Shi J, Gong Y. Model Behavior Preserving for Class-Incremental Learning. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2023; 34:7529-7540. [PMID: 35120008 DOI: 10.1109/tnnls.2022.3144183] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Deep models have shown to be vulnerable to catastrophic forgetting, a phenomenon that the recognition performance on old data degrades when a pre-trained model is fine-tuned on new data. Knowledge distillation (KD) is a popular incremental approach to alleviate catastrophic forgetting. However, it usually fixes the absolute values of neural responses for isolated historical instances, without considering the intrinsic structure of the responses by a convolutional neural network (CNN) model. To overcome this limitation, we recognize the importance of the global property of the whole instance set and treat it as a behavior characteristic of a CNN model relevant to model incremental learning. On this basis: 1) we design an instance neighborhood-preserving (INP) loss to maintain the order of pair-wise instance similarities of the old model in the feature space; 2) we devise a label priority-preserving (LPP) loss to preserve the label ranking lists within instance-wise label probability vectors in the output space; and 3) we introduce an efficient derivable ranking algorithm for calculating the two loss functions. Extensive experiments conducted on CIFAR100 and ImageNet show that our approach achieves the state-of-the-art performance.
Collapse
|
6
|
Zhang W, Gu X. Few Shot Class Incremental Learning via Efficient Prototype Replay and Calibration. ENTROPY (BASEL, SWITZERLAND) 2023; 25:e25050776. [PMID: 37238532 DOI: 10.3390/e25050776] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Revised: 05/06/2023] [Accepted: 05/08/2023] [Indexed: 05/28/2023]
Abstract
Few shot class incremental learning (FSCIL) is an extremely challenging but valuable problem in real-world applications. When faced with novel few shot tasks in each incremental stage, it should take into account both catastrophic forgetting of old knowledge and overfitting of new categories with limited training data. In this paper, we propose an efficient prototype replay and calibration (EPRC) method with three stages to improve classification performance. We first perform effective pre-training with rotation and mix-up augmentations in order to obtain a strong backbone. Then a series of pseudo few shot tasks are sampled to perform meta-training, which enhances the generalization ability of both the feature extractor and projection layer and then helps mitigate the over-fitting problem of few shot learning. Furthermore, an even nonlinear transformation function is incorporated into the similarity computation to implicitly calibrate the generated prototypes of different categories and alleviate correlations among them. Finally, we replay the stored prototypes to relieve catastrophic forgetting and rectify prototypes to be more discriminative in the incremental-training stage via an explicit regularization within the loss function. The experimental results on CIFAR-100 and miniImageNet demonstrate that our EPRC significantly boosts the classification performance compared with existing mainstream FSCIL methods.
Collapse
Affiliation(s)
- Wei Zhang
- Department of Electronic Engineering, Fudan University, Shanghai 200438, China
| | - Xiaodong Gu
- Department of Electronic Engineering, Fudan University, Shanghai 200438, China
| |
Collapse
|
7
|
Ye F, Bors AG. Dynamic Self-Supervised Teacher-Student Network Learning. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2023; 45:5731-5748. [PMID: 36355745 DOI: 10.1109/tpami.2022.3220928] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
Lifelong learning (LLL) represents the ability of an artificial intelligence system to learn successively a sequence of different databases. In this paper we introduce the Dynamic Self-Supervised Teacher-Student Network (D-TS), representing a more general LLL framework, where the Teacher is implemented as a dynamically expanding mixture model which automatically increases its capacity to deal with a growing number of tasks. We propose the Knowledge Discrepancy Score (KDS) criterion for measuring the relevance of the incoming information characterizing a new task when compared to the existing knowledge accumulated by the Teacher module from its previous training. The KDS ensures a light Teacher architecture while also enabling to reuse the learned knowledge whenever appropriate, accelerating the learning of given tasks. The Student module is implemented as a lightweight probabilistic generative model. We introduce a novel self-supervised learning procedure for the Student that allows to capture cross-domain latent representations from the entire knowledge accumulated by the Teacher as well as from novel data. We perform several experiments which show that D-TS can achieve the state of the art results in LLL while requiring fewer parameters than other methods.
Collapse
|
8
|
Küllahcı K, Altunkaynak A. Enhanced rainfall prediction performance via hybrid empirical-singular-wavelet-fuzzy approaches. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2023; 30:58090-58108. [PMID: 36976466 DOI: 10.1007/s11356-023-26598-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Accepted: 03/17/2023] [Indexed: 05/10/2023]
Abstract
Rainfall is a vital process in the hydrological cycle of the globe. Accessing reliable and accurate rainfall data is crucial for water resources operation, flood control, drought warning, irrigation, and drainage. In the present study, the main objective is to develop a predictive model to enhance daily rainfall prediction accuracy with an extended time horizon. In the literature, various methods for the prediction of daily rainfall data for short lead times are presented. However, due to the complex and random nature of rainfall, in general, they yield inaccurate prediction results. Generically, rainfall predictive models require many physical meteorological variables and consist of challenging mathematical processes that require high computational power. Furthermore, due to the nonlinear and chaotic nature of rainfall, observed raw data typically has to be decomposed into its trend cycle, seasonality, and stochastic components before being fed into the predictive model. The present study proposes a novel singular spectrum analysis (SSA)-based approach for decomposing observed raw data into its hierarchically energetic pertinent features. To this end, in addition to the stand-alone fuzzy logic model, preprocessing methods SSA, empirical mode decomposition (EMD), and commonly used discrete wavelet transform (DWT) are incorporated into the fuzzy models which are named as hybrid SSA-fuzzy, EMD-fuzzy, W-fuzzy models, respectively. In this study, fuzzy, hybrid SSA-fuzzy, EMD-fuzzy, and W-fuzzy models are developed to enhance the daily rainfall prediction accuracy and improve the prediction time span up to 3 days via three (3) stations' data in Turkey. The proposed SSA-fuzzy model is compared with fuzzy, hybrid EMD-fuzzy, and widely used hybrid W-fuzzy models in predicting daily rainfall in three distinctive locations up to a 3-day time horizon. Improved accuracy in predicting daily rainfall is provided by the SSA-fuzzy, W-fuzzy, and EMD-fuzzy models compared to the stand-alone fuzzy model based on mean square error (MSE) and the Nash-Sutcliffe coefficient of efficiency (CE) model assessment metrics. Specifically, the advocated SSA-fuzzy model is found to be superior in accuracy to hybrid EMD-fuzzy and W-fuzzy models in predicting daily rainfall for all time spans. The results reveal that, with its easy-to-use features, the advocated SSA-fuzzy modeling tool in this study is a promising principled method for its possible future implementations not only in hydrological studies but in water resources and hydraulics engineering and all scientific disciplines where future state space prediction of a vague nature and stochastic dynamical system is important.
Collapse
Affiliation(s)
- Kübra Küllahcı
- Department of Civil Engineering Hydraulics and Water Resources Division, Istanbul Technical University, Maslak, 34469, Istanbul, Turkey.
| | - Abdüsselam Altunkaynak
- Department of Civil Engineering Hydraulics and Water Resources Division, Istanbul Technical University, Maslak, 34469, Istanbul, Turkey
| |
Collapse
|
9
|
García-Salinas JS, Torres-García AA, Reyes-Garćia CA, Villaseñor-Pineda L. Intra-subject class-incremental deep learning approach for EEG-based imagined speech recognition. Biomed Signal Process Control 2023. [DOI: 10.1016/j.bspc.2022.104433] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022]
|
10
|
Xu X, Wang Z, Fu Z, Guo W, Chi Z, Li D. Flexible few-shot class-incremental learning with prototype container. Neural Comput Appl 2023. [DOI: 10.1007/s00521-023-08272-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/09/2023]
|
11
|
Ye F, Bors AG. Lifelong Mixture of Variational Autoencoders. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2023; 34:461-474. [PMID: 34370670 DOI: 10.1109/tnnls.2021.3096457] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
In this article, we propose an end-to-end lifelong learning mixture of experts. Each expert is implemented by a variational autoencoder (VAE). The experts in the mixture system are jointly trained by maximizing a mixture of individual component evidence lower bounds (MELBO) on the log-likelihood of the given training samples. The mixing coefficients in the mixture model control the contributions of each expert in the global representation. These are sampled from a Dirichlet distribution whose parameters are determined through nonparametric estimation during lifelong learning. The model can learn new tasks fast when these are similar to those previously learned. The proposed lifelong mixture of VAE (L-MVAE) expands its architecture with new components when learning a completely new task. After the training, our model can automatically determine the relevant expert to be used when fed with new data samples. This mechanism benefits both the memory efficiency and the required computational cost as only one expert is used during the inference. The L-MVAE inference model is able to perform interpolations in the joint latent space across the data domains associated with different tasks and is shown to be efficient for disentangled learning representation.
Collapse
|
12
|
Gu X, Han J, Shen Q, Angelov PP. Autonomous learning for fuzzy systems: a review. Artif Intell Rev 2022. [DOI: 10.1007/s10462-022-10355-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
AbstractAs one of the three pillars in computational intelligence, fuzzy systems are a powerful mathematical tool widely used for modelling nonlinear problems with uncertainties. Fuzzy systems take the form of linguistic IF-THEN fuzzy rules that are easy to understand for human. In this sense, fuzzy inference mechanisms have been developed to mimic human reasoning and decision-making. From a data analytic perspective, fuzzy systems provide an effective solution to build precise predictive models from imprecise data with great transparency and interpretability, thus facilitating a wide range of real-world applications. This paper presents a systematic review of modern methods for autonomously learning fuzzy systems from data, with an emphasis on the structure and parameter learning schemes of mainstream evolving, evolutionary, reinforcement learning-based fuzzy systems. The main purpose of this paper is to introduce the underlying concepts, underpinning methodologies, as well as outstanding performances of the state-of-the-art methods. It serves as a one-stop guide for readers learning the representative methodologies and foundations of fuzzy systems or who desire to apply fuzzy-based autonomous learning in other scientific disciplines and applied fields.
Collapse
|
13
|
Döllinger M, Schraut T, Henrich LA, Chhetri D, Echternach M, Johnson AM, Kunduk M, Maryn Y, Patel RR, Samlan R, Semmler M, Schützenberger A. Re-Training of Convolutional Neural Networks for Glottis Segmentation in Endoscopic High-Speed Videos. APPLIED SCIENCES (BASEL, SWITZERLAND) 2022; 12:9791. [PMID: 37583544 PMCID: PMC10427138 DOI: 10.3390/app12199791] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 08/17/2023]
Abstract
Endoscopic high-speed video (HSV) systems for visualization and assessment of vocal fold dynamics in the larynx are diverse and technically advancing. To consider resulting "concepts shifts" for neural network (NN)-based image processing, re-training of already trained and used NNs is necessary to allow for sufficiently accurate image processing for new recording modalities. We propose and discuss several re-training approaches for convolutional neural networks (CNN) being used for HSV image segmentation. Our baseline CNN was trained on the BAGLS data set (58,750 images). The new BAGLS-RT data set consists of additional 21,050 images from previously unused HSV systems, light sources, and different spatial resolutions. Results showed that increasing data diversity by means of preprocessing already improves the segmentation accuracy (mIoU + 6.35%). Subsequent re-training further increases segmentation performance (mIoU + 2.81%). For re-training, finetuning with dynamic knowledge distillation showed the most promising results. Data variety for training and additional re-training is a helpful tool to boost HSV image segmentation quality. However, when performing re-training, the phenomenon of catastrophic forgetting should be kept in mind, i.e., adaption to new data while forgetting already learned knowledge.
Collapse
Affiliation(s)
- Michael Döllinger
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhino-laryngology Head & Neck Surgery, University Hospital Erlangen, Friedrich-Alexander-University Erlangen-Nürnberg, 91054 Erlangen, Germany
| | - Tobias Schraut
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhino-laryngology Head & Neck Surgery, University Hospital Erlangen, Friedrich-Alexander-University Erlangen-Nürnberg, 91054 Erlangen, Germany
| | - Lea A. Henrich
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhino-laryngology Head & Neck Surgery, University Hospital Erlangen, Friedrich-Alexander-University Erlangen-Nürnberg, 91054 Erlangen, Germany
| | - Dinesh Chhetri
- Department of Head and Neck Surgery, David Geffen School of Medicine at the University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Matthias Echternach
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Munich University Hospital (LMU), 80331 Munich, Germany
| | - Aaron M. Johnson
- NYU Voice Center, Department of Otolaryngology–Head and Neck Surgery, New York University, Grossman School of Medicine, New York, NY 10001, USA
| | - Melda Kunduk
- Department of Communication Sciences and Disorders, Louisiana State University, Baton Rouge, LA 70801, USA
| | - Youri Maryn
- Department of Speech, Language and Hearing Sciences, University of Ghent, 9000 Ghent, Belgium
| | - Rita R. Patel
- Department of Speech, Language and Hearing Sciences, Indiana University, Bloomington, IA 47401, USA
| | - Robin Samlan
- Department of Speech, Language, & Hearing Sciences, University of Arizona, Tucson, AZ 85641, USA
| | - Marion Semmler
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhino-laryngology Head & Neck Surgery, University Hospital Erlangen, Friedrich-Alexander-University Erlangen-Nürnberg, 91054 Erlangen, Germany
| | - Anne Schützenberger
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhino-laryngology Head & Neck Surgery, University Hospital Erlangen, Friedrich-Alexander-University Erlangen-Nürnberg, 91054 Erlangen, Germany
| |
Collapse
|
14
|
Ye F, Bors AG. Lifelong Teacher-Student Network Learning. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2022; 44:6280-6296. [PMID: 34170822 DOI: 10.1109/tpami.2021.3092677] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
A unique cognitive capability of humans consists in their ability to acquire new knowledge and skills from a sequence of experiences. Meanwhile, artificial intelligence systems are good at learning only the last given task without being able to remember the databases learnt in the past. We propose a novel lifelong learning methodology by employing a Teacher-Student network framework. While the Student module is trained with a new given database, the Teacher module would remind the Student about the information learnt in the past. The Teacher, implemented by a Generative Adversarial Network (GAN), is trained to preserve and replay past knowledge corresponding to the probabilistic representations of previously learnt databases. Meanwhile, the Student module is implemented by a Variational Autoencoder (VAE) which infers its latent variable representation from both the output of the Teacher module as well as from the newly available database. Moreover, the Student module is trained to capture both continuous and discrete underlying data representations across different domains. The proposed lifelong learning framework is applied in supervised, semi-supervised and unsupervised training.
Collapse
|
15
|
Adaimi R, Thomaz E. Lifelong Adaptive Machine Learning for Sensor-Based Human Activity Recognition Using Prototypical Networks. SENSORS (BASEL, SWITZERLAND) 2022; 22:6881. [PMID: 36146230 PMCID: PMC9504213 DOI: 10.3390/s22186881] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/03/2022] [Revised: 08/30/2022] [Accepted: 09/08/2022] [Indexed: 06/16/2023]
Abstract
Continual learning (CL), also known as lifelong learning, is an emerging research topic that has been attracting increasing interest in the field of machine learning. With human activity recognition (HAR) playing a key role in enabling numerous real-world applications, an essential step towards the long-term deployment of such systems is to extend the activity model to dynamically adapt to changes in people's everyday behavior. Current research in CL applied to the HAR domain is still under-explored with researchers exploring existing methods developed for computer vision in HAR. Moreover, analysis has so far focused on task-incremental or class-incremental learning paradigms where task boundaries are known. This impedes the applicability of such methods for real-world systems. To push this field forward, we build on recent advances in the area of continual learning and design a lifelong adaptive learning framework using Prototypical Networks, LAPNet-HAR, that processes sensor-based data streams in a task-free data-incremental fashion and mitigates catastrophic forgetting using experience replay and continual prototype adaptation. Online learning is further facilitated using contrastive loss to enforce inter-class separation. LAPNet-HAR is evaluated on five publicly available activity datasets in terms of its ability to acquire new information while preserving previous knowledge. Our extensive empirical results demonstrate the effectiveness of LAPNet-HAR in task-free CL and uncover useful insights for future challenges.
Collapse
Affiliation(s)
- Rebecca Adaimi
- Department of Electrical and Computer Engineering, University of Texas at Austin, Austin, TX 78712, USA
| | | |
Collapse
|
16
|
Liu S, Wang B, Li H, Chen C, Wang Z. Continual portfolio selection in dynamic environments via incremental reinforcement learning. INT J MACH LEARN CYB 2022. [DOI: 10.1007/s13042-022-01639-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
|
17
|
Komorniczak J, Zyblewski P, Ksieniewicz P. Statistical Drift Detection Ensemble for batch processing of data streams. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2022.109380] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
|
18
|
Reminding the incremental language model via data-free self-distillation. APPL INTELL 2022. [DOI: 10.1007/s10489-022-03678-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
19
|
Zhao Q, Si J, Sun J. Online Reinforcement Learning Control by Direct Heuristic Dynamic Programming: From Time-Driven to Event-Driven. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2022; 33:4139-4144. [PMID: 33534714 DOI: 10.1109/tnnls.2021.3053037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
In this work, time-driven learning refers to the machine learning method that updates parameters in a prediction model continuously as new data arrives. Among existing approximate dynamic programming (ADP) and reinforcement learning (RL) algorithms, the direct heuristic dynamic programming (dHDP) has been shown an effective tool as demonstrated in solving several complex learning control problems. It continuously updates the control policy and the critic as system states continuously evolve. It is therefore desirable to prevent the time-driven dHDP from updating due to insignificant system event such as noise. Toward this goal, we propose a new event-driven dHDP. By constructing a Lyapunov function candidate, we prove the uniformly ultimately boundedness (UUB) of the system states and the weights in the critic and the control policy networks. Consequently, we show the approximate control and cost-to-go function approaching Bellman optimality within a finite bound. We also illustrate how the event-driven dHDP algorithm works in comparison to the original time-driven dHDP.
Collapse
|
20
|
Bayram F, Ahmed BS, Kassler A. From concept drift to model degradation: An overview on performance-aware drift detectors. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2022.108632] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
21
|
Klikowski J, Woźniak M. Deterministic Sampling Classifier with weighted Bagging for drifted imbalanced data stream classification. Appl Soft Comput 2022. [DOI: 10.1016/j.asoc.2022.108855] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
22
|
Agarwal S, Rattani A, Chowdary CR. A-iLearn: An adaptive incremental learning model for spoof fingerprint detection. MACHINE LEARNING WITH APPLICATIONS 2022. [DOI: 10.1016/j.mlwa.2021.100210] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022] Open
|
23
|
Wu Z, Gao P, Cui L, Chen J. An Incremental Learning Method Based on Dynamic Ensemble RVM for Intrusion Detection. IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT 2022. [DOI: 10.1109/tnsm.2021.3102388] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
24
|
Wu Y, Yang W, Yuan C, Pan J, Chen P. Incremental learning for detection in X-ray luggage perspective images. APPLIED OPTICS 2022; 61:C179-C191. [PMID: 35201051 DOI: 10.1364/ao.446060] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/14/2021] [Accepted: 01/13/2022] [Indexed: 06/14/2023]
Abstract
Convolutional neural networks have achieved remarkable results in the detection of X-ray luggage contraband. However, with an increase in contraband classes and substantial artificial transformation, the offline network training method has been unable to accurately detect the rapidly growing new classes of contraband. The current model cannot incrementally learn the newly appearing classes in real time without retraining the model. When the quantity of different types of contraband is not evenly distributed in the real-time detection process, the convolution neural network that is optimized by the gradient descent method will produce catastrophic forgetting, which means learning new knowledge and forgetting old knowledge, and the detection effect on the old classes will suddenly decline. To overcome this problem, this paper proposes an incremental learning method for online continuous learning of models and incrementally learns and detects new classes in the absence of old classes in the new classes. First, we perform parameter compression on the original network by distillation to ensure stable identification of the old classes. Second, the area proposal subnetwork and object detection subnetwork are incrementally learned to obtain the recognition ability of the new classes. In addition, this paper designs a new loss function, which causes the network to avoid catastrophic forgetting and stably detect the object of the new contraband classes. To reliably verify the model, this paper produces a multi-angle dataset for security perspective images. A total of 10 classes of contraband are tested, and the interference between two object detections is analyzed by model parameters. The experimental results show that the model can stably perform new contraband object learning even when there is an uneven distribution of data types.
Collapse
|
25
|
Incremental and accurate computation of machine learning models with smart data summarization. J Intell Inf Syst 2022. [DOI: 10.1007/s10844-021-00690-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
26
|
Evolved fuzzy min-max neural network for new-labeled data classification. APPL INTELL 2022. [DOI: 10.1007/s10489-021-02259-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
27
|
M. S. AR, C. R. N, B. R. S, Lahza H, Lahza HFM. A survey on detecting healthcare concept drift in AI/ML models from a finance perspective. Front Artif Intell 2022; 5:955314. [PMID: 37139355 PMCID: PMC10150933 DOI: 10.3389/frai.2022.955314] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2022] [Accepted: 10/31/2022] [Indexed: 05/05/2023] Open
Abstract
Data is incredibly significant in today's digital age because data represents facts and numbers from our regular life transactions. Data is no longer arriving in a static form; it is now arriving in a streaming fashion. Data streams are the arrival of limitless, continuous, and rapid data. The healthcare industry is a major generator of data streams. Processing data streams is extremely complex due to factors such as volume, pace, and variety. Data stream classification is difficult owing to idea drift. Concept drift occurs in supervised learning when the statistical properties of the target variable that the model predicts change unexpectedly. We focused on solving various forms of concept drift problems in healthcare data streams in this research, and we outlined the existing statistical and machine learning methodologies for dealing with concept drift. It also emphasizes the use of deep learning algorithms for concept drift detection and describes the various healthcare datasets utilized for concept drift detection in data stream categorization.
Collapse
Affiliation(s)
- Abdul Razak M. S.
- Department of Computer Science and Engineering, Bapuji Institute of Engineering and Technology, Davangere, India
- *Correspondence: Abdul Razak M. S.
| | - Nirmala C. R.
- Department of Computer Science and Engineering, Bapuji Institute of Engineering and Technology, Davangere, India
| | - Sreenivasa B. R.
- Department of Information Science and Engineering, Bapuji Institute of Engineering and Technology, Davangere, India
| | - Husam Lahza
- Department of Information Technology, Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Hassan Fareed M. Lahza
- Department of Information Systems, College of Computers and Information Systems, Umm Al-Qura University, Makkah, Saudi Arabia
| |
Collapse
|
28
|
Siegert I, Weißkirchen N, Krüger J, Akhtiamov O, Wendemuth A. Admitting the addressee detection faultiness of voice assistants to improve the activation performance using a continuous learning framework. COGN SYST RES 2021. [DOI: 10.1016/j.cogsys.2021.07.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
29
|
Samarasinghe D, Barlow M, Lakshika E, Kasmarik K. Exploiting abstractions for grammar‐based learning of complex multi‐agent behaviours. INT J INTELL SYST 2021. [DOI: 10.1002/int.22550] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Affiliation(s)
- Dilini Samarasinghe
- School of Engineering and IT University of New South Wales Canberra Australian Capital Territory Australia
| | - Michael Barlow
- School of Engineering and IT University of New South Wales Canberra Australian Capital Territory Australia
| | - Erandi Lakshika
- School of Engineering and IT University of New South Wales Canberra Australian Capital Territory Australia
| | - Kathryn Kasmarik
- School of Engineering and IT University of New South Wales Canberra Australian Capital Territory Australia
| |
Collapse
|
30
|
Multi-style learning for adaptation of perception intelligence in home service robots. Pattern Recognit Lett 2021. [DOI: 10.1016/j.patrec.2021.08.026] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
|
31
|
Angarita-Zapata JS, Alonso-Vicario A, Masegosa AD, Legarda J. A Taxonomy of Food Supply Chain Problems from a Computational Intelligence Perspective. SENSORS (BASEL, SWITZERLAND) 2021; 21:6910. [PMID: 34696123 PMCID: PMC8537557 DOI: 10.3390/s21206910] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/04/2021] [Revised: 10/14/2021] [Accepted: 10/15/2021] [Indexed: 11/16/2022]
Abstract
In the last few years, the Internet of Things, and other enabling technologies, have been progressively used for digitizing Food Supply Chains (FSC). These and other digitalization-enabling technologies are generating a massive amount of data with enormous potential to manage supply chains more efficiently and sustainably. Nevertheless, the intricate patterns and complexity embedded in large volumes of data present a challenge for systematic human expert analysis. In such a data-driven context, Computational Intelligence (CI) has achieved significant momentum to analyze, mine, and extract the underlying data information, or solve complex optimization problems, striking a balance between productive efficiency and sustainability of food supply systems. Although some recent studies have sorted the CI literature in this field, they are mainly oriented towards a single family of CI methods (a group of methods that share common characteristics) and review their application in specific FSC stages. As such, there is a gap in identifying and classifying FSC problems from a broader perspective, encompassing the various families of CI methods that can be applied in different stages (from production to retailing) and identifying the problems that arise in these stages from a CI perspective. This paper presents a new and comprehensive taxonomy of FSC problems (associated with agriculture, fish farming, and livestock) from a CI approach; that is, it defines FSC problems (from production to retail) and categorizes them based on how they can be modeled from a CI point of view. Furthermore, we review the CI approaches that are more commonly used in each stage of the FSC and in their corresponding categories of problems. We also introduce a set of guidelines to help FSC researchers and practitioners to decide on suitable families of methods when addressing any particular problems they might encounter. Finally, based on the proposed taxonomy, we identify and discuss challenges and research opportunities that the community should explore to enhance the contributions that CI can bring to the digitization of the FSC.
Collapse
Affiliation(s)
- Juan S. Angarita-Zapata
- Deusto Institute of Technology (DeustoTech), Faculty of Engineering, University of Deusto, 48007 Bilbao, Spain; (A.A.-V.); (A.D.M.); (J.L.)
| | - Ainhoa Alonso-Vicario
- Deusto Institute of Technology (DeustoTech), Faculty of Engineering, University of Deusto, 48007 Bilbao, Spain; (A.A.-V.); (A.D.M.); (J.L.)
| | - Antonio D. Masegosa
- Deusto Institute of Technology (DeustoTech), Faculty of Engineering, University of Deusto, 48007 Bilbao, Spain; (A.A.-V.); (A.D.M.); (J.L.)
- Ikerbasque, Basque Foundation for Science, 48009 Bilbao, Spain
| | - Jon Legarda
- Deusto Institute of Technology (DeustoTech), Faculty of Engineering, University of Deusto, 48007 Bilbao, Spain; (A.A.-V.); (A.D.M.); (J.L.)
| |
Collapse
|
32
|
SSIT: a sample selection-based incremental model training method for image recognition. Neural Comput Appl 2021. [DOI: 10.1007/s00521-021-06515-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
33
|
|
34
|
Zhang X, Han M, Wu H, Li M, Chen Z. An overview of complex data stream ensemble classification. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2021. [DOI: 10.3233/jifs-211100] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
With the rapid development of information technology, data streams in various fields are showing the characteristics of rapid arrival, complex structure and timely processing. Complex types of data streams make the classification performance worse. However, ensemble classification has become one of the main methods of processing data streams. Ensemble classification performance is better than traditional single classifiers. This article introduces the ensemble classification algorithms of complex data streams for the first time. Then overview analyzes the advantages and disadvantages of these algorithms for steady-state, concept drift, imbalanced, multi-label and multi-instance data streams. At the same time, the application fields of data streams are also introduced which summarizes the ensemble algorithms processing text, graph and big data streams. Moreover, it comprehensively summarizes the verification technology, evaluation indicators and open source platforms of complex data streams mining algorithms. Finally, the challenges and future research directions of ensemble learning algorithms dealing with uncertain, multi-type, delayed, multi-type concept drift data streams are given.
Collapse
Affiliation(s)
- Xilong Zhang
- School of Computer Science and Engineering, North Minzu University, Yinchuan, China
| | - Meng Han
- School of Computer Science and Engineering, North Minzu University, Yinchuan, China
| | - Hongxin Wu
- School of Computer Science and Engineering, North Minzu University, Yinchuan, China
| | - Muhang Li
- School of Computer Science and Engineering, North Minzu University, Yinchuan, China
| | - Zhiqiang Chen
- School of Computer Science and Engineering, North Minzu University, Yinchuan, China
| |
Collapse
|
35
|
Li H, Barnaghi P, Enshaeifar S, Ganz F. Continual Learning Using Bayesian Neural Networks. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2021; 32:4243-4252. [PMID: 32866104 DOI: 10.1109/tnnls.2020.3017292] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Continual learning models allow them to learn and adapt to new changes and tasks over time. However, in continual and sequential learning scenarios, in which the models are trained using different data with various distributions, neural networks (NNs) tend to forget the previously learned knowledge. This phenomenon is often referred to as catastrophic forgetting. The catastrophic forgetting is an inevitable problem in continual learning models for dynamic environments. To address this issue, we propose a method, called continual Bayesian learning networks (CBLNs), which enables the networks to allocate additional resources to adapt to new tasks without forgetting the previously learned tasks. Using a Bayesian NN, CBLN maintains a mixture of Gaussian posterior distributions that are associated with different tasks. The proposed method tries to optimize the number of resources that are needed to learn each task and avoids an exponential increase in the number of resources that are involved in learning multiple tasks. The proposed method does not need to access the past training data and can choose suitable weights to classify the data points during the test time automatically based on an uncertainty criterion. We have evaluated the method on the MNIST and UCR time-series data sets. The evaluation results show that the method can address the catastrophic forgetting problem at a promising rate compared to the state-of-the-art models.
Collapse
|
36
|
Lughofer E, Pratama M. Online sequential ensembling of predictive fuzzy systems. EVOLVING SYSTEMS 2021. [DOI: 10.1007/s12530-021-09398-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
Abstract
AbstractEvolving fuzzy systems (EFS) have enjoyed a wide attraction in the community to handle learning from data streams in an incremental, single-pass and transparent manner. The main concentration so far lied in the development of approaches for single EFS models, basically used for prediction purposes. Forgetting mechanisms have been used to increase their flexibility, especially for the purpose to adapt quickly to changing situations such as drifting data distributions. These require forgetting factors steering the degree of timely out-weighing older learned concepts, whose adequate setting in advance or in adaptive fashion is not an easy and not a fully resolved task. In this paper, we propose a new concept of learning fuzzy systems from data streams, which we call online sequential ensembling of fuzzy systems (OS-FS). It is able to model the recent dependencies in streams on a chunk-wise basis: for each new incoming chunk, a new fuzzy model is trained from scratch and added to the ensemble (of fuzzy systems trained before). This induces (i) maximal flexibility in terms of being able to apply variable chunk sizes according to the actual system delay in receiving target values and (ii) fast reaction possibilities in the case of arising drifts. The latter are realized with specific prediction techniques on new data chunks based on the sequential ensemble members trained so far over time. We propose four different prediction variants including various weighting concepts in order to put higher weights on the members with higher inference certainty during the amalgamation of predictions of single members to a final prediction. In this sense, older members, which keep in mind knowledge about past states, may get dynamically reactivated in the case of cyclic drifts, which induce dynamic changes in the process behavior which are re-occurring from time to time later. Furthermore, we integrate a concept for properly resolving possible contradictions among members with similar inference certainties. The reaction onto drifts is thus autonomously handled on demand and on the fly during the prediction stage (and not during model adaptation/evolution stage as conventionally done in single EFS models), which yields enormous flexibility. Finally, in order to cope with large-scale and (theoretically) infinite data streams within a reasonable amount of prediction time, we demonstrate two concepts for pruning past ensemble members, one based on atypical high error trends of single members and one based on the non-diversity of ensemble members. The results based on two data streams showed significantly improved performance compared to single EFS models in terms of a better convergence of the accumulated chunk-wise ahead prediction error trends, especially in the case of regular and cyclic drifts. Moreover, the more advanced prediction schemes could significantly outperform standard averaging over all members’ outputs. Furthermore, resolving contradictory outputs among members helped to improve the performance of the sequential ensemble further. Results on a wider range of data streams from different application scenarios showed (i) improved error trend lines over single EFS models, as well as over related AI methods OS-ELM and MLPs neural networks retrained on data chunks, and (ii) slightly worse trend lines than on-line bagged EFS (as specific EFS ensembles), but with around 100 times faster processing times (achieving low processing times way below requiring milli-seconds for single samples updates).
Collapse
|
37
|
Cost-Sensitive Classification for Evolving Data Streams with Concept Drift and Class Imbalance. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2021; 2021:8813806. [PMID: 34381499 PMCID: PMC8352686 DOI: 10.1155/2021/8813806] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/07/2020] [Revised: 07/04/2021] [Accepted: 07/21/2021] [Indexed: 11/17/2022]
Abstract
Class imbalance and concept drift are two primary principles that exist concurrently in data stream classification. Although the two issues have drawn enough attention separately, the joint treatment largely remains unexplored. Moreover, the class imbalance issue is further complicated if data streams with concept drift. A novel Cost-Sensitive based Data Stream (CSDS) classification is introduced to overcome the two issues simultaneously. The CSDS considers cost information during the procedures of data preprocessing and classification. During the data preprocessing, a cost-sensitive learning strategy is introduced into the ReliefF algorithm for alleviating the class imbalance at the data level. In the classification process, a cost-sensitive weighting schema is devised to enhance the overall performance of the ensemble. Besides, a change detection mechanism is embedded in our algorithm, which guarantees that an ensemble can capture and react to drift promptly. Experimental results validate that our method can obtain better classification results under different imbalanced concept drifting data stream scenarios.
Collapse
|
38
|
Qiang N, Dong Q, Liang H, Ge B, Zhang S, Sun Y, Zhang C, Zhang W, Gao J, Liu T. Modeling and augmenting of fMRI data using deep recurrent variational auto-encoder. J Neural Eng 2021; 18. [PMID: 34229310 DOI: 10.1088/1741-2552/ac1179] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2020] [Accepted: 07/06/2021] [Indexed: 11/11/2022]
Abstract
Objective. Recently, deep learning models have been successfully applied in functional magnetic resonance imaging (fMRI) modeling and associated applications. However, there still exist at least two challenges. Firstly, due to the lack of sufficient data, deep learning models tend to suffer from overfitting in the training process. Secondly, it is still challenging to model the temporal dynamics from fMRI, due to that the brain state is continuously changing over scan time. In addition, existing methods rarely studied and applied fMRI data augmentation.Approach. In this work, we construct a deep recurrent variational auto-encoder (DRVAE) that combined variational auto-encoder and recurrent neural network, aiming to address all of the above mentioned challenges. The encoder of DRVAE can extract more generalized temporal features from assumed Gaussian distribution of input data, and the decoder of DRVAE can generate new data to increase training samples and thus partially relieve the overfitting issue. The recurrent layers in DRVAE are designed to effectively model the temporal dynamics of functional brain activities. LASSO (least absolute shrinkage and selection operator) regression is applied on the temporal features and input fMRI data to estimate the corresponding spatial networks.Main results. Extensive experimental results on seven tasks from HCP dataset showed that the DRVAE and LASSO framework can learn meaningful temporal patterns and spatial networks from both real data and generated data. The results on group-wise data and single subject suggest that the brain activities may follow certain distribution. Moreover, we applied DRVAE on four resting state fMRI datasets from ADHD-200 for data augmentation, and the results showed that the classification performances on augmented datasets have been considerably improved.Significance. The proposed method can not only derive meaningful temporal features and spatial networks from fMRI, but also generate high-quality new data for fMRI data augmentation and associated applications.
Collapse
Affiliation(s)
- Ning Qiang
- School of Physics and Information Technology, Shaanxi Normal University, Xi'an, People's Republic of China.,Center for Brain and Brain-Inspired Computing Research, Department of Computer Science, Northwestern Polytechnical University, Xi'an, People's Republic of China
| | - Qinglin Dong
- Advanced Medical Computing and Analysis, Department of Radiology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, United States of America
| | - Hongtao Liang
- School of Physics and Information Technology, Shaanxi Normal University, Xi'an, People's Republic of China
| | - Bao Ge
- School of Physics and Information Technology, Shaanxi Normal University, Xi'an, People's Republic of China.,Center for Brain and Brain-Inspired Computing Research, Department of Computer Science, Northwestern Polytechnical University, Xi'an, People's Republic of China
| | - Shu Zhang
- Center for Brain and Brain-Inspired Computing Research, Department of Computer Science, Northwestern Polytechnical University, Xi'an, People's Republic of China
| | - Yifei Sun
- School of Physics and Information Technology, Shaanxi Normal University, Xi'an, People's Republic of China
| | - Cheng Zhang
- School of Electronics Engineering and Computer Science, Peking University, Beijing, People's Republic of China
| | - Wei Zhang
- Department of Radiology and Biomedical Imaging, University of California, San Francisco, CA, United States of America
| | - Jie Gao
- School of Physics and Information Technology, Shaanxi Normal University, Xi'an, People's Republic of China
| | - Tianming Liu
- Cortical Architecture Imaging and Discovery Lab, Department of Computer Science and Bioimaging Research Center, The University of Georgia, Athens, GA, United States of America
| |
Collapse
|
39
|
Elmi J, Eftekhari M. Multi-Layer Selector(MLS): Dynamic selection based on filtering some competence measures. Appl Soft Comput 2021. [DOI: 10.1016/j.asoc.2021.107257] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
40
|
Liu A, Lu J, Zhang G. Concept Drift Detection via Equal Intensity k-Means Space Partitioning. IEEE TRANSACTIONS ON CYBERNETICS 2021; 51:3198-3211. [PMID: 32324590 DOI: 10.1109/tcyb.2020.2983962] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
The data stream poses additional challenges to statistical classification tasks because distributions of the training and target samples may differ as time passes. Such a distribution change in streaming data is called concept drift. Numerous histogram-based distribution change detection methods have been proposed to detect drift. Most histograms are developed on the grid-based or tree-based space partitioning algorithms which makes the space partitions arbitrary, unexplainable, and may cause drift blind spots. There is a need to improve the drift detection accuracy for the histogram-based methods with the unsupervised setting. To address this problem, we propose a cluster-based histogram, called equal intensity k -means space partitioning (EI-kMeans). In addition, a heuristic method to improve the sensitivity of drift detection is introduced. The fundamental idea of improving the sensitivity is to minimize the risk of creating partitions in distribution offset regions. Pearson's chi-square test is used as the statistical hypothesis test so that the test statistics remain independent of the sample distribution. The number of bins and their shapes, which strongly influence the ability to detect drift, are determined dynamically from the sample based on an asymptotic constraint in the chi-square test. Accordingly, three algorithms are developed to implement concept drift detection, including a greedy centroids initialization algorithm, a cluster amplify-shrink algorithm, and a drift detection algorithm. For drift adaptation, we recommend retraining the learner if a drift is detected. The results of experiments on the synthetic and real-world datasets demonstrate the advantages of EI-kMeans and show its efficacy in detecting concept drift.
Collapse
|
41
|
Choudhary S, Herdt D, Spoor E, García Molina JF, Nachtmann M, Rädle M. Incremental Learning in Modelling Process Analysis Technology (PAT)-An Important Tool in the Measuring and Control Circuit on the Way to the Smart Factory. SENSORS 2021; 21:s21093144. [PMID: 34062767 PMCID: PMC8124399 DOI: 10.3390/s21093144] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/02/2021] [Revised: 04/15/2021] [Accepted: 04/28/2021] [Indexed: 12/01/2022]
Abstract
To meet the demands of the chemical and pharmaceutical process industry for a combination of high measurement accuracy, product selectivity, and low cost of ownership, the existing measurement and evaluation methods have to be further developed. This paper demonstrates the attempt to combine future Raman photometers with promising evaluation methods. As part of the investigations presented here, a new and easy-to-use evaluation method based on a self-learning algorithm is presented. This method can be applied to various measurement methods and is carried out here using an example of a Raman spectrometer system and an alcohol-water mixture as demonstration fluid. The spectra’s chosen bands can be later transformed to low priced and even more robust Raman photometers. The evaluation method gives more precise results than the evaluation through classical methods like one primarily used in the software package Unscrambler. This technique increases the accuracy of detection and proves the concept of Raman process monitoring for determining concentrations. In the example of alcohol/water, the computation time is less, and it can be applied to continuous column monitoring.
Collapse
Affiliation(s)
- Shivani Choudhary
- Center for Mass Spectrometry and Optical Spectroscopy, Mannheim University of Applied Sciences, Paul-Wittsack-Straße 10, 68163 Mannheim, Germany; (E.S.); (M.N.); (M.R.)
- Correspondence: (S.C.); (D.H.)
| | - Deborah Herdt
- Center for Mass Spectrometry and Optical Spectroscopy, Mannheim University of Applied Sciences, Paul-Wittsack-Straße 10, 68163 Mannheim, Germany; (E.S.); (M.N.); (M.R.)
- Correspondence: (S.C.); (D.H.)
| | - Erik Spoor
- Center for Mass Spectrometry and Optical Spectroscopy, Mannheim University of Applied Sciences, Paul-Wittsack-Straße 10, 68163 Mannheim, Germany; (E.S.); (M.N.); (M.R.)
| | - José Fernando García Molina
- Institute of Process Control and Innovative Energy Conversion, Mannheim University of Applied Sciences, 68163 Mannheim, Germany;
| | - Marcel Nachtmann
- Center for Mass Spectrometry and Optical Spectroscopy, Mannheim University of Applied Sciences, Paul-Wittsack-Straße 10, 68163 Mannheim, Germany; (E.S.); (M.N.); (M.R.)
| | - Matthias Rädle
- Center for Mass Spectrometry and Optical Spectroscopy, Mannheim University of Applied Sciences, Paul-Wittsack-Straße 10, 68163 Mannheim, Germany; (E.S.); (M.N.); (M.R.)
| |
Collapse
|
42
|
Sarnovsky M, Kolarik M. Classification of the drifting data streams using heterogeneous diversified dynamic class-weighted ensemble. PeerJ Comput Sci 2021; 7:e459. [PMID: 33834113 PMCID: PMC8022634 DOI: 10.7717/peerj-cs.459] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2020] [Accepted: 03/05/2021] [Indexed: 06/12/2023]
Abstract
Data streams can be defined as the continuous stream of data coming from different sources and in different forms. Streams are often very dynamic, and its underlying structure usually changes over time, which may result to a phenomenon called concept drift. When solving predictive problems using the streaming data, traditional machine learning models trained on historical data may become invalid when such changes occur. Adaptive models equipped with mechanisms to reflect the changes in the data proved to be suitable to handle drifting streams. Adaptive ensemble models represent a popular group of these methods used in classification of drifting data streams. In this paper, we present the heterogeneous adaptive ensemble model for the data streams classification, which utilizes the dynamic class weighting scheme and a mechanism to maintain the diversity of the ensemble members. Our main objective was to design a model consisting of a heterogeneous group of base learners (Naive Bayes, k-NN, Decision trees), with adaptive mechanism which besides the performance of the members also takes into an account the diversity of the ensemble. The model was experimentally evaluated on both real-world and synthetic datasets. We compared the presented model with other existing adaptive ensemble methods, both from the perspective of predictive performance and computational resource requirements.
Collapse
|
43
|
Efficiently Updating ECG-Based Biometric Authentication Based on Incremental Learning. SENSORS 2021; 21:s21051568. [PMID: 33668148 PMCID: PMC7956719 DOI: 10.3390/s21051568] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/31/2020] [Revised: 02/05/2021] [Accepted: 02/12/2021] [Indexed: 11/17/2022]
Abstract
Recently, the interest in biometric authentication based on electrocardiograms (ECGs) has increased. Nevertheless, the ECG signal of a person may vary according to factors such as the emotional or physical state, thus hindering authentication. We propose an adaptive ECG-based authentication method that performs incremental learning to identify ECG signals from a subject under a variety of measurement conditions. An incremental support vector machine (SVM) is adopted for authentication implementing incremental learning. We collected ECG signals from 11 subjects during 10 min over six days and used the data from days 1 to 5 for incremental learning, and those from day 6 for testing. The authentication results show that the proposed system consistently reduces the false acceptance rate from 6.49% to 4.39% and increases the true acceptance rate from 61.32% to 87.61% per single ECG wave after incremental learning using data from the five days. In addition, the authentication results tested using data obtained a day after the latest training show the false acceptance rate being within reliable range (3.5–5.33%) and improvement of the true acceptance rate (70.05–87.61%) over five days.
Collapse
|
44
|
Zhang J, Tang Z, Xie Y, Ai M, Zhang G, Gui W. Data-driven adaptive modeling method for industrial processes and its application in flotation reagent control. ISA TRANSACTIONS 2021; 108:305-316. [PMID: 32861477 DOI: 10.1016/j.isatra.2020.08.024] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/03/2019] [Revised: 08/03/2020] [Accepted: 08/14/2020] [Indexed: 06/11/2023]
Abstract
In real industrial processes, new process "excitation" patterns that largely deviate from previously collected training data will appear due to disturbances caused by process inputs. To reduce model mismatch, it is important for a data-driven process model to adapt to new process "excitation" patterns. Although efforts have been devoted to developing adaptive process models to deal with this problem, few studies have attempted to develop an adaptive process model that can incrementally learn new process "excitation" patterns without performance degradation on old patterns. In this study, efforts are devoted to enabling data-driven process models with incremental learning ability. First, a novel incremental learning method is proposed for process model updating. Second, an adaptive neural network process model is developed based on the novel incremental learning method. Third, a nonlinear model predictive control based on the adaptive process model is implemented and applied for flotation reagent control. Experiments based on historical data provide evidence that the newly developed adaptive process model can accommodate new process "excitation" patterns and preserve its performance on old patterns. Furthermore, industry experiments carried out in a real-world lead-zinc froth flotation plant provide industrial evidence and show that the newly designed controller is promising for practical flotation reagent control.
Collapse
Affiliation(s)
- Jin Zhang
- School of Automation, Central South University, Changsha 410083, China.
| | - Zhaohui Tang
- School of Automation, Central South University, Changsha 410083, China.
| | - Yongfang Xie
- School of Automation, Central South University, Changsha 410083, China.
| | - Mingxi Ai
- School of Automation, Central South University, Changsha 410083, China.
| | - Guoyong Zhang
- School of Computer Science and Engineering, Central South University, Changsha 410083, China.
| | - Weihua Gui
- School of Automation, Central South University, Changsha 410083, China.
| |
Collapse
|
45
|
Tabassum A, Erbad A, Mohamed A, Guizani M. Privacy-Preserving Distributed IDS Using Incremental Learning for IoT Health Systems. IEEE ACCESS 2021; 9:14271-14283. [DOI: 10.1109/access.2021.3051530] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/06/2025]
|
46
|
Gepperth A. Incremental learning with a homeostatic self-organizing neural model. Neural Comput Appl 2020. [DOI: 10.1007/s00521-019-04112-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
47
|
Wei X, Liu S, Xiang Y, Duan Z, Zhao C, Lu Y. Incremental learning based multi-domain adaptation for object detection. Knowl Based Syst 2020. [DOI: 10.1016/j.knosys.2020.106420] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
48
|
|
49
|
Luo Y, Yin L, Bai W, Mao K. An Appraisal of Incremental Learning Methods. ENTROPY (BASEL, SWITZERLAND) 2020; 22:E1190. [PMID: 33286958 PMCID: PMC7712976 DOI: 10.3390/e22111190] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/17/2020] [Revised: 10/16/2020] [Accepted: 10/19/2020] [Indexed: 11/24/2022]
Abstract
As a special case of machine learning, incremental learning can acquire useful knowledge from incoming data continuously while it does not need to access the original data. It is expected to have the ability of memorization and it is regarded as one of the ultimate goals of artificial intelligence technology. However, incremental learning remains a long term challenge. Modern deep neural network models achieve outstanding performance on stationary data distributions with batch training. This restriction leads to catastrophic forgetting for incremental learning scenarios since the distribution of incoming data is unknown and has a highly different probability from the old data. Therefore, a model must be both plastic to acquire new knowledge and stable to consolidate existing knowledge. This review aims to draw a systematic review of the state of the art of incremental learning methods. Published reports are selected from Web of Science, IEEEXplore, and DBLP databases up to May 2020. Each paper is reviewed according to the types: architectural strategy, regularization strategy and rehearsal and pseudo-rehearsal strategy. We compare and discuss different methods. Moreover, the development trend and research focus are given. It is concluded that incremental learning is still a hot research area and will be for a long period. More attention should be paid to the exploration of both biological systems and computational models.
Collapse
Affiliation(s)
| | | | | | - Keming Mao
- College of Software, Northeastern University, Shenyang 110004, China; (Y.L.); (L.Y.); (W.B.)
| |
Collapse
|
50
|
Li L, Wang Y, Hsu CY, Li Y, Lin KY. L-measure evaluation metric for fake information detection models with binary class imbalance. ENTERP INF SYST-UK 2020. [DOI: 10.1080/17517575.2020.1825821] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Affiliation(s)
- Li Li
- College of Electronics and Information Engineering, Tongji University, Shanghai, China
- Shanghai Institute of Intelligent Science and Technology, Tongji University, Shanghai, China
| | - Yong Wang
- College of Electronics and Information Engineering, Tongji University, Shanghai, China
| | - Chia-Yu Hsu
- Industrial Engineering of Management, National Taipei University of Technology, Taipei, China
| | - Yibin Li
- Department of Economics and Finance, Tongji University, Shanghai, China
| | - Kuo-Yi Lin
- College of Electronics and Information Engineering, Tongji University, Shanghai, China
- Shanghai Institute of Intelligent Science and Technology, Tongji University, Shanghai, China
| |
Collapse
|