1
|
Arnia F, Saddami K, Roslidar R, Muharar R, Munadi K. Towards accurate Diabetic Foot Ulcer image classification: Leveraging CNN pre-trained features and extreme learning machine. SMART HEALTH 2024; 33:100502. [DOI: 10.1016/j.smhl.2024.100502] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/06/2025]
|
2
|
Liu K, Wang R, Song X, Deng X, Zhu Q. Exploration of MPSO-Two-Stage Classification Optimization Model for Scene Images with Low Quality and Complex Semantics. SENSORS (BASEL, SWITZERLAND) 2024; 24:3983. [PMID: 38931766 PMCID: PMC11207861 DOI: 10.3390/s24123983] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/30/2024] [Revised: 05/30/2024] [Accepted: 06/17/2024] [Indexed: 06/28/2024]
Abstract
Currently, complex scene classification strategies are limited to high-definition image scene sets, and low-quality scene sets are overlooked. Although a few studies have focused on artificially noisy images or specific image sets, none have involved actual low-resolution scene images. Therefore, designing classification models around practicality is of paramount importance. To solve the above problems, this paper proposes a two-stage classification optimization algorithm model based on MPSO, thus achieving high-precision classification of low-quality scene images. Firstly, to verify the rationality of the proposed model, three groups of internationally recognized scene datasets were used to conduct comparative experiments with the proposed model and 21 existing methods. It was found that the proposed model performs better, especially in the 15-scene dataset, with 1.54% higher accuracy than the best existing method ResNet-ELM. Secondly, to prove the necessity of the pre-reconstruction stage of the proposed model, the same classification architecture was used to conduct comparative experiments between the proposed reconstruction method and six existing preprocessing methods on the seven self-built low-quality news scene frames. The results show that the proposed model has a higher improvement rate for outdoor scenes. Finally, to test the application potential of the proposed model in outdoor environments, an adaptive test experiment was conducted on the two self-built scene sets affected by lighting and weather. The results indicate that the proposed model is suitable for weather-affected scene classification, with an average accuracy improvement of 1.42%.
Collapse
Affiliation(s)
- Kexin Liu
- Department of Information Engineering, Engineering University of PAP, Xi’an 710086, China; (R.W.); (X.S.); (X.D.); (Q.Z.)
| | | | | | | | | |
Collapse
|
3
|
Panagos II, Giotis AP, Sofianopoulos S, Nikou C. A New Benchmark for Consumer Visual Tracking and Apparent Demographic Estimation from RGB and Thermal Images. SENSORS (BASEL, SWITZERLAND) 2023; 23:9510. [PMID: 38067883 PMCID: PMC10708599 DOI: 10.3390/s23239510] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/26/2023] [Revised: 11/20/2023] [Accepted: 11/23/2023] [Indexed: 12/18/2023]
Abstract
Visual tracking and attribute estimation related to age or gender information of multiple person entities in a scene are mature research topics with the advent of deep learning techniques. However, when it comes to indoor images such as video sequences of retail consumers, data are not always adequate or accurate enough to essentially train effective models for consumer detection and tracking under various adverse factors. This in turn affects the quality of recognizing age or gender for those detected instances. In this work, we introduce two novel datasets: Consumers comprises 145 video sequences compliant to personal information regulations as far as facial images are concerned and BID is a set of cropped body images from each sequence that can be used for numerous computer vision tasks. We also propose an end-to-end framework which comprises CNNs as object detectors, LSTMs for motion forecasting of the tracklet association component in a sequence, along with a multi-attribute classification model for apparent demographic estimation of the detected outputs, aiming to capture useful metadata of consumer product preferences. Obtained results on tracking and age/gender prediction are promising with respect to reference systems while they indicate the proposed model's potential for practical consumer metadata extraction.
Collapse
Affiliation(s)
- Iason-Ioannis Panagos
- Department of Computer Science and Engineering (CSE), University of Ioannina, 45110 Ioannina, Greece; (I.-I.P.); (C.N.)
| | - Angelos P. Giotis
- Department of Computer Science and Engineering (CSE), University of Ioannina, 45110 Ioannina, Greece; (I.-I.P.); (C.N.)
| | - Sokratis Sofianopoulos
- Institute for Language and Speech Processing (ILSP), Athena Research and Innovation Center, 15125 Athens, Greece;
| | - Christophoros Nikou
- Department of Computer Science and Engineering (CSE), University of Ioannina, 45110 Ioannina, Greece; (I.-I.P.); (C.N.)
| |
Collapse
|
4
|
Leung TM, Chan KL. Gender Recognition Based on Gradual and Ensemble Learning from Multi-View Gait Energy Images and Poses. SENSORS (BASEL, SWITZERLAND) 2023; 23:8961. [PMID: 37960659 PMCID: PMC10648426 DOI: 10.3390/s23218961] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/07/2023] [Revised: 10/31/2023] [Accepted: 11/02/2023] [Indexed: 11/15/2023]
Abstract
Image-based gender classification is very useful in many applications, such as intelligent surveillance, micromarketing, etc. One common approach is to adopt a machine learning algorithm to recognize the gender class of the captured subject based on spatio-temporal gait features extracted from the image. The image input can be generated from the video of the walking cycle, e.g., gait energy image (GEI). Recognition accuracy depends on the similarity of intra-class GEIs, as well as the dissimilarity of inter-class GEIs. However, we observe that, at some viewing angles, the GEIs of both gender classes are very similar. Moreover, the GEI does not exhibit a clear appearance of posture. We postulate that distinctive postures of the walking cycle can provide additional and valuable information for gender classification. This paper proposes a gender classification framework that exploits multiple inputs of the GEI and the characteristic poses of the walking cycle. The proposed framework is a cascade network that is capable of gradually learning the gait features from images acquired in multiple views. The cascade network contains a feature extractor and gender classifier. The multi-stream feature extractor network is trained to extract features from the multiple input images. Features are then fed to the classifier network, which is trained with ensemble learning. We evaluate and compare the performance of our proposed framework with state-of-the-art gait-based gender classification methods on benchmark datasets. The proposed framework outperforms other methods that only utilize a single input of the GEI or pose.
Collapse
Affiliation(s)
| | - Kwok-Leung Chan
- Department of Electrical Engineering, City University of Hong Kong, Hong Kong, China;
| |
Collapse
|
5
|
Li J, Luo X, Ma H, Zhao W. A Hybrid Deep Transfer Learning Model With Kernel Metric for COVID-19 Pneumonia Classification Using Chest CT Images. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023; 20:2506-2517. [PMID: 36279353 DOI: 10.1109/tcbb.2022.3216661] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
Coronavirus disease-2019 (COVID-19) as a new pneumonia which is extremely infectious, the classification of this coronavirus is essential to effectively control the development of the epidemic. Pathological changes in the chest computed tomography (CT) scans are often used as one of the diagnostic criteria of COVID-19. Meanwhile, deep learning-based transfer learning is currently an effective strategy for computer-aided diagnosis (CAD). To further improve the performance of deep transfer learning model used for COVID-19 classification with CT images, in this article, we propose a hybrid model combined with a semi-supervised domain adaption model and extreme learning machine (ELM) classifier, and the application of a novel multikernel correntropy induced loss function in transfer learning is also presented. The proposed model is evaluated on open-source datasets. The experimental results are compared to some baseline models to verify the effectiveness, while adopting accuracy, precision, recall, F1 score and area under curve (AUC) as the evaluation metrics. Experimental results show that the proposed method improves the performance of original model and is more suitable for CT images analysis.
Collapse
|
6
|
Li J, Li Y, Du M. Comparative study of EEG motor imagery classification based on DSCNN and ELM. Biomed Signal Process Control 2023. [DOI: 10.1016/j.bspc.2023.104750] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/26/2023]
|
7
|
Maayah M, Abunada A, Al-Janahi K, Ahmed ME, Qadir J. LimitAccess: on-device TinyML based robust speech recognition and age classification. DISCOVER ARTIFICIAL INTELLIGENCE 2023. [DOI: 10.1007/s44163-023-00051-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/25/2023]
Abstract
AbstractAutomakers from Honda to Lamborghini are incorporating voice interaction technology into their vehicles to improve the user experience and offer value-added services. Speech recognition systems are a key component of smart cars, enhancing convenience and safety for drivers and passengers. In the future, safety-critical features may rely on speech recognition, but this raises concerns about children accessing such services. To address this issue, the LimitAccess system is proposed, which uses TinyML for age classification and helps parents limit children’s access to critical speech recognition services. This study employs a lite convolutional neural network (CNN) model for two different reasons: First, CNN showed superior accuracy compared to other audio classification models for age classification problems. Second, the lite model will be integrated into a microcontroller to meet its limited resource requirements. To train and evaluate our model, we created a dataset that included child and adult voices of the keyword “open”. The system approach categorizes voices into age groups (child, adult) and then utilizes that categorization to grant access to a car. The robustness of the model was enhanced by adding a new class (recordings) to the dataset, which enabled our system to detect replay and synthetic voice attacks. If an adult voice is detected, access to start the car will be granted. However, if a child’s voice or a recording is detected, the system will display a warning message that educates the child about the dangers and consequences of the improper use of a car. Arduino Nano 33 BLE sensing was our embedded device of choice for integrating our trained, optimized model. Our system achieved an overall F1 score of 87.7% and 85.89% accuracy. LimitAccess detected replay and synthetic voice attacks with an 88% F1 score.
Collapse
|
8
|
Podder T, Bhattacharya D, Majumdar A. Real time facial expression and gender recognition with feature integrated CNN. THE IMAGING SCIENCE JOURNAL 2023. [DOI: 10.1080/13682199.2022.2157956] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Affiliation(s)
- Tanusree Podder
- Department of Computer Science and Engineering, National Institute of Technology Agartala, Agartala, Tripura, India
| | - Diptendu Bhattacharya
- Department of Computer Science and Engineering, National Institute of Technology Agartala, Agartala, Tripura, India
| | - Abhishek Majumdar
- Department of Computer Science and Engineering, Techno India University, Kolkata, West Bengal, India
| |
Collapse
|
9
|
Oladipo O, Omidiora EO, Osamor VC. A novel genetic-artificial neural network based age estimation system. Sci Rep 2022; 12:19290. [PMID: 36369517 PMCID: PMC9652376 DOI: 10.1038/s41598-022-23242-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2021] [Accepted: 10/27/2022] [Indexed: 11/13/2022] Open
Abstract
Age estimation is the ability to predict the age of an individual based on facial clues. This could be put to practical use in underage voting detection, underage driving detection, and overage sportsmen detection. To date, no popular automatic age estimation system has been developed to target black faces. This study developed a novel age estimation system from the combination of a genetic algorithm and a back propagation (BP)-trained artificial neural network (ANN) and using the local binary pattern feature extraction technique (LBGANN) targeted at black faces. The system was trained with a predominantly black face database, and the result was compared against that of a standard ANN system (LBANN). The results showed that the developed system LBGANN outperformed the LBANN in terms of the correct classification rate.
Collapse
Affiliation(s)
- Oluwasegun Oladipo
- grid.411932.c0000 0004 1794 8359Department of Computer and Information Sciences, Covenant University, Ota, Ogun State Nigeria
| | - Elijah Olusayo Omidiora
- grid.411270.10000 0000 9777 3851Department of Computer Science and Engineering, Ladoke Akintola University of Technology (LAUTECH), Ogbomoso, Oyo State Nigeria
| | - Victor Chukwudi Osamor
- grid.411932.c0000 0004 1794 8359Department of Computer and Information Sciences, Covenant University, Ota, Ogun State Nigeria
| |
Collapse
|
10
|
Agbo-Ajala O, Viriri S, Oloko-Oba M, Ekundayo O, Heymann R. Apparent age prediction from faces: A survey of modern approaches. Front Big Data 2022; 5:1025806. [PMID: 36387012 PMCID: PMC9644213 DOI: 10.3389/fdata.2022.1025806] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2022] [Accepted: 09/28/2022] [Indexed: 08/29/2023] Open
Abstract
Apparent age estimation via human face image has attracted increased attention due to its numerous real-world applications. Predicting the apparent age has been quite difficult for machines and humans. However, researchers have focused on machine estimation of "age as perceived" to a high level of accuracy. To further improve the performance of apparent age estimation from the facial image, researchers continue to examine different methods to enhance its results further. This paper presents a critical review of the modern approaches and techniques for the apparent age estimation task. We also present a comparative analysis of the performance of some of those approaches on the apparent facial aging benchmark. The study also highlights the strengths and weaknesses of each approach used for apparent age estimation to guide in choosing the appropriate algorithms for future work in the field. The work focuses on the most popular algorithms and those that appear to have been the most successful for apparent age estimation to improve on the existing state-of-the-art results. We based our evaluations on three facial aging datasets, including looking at people (LAP)-2015, LAP-2016, and APPA-REAL, the most popular and publicly available datasets benchmark for apparent age estimation.
Collapse
Affiliation(s)
- Olatunbosun Agbo-Ajala
- Computer Science Discipline, School of Mathematics, Statistics and Computer Science, University of KwaZulu-Natal, Durban, South Africa
| | - Serestina Viriri
- Computer Science Discipline, School of Mathematics, Statistics and Computer Science, University of KwaZulu-Natal, Durban, South Africa
- Electrical and Electronic Engineering Science, University of Johannesburg, Johannesburg, South Africa
| | - Mustapha Oloko-Oba
- Computer Science Discipline, School of Mathematics, Statistics and Computer Science, University of KwaZulu-Natal, Durban, South Africa
| | - Olufisayo Ekundayo
- Computer Science Discipline, School of Mathematics, Statistics and Computer Science, University of KwaZulu-Natal, Durban, South Africa
| | - Reolyn Heymann
- Electrical and Electronic Engineering Science, University of Johannesburg, Johannesburg, South Africa
| |
Collapse
|
11
|
Neggaz I, Neggaz N, Fizazi H. Boosting Archimedes optimization algorithm using trigonometric operators based on feature selection for facial analysis. Neural Comput Appl 2022; 35:3903-3923. [PMID: 36267472 PMCID: PMC9569187 DOI: 10.1007/s00521-022-07925-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2021] [Accepted: 10/03/2022] [Indexed: 01/31/2023]
Abstract
Due to technical advancements and the proliferation of mobile applications, facial analysis (FA) of humans has recently become an important area for computer vision research. FA investigates a variety of difficulties, including gender recognition, facial expression recognition, age and race recognition, with the goal of automatically comprehending social interactions. Due to the dimensional challenge posed by pre-trained CNN networks, the scientific community has developed numerous techniques inspired by biology, swarm intelligence theory, physics, and mathematical rules. This article presents a gender recognition system based on scAOA, that is a modified version of the Archimedes optimization algorithm (AOA). The latest variant (scAOA) enhances the exploitation stage by using trigonometric operators inspired by the sine cosine algorithm (SCA) in order to prevent local optima and to accelerate the convergence. The main purpose of this paper is to apply scAOA to select the relevant deep features provided by two pretrained models of CNN (AlexNet & ResNet) to recognize the gender of a human person categorized into two classes (men and women). Two datasets are used to evaluate the proposed approach (scAOA): the Brazilian FEI dataset and the Georgia Tech Face dataset (GT). In terms of accuracy, Fscore and statistical test, the comparison analysis demonstrates that scAOA outperforms other modern and competitive optimizers such as AOA, SCA, Ant lion optimizer (ALO), Salp swarm algorithm (SSA), Grey wolf optimizer (GWO), Simple genetic algorithm (SGA), Grasshopper optimization algorithm (GOA) and Particle swarm optimizer (PSO).
Collapse
Affiliation(s)
- Imène Neggaz
- Université des Sciences et de la Technologie d’Oran Mohamed Boudiaf (USTO-MB), BP 1505, EL M’naouer, 3100 Oran, Algeria
| | - Nabil Neggaz
- Université des Sciences et de la Technologie d’Oran Mohamed Boudiaf (USTO-MB), BP 1505, EL M’naouer, 3100 Oran, Algeria
| | - Hadria Fizazi
- Université des Sciences et de la Technologie d’Oran Mohamed Boudiaf (USTO-MB), BP 1505, EL M’naouer, 3100 Oran, Algeria
| |
Collapse
|
12
|
Gupta SK, Nain N. Review: Single attribute and multi attribute facial gender and age estimation. MULTIMEDIA TOOLS AND APPLICATIONS 2022; 82:1289-1311. [PMID: 35729932 PMCID: PMC9200214 DOI: 10.1007/s11042-022-12678-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/29/2020] [Revised: 03/05/2021] [Accepted: 02/21/2022] [Indexed: 06/15/2023]
Abstract
Facial age and gender recognition have vital applications as consumer profile prediction, social media advertisement, human-computer interaction, image retrieval system, demographic profiling, customized advertisement systems, security and surveillance. This paper presents a study on Single Attribute (Attribute: either Gender or Age) and Multi-Attribute (both Gender and Age) prediction model. We present a review for facial age estimation and gender classification methods based on conventional as well as deep learning approaches developed so far with analysis of their pros, cons and insights for future research. Moreover, this study also enlists the databases used for benchmarking results with their properties for both constrained and unconstrained environment.
Collapse
Affiliation(s)
- Sandeep Kumar Gupta
- Department of Computer Science & Engineering, Malaviya National Institute of Technology, Jaipur, 302017 Rajasthan India
| | - Neeta Nain
- Department of Computer Science & Engineering, Malaviya National Institute of Technology, Jaipur, 302017 Rajasthan India
| |
Collapse
|
13
|
Badr M, Sarhan A, Elbasiony R. ICRL: Using landmark ratios with cascade model for an accurate age estimation system using deep neural networks. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2022. [DOI: 10.3233/jifs-211267] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Over the past decade, the computer vision community has given increased attention to the development of age estimation systems. Several approaches to more accurate and robust facial age estimation have been introduced. Apparent age datasets are typically collected from uncontrolled environments, leading to a number of challenges. In this paper, a cascade model system, which we called the ‘Integrated Classification and Regression with Landmark Ratios (ICRL), is introduced. Our system uses a classification model in order to learn the age label distribution, then uses this knowledge as an auxiliary input to a regression model. ICRL is based on context facial information and label distribution analysis. Facial context information is introduced through the extraction of precise facial landmark ratios. Extracted landmark ratios allow the system to distinguish each age label. The ICRL system uses a classification model to train the CNN network to learn the in-between relation of age labels. ICRL sufficiently models the aging process in the form of ordered and continuous imagery. The ICRL system minimizes the number of parameters needed as well as overall computational costs whilst maintaining robust and accurate results. Despite its simplicity, our system has outperformed other state-of-the-art approaches when applied onto the MORPH II, CLAP2015, AFAD and UTKFace datasets. ICRL achieved an overall superior predictive performance, reaching 99.67% with MORPH II, 99.51% with AFAD, 96.52 with CLAP2015, and 96.28% with UTKFace.
Collapse
Affiliation(s)
- Marwa Badr
- Department of Computers and Control Engineering, Faculty of Engineering, Tanta University, Tanta, Egypt
| | - Amany Sarhan
- Department of Computers and Control Engineering, Faculty of Engineering, Tanta University, Tanta, Egypt
| | - Reda Elbasiony
- Department of Computers and Control Engineering, Faculty of Engineering, Tanta University, Tanta, Egypt
| |
Collapse
|
14
|
A Hierarchical Approach toward Prediction of Human Biological Age from Masked Facial Image Leveraging Deep Learning Techniques. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12115306] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
The lifestyle of humans has changed noticeably since the contagious COVID-19 disease struck globally. People should wear a face mask as a protective measure to curb the spread of the contagious disease. Consequently, real-world applications (i.e., electronic customer relationship management) dealing with human ages extracted from face images must migrate to a robust system proficient to estimate the age of a person wearing a face mask. In this paper, we proposed a hierarchical age estimation model from masked facial images in a group-to-specific manner rather than a single regression model because age progression across different age groups is quite dissimilar. Our intention was to squeeze the feature space among limited age classes so that the model could fairly discern age. We generated a synthetic masked face image dataset over the IMDB-WIKI face image dataset to train and validate our proposed model due to the absence of a benchmark masked face image dataset with real age annotations. We somewhat mitigated the data sparsity problem of the large public IMDB-WIKI dataset using off-the-shelf down-sampling and up-sampling techniques as required. The age estimation task was fully modeled like a deep classification problem, and expected ages were formulated from SoftMax probabilities. We performed a classification task by deploying multiple low-memory and higher-accuracy-based convolutional neural networks (CNNs). Our proposed hierarchical framework demonstrated marginal improvement in terms of mean absolute error (MAE) compared to the one-off model approach for masked face real age estimation. Moreover, this research is perhaps the maiden attempt to estimate the real age of a person from his/her masked face image.
Collapse
|
15
|
Face Image Analysis Using Machine Learning: A Survey on Recent Trends and Applications. ELECTRONICS 2022. [DOI: 10.3390/electronics11081210] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Human face image analysis using machine learning is an important element in computer vision. The human face image conveys information such as age, gender, identity, emotion, race, and attractiveness to both human and computer systems. Over the last ten years, face analysis methods using machine learning have received immense attention due to their diverse applications in various tasks. Although several methods have been reported in the last ten years, face image analysis still represents a complicated challenge, particularly for images obtained from ’in the wild’ conditions. This survey paper presents a comprehensive review focusing on methods in both controlled and uncontrolled conditions. Our work illustrates both merits and demerits of each method previously proposed, starting from seminal works on face image analysis and ending with the latest ideas exploiting deep learning frameworks. We show a comparison of the performance of the previous methods on standard datasets and also present some promising future directions on the topic.
Collapse
|
16
|
Comparative analysis of features extraction techniques for black face age estimation. AI & SOCIETY 2022. [DOI: 10.1007/s00146-022-01407-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
17
|
Neggaz I, Fizazi H. An Intelligent handcrafted feature selection using Archimedes optimization algorithm for facial analysis. Soft comput 2022; 26:10435-10464. [PMID: 35250374 PMCID: PMC8889074 DOI: 10.1007/s00500-022-06886-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/04/2022] [Indexed: 11/04/2022]
Abstract
Human facial analysis (HFA) has recently become an attractive topic for computer vision research due to technological progress and mobile applications. HFA explores several issues as gender recognition (GR), facial expression, age, and race recognition for automatically understanding social life. This study explores HFA from the angle of recognizing a person's gender from their face. Several hard challenges are provoked, such as illumination, occlusion, facial emotions, quality, and angle of capture by cameras, making gender recognition more difficult for machines. The Archimedes optimization algorithm (AOA) was recently designed as a metaheuristic-based population optimization method, inspired by the Archimedes theory's physical notion. Compared to other swarm algorithms in the realm of optimization, this method promotes a good balance between exploration and exploitation. The convergence area is increased By incorporating extra data into the solution, such as volume and density. Because of the preceding benefits of AOA and the fact that it has not been used to choose the best area of the face, we propose utilizing a wrapper feature selection technique, which is a real motivation in the field of computer vision and machine learning. The paper's primary purpose is to automatically determine the optimal face area using AOA to recognize the gender of a human person categorized by two classes (Men and women). In this paper, the facial image is divided into several subregions (blocks), where each area provides a vector of characteristics using one method from handcrafted techniques as the local binary pattern (LBP), histogram-oriented gradient (HOG), or gray-level co-occurrence matrix (GLCM). Two experiments assess the proposed method (AOA): The first employs two benchmarking datasets: the Georgia Tech Face dataset (GT) and the Brazilian FEI dataset. The second experiment represents a more challenging large dataset that uses Gallagher's uncontrolled dataset. The experimental results show the good performance of AOA compared to other recent and competitive optimizers for all datasets. In terms of accuracy, the AOA-based LBP outperforms the state-of-the-art deep convolutional neural network (CNN) with 96.08% for the Gallagher's dataset.
Collapse
Affiliation(s)
- Imène Neggaz
- Laboratoire Signal Image PArole (SIMPA), Département d’informatique, Faculté des Mathématiques et Informatique, Université des Sciences et de la Technologie d’Oran Mohamed Boudiaf, USTO-MB, EL M’naouer, BP 1505, 31000 Oran, Algérie
| | - Hadria Fizazi
- Laboratoire Signal Image PArole (SIMPA), Département d’informatique, Faculté des Mathématiques et Informatique, Université des Sciences et de la Technologie d’Oran Mohamed Boudiaf, USTO-MB, EL M’naouer, BP 1505, 31000 Oran, Algérie
| |
Collapse
|
18
|
Serraoui I, Laiadi O, Ouamane A, Dornaika F, Taleb-Ahmed A. Knowledge-based tensor subspace analysis system for kinship verification. Neural Netw 2022; 151:222-237. [DOI: 10.1016/j.neunet.2022.03.020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2021] [Revised: 03/08/2022] [Accepted: 03/14/2022] [Indexed: 10/18/2022]
|
19
|
Automatic Ethnicity Classification from Middle Part of the Face Using Convolutional Neural Networks. INFORMATICS 2022. [DOI: 10.3390/informatics9010018] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
In the field of face biometrics, finding the identity of a person in an image is most researched, but there are other, soft biometric information that are equally as important, such as age, gender, ethnicity or emotion. Nowadays, ethnicity classification has a wide application area and is a prolific area of research. This paper gives an overview of recent advances in ethnicity classification with focus on convolutional neural networks (CNNs) and proposes a new ethnicity classification method using only the middle part of the face and CNN. The paper also compares the differences in results of CNN with and without plotted landmarks. The proposed model was tested using holdout testing method on UTKFace dataset and FairFace dataset. The accuracy of the model was 80.34% for classification into five classes and 61.74% for classification into seven classes, which is slightly better than state-of-the-art, but it is also important to note that results in this paper are obtained by using only the middle part of the face which reduces the time and resources necessary.
Collapse
|
20
|
Harris EJ, Khoo IH, Demircan E. A Survey of Human Gait-Based Artificial Intelligence Applications. Front Robot AI 2022; 8:749274. [PMID: 35047564 PMCID: PMC8762057 DOI: 10.3389/frobt.2021.749274] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2021] [Accepted: 11/01/2021] [Indexed: 12/17/2022] Open
Abstract
We performed an electronic database search of published works from 2012 to mid-2021 that focus on human gait studies and apply machine learning techniques. We identified six key applications of machine learning using gait data: 1) Gait analysis where analyzing techniques and certain biomechanical analysis factors are improved by utilizing artificial intelligence algorithms, 2) Health and Wellness, with applications in gait monitoring for abnormal gait detection, recognition of human activities, fall detection and sports performance, 3) Human Pose Tracking using one-person or multi-person tracking and localization systems such as OpenPose, Simultaneous Localization and Mapping (SLAM), etc., 4) Gait-based biometrics with applications in person identification, authentication, and re-identification as well as gender and age recognition 5) “Smart gait” applications ranging from smart socks, shoes, and other wearables to smart homes and smart retail stores that incorporate continuous monitoring and control systems and 6) Animation that reconstructs human motion utilizing gait data, simulation and machine learning techniques. Our goal is to provide a single broad-based survey of the applications of machine learning technology in gait analysis and identify future areas of potential study and growth. We discuss the machine learning techniques that have been used with a focus on the tasks they perform, the problems they attempt to solve, and the trade-offs they navigate.
Collapse
Affiliation(s)
- Elsa J Harris
- Human Performance and Robotics Laboratory, Department of Mechanical and Aerospace Engineering, California State University Long Beach, Long Beach, CA, United States
| | - I-Hung Khoo
- Department of Electrical Engineering, California State University Long Beach, Long Beach, CA, United States.,Department of Biomedical Engineering, California State University Long Beach, Long Beach, CA, United States
| | - Emel Demircan
- Human Performance and Robotics Laboratory, Department of Mechanical and Aerospace Engineering, California State University Long Beach, Long Beach, CA, United States.,Department of Biomedical Engineering, California State University Long Beach, Long Beach, CA, United States
| |
Collapse
|
21
|
A hybrid deep learning and ensemble learning mechanism for damaged power line detection in smart grids. Soft comput 2021. [DOI: 10.1007/s00500-021-06482-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
22
|
Potential Fault Diagnosis Method and Classification Accuracy Detection of IGBT Device Based on Improved Single Hidden Layer Feedforward Neural Network. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2021; 2021:6036118. [PMID: 34630549 PMCID: PMC8500739 DOI: 10.1155/2021/6036118] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/24/2021] [Revised: 08/19/2021] [Accepted: 08/20/2021] [Indexed: 11/29/2022]
Abstract
Insulated Gate Bipolar Transistor (IGBT) is a high-power switch in the field of power electronics. Its reliability is closely related to system stability. Once failure occurs, it may cause irreparable loss. Therefore, potential fault diagnosis methods of IGBT devices are studied in this paper, and their classification accuracy is tested. Due to the shortcomings of incomplete data application in the traditional method of characterizing the device state based on point frequency parameters, a fault diagnosis method based on full frequency threshold screening was proposed in this paper, and its classification accuracy was detected by the Extreme Learning Machine (ELM) algorithm. The randomly generated input layer weight ω and hidden layer deviation lead to the change of output layer weight β and then affect the overall output result. In view of the errors and instability caused by this randomness, an improved Finite Impulse Response Filter ELM (FIR-ELM) training algorithm is proposed. The filtering technique is used to initialize the input weights of the Single Hidden Layer Feedforward Neural Network (SLFN). The hidden layer of SLFN is used as the preprocessor to achieve the minimum output error. To reduce the structural risk and empirical risk of SLFN, the simulation results of 300 groups of spectral data show that the improved FIR-ELM algorithm significantly improves the training accuracy and has good robustness compared with the traditional extreme learning machine algorithm.
Collapse
|
23
|
Liang S, Liu H, Yang F, Qin C, Feng Y. Classification of Benign and Malignant Pulmonary Nodules Using a Regularized Extreme Learning Machine. JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS 2021. [DOI: 10.1166/jmihi.2021.3448] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
An L1/L2-norm-bound extreme learning machine classification algorithm is proposed to improve the accuracy of distinguishing between benign and malignant pulmonary nodules. In this algorithm, features extracted from the segmented lung nodule using the histogram of oriented gradients
method are used as inputs. L1-norm can promote sparsity in the weights of the output layer, and L2-norm can smooth output weights. The combination of the L1 norm and L2 norm can simplify the complexity of the network and prevent overfitting to improve classification accuracy. For each newly
tested lung nodule, the algorithm outputs a class label of either benign or malignant. The accuracy, sensitivity, and specificity reached 94.12%, 93%, and 95% respectively over the lung image database consortium and image database resource initiative dataset. Compared with other algorithms,
the average values of the three metrics increased by 6.5%, 7.94%, and 4.32%, respectively. An accuracy score of 95.83% can be achieved over a set of 120 urinary sediment images. Therefore, this algorithm has a good classification effect of pulmonary nodules.
Collapse
Affiliation(s)
- ShuFen Liang
- Faculty of Intelligent Manufacturing, Wuyi University, 529020, China
| | - HuiLin Liu
- Faculty of Intelligent Manufacturing, Wuyi University, 529020, China
| | - FangChen Yang
- Faculty of Intelligent Manufacturing, Wuyi University, 529020, China
| | - Chuanbo Qin
- Faculty of Intelligent Manufacturing, Wuyi University, 529020, China
| | - Yue Feng
- Faculty of Intelligent Manufacturing, Wuyi University, 529020, China
| |
Collapse
|
24
|
Tian Q, Cao M, Sun H, Qi L, Mao J, Cao Y, Tang J. Facial age estimation with bilateral relationships exploitation. Neurocomputing 2021. [DOI: 10.1016/j.neucom.2020.07.149] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
25
|
Deep Learning Based Real Age and Gender Estimation from Unconstrained Face Image towards Smart Store Customer Relationship Management. APPLIED SCIENCES-BASEL 2021. [DOI: 10.3390/app11104549] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
The COVID-19 pandemic markedly changed the human shopping nature, necessitating a contactless shopping system to curb the spread of the contagious disease efficiently. Consequently, a customer opts for a store where it is possible to avoid physical contacts and shorten the shopping process with extended services such as personalized product recommendations. Automatic age and gender estimation of a customer in a smart store strongly benefit the consumer by providing personalized advertisement and product recommendation; similarly, it aids the smart store proprietor to promote sales and develop an inventory perpetually for the future retail. In our paper, we propose a deep learning-founded enterprise solution for smart store customer relationship management (CRM), which allows us to predict the age and gender from a customer’s face image taken in an unconstrained environment to facilitate the smart store’s extended services, as it is expected for a modern venture. For the age estimation problem, we mitigate the data sparsity problem of the large public IMDB-WIKI dataset by image enhancement from another dataset and perform data augmentation as required. We handle our classification tasks utilizing an empirically leading pre-trained convolutional neural network (CNN), the VGG-16 network, and incorporate batch normalization. Especially, the age estimation task is posed as a deep classification problem followed by a multinomial logistic regression first-moment refinement. We validate our system for two standard benchmarks, one for each task, and demonstrate state-of-the-art performance for both real age and gender estimation.
Collapse
|
26
|
Integrated Multi-Model Face Shape and Eye Attributes Identification for Hair Style and Eyelashes Recommendation. COMPUTATION 2021. [DOI: 10.3390/computation9050054] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Identifying human face shape and eye attributes is the first and most vital process before applying for the right hairstyle and eyelashes extension. The aim of this research work includes the development of a decision support program to constitute an aid system that analyses eye and face features automatically based on the image taken from a user. The system suggests a suitable recommendation of eyelashes type and hairstyle based on the automatic reported users’ eye and face features. To achieve the aim, we develop a multi-model system comprising three separate models; each model targeted a different task, including; face shape classification, eye attribute identification and gender detection model. Face shape classification system has been designed based on the development of a hybrid framework of handcrafting and learned feature. Eye attributes have been identified by exploiting the geometrical eye measurements using the detected eye landmarks. Gender identification system has been realised and designed by implementing a deep learning-based approach. The outputs of three developed models are merged to design a decision support system for haircut and eyelash extension recommendation. The obtained detection results demonstrate that the proposed method effectively identifies the face shape and eye attributes. Developing such computer-aided systems is suitable and beneficial for the user and would be beneficial to the beauty industrial.
Collapse
|
27
|
A hybrid approach of neural networks for age and gender classification through decision fusion. Biomed Signal Process Control 2021. [DOI: 10.1016/j.bspc.2021.102459] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
28
|
Coming to Grips with Age Prediction on Imbalanced Multimodal Community Question Answering Data. INFORMATION 2021. [DOI: 10.3390/info12020048] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
For almost every online service, it is fundamental to understand patterns, differences and trends revealed by age demographic analysis—for example, take the discovery of malicious activity, including identity theft, violation of community guidelines and fake profiles. In the particular case of platforms such as Facebook, Twitter and Yahoo! Answers, user demographics have impacts on their revenues and user experience; demographics assist in ensuring that the needs of each cohort are fulfilled via personalizing and contextualizing content. Despite the fact that technology has been made more accessible, thereby becoming evermore prevalent in both personal and professional lives alike, older people continue to trail Gen Z and Millennials in its adoption. This trailing brings about an under-representation that has a harmful influence on the demographic analysis and on supervised machine learning models. To that end, this paper pioneers attempts at examining this and other major challenges facing three distinct modalities when dealing with community question answering (cQA) platforms (i.e., texts, images and metadata). As for textual inputs, we propose an age-batched greedy curriculum learning (AGCL) approach to lessen the effects of their inherent class imbalances. When built on top of FastText shallow neural networks, AGCL achieved an increase of ca. 4% in macro-F1-score with respect to baseline systems (i.e., off-the-shelf deep neural networks). With regard to metadata, our experiments show that random forest classifiers significantly improve their performance when individuals close to generational borders are excluded (up to 20% more accuracy); and by experimenting with neural network-based visual classifiers, we discovered that images are the most challenging modality for age prediction. In fact, it is hard for a visual inspection to connect profile pictures with age cohorts, and there are considerable differences in their group distributions with respect to meta-data and textual inputs. All in all, we envisage that our findings will be highly relevant as guidelines for constructing assorted multimodal supervised models for automatic age recognition across cQA platforms.
Collapse
|
29
|
Agbo-Ajala O, Viriri S. Deep learning approach for facial age classification: a survey of the state-of-the-art. Artif Intell Rev 2021; 54:179-213. [DOI: 10.1007/s10462-020-09855-0] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
30
|
Zhu X, Song B, Shi F, Chen Y, Hu R, Gan J, Zhang W, Li M, Wang L, Gao Y, Shan F, Shen D. Joint prediction and time estimation of COVID-19 developing severe symptoms using chest CT scan. Med Image Anal 2021; 67:101824. [PMID: 33091741 PMCID: PMC7547024 DOI: 10.1016/j.media.2020.101824] [Citation(s) in RCA: 39] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2020] [Revised: 08/23/2020] [Accepted: 09/25/2020] [Indexed: 02/08/2023]
Abstract
With the rapidly worldwide spread of Coronavirus disease (COVID-19), it is of great importance to conduct early diagnosis of COVID-19 and predict the conversion time that patients possibly convert to the severe stage, for designing effective treatment plans and reducing the clinicians' workloads. In this study, we propose a joint classification and regression method to determine whether the patient would develop severe symptoms in the later time formulated as a classification task, and if yes, the conversion time will be predicted formulated as a classification task. To do this, the proposed method takes into account 1) the weight for each sample to reduce the outliers' influence and explore the problem of imbalance classification, and 2) the weight for each feature via a sparsity regularization term to remove the redundant features of the high-dimensional data and learn the shared information across two tasks, i.e., the classification and the regression. To our knowledge, this study is the first work to jointly predict the disease progression and the conversion time, which could help clinicians to deal with the potential severe cases in time or even save the patients' lives. Experimental analysis was conducted on a real data set from two hospitals with 408 chest computed tomography (CT) scans. Results show that our method achieves the best classification (e.g., 85.91% of accuracy) and regression (e.g., 0.462 of the correlation coefficient) performance, compared to all comparison methods. Moreover, our proposed method yields 76.97% of accuracy for predicting the severe cases, 0.524 of the correlation coefficient, and 0.55 days difference for the conversion time.
Collapse
Affiliation(s)
- Xiaofeng Zhu
- Center for Future Media and school of computer science and technology, University of Electronic Science and Technology of China, Chengdu 611731, China.
| | - Bin Song
- Department of Radiology, Sichuan University West China Hospital, Chengdu 610041, China.
| | - Feng Shi
- Department of Research and Development, Shanghai United Imaging Intelligence Co., Ltd., Shanghai 200232, China
| | - Yanbo Chen
- Department of Research and Development, Shanghai United Imaging Intelligence Co., Ltd., Shanghai 200232, China
| | - Rongyao Hu
- Center for Future Media and school of computer science and technology, University of Electronic Science and Technology of China, Chengdu 611731, China; School of Natural and Computational Sciences, Massey University Auckland, Auckland 0745, New Zealand
| | - Jiangzhang Gan
- Center for Future Media and school of computer science and technology, University of Electronic Science and Technology of China, Chengdu 611731, China; School of Natural and Computational Sciences, Massey University Auckland, Auckland 0745, New Zealand
| | - Wenhai Zhang
- Department of Research and Development, Shanghai United Imaging Intelligence Co., Ltd., Shanghai 200232, China
| | - Man Li
- Department of Research and Development, Shanghai United Imaging Intelligence Co., Ltd., Shanghai 200232, China
| | - Liye Wang
- Department of Research and Development, Shanghai United Imaging Intelligence Co., Ltd., Shanghai 200232, China
| | - Yaozong Gao
- Department of Research and Development, Shanghai United Imaging Intelligence Co., Ltd., Shanghai 200232, China
| | - Fei Shan
- Department of Radiology, Shanghai Public Health Clinical Center, Fudan University, Shanghai 201508, China.
| | - Dinggang Shen
- Department of Research and Development, Shanghai United Imaging Intelligence Co., Ltd., Shanghai 200232, China; School of Biomedical Engineering, ShanghaiTech University, Shanghai 201210, China; Department of Brain and Cognitive Engineering, Korea University, Seoul 02841, Republic of Korea.
| |
Collapse
|
31
|
Face Gender Recognition in the Wild: An Extensive Performance Comparison of Deep-Learned, Hand-Crafted, and Fused Features with Deep and Traditional Models. APPLIED SCIENCES-BASEL 2020. [DOI: 10.3390/app11010089] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
Face gender recognition has many useful applications in human–robot interactions as it can improve the overall user experience. Support vector machines (SVM) and convolutional neural networks (CNNs) have been used successfully in this domain. Researchers have shown an increased interest in comparing and combining different feature extraction paradigms, including deep-learned features, hand-crafted features, and the fusion of both features. Related research in face gender recognition has been mostly restricted to limited comparisons of the deep-learned and fused features with the CNN model or only deep-learned features with the CNN and SVM models. In this work, we perform a comprehensive comparative study to analyze the classification performance of two widely used learning models (i.e., CNN and SVM), when they are combined with seven features that include hand-crafted, deep-learned, and fused features. The experiments were performed using two challenging unconstrained datasets, namely, Adience and Labeled Faces in the Wild. Further, we used T-tests to assess the statistical significance of the differences in performances with respect to the accuracy, f-score, and area under the curve. Our results proved that SVMs showed best performance with fused features, whereas CNN showed the best performance with deep-learned features. CNN outperformed SVM significantly at p < 0.05.
Collapse
|
32
|
Deep Wide Spatial-Temporal Based Transformer Networks Modeling for the Next Destination According to the Taxi Driver Behavior Prediction. APPLIED SCIENCES-BASEL 2020. [DOI: 10.3390/app11010017] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
This paper uses a neural network approach transformer of taxi driver behavior to predict the next destination with geographical factors. The problem of predicting the next destination is a well-studied application of human mobility, for reducing traffic congestion and optimizing the electronic dispatching system’s performance. According to the Intelligent Transport System (ITS), this kind of task is usually modeled as a multi-class problem. We propose the novel model Deep Wide Spatial-Temporal-Based Transformer Networks (DWSTTNs). In our approach, the encoder and decoder are the transformer’s primary units; with the help of Location-Based Social Networks (LBSN), we encode the geographical information based on visited semantic locations. In particular, we trained our model for the exact longitude and latitude coordinates to predict the next destination. The benefit in the real world of this kind of research is to reduce the customer waiting time for a ride and driver waiting time to pick up a customer. Taxi companies can also optimize their management to improve their company’s service, while urban transport planner can use this information to better plan the urban traffic. We conducted extensive experiments on two real-word datasets, Porto and Manhattan, and the performance was improved compared to the previous models.
Collapse
|
33
|
A Hybrid Deep Learning Model and Comparison for Wind Power Forecasting Considering Temporal-Spatial Feature Extraction. SUSTAINABILITY 2020. [DOI: 10.3390/su12229490] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Abstract
The inherent intermittency and uncertainty of wind power have brought challenges in accurate wind power output forecasting, which also cause tricky problems in the integration of wind power to the grid. In this paper, a hybrid deep learning model bidirectional long short term memory-convolutional neural network (BiLSTM-CNN) is proposed for short-term wind power forecasting. First, the grey correlation analysis is utilized to select the inputs for forecasting model; Then, the proposed hybrid model extracts multi-dimension features of inputs to predict the wind power from the temporal-spatial perspective, where the Bi-LSTM model is utilized to mine the bidirectional temporal characteristics while the convolution and pooling operations of CNN are utilized to extract the spatial characteristics from multiple input time series. Lastly, a case study is conducted to verify the superiority of the proposed model. Other deep learning models (Bi-LSTM, LSTM, CNN, LSTM-CNN, CNN-BiLSTM, CNN-LSTM) are also simulated to conduct comparison from three aspects. The results show that the BiLSTM-CNN model has the best accuracy with the lowest RMSE of 2.5492, MSE of 6.4984, MAE of 1.7344 and highest R2 of 0.9929. CNN has the fastest speed with an average computational time of 0.0741s. The hybrid model that mines the spatial feature based on the extracted temporal feature has a better performance than the model mines the temporal feature based on the extracted spatial feature.
Collapse
|
34
|
Image-based insect species and gender classification by trained supervised machine learning algorithms. ECOL INFORM 2020. [DOI: 10.1016/j.ecoinf.2020.101135] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
|
35
|
Qing Y, Zeng Y, Li Y, Huang GB. Deep and wide feature based extreme learning machine for image classification. Neurocomputing 2020. [DOI: 10.1016/j.neucom.2020.06.110] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
|
36
|
Wei B, Hao K, Gao L, Tang XS, Zhao Y. A biologically inspired visual integrated model for image classification. Neurocomputing 2020. [DOI: 10.1016/j.neucom.2020.04.081] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
37
|
Huang X, Lei Q, Xie T, Zhang Y, Hu Z, Zhou Q. Deep Transfer Convolutional Neural Network and Extreme Learning Machine for lung nodule diagnosis on CT images. Knowl Based Syst 2020. [DOI: 10.1016/j.knosys.2020.106230] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
|
38
|
Vargas VM, Gutiérrez PA, Hervás-Martínez C. Cumulative link models for deep ordinal classification. Neurocomputing 2020. [DOI: 10.1016/j.neucom.2020.03.034] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
39
|
Using Convolutional Neural Networks Based on a Taguchi Method for Face Gender Recognition. ELECTRONICS 2020. [DOI: 10.3390/electronics9081227] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
In general, a convolutional neural network (CNN) consists of one or more convolutional layers, pooling layers, and fully connected layers. Most designers adopt a trial-and-error method to select CNN parameters. In this study, an AlexNet network with optimized parameters is proposed for face image recognition. A Taguchi method is used for selecting preliminary factors and experiments are performed through orthogonal table design. The proposed method filters out factors that are significantly affected. Finally, experimental results show that the proposed Taguchi-based AlexNet network obtains 87.056% and 98.72% average accuracy of image gender recognition in the CIA and MORPH databases, respectively. In addition, the average accuracy of the proposed Taguchi-based AlexNet network is 1.576% and 3.47% higher than that of the original AlexNet network in CIA and MORPH databases, respectively.
Collapse
|
40
|
Abstract
This paper describes the investigation results about the usage of shallow (limited by few layers only) convolutional neural networks (CNNs) to solve the video-based gender classification problem. Different architectures of shallow CNN are proposed, trained and tested using balanced and unbalanced static image datasets. The influence of diverse voting over confidences methods, applied for frame-by-frame gender classification of the video stream, is investigated for possible enhancement of the classification accuracy. The possibility of the grouping of shallow networks into ensembles is investigated; it has been shown that the accuracy may be more improved with the further voting of separate shallow CNN classification results inside an ensemble over a single frame or different ones.
Collapse
Affiliation(s)
- Oleksii Gorokhovatskyi
- Department of Informatics and Computer Engineering, Simon Kuznets Kharkiv National University of Economics, Nauky Avenue Science 9-A, 61166 Kharkov, Ukraine
| | - Olena Peredrii
- Department of Informatics and Computer Engineering, Simon Kuznets Kharkiv National University of Economics, Nauky Avenue Science 9-A, 61166 Kharkov, Ukraine
| |
Collapse
|
41
|
Risnandar, Prakasa E, Erwin IM, Gojali EA, Herlan, Lestari P. Deep salient wood image-based quality assessment. SN APPLIED SCIENCES 2020. [DOI: 10.1007/s42452-020-2671-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
|
42
|
Culquicondor A, Baldassin A, Castelo-Fernández C, de Carvalho JP, Papa JP. An efficient parallel implementation for training supervised optimum-path forest classifiers. Neurocomputing 2020. [DOI: 10.1016/j.neucom.2018.10.115] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
43
|
J-LDFR: joint low-level and deep neural network feature representations for pedestrian gender classification. Neural Comput Appl 2020. [DOI: 10.1007/s00521-020-05015-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
44
|
Deeply Learned Classifiers for Age and Gender Predictions of Unfiltered Faces. ScientificWorldJournal 2020; 2020:1289408. [PMID: 32395084 PMCID: PMC7201854 DOI: 10.1155/2020/1289408] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2019] [Accepted: 03/11/2020] [Indexed: 11/17/2022] Open
Abstract
Age and gender predictions of unfiltered faces classify unconstrained real-world facial images into predefined age and gender. Significant improvements have been made in this research area due to its usefulness in intelligent real-world applications. However, the traditional methods on the unfiltered benchmarks show their incompetency to handle large degrees of variations in those unconstrained images. More recently, Convolutional Neural Networks (CNNs) based methods have been extensively used for the classification task due to their excellent performance in facial analysis. In this work, we propose a novel end-to-end CNN approach, to achieve robust age group and gender classification of unfiltered real-world faces. The two-level CNN architecture includes feature extraction and classification itself. The feature extraction extracts feature corresponding to age and gender, while the classification classifies the face images to the correct age group and gender. Particularly, we address the large variations in the unfiltered real-world faces with a robust image preprocessing algorithm that prepares and processes those faces before being fed into the CNN model. Technically, our network is pretrained on an IMDb-WIKI with noisy labels and then fine-tuned on MORPH-II and finally on the training set of the OIU-Adience (original) dataset. The experimental results, when analyzed for classification accuracy on the same OIU-Adience benchmark, show that our model obtains the state-of-the-art performance in both age group and gender classification. It improves over the best-reported results by 16.6% (exact accuracy) and 3.2% (one-off accuracy) for age group classification and also there is an improvement of 3.0% (exact accuracy) for gender classification.
Collapse
|
45
|
Hasan AM, AL-Jawad MM, Jalab HA, Shaiba H, Ibrahim RW, AL-Shamasneh AR. Classification of Covid-19 Coronavirus, Pneumonia and Healthy Lungs in CT Scans Using Q-Deformed Entropy and Deep Learning Features. ENTROPY (BASEL, SWITZERLAND) 2020; 22:E517. [PMID: 33286289 PMCID: PMC7517011 DOI: 10.3390/e22050517] [Citation(s) in RCA: 59] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/04/2020] [Revised: 04/28/2020] [Accepted: 04/28/2020] [Indexed: 12/24/2022]
Abstract
Many health systems over the world have collapsed due to limited capacity and a dramatic increase of suspected COVID-19 cases. What has emerged is the need for finding an efficient, quick and accurate method to mitigate the overloading of radiologists' efforts to diagnose the suspected cases. This study presents the combination of deep learning of extracted features with the Q-deformed entropy handcrafted features for discriminating between COVID-19 coronavirus, pneumonia and healthy computed tomography (CT) lung scans. In this study, pre-processing is used to reduce the effect of intensity variations between CT slices. Then histogram thresholding is used to isolate the background of the CT lung scan. Each CT lung scan undergoes a feature extraction which involves deep learning and a Q-deformed entropy algorithm. The obtained features are classified using a long short-term memory (LSTM) neural network classifier. Subsequently, combining all extracted features significantly improves the performance of the LSTM network to precisely discriminate between COVID-19, pneumonia and healthy cases. The maximum achieved accuracy for classifying the collected dataset comprising 321 patients is 99.68%.
Collapse
Affiliation(s)
- Ali M. Hasan
- College of Medicine, Al-Nahrain University, Baghdad 10001, Iraq;
| | | | - Hamid A. Jalab
- Faculty of Computer Science & Information Technology, University of Malaya, Kuala Lumpur 50603, Malaysia;
| | - Hadil Shaiba
- Department of Computer Science, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, Riyadh 84428, Saudi Arabia;
| | - Rabha W. Ibrahim
- Informetrics Research Group, Ton Duc Thang University, Ho Chi Minh City 758307, Vietnam;
- Faculty of Mathematics and Statistics, Ton Duc Thang University, Ho Chi Minh City 758307, Vietnam
| | - Ala’a R. AL-Shamasneh
- Faculty of Computer Science & Information Technology, University of Malaya, Kuala Lumpur 50603, Malaysia;
| |
Collapse
|
46
|
Human Gender Classification Using Transfer Learning via Pareto Frontier CNN Networks. INVENTIONS 2020. [DOI: 10.3390/inventions5020016] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Human gender is deemed as a prime demographic trait due to its various usage in the practical domain. Human gender classification in an unconstrained environment is a sophisticated task due to large variations in the image scenarios. Due to the multifariousness of internet images, the classification accuracy suffers from traditional machine learning methods. The aim of this research is to streamline the gender classification process using the transfer learning concept. This research proposes a framework that performs automatic gender classification in unconstrained internet images deploying Pareto frontier deep learning networks; GoogleNet, SqueezeNet, and ResNet50. We analyze the experiment with three different Pareto frontier Convolutional Neural Network (CNN) models pre-trained on ImageNet. The massive experiments demonstrate that the performance of the Pareto frontier CNN networks is remarkable in the unconstrained internet image dataset as well as in the frontal images that pave the way to developing an automatic gender classification system.
Collapse
|
47
|
Khan K, Attique M, Khan RU, Syed I, Chung TS. A Multi-Task Framework for Facial Attributes Classification through End-to-End Face Parsing and Deep Convolutional Neural Networks. SENSORS (BASEL, SWITZERLAND) 2020; 20:E328. [PMID: 31935996 PMCID: PMC7014093 DOI: 10.3390/s20020328] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/13/2019] [Revised: 12/29/2019] [Accepted: 12/30/2019] [Indexed: 11/17/2022]
Abstract
Human face image analysis is an active research area within computer vision. In this paper we propose a framework for face image analysis, addressing three challenging problems of race, age, and gender recognition through face parsing. We manually labeled face images for training an end-to-end face parsing model through Deep Convolutional Neural Networks. The deep learning-based segmentation model parses a face image into seven dense classes. We use the probabilistic classification method and created probability maps for each face class. The probability maps are used as feature descriptors. We trained another Convolutional Neural Network model by extracting features from probability maps of the corresponding class for each demographic task (race, age, and gender). We perform extensive experiments on state-of-the-art datasets and obtained much better results as compared to previous results.
Collapse
Affiliation(s)
- Khalil Khan
- Department of Electrical Engineering, University of Azad Jammu and Kashmir, Muzaffarabad 13100, Pakistan
- Intelligent Analytics Group (IAG), College of Computer, Qassim University, Al-Mulida 51431, Saudi Arabia
| | | | - Rehan Ullah Khan
- Department of Information Technology, College of Computer, Qassim University, Al-Mulida 51431, Saudi Arabia;
- Intelligent Analytics Group (IAG), College of Computer, Qassim University, Al-Mulida 51431, Saudi Arabia
| | - Ikram Syed
- Department of Computer Science, The Superior College, Lahore 54000, Pakistan;
| | - Tae-Sun Chung
- Department of Computer Engineering, Ajou University, Ajou 16499, Korea;
| |
Collapse
|
48
|
Agbo-Ajala O, Viriri S. A Lightweight Convolutional Neural Network for Real and Apparent Age Estimation in Unconstrained Face Images. IEEE ACCESS 2020; 8:162800-162808. [DOI: 10.1109/access.2020.3022039] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/29/2023]
|
49
|
Akl A, El-Henawy I, Salah A, Li K. Optimizing deep neural networks hyperparameter positions and values. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2019. [DOI: 10.3233/jifs-190033] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Affiliation(s)
- Ahmed Akl
- Computer Science Department, College of Computers and Informatics, Zagazig University, Zagazig, Egypt
| | - Ibrahim El-Henawy
- Computer Science Department, College of Computers and Informatics, Zagazig University, Zagazig, Egypt
| | - Ahmad Salah
- Computer Science Department, College of Computers and Informatics, Zagazig University, Zagazig, Egypt
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, China
| | - Kenli Li
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, China
| |
Collapse
|
50
|
Magnetic flux leakage image classification method for pipeline weld based on optimized convolution kernel. Neurocomputing 2019. [DOI: 10.1016/j.neucom.2019.07.083] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|