1
|
Pedestrian gender classification on imbalanced and small sample datasets using deep and traditional features. Neural Comput Appl 2023. [DOI: 10.1007/s00521-023-08331-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/15/2023]
|
2
|
Parashar A, Parashar A, Ding W, Shekhawat RS, Rida I. Deep learning pipelines for recognition of gait biometrics with covariates: a comprehensive review. Artif Intell Rev 2023. [DOI: 10.1007/s10462-022-10365-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
|
3
|
Akbari A, Awais M, Feng ZH, Farooq A, Kittler J. Distribution Cognisant Loss for Cross-Database Facial Age Estimation With Sensitivity Analysis. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2022; 44:1869-1887. [PMID: 33026982 DOI: 10.1109/tpami.2020.3029486] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Existing facial age estimation studies have mostly focused on intra-database protocols that assume training and test images are captured under similar conditions. This is rarely valid in practical applications, where we typically encounter training and test sets with different characteristics. In this article, we deal with such situations, namely subjective-exclusive cross-database age estimation. We formulate the age estimation problem as the distribution learning framework, where the age labels are encoded as a probability distribution. To improve the cross-database age estimation performance, we propose a new loss function which provides a more robust measure of the difference between ground-truth and predicted distributions. The desirable properties of the proposed loss function are theoretically analysed and compared with the state-of-the-art approaches. In addition, we compile a new balanced large-scale age estimation database. Last, we introduce a novel evaluation protocol, called subject-exclusive cross-database age estimation protocol, which provides meaningful information of a method in terms of the generalisation capability. The experimental results demonstrate that the proposed approach outperforms the state-of-the-art age estimation methods under both intra-database and subject-exclusive cross-database evaluation protocols. In addition, in this article, we provide a comparative sensitivity analysis of various algorithms to identify trends and issues inherent to their performance. This analysis introduces some open problems to the community which might be considered when designing a robust age estimation system.
Collapse
|
4
|
Lahens NF, Rahman M, Cohen JB, Cohen DL, Chen J, Weir MR, Feldman HI, Grant GR, Townsend RR, Skarke C, Study Investigators* ATCRIC. Time-specific associations of wearable sensor-based cardiovascular and behavioral readouts with disease phenotypes in the outpatient setting of the Chronic Renal Insufficiency Cohort. Digit Health 2022; 8:20552076221107903. [PMID: 35746950 PMCID: PMC9210076 DOI: 10.1177/20552076221107903] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2022] [Accepted: 05/30/2022] [Indexed: 11/15/2022] Open
Abstract
Patients with chronic kidney disease are at risk of developing cardiovascular disease. To facilitate out-of-clinic evaluation, we piloted wearable device-based analysis of heart rate variability and behavioral readouts in patients with chronic kidney disease from the Chronic Renal Insufficiency Cohort and controls (n = 49). Time-specific partitioning of heart rate variability readouts confirm higher parasympathetic nervous activity during the night (mean RR at night 14.4 ± 1.9 ms vs. 12.8 ± 2.1 ms during active hours; n = 47, analysis of variance (ANOVA) q = 0.001). The α2 long-term fluctuations in the detrended fluctuation analysis, a parameter predictive of cardiovascular mortality, significantly differentiated between diabetic and nondiabetic patients (prominent at night with 0.58 ± 0.2 vs. 0.45 ± 0.12, respectively, adj. p = 0.004). Both diabetic and nondiabetic chronic kidney disease patients showed loss of rhythmic organization compared to controls, with diabetic chronic kidney disease patients exhibiting deconsolidation of peak phases between their activity and standard deviation of interbeat intervals rhythms (mean phase difference chronic kidney disease 8.3 h, chronic kidney disease/type 2 diabetes mellitus 4 h, controls 6.8 h). This work provides a roadmap toward deriving actionable clinical insights from the data collected by wearable devices outside of highly controlled clinical environments.
Collapse
Affiliation(s)
- Nicholas F. Lahens
- Institute for Translational Medicine and Therapeutics (ITMAT), University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
- Department of Systems Pharmacology and Translational Therapeutics, University of Pennsylvania Perelman School of Medicine, Philadelphia,
PA, USA
| | - Mahboob Rahman
- University Hospitals Cleveland Medical Center, Case Western Reserve University, Cleveland, OH, USA
| | - Jordana B. Cohen
- Department of Medicine, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
- Department of Biostatistics and Epidemiology, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
| | - Debbie L. Cohen
- Department of Medicine, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
| | - Jing Chen
- Department of Medicine, Tulane University School of Medicine, New Orleans, LA, USA
| | - Matthew R. Weir
- Division of Nephrology, Department of Medicine, University of Maryland School of Medicine, Baltimore, MD, USA
| | - Harold I. Feldman
- Department of Biostatistics and Epidemiology, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
| | - Gregory R. Grant
- Institute for Translational Medicine and Therapeutics (ITMAT), University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
- Department of Genetics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
| | - Raymond R. Townsend
- Institute for Translational Medicine and Therapeutics (ITMAT), University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
- Department of Medicine, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
| | - Carsten Skarke
- Institute for Translational Medicine and Therapeutics (ITMAT), University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
- Department of Systems Pharmacology and Translational Therapeutics, University of Pennsylvania Perelman School of Medicine, Philadelphia,
PA, USA
- Department of Medicine, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
| | | |
Collapse
|
5
|
Gender Classification Using Proposed CNN-Based Model and Ant Colony Optimization. MATHEMATICS 2021. [DOI: 10.3390/math9192499] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Pedestrian gender classification is one of the key assignments of pedestrian study, and it finds practical applications in content-based image retrieval, population statistics, human–computer interaction, health care, multimedia retrieval systems, demographic collection, and visual surveillance. In this research work, gender classification was carried out using a deep learning approach. A new 64-layer architecture named 4-BSMAB derived from deep AlexNet is proposed. The proposed model was trained on CIFAR-100 dataset utilizing SoftMax classifier. Then, features were obtained from applied datasets with this pre-trained model. The obtained feature set was optimized with ant colony system (ACS) optimization technique. Various classifiers of SVM and KNN were used to perform gender classification utilizing the optimized feature set. Comprehensive experimentation was performed on gender classification datasets, and proposed model produced better results than the existing methods. The suggested model attained highest accuracy, i.e., 85.4%, and 92% AUC on MIT dataset, and best classification results, i.e., 93% accuracy and 96% AUC, on PKU-Reid dataset. The outcomes of extensive experiments carried out on existing standard pedestrian datasets demonstrate that the proposed framework outperformed existing pedestrian gender classification methods, and acceptable results prove the proposed model as a robust model.
Collapse
|
6
|
Fei L, Zhang B, Tian C, Teng S, Wen J. Jointly learning multi-instance hand-based biometric descriptor. Inf Sci (N Y) 2021. [DOI: 10.1016/j.ins.2021.01.086] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
7
|
Alonso‐Fernandez F, Hernandez‐Diaz K, Ramis S, Perales FJ, Bigun J. Facial masks and soft‐biometrics: Leveraging face recognition CNNs for age and gender prediction on mobile ocular images. IET BIOMETRICS 2021. [DOI: 10.1049/bme2.12046] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Affiliation(s)
| | | | - Silvia Ramis
- Computer Graphics and Vision and AI Group University of Balearic Islands Spain
| | | | - Josef Bigun
- School of Information Technology Halmstad University Sweden
| |
Collapse
|
8
|
Wang P, Zhou Y, Li Z, Zhang D. GBCI: Adaptive Frequency Band Learning for Gender Recognition in Brain-Computer Interfaces. ARTIF INTELL 2021. [DOI: 10.1007/978-3-030-93046-2_19] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
|
9
|
Mirjalili V, Raschka S, Ross A. PrivacyNet: Semi-Adversarial Networks for Multi-attribute Face Privacy. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2020; PP:9400-9412. [PMID: 32956058 DOI: 10.1109/tip.2020.3024026] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Recent research has established the possibility of deducing soft-biometric attributes such as age, gender and race from an individual's face image with high accuracy. However, this raises privacy concerns, especially when face images collected for biometric recognition purposes are used for attribute analysis without the person's consent. To address this problem, we develop a technique for imparting soft biometric privacy to face images via an image perturbation methodology. The image perturbation is undertaken using a GAN-based Semi-Adversarial Network (SAN) - referred to as PrivacyNet - that modifies an input face image such that it can be used by a face matcher for matching purposes but cannot be reliably used by an attribute classifier. Further, PrivacyNet allows a person to choose specific attributes that have to be obfuscated in the input face images (e.g., age and race), while allowing for other types of attributes to be extracted (e.g., gender). Extensive experiments using multiple face matchers, multiple age/gender/race classifiers, and multiple face datasets demonstrate the generalizability of the proposed multi-attribute privacy enhancing method across multiple face and attribute classifiers.
Collapse
|
10
|
Carletti V, Greco A, Percannella G, Vento M. Age from Faces in the Deep Learning Revolution. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2020; 42:2113-2132. [PMID: 30990174 DOI: 10.1109/tpami.2019.2910522] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
Face analysis includes a variety of specific problems as face detection, person identification, gender and ethnicity recognition, just to name the most common ones; in the last two decades, significant research efforts have been devoted to the challenging task of age estimation from faces, as witnessed by the high number of published papers. The explosion of the deep learning paradigm, that is determining a spectacular increasing of the performance, is in the public eye; consequently, the number of approaches based on deep learning is impressively growing and this also happened for age estimation. The exciting results obtained have been recently surveyed on almost all the specific face analysis problems; the only exception stands for age estimation, whose last survey dates back to 2010 and does not include any deep learning based approach to the problem. This paper provides an analysis of the deep methods proposed in the last six years; these are analysed from different points of view: the network architecture together with the learning procedure, the used datasets, data preprocessing and augmentation, and the exploitation of additional data coming from gender, race and face expression. The review is completed by discussing the results obtained on public datasets, so as the impact of different aspects on system performance, together with still open issues.
Collapse
|
11
|
|
12
|
Belo D, Bento N, Silva H, Fred A, Gamboa H. ECG Biometrics Using Deep Learning and Relative Score Threshold Classification. SENSORS 2020; 20:s20154078. [PMID: 32707861 PMCID: PMC7435887 DOI: 10.3390/s20154078] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/25/2020] [Revised: 07/19/2020] [Accepted: 07/20/2020] [Indexed: 11/25/2022]
Abstract
The field of biometrics is a pattern recognition problem, where the individual traits are coded, registered, and compared with other database records. Due to the difficulties in reproducing Electrocardiograms (ECG), their usage has been emerging in the biometric field for more secure applications. Inspired by the high performance shown by Deep Neural Networks (DNN) and to mitigate the intra-variability challenges displayed by the ECG of each individual, this work proposes two architectures to improve current results in both identification (finding the registered person from a sample) and authentication (prove that the person is whom it claims) processes: Temporal Convolutional Neural Network (TCNN) and Recurrent Neural Network (RNN). Each architecture produces a similarity score, based on the prediction error of the former and the logits given by the last, and fed to the same classifier, the Relative Score Threshold Classifier (RSTC).The robustness and applicability of these architectures were trained and tested on public databases used by literature in this context: Fantasia, MIT-BIH, and CYBHi databases. Results show that overall the TCNN outperforms the RNN achieving almost 100%, 96%, and 90% accuracy, respectively, for identification and 0.0%, 0.1%, and 2.2% equal error rate (EER) for authentication processes. When comparing to previous work, both architectures reached results beyond the state-of-the-art. Nevertheless, the improvement of these techniques, such as enriching training with extra varied data and transfer learning, may provide more robust systems with a reduced time required for validation.
Collapse
Affiliation(s)
- David Belo
- LIBPhys, Physics Department, Faculty of Sciences and Technology, Nova University of Lisbon, 2825-149 Caparica, Portugal; (N.B.); (H.G.)
- Correspondence:
| | - Nuno Bento
- LIBPhys, Physics Department, Faculty of Sciences and Technology, Nova University of Lisbon, 2825-149 Caparica, Portugal; (N.B.); (H.G.)
| | - Hugo Silva
- Instituto de Telecomunicacoes, Instituto Superior Tecnico (IST), Technical University of Lisbon, 1049-001 Lisboa, Portugal; (H.S.); (A.F.)
| | - Ana Fred
- Instituto de Telecomunicacoes, Instituto Superior Tecnico (IST), Technical University of Lisbon, 1049-001 Lisboa, Portugal; (H.S.); (A.F.)
| | - Hugo Gamboa
- LIBPhys, Physics Department, Faculty of Sciences and Technology, Nova University of Lisbon, 2825-149 Caparica, Portugal; (N.B.); (H.G.)
| |
Collapse
|
13
|
J-LDFR: joint low-level and deep neural network feature representations for pedestrian gender classification. Neural Comput Appl 2020. [DOI: 10.1007/s00521-020-05015-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
14
|
Kempfert KC, Wang Y, Chen C, Wong SW. A comparison study on nonlinear dimension reduction methods with kernel variations: Visualization, optimization and classification. INTELL DATA ANAL 2020. [DOI: 10.3233/ida-194486] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Affiliation(s)
| | - Yishi Wang
- University of North Carolina Wilmington, Wilmington, NC, USA
| | - Cuixian Chen
- University of North Carolina Wilmington, Wilmington, NC, USA
| | | |
Collapse
|
15
|
Martinho-Corbishley D, Nixon MS, Carter JN. Super-Fine Attributes with Crowd Prototyping. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2019; 41:1486-1500. [PMID: 29994759 DOI: 10.1109/tpami.2018.2836900] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
Recognising human attributes from surveillance footage is widely studied for attribute-based re-identification. However, most works assume coarse, expertly-defined categories, ineffective in describing challenging images. Such brittle representations are limited in descriminitive power and hamper the efficacy of learnt estimators. We aim to discover more relevant and precise subject descriptions, improving image retrieval and closing the semantic gap. Inspired by fine-grained and relative attributes, we introduce super-fine attributes, which now describe multiple, integral concepts of a single trait as multi-dimensional perceptual coordinates. Crowd prototyping facilitates efficient crowdsourcing of super-fine labels by pre-discovering salient perceptual concepts for prototype matching. We re-annotate gender, age and ethnicity traits from PETA, a highly diverse (19K instances, 8.7K identities) amalgamation of 10 re-id datasets including VIPER, CUHK and TownCentre. Employing joint attribute regression with the ResNet-152 CNN, we demonstrate substantially improved ranked retrieval performance with super-fine attributes in comparison to conventional binary labels, reporting up to a 11.2 and 14.8 percent mAP improvement for gender and age, further surpassed by ethnicity. We also find our 3 super-fine traits to outperform 35 binary attributes by 6.5 percent mAP for subject retrieval in a challenging zero-shot identification scenario.
Collapse
|
16
|
Zhan S, Wu J, Han N, Wen J, Fang X. Unsupervised feature extraction by low-rank and sparsity preserving embedding. Neural Netw 2019; 109:56-66. [DOI: 10.1016/j.neunet.2018.10.001] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2018] [Revised: 09/18/2018] [Accepted: 10/05/2018] [Indexed: 11/24/2022]
|