1
|
Hu Z, Luo Z, Wang Y, Zhou Q, Liu S, Wang Q. Texture Feature Extraction from 1H NMR Spectra for the Geographical Origin Traceability of Chinese Yam. Foods 2023; 12:2476. [PMID: 37444214 DOI: 10.3390/foods12132476] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2023] [Revised: 06/20/2023] [Accepted: 06/20/2023] [Indexed: 07/15/2023] Open
Abstract
Adulteration is widespread in the herbal and food industry and seriously restricts traditional Chinese medicine development. Accurate identification of geo-authentic herbs ensures drug safety and effectiveness. In this study, 1H NMR combined intelligent "rotation-invariant uniform local binary pattern" identification was implemented for the geographical origin confirmation of geo-authentic Chinese yam (grown in Jiaozuo, Henan province) from Chinese yams grown in other locations. Our results showed that the texture feature of 1H NMR image extracted with rotation-invariant uniform local binary pattern for identification is far superior compared to the original NMR data. Furthermore, data preprocessing is necessary. Moreover, the model combining a feature extraction algorithm and support vector machine (SVM) classifier demonstrated good robustness. This approach is advantageous, as it is accurate, rapid, simple, and inexpensive. It is also suitable for the geographical origin traceability of other geographical indication agricultural products.
Collapse
Affiliation(s)
- Zhongyi Hu
- College of Computer Science and Artifical Intelligence, Wenzhou University, Wenzhou 325035, China
- Intelligent Information Systems Institute, Wenzhou University, Wenzhou 325035, China
| | - Zhenzhen Luo
- Zhenhai District Finance Bureau, Ningbo 315202, China
| | - Yanli Wang
- National Health Commission Key Laboratory of Birth Defect Prevention, Henan Institute of Reproductive Health Science and Technology, Zhengzhou 450002, China
| | - Qiuju Zhou
- College of Chemistry and Chemical Engineering, Xinyang Normal University, Xinyang 464000, China
| | - Shuangyan Liu
- High & New Technology Research Center, Henan Academy of Sciences, Zhengzhou 450002, China
| | - Qiang Wang
- High & New Technology Research Center, Henan Academy of Sciences, Zhengzhou 450002, China
- School of Medicine, Huanghe Science and Technology College, Zhengzhou 450063, China
| |
Collapse
|
2
|
Azad AK, Ahmed I, Ahmed MU. In Search of an Efficient and Reliable Deep Learning Model for Identification of COVID-19 Infection from Chest X-ray Images. Diagnostics (Basel) 2023; 13:diagnostics13030574. [PMID: 36766679 PMCID: PMC9914163 DOI: 10.3390/diagnostics13030574] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2022] [Revised: 12/08/2022] [Accepted: 01/17/2023] [Indexed: 02/08/2023] Open
Abstract
The virus responsible for COVID-19 is mutating day by day with more infectious characteristics. With the limited healthcare resources and overburdened medical practitioners, it is almost impossible to contain this virus. The automatic identification of this viral infection from chest X-ray (CXR) images is now more demanding as it is a cheaper and less time-consuming diagnosis option. To that cause, we have applied deep learning (DL) approaches for four-class classification of CXR images comprising COVID-19, normal, lung opacity, and viral pneumonia. At first, we extracted features of CXR images by applying a local binary pattern (LBP) and pre-trained convolutional neural network (CNN). Afterwards, we utilized a pattern recognition network (PRN), support vector machine (SVM), decision tree (DT), random forest (RF), and k-nearest neighbors (KNN) classifiers on the extracted features to classify aforementioned four-class CXR images. The performances of the proposed methods have been analyzed rigorously in terms of classification performance and classification speed. Among different methods applied to the four-class test images, the best method achieved classification performances with 97.41% accuracy, 94.94% precision, 94.81% recall, 98.27% specificity, and 94.86% F1 score. The results indicate that the proposed method can offer an efficient and reliable framework for COVID-19 detection from CXR images, which could be immensely conducive to the effective diagnosis of COVID-19-infected patients.
Collapse
|
3
|
Manic KS, Rajinikanth V, Al-Bimani AS, Taniar D, Kadry S. Framework to Detect Schizophrenia in Brain MRI Slices with Mayfly Algorithm-Selected Deep and Handcrafted Features. Sensors (Basel) 2022; 23:280. [PMID: 36616876 PMCID: PMC9823879 DOI: 10.3390/s23010280] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/21/2022] [Revised: 12/21/2022] [Accepted: 12/24/2022] [Indexed: 06/17/2023]
Abstract
Brain abnormality causes severe human problems, and thorough screening is necessary to identify the disease. In clinics, bio-image-supported brain abnormality screening is employed mainly because of its investigative accuracy compared with bio-signal (EEG)-based practice. This research aims to develop a reliable disease screening framework for the automatic identification of schizophrenia (SCZ) conditions from brain MRI slices. This scheme consists following phases: (i) MRI slices collection and pre-processing, (ii) implementation of VGG16 to extract deep features (DF), (iii) collection of handcrafted features (HF), (iv) mayfly algorithm-supported optimal feature selection, (v) serial feature concatenation, and (vi) binary classifier execution and validation. The performance of the proposed scheme was independently tested with DF, HF, and concatenated features (DF+HF), and the achieved outcome of this study verifies that the schizophrenia screening accuracy with DF+HF is superior compared with other methods. During this work, 40 patients’ brain MRI images (20 controlled and 20 SCZ class) were considered for the investigation, and the following accuracies were achieved: DF provided >91%, HF obtained >85%, and DF+HF achieved >95%. Therefore, this framework is clinically significant, and in the future, it can be used to inspect actual patients’ brain MRI slices.
Collapse
Affiliation(s)
- K. Suresh Manic
- National University of Science and Technology, Muscat P.O. Box 112, Oman
| | - Venkatesan Rajinikanth
- Department of Computer Science and Engineering, Division of Research and Innovation, Saveetha School of Engineering, Saveetha Institute of Medical and Technical Sciences, Chennai 602105, India
| | - Ali Saud Al-Bimani
- National University of Science and Technology, Muscat P.O. Box 112, Oman
| | - David Taniar
- Faculty of Information Technology, Monash University, Wellington Rd, Clayton, VIC 3800, Australia
| | - Seifedine Kadry
- Department of Applied Data Science, Noroff University College, 4612 Kristiansand, Norway
- Artificial Intelligence Research Center (AIRC), Ajman University, Ajman P.O. Box 346, United Arab Emirates
- Department of Electrical and Computer Engineering, Lebanese American University, Byblos P.O. Box 36, Lebanon
| |
Collapse
|
4
|
Mirjalili F, Hardeberg JY. On the Quantification of Visual Texture Complexity. J Imaging 2022; 8:248. [PMID: 36135413 DOI: 10.3390/jimaging8090248] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2022] [Revised: 08/26/2022] [Accepted: 09/08/2022] [Indexed: 11/20/2022] Open
Abstract
Complexity is one of the major attributes of the visual perception of texture. However, very little is known about how humans visually interpret texture complexity. A psychophysical experiment was conducted to visually quantify the seven texture attributes of a series of textile fabrics: complexity, color variation, randomness, strongness, regularity, repetitiveness, and homogeneity. It was found that the observers could discriminate between the textures with low and high complexity using some high-level visual cues such as randomness, color variation, strongness, etc. The results of principal component analysis (PCA) on the visual scores of the above attributes suggest that complexity and homogeneity could be essentially the underlying attributes of the same visual texture dimension, with complexity at the negative extreme and homogeneity at the positive extreme of this dimension. We chose to call this dimension visual texture complexity. Several texture measures including the first-order image statistics, co-occurrence matrix, local binary pattern, and Gabor features were computed for images of the textiles in sRGB, and four luminance-chrominance color spaces (i.e., HSV, YCbCr, Ohta’s I1I2I3, and CIELAB). The relationships between the visually quantified texture complexity of the textiles and the corresponding texture measures of the images were investigated. Analyzing the relationships showed that simple standard deviation of the image luminance channel had a strong correlation with the corresponding visual ratings of texture complexity in all five color spaces. Standard deviation of the energy of the image after convolving with an appropriate Gabor filter and entropy of the co-occurrence matrix, both computed for the image luminance channel, also showed high correlations with the visual data. In this comparison, sRGB, YCbCr, and HSV always outperformed the I1I2I3 and CIELAB color spaces. The highest correlations between the visual data and the corresponding image texture features in the luminance-chrominance color spaces were always obtained for the luminance channel of the images, and one of the two chrominance channels always performed better than the other. This result indicates that the arrangement of the image texture elements that impacts the observer’s perception of visual texture complexity cannot be represented properly by the chrominance channels. This must be carefully considered when choosing an image channel to quantify the visual texture complexity. Additionally, the good performance of the luminance channel in the five studied color spaces proves that variations in the luminance of the texture, or as one could call the luminance contrast, plays a crucial role in creating visual texture complexity.
Collapse
|
5
|
Al Saidi I, Rziza M, Debayle J. A New LBP Variant: Corner Rhombus Shape LBP (CRSLBP). J Imaging 2022; 8:jimaging8070200. [PMID: 35877644 PMCID: PMC9324107 DOI: 10.3390/jimaging8070200] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2022] [Revised: 06/27/2022] [Accepted: 07/11/2022] [Indexed: 12/10/2022] Open
Abstract
The local binary model is a straightforward, dependable, and effective method for extracting relevant local information from images. However, because it only uses sign information in the local region, the local binary pattern (LBP) is ineffective at capturing discriminating characteristics. Furthermore, most LBP variants select a region with one specific center pixel to fill all neighborhoods. In this paper, a new variant of a LBP is proposed for texture classification, known as corner rhombus-shape LBP (CRSLBP). In the CRSLBP approach, we first use three methods to threshold the pixel's neighbors and center to obtain four center pixels by using sign and magnitude information with respect to a chosen region of an even block. This helps determine not just the relationship between neighbors and the pixel center but also between the center and the neighbor pixels of neighborhood center pixels. We evaluated the performance of our descriptors using four challenging texture databases: Outex (TC10,TC12), Brodatz, KTH-TIPSb2, and UMD. Various extensive experiments were performed that demonstrated the effectiveness and robustness of our descriptor in comparison with the available state of the art (SOTA).
Collapse
Affiliation(s)
- Ibtissam Al Saidi
- LRIT Laboratory, Rabat IT Center, Faculty of Sciences, Mohammed V University in Rabat, Rabat B.P. 1014, Morocco;
- Correspondence:
| | - Mohammed Rziza
- LRIT Laboratory, Rabat IT Center, Faculty of Sciences, Mohammed V University in Rabat, Rabat B.P. 1014, Morocco;
| | - Johan Debayle
- Mines Saint-Etienne, French National Center for Scientific Research, Joint Research Unit 5307 Laboratory Georges Friedel, Centre SPIN 158 Cours Fauriel, CEDEX 2, 42023 Saint-Etienne, France;
| |
Collapse
|
6
|
Dong X, Fu L, Liu Q. No-reference image quality assessment for confocal endoscopy images with perceptual local descriptor. J Biomed Opt 2022; 27:056503. [PMID: 35585672 PMCID: PMC9116465 DOI: 10.1117/1.jbo.27.5.056503] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/11/2021] [Accepted: 04/29/2022] [Indexed: 06/15/2023]
Abstract
SIGNIFICANCE Confocal endoscopy images often suffer distortions, resulting in image quality degradation and information loss, increasing the difficulty of diagnosis and even leading to misdiagnosis. It is important to assess image quality and filter images with low diagnostic value before diagnosis. AIM We propose a no-reference image quality assessment (IQA) method for confocal endoscopy images based on Weber's law and local descriptors. The proposed method can detect the severity of image degradation by capturing the perceptual structure of an image. APPROACH We created a new dataset of 642 confocal endoscopy images to validate the performance of the proposed method. We then conducted extensive experiments to compare the accuracy and speed of the proposed method with other state-of-the-art IQA methods. RESULTS Experimental results demonstrate that the proposed method achieved an SROCC of 0.85 and outperformed other IQA methods. CONCLUSIONS Given its high consistency in subjective quality assessment, the proposed method can screen high-quality images in practical applications and contribute to diagnosis.
Collapse
Affiliation(s)
- Xiangjiang Dong
- Huazhong University of Science and Technology, Wuhan National Laboratory for Optoelectronics, Wuhan, China
| | - Ling Fu
- Huazhong University of Science and Technology, Wuhan National Laboratory for Optoelectronics, Wuhan, China
- Hainan University, School of Biomedical Engineering, Key Laboratory of Biomedical Engineering of Hainan Province, Hainan, China
| | - Qian Liu
- Hainan University, School of Biomedical Engineering, Key Laboratory of Biomedical Engineering of Hainan Province, Hainan, China
| |
Collapse
|
7
|
Chen K, Wang Q, Ma Y. Cervical optical coherence tomography image classification based on contrastive self-supervised texture learning. Med Phys 2022; 49:3638-3653. [PMID: 35342956 DOI: 10.1002/mp.15630] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2021] [Revised: 02/26/2022] [Accepted: 03/16/2022] [Indexed: 12/24/2022] Open
Abstract
BACKGROUND Cervical cancer seriously affects the health of the female reproductive system. Optical coherence tomography (OCT) emerged as a non-invasive, high-resolution imaging technology for cervical disease detection. However, OCT image annotation is knowledge-intensive and time-consuming, which impedes the training process of deep-learning-based classification models. PURPOSE This study aims to develop a computer-aided diagnosis (CADx) approach to classifying in-vivo cervical OCT images based on self-supervised learning. METHODS In addition to high-level semantic features extracted by a convolutional neural network (CNN), the proposed CADx approach designs a contrastive texture learning (CTL) strategy to leverage unlabeled cervical OCT images' texture features. We conducted ten-fold cross-validation on the OCT image dataset from a multi-center clinical study on 733 patients from China. RESULTS In a binary classification task for detecting high-risk diseases, including high-grade squamous intraepithelial lesion and cervical cancer, our method achieved an area-under-the-curve value of 0.9798 ± 0.0157 with a sensitivity of 91.17 ± 4.99% and a specificity of 93.96 ± 4.72% for OCT image patches; also, it outperformed two out of four medical experts on the test set. Furthermore, our method achieved a 91.53% sensitivity and 97.37% specificity on an external validation dataset containing 287 3D OCT volumes from 118 Chinese patients in a new hospital using a cross-shaped threshold voting strategy. CONCLUSIONS The proposed contrastive-learning-based CADx method outperformed the end-to-end CNN models and provided better interpretability based on texture features, which holds great potential to be used in the clinical protocol of "see-and-treat." This article is protected by copyright. All rights reserved.
Collapse
Affiliation(s)
- Kaiyi Chen
- School of Computer Science, Wuhan University, Wuhan, 430072, China
| | - Qingbin Wang
- School of Computer Science, Wuhan University, Wuhan, 430072, China
| | - Yutao Ma
- School of Computer Science, Wuhan University, Wuhan, 430072, China
| |
Collapse
|
8
|
Murguia-Cozar A, Macedo-Cruz A, Fernandez-Reynoso DS, Salgado Transito JA. Recognition of Maize Phenology in Sentinel Images with Machine Learning. Sensors (Basel) 2021; 22:94. [PMID: 35009637 PMCID: PMC8747376 DOI: 10.3390/s22010094] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/21/2021] [Revised: 12/01/2021] [Accepted: 12/07/2021] [Indexed: 06/14/2023]
Abstract
The scarcity of water for agricultural use is a serious problem that has increased due to intense droughts, poor management, and deficiencies in the distribution and application of the resource. The monitoring of crops through satellite image processing and the application of machine learning algorithms are technological strategies with which developed countries tend to implement better public policies regarding the efficient use of water. The purpose of this research was to determine the main indicators and characteristics that allow us to discriminate the phenological stages of maize crops (Zea mays L.) in Sentinel 2 satellite images through supervised classification models. The training data were obtained by monitoring cultivated plots during an agricultural cycle. Indicators and characteristics were extracted from 41 Sentinel 2 images acquired during the monitoring dates. With these images, indicators of texture, vegetation, and colour were calculated to train three supervised classifiers: linear discriminant (LD), support vector machine (SVM), and k-nearest neighbours (kNN) models. It was found that 45 of the 86 characteristics extracted contributed to maximizing the accuracy by stage of development and the overall accuracy of the trained classification models. The characteristics of the Moran's I local indicator of spatial association (LISA) improved the accuracy of the classifiers when applied to the L*a*b* colour model and to the near-infrared (NIR) band. The local binary pattern (LBP) increased the accuracy of the classification when applied to the red, green, blue (RGB) and NIR bands. The colour ratios, leaf area index (LAI), RGB colour model, L*a*b* colour space, LISA, and LBP extracted the most important intrinsic characteristics of maize crops with regard to classifying the phenological stages of the maize cultivation. The quadratic SVM model was the best classifier of maize crop phenology, with an overall accuracy of 82.3%.
Collapse
Affiliation(s)
- Alvaro Murguia-Cozar
- Colegio de Postgraduados, Campus Montecillo, Carretera Federal Mexico-Texcoco, km. 36.5, Montecillo, Texcoco 56230, State of Mexico, Mexico; (A.M.-C.); (D.S.F.-R.)
| | - Antonia Macedo-Cruz
- Colegio de Postgraduados, Campus Montecillo, Carretera Federal Mexico-Texcoco, km. 36.5, Montecillo, Texcoco 56230, State of Mexico, Mexico; (A.M.-C.); (D.S.F.-R.)
| | - Demetrio Salvador Fernandez-Reynoso
- Colegio de Postgraduados, Campus Montecillo, Carretera Federal Mexico-Texcoco, km. 36.5, Montecillo, Texcoco 56230, State of Mexico, Mexico; (A.M.-C.); (D.S.F.-R.)
| | - Jorge Arturo Salgado Transito
- Colegio Mexicano de Especialistas en Recursos Naturales AC, De las Flores no. 8 s/n, San Luis Huexotla, Texcoco 56220, State of Mexico, Mexico;
| |
Collapse
|
9
|
Sabanci K. Detection of sunn pest-damaged wheat grains using artificial bee colony optimization-based artificial intelligence techniques. J Sci Food Agric 2020; 100:817-824. [PMID: 31646637 DOI: 10.1002/jsfa.10093] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/16/2019] [Revised: 10/13/2019] [Accepted: 10/17/2019] [Indexed: 06/10/2023]
Abstract
BACKGROUND In this study, artificial intelligence models that identify sunn pest-damaged wheat grains (SDG) and healthy wheat grains (HWG) are presented. Svevo durum wheat cultivated in Konya province, Turkey is used for the process, with 150 HWG and 150 SDG being used for classification. Thanks to the constructed imaging setup, photos of the 300 wheat grains are obtained. Seventeen visual features of each wheat grain are extracted by image-processing techniques and evaluated in three different groups of dimension, texture and pattern as visual parameters. Artificial bee colony (ABC) optimization-based artificial neural network (ANN) and extreme learning machine (ELM) algorithms are implemented to classify the damaged wheat grains. RESULTS A correlation-based feature selection (CFS) technique is also utilized to find the most effective among the 17 features. In the classification process using five selected features, the mean absolute error (MAE) and root mean square error (RMSE) values for ABC-based ANN are calculated as 0.00174 and 0.00433 respectively. The proposed technique is integrated into graphical user interface (GUI) software to construct an effective detection system for practical use. CONCLUSION The results indicate that, thanks to the modified ANN algorithm and implemented CFS algorithm, the detection accuracy of damaged wheat grains is considerably increased. © 2019 Society of Chemical Industry.
Collapse
Affiliation(s)
- Kadir Sabanci
- Department of Electrical Electronics Engineering, Engineering Faculty, Karamanoglu Mehmetbey University, Karaman, Turkey
| |
Collapse
|
10
|
Shanir PPM, Khan KA, Khan YU, Farooq O, Adeli H. Automatic Seizure Detection Based on Morphological Features Using One-Dimensional Local Binary Pattern on Long-Term EEG. Clin EEG Neurosci 2018; 49:351-362. [PMID: 29214865 DOI: 10.1177/1550059417744890] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Epileptic neurological disorder of the brain is widely diagnosed using the electroencephalography (EEG) technique. EEG signals are nonstationary in nature and show abnormal neural activity during the ictal period. Seizures can be identified by analyzing and obtaining features of EEG signal that can detect these abnormal activities. The present work proposes a novel morphological feature extraction technique based on the local binary pattern (LBP) operator. LBP provides a unique decimal value to a sample point by weighing the binary outcomes after thresholding the neighboring samples with the present sample point. These LBP values assist in capturing the rising and falling edges of the EEG signal, thus providing a morphologically featured discriminating pattern for epilepsy detection. In the present work, the variability in the LBP values is measured by calculating the sum of absolute difference of the consecutive LBP values. Interquartile range is calculated over the preprocessed EEG signal to provide dispersion measure in the signal. For classification purpose, K-nearest neighbor classifier is used, and the performance is evaluated on 896.9 hours of data from CHB-MIT continuous EEG database. Mean accuracy of 99.7% and mean specificity of 99.8% is obtained with average false detection rate of 0.47/h and sensitivity of 99.2% for 136 seizures.
Collapse
Affiliation(s)
- P P Muhammed Shanir
- 1 Department of Electrical and Electronics Engineering, Thangal Kunju Musaliar College of Engineering, Kollam, Kerala, India.,2 Department of Electrical Engineering, Zakir Husain College of Engineering and Technology, Aligarh Muslim University, Aligarh, Uttar Pradesh, India
| | - Kashif Ahmad Khan
- 3 School of Electrical and Electronics Engineering, Lovely Professional University, Phagwara, Punjab, India
| | - Yusuf Uzzaman Khan
- 2 Department of Electrical Engineering, Zakir Husain College of Engineering and Technology, Aligarh Muslim University, Aligarh, Uttar Pradesh, India
| | - Omar Farooq
- 4 Department of Electronics Engineering, Zakir Husain College of Engineering and Technology, AMU Aligarh, Aligarh, Uttar Pradesh, India
| | - Hojjat Adeli
- 5 College of Engineering, The Ohio State University, Columbus, OH, USA
| |
Collapse
|
11
|
Ma C, Trung NT, Uchiyama H, Nagahara H, Shimada A, Taniguchi RI. Adapting Local Features for Face Detection in Thermal Image. Sensors (Basel) 2017; 17:E2741. [PMID: 29186923 DOI: 10.3390/s17122741] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/19/2017] [Revised: 11/20/2017] [Accepted: 11/23/2017] [Indexed: 11/17/2022]
Abstract
A thermal camera captures the temperature distribution of a scene as a thermal image. In thermal images, facial appearances of different people under different lighting conditions are similar. This is because facial temperature distribution is generally constant and not affected by lighting condition. This similarity in face appearances is advantageous for face detection. To detect faces in thermal images, cascade classifiers with Haar-like features are generally used. However, there are few studies exploring the local features for face detection in thermal images. In this paper, we introduce two approaches relying on local features for face detection in thermal images. First, we create new feature types by extending Multi-Block LBP. We consider a margin around the reference and the generally constant distribution of facial temperature. In this way, we make the features more robust to image noise and more effective for face detection in thermal images. Second, we propose an AdaBoost-based training method to get cascade classifiers with multiple types of local features. These feature types have different advantages. In this way we enhance the description power of local features. We did a hold-out validation experiment and a field experiment. In the hold-out validation experiment, we captured a dataset from 20 participants, comprising 14 males and 6 females. For each participant, we captured 420 images with 10 variations in camera distance, 21 poses, and 2 appearances (participant with/without glasses). We compared the performance of cascade classifiers trained by different sets of the features. The experiment results showed that the proposed approaches effectively improve the performance of face detection in thermal images. In the field experiment, we compared the face detection performance in realistic scenes using thermal and RGB images, and gave discussion based on the results.
Collapse
|
12
|
Saeed A, Al-Hamadi A, Ghoneim A. Head Pose Estimation on Top of Haar-Like Face Detection: A Study Using the Kinect Sensor. Sensors (Basel) 2015; 15:20945-66. [PMID: 26343651 DOI: 10.3390/s150920945] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/03/2015] [Revised: 08/04/2015] [Accepted: 08/06/2015] [Indexed: 11/29/2022]
Abstract
Head pose estimation is a crucial initial task for human face analysis, which is employed in several computer vision systems, such as: facial expression recognition, head gesture recognition, yawn detection, etc. In this work, we propose a frame-based approach to estimate the head pose on top of the Viola and Jones (VJ) Haar-like face detector. Several appearance and depth-based feature types are employed for the pose estimation, where comparisons between them in terms of accuracy and speed are presented. It is clearly shown through this work that using the depth data, we improve the accuracy of the head pose estimation. Additionally, we can spot positive detections, faces in profile views detected by the frontal model, that are wrongly cropped due to background disturbances. We introduce a new depth-based feature descriptor that provides competitive estimation results with a lower computation time. Evaluation on a benchmark Kinect database shows that the histogram of oriented gradients and the developed depth-based features are more distinctive for the head pose estimation, where they compare favorably to the current state-of-the-art approaches. Using a concatenation of the aforementioned feature types, we achieved a head pose estimation with average errors not exceeding 5.1∘,4.6∘,4.2∘ for pitch, yaw and roll angles, respectively.
Collapse
|
13
|
Abstract
Local binary pattern (LBP) is a simple gray scale descriptor to characterize the local distribution of the grey levels in an image. Multi-resolution LBP and/or combinations of the LBPs have shown to be effective in texture image analysis. However, it is unclear what resolutions or combinations to choose for texture analysis. Examining all the possible cases is impractical and intractable due to the exponential growth in a feature space. This limits the accuracy and time- and space-efficiency of LBP. Here, we propose a data mining approach for LBP, which efficiently explores a high-dimensional feature space and finds a relatively smaller number of discriminative features. The features can be any combinations of LBPs. These may not be achievable with conventional approaches. Hence, our approach not only fully utilizes the capability of LBP but also maintains the low computational complexity. We incorporated three different descriptors (LBP, local contrast measure, and local directional derivative measure) with three spatial resolutions and evaluated our approach using two comprehensive texture databases. The results demonstrated the effectiveness and robustness of our approach to different experimental designs and texture images.
Collapse
Affiliation(s)
- Jin Tae Kwak
- Center for Interventional Oncology, National Institutes of Health Clinical Center, Bethesda MD 20892, USA
| | - Sheng Xu
- Center for Interventional Oncology, National Institutes of Health Clinical Center, Bethesda MD 20892, USA
| | - Bradford J. Wood
- Center for Interventional Oncology, National Institutes of Health Clinical Center, Bethesda MD 20892, USA
| |
Collapse
|
14
|
Salehi L, Azmi R. A novel method based on learning automata for automatic lesion detection in breast magnetic resonance imaging. J Med Signals Sens 2014; 4:202-10. [PMID: 25298929 PMCID: PMC4187355] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2013] [Accepted: 04/16/2014] [Indexed: 11/26/2022]
Abstract
Breast cancer continues to be a significant public health problem in the world. Early detection is the key for improving breast cancer prognosis. In this way, magnetic resonance imaging (MRI) is emerging as a powerful tool for the detection of breast cancer. Breast MRI presently has two major challenges. First, its specificity is relatively poor, and it detects many false positives (FPs). Second, the method involves acquiring several high-resolution image volumes before, during, and after the injection of a contrast agent. The large volume of data makes the task of interpretation by the radiologist both complex and time-consuming. These challenges have led to the development of the computer-aided detection systems to improve the efficiency and accuracy of the interpretation process. Detection of suspicious regions of interests (ROIs) is a critical preprocessing step in dynamic contrast-enhanced (DCE)-MRI data evaluation. In this regard, this paper introduces a new automatic method to detect the suspicious ROIs for breast DCE-MRI based on region growing. The results indicate that the proposed method is thoroughly able to identify suspicious regions (accuracy of 75.39 ± 3.37 on PIDER breast MRI dataset). Furthermore, the FP per image in this method is averagely 7.92, which shows considerable improvement comparing to other methods like ROI hunter.
Collapse
Affiliation(s)
- Leila Salehi
- Department of Computer Engineering, Alzahra University, Tehran, Iran,Address for correspondence: Leila Salehi, Department of Computer Engineering, Alzahra University, Tehran, Iran. E-mail:
| | - Reza Azmi
- Department of Computer Engineering, Alzahra University, Tehran, Iran
| |
Collapse
|
15
|
Acharya UR, Sree SV, Mookiah MRK, Saba L, Gao H, Mallarini G, Suri JS. Computed tomography carotid wall plaque characterization using a combination of discrete wavelet transform and texture features: A pilot study. Proc Inst Mech Eng H 2013; 227:643-54. [PMID: 23636747 DOI: 10.1177/0954411913480622] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
In 30% of stroke victims, the cause of stroke has been found to be the stenosis caused by plaques in the carotid artery. Early detection of plaque and subsequent classification of the same into symptomatic and asymptomatic can help the clinicians to choose only those patients who are at a higher risk of stroke for risky surgeries and stenosis treatments. Therefore, in this work, we have proposed a non-invasive computer-aided diagnostic technique to classify the detected plaque into the two classes. Computed tomography (CT) images of the carotid artery images were used to extract Local Binary Pattern (LBP) features and wavelet energy features. Significant features were then used to train and test several supervised learning algorithm based classifiers. The Support Vector Machine (SVM) classifier with various kernel configurations was evaluated using LBP and wavelet features. The SVM classifier presented the highest accuracy of 88%, sensitivity of 90.2%, and specificity of 86.5% for radial basis function (RBF) kernel function. The CT images of the carotid artery provide unique 3D images of the artery and plaque that could be used for calculating percentage of stenosis. Our proposed technique enables automatic classification of plaque into asymptomatic and symptomatic with high accuracy, and hence, it can be used for deciding the course of treatment. We have also proposed a single-valued integrated index (Atheromatic Index) using the significant features which can provide a more objective and faster prediction of the class.
Collapse
Affiliation(s)
- U R Acharya
- Department of Electronics and Computer Engineering, Ngee Ann Polytechnic, Singapore, Singapore.
| | | | | | | | | | | | | |
Collapse
|
16
|
Yang G, Xi X, Yin Y. Finger vein recognition based on a personalized best bit map. Sensors (Basel) 2012; 12:1738-57. [PMID: 22438735 PMCID: PMC3304137 DOI: 10.3390/s120201738] [Citation(s) in RCA: 76] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/24/2011] [Revised: 02/02/2012] [Accepted: 02/03/2012] [Indexed: 11/16/2022]
Abstract
Finger vein patterns have recently been recognized as an effective biometric identifier. In this paper, we propose a finger vein recognition method based on a personalized best bit map (PBBM). Our method is rooted in a local binary pattern based method and then inclined to use the best bits only for matching. We first present the concept of PBBM and the generating algorithm. Then we propose the finger vein recognition framework, which consists of preprocessing, feature extraction, and matching. Finally, we design extensive experiments to evaluate the effectiveness of our proposal. Experimental results show that PBBM achieves not only better performance, but also high robustness and reliability. In addition, PBBM can be used as a general framework for binary pattern based recognition.
Collapse
Affiliation(s)
- Gongping Yang
- School of Computer Science and Technology, Shandong University, Jinan 250101, China.
| | | | | |
Collapse
|
17
|
Gavilán M, Balcones D, Marcos O, Llorca DF, Sotelo MA, Parra I, Ocaña M, Aliseda P, Yarza P, Amírola A. Adaptive road crack detection system by pavement classification. Sensors (Basel) 2011; 11:9628-57. [PMID: 22163717 DOI: 10.3390/s111009628] [Citation(s) in RCA: 180] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/27/2011] [Revised: 09/27/2011] [Accepted: 10/09/2011] [Indexed: 11/30/2022]
Abstract
This paper presents a road distress detection system involving the phases needed to properly deal with fully automatic road distress assessment. A vehicle equipped with line scan cameras, laser illumination and acquisition HW-SW is used to storage the digital images that will be further processed to identify road cracks. Pre-processing is firstly carried out to both smooth the texture and enhance the linear features. Non-crack features detection is then applied to mask areas of the images with joints, sealed cracks and white painting, that usually generate false positive cracking. A seed-based approach is proposed to deal with road crack detection, combining Multiple Directional Non-Minimum Suppression (MDNMS) with a symmetry check. Seeds are linked by computing the paths with the lowest cost that meet the symmetry restrictions. The whole detection process involves the use of several parameters. A correct setting becomes essential to get optimal results without manual intervention. A fully automatic approach by means of a linear SVM-based classifier ensemble able to distinguish between up to 10 different types of pavement that appear in the Spanish roads is proposed. The optimal feature vector includes different texture-based features. The parameters are then tuned depending on the output provided by the classifier. Regarding non-crack features detection, results show that the introduction of such module reduces the impact of false positives due to non-crack features up to a factor of 2. In addition, the observed performance of the crack detection system is significantly boosted by adapting the parameters to the type of pavement.
Collapse
|
18
|
Rosdi BA, Shing CW, Suandi SA. Finger vein recognition using local line binary pattern. Sensors (Basel) 2011; 11:11357-71. [PMID: 22247670 DOI: 10.3390/s111211357] [Citation(s) in RCA: 178] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/01/2011] [Revised: 11/28/2011] [Accepted: 11/29/2011] [Indexed: 11/17/2022]
Abstract
In this paper, a personal verification method using finger vein is presented. Finger vein can be considered more secured compared to other hands based biometric traits such as fingerprint and palm print because the features are inside the human body. In the proposed method, a new texture descriptor called local line binary pattern (LLBP) is utilized as feature extraction technique. The neighbourhood shape in LLBP is a straight line, unlike in local binary pattern (LBP) which is a square shape. Experimental results show that the proposed method using LLBP has better performance than the previous methods using LBP and local derivative pattern (LDP).
Collapse
|
19
|
Lee EC, Jung H, Kim D. New finger biometric method using near infrared imaging. Sensors (Basel) 2011; 11:2319-33. [PMID: 22163741 PMCID: PMC3231585 DOI: 10.3390/s110302319] [Citation(s) in RCA: 165] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/07/2011] [Revised: 01/21/2011] [Accepted: 02/10/2011] [Indexed: 11/16/2022]
Abstract
In this paper, we propose a new finger biometric method. Infrared finger images are first captured, and then feature extraction is performed using a modified Gaussian high-pass filter through binarization, local binary pattern (LBP), and local derivative pattern (LDP) methods. Infrared finger images include the multimodal features of finger veins and finger geometries. Instead of extracting each feature using different methods, the modified Gaussian high-pass filter is fully convolved. Therefore, the extracted binary patterns of finger images include the multimodal features of veins and finger geometries. Experimental results show that the proposed method has an error rate of 0.13%.
Collapse
Affiliation(s)
- Eui Chul Lee
- Division of Fusion and Convergence of Mathematical Sciences, National Institute for Mathematical Sciences/463-1, Jeonmin-Dong, KT Daeduk Research Center, Yuseong-gu, Daejeon 305-390, Korea; E-Mail:
- Author to whom correspondence should be addressed; E-Mail:
| | - Hyunwoo Jung
- Department of Math and Computer Science, Korea Science Academy of KAIST, Pusan, Korea; E-Mail:
| | - Daeyeoul Kim
- Division of Fusion and Convergence of Mathematical Sciences, National Institute for Mathematical Sciences/463-1, Jeonmin-Dong, KT Daeduk Research Center, Yuseong-gu, Daejeon 305-390, Korea; E-Mail:
| |
Collapse
|