1
|
Classification and distribution of freshwater microplastics along the Italian Po river by hyperspectral imaging. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2022; 29:48588-48606. [PMID: 35195863 PMCID: PMC9252960 DOI: 10.1007/s11356-022-18501-x] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/02/2021] [Accepted: 12/31/2021] [Indexed: 06/13/2023]
Abstract
In this work, freshwater microplastic samples collected from four different stations along the Italian Po river were characterized in terms of abundance, distribution, category, morphological and morphometrical features, and polymer type. The correlation between microplastic category and polymer type was also evaluated. Polymer identification was carried out developing and implementing a new and effective hierarchical classification logic applied to hyperspectral images acquired in the short-wave infrared range (SWIR: 1000-2500 nm). Results showed that concentration of microplastics ranged from 1.89 to 8.22 particles/m3, the most abundant category was fragment, followed by foam, granule, pellet, and filament and the most diffused polymers were expanded polystyrene followed by polyethylene, polypropylene, polystyrene, polyamide, polyethylene terephthalate and polyvinyl chloride, with some differences in polymer distribution among stations. The application of hyperspectral imaging (HSI) as a rapid and non-destructive method to classify freshwater microplastics for environmental monitoring represents a completely innovative approach in this field.
Collapse
|
2
|
VirusTaxo: Taxonomic classification of viruses from the genome sequence using k-mer enrichment. Genomics 2022; 114:110414. [PMID: 35718090 DOI: 10.1016/j.ygeno.2022.110414] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2021] [Revised: 04/27/2022] [Accepted: 06/13/2022] [Indexed: 11/04/2022]
Abstract
Classification of viruses into their taxonomic ranks (e.g., order, family, and genus) provides a framework to organize an abundant population of viruses. Next-generation metagenomic sequencing technologies lead to a rapid increase in generating sequencing data of viruses which require bioinformatics tools to analyze the taxonomy. Many metagenomic taxonomy classifiers have been developed to study microbiomes, but it is particularly challenging to assign the taxonomy of diverse virus sequences and there is a growing need for dedicated methods to be developed that are optimized to classify virus sequences into their taxa. For taxonomic classification of viruses from metagenomic sequences, we developed VirusTaxo using diverse (e.g., 402 DNA and 280 RNA) genera of viruses. VirusTaxo has an average accuracy of 93% at genus level prediction in DNA and RNA viruses. VirusTaxo outperformed existing taxonomic classifiers of viruses where it assigned taxonomy of a larger fraction of metagenomic contigs compared to other methods. Benchmarking of VirusTaxo on a collection of SARS-CoV-2 sequencing libraries and metavirome datasets suggests that VirusTaxo can characterize virus taxonomy from highly diverse contigs and provide a reliable decision on the taxonomy of viruses.
Collapse
|
3
|
Bacterial species identification using MALDI-TOF mass spectrometry and machine learning techniques: A large-scale benchmarking study. Comput Struct Biotechnol J 2021; 19:6157-6168. [PMID: 34938408 PMCID: PMC8649224 DOI: 10.1016/j.csbj.2021.11.004] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2021] [Revised: 11/03/2021] [Accepted: 11/03/2021] [Indexed: 11/17/2022] Open
Abstract
Today machine learning methods are commonly deployed for bacterial species identification using MALDI-TOF mass spectrometry data. However, most of the studies reported in literature only consider very traditional machine learning methods on small datasets that contain a limited number of species. In this paper we present benchmarking results on an unprecedented scale for a wide range of machine learning methods, using datasets that contain almost 100,000 spectra and more than 1000 different species. The size and the diversity of the data allow to compare three important identification scenarios that are often not distinguished in literature, i.e., identification for novel biological replicates, novel strains and novel species that are not present in the training data. The results demonstrate that in all three scenarios acceptable identification rates are obtained, but the numbers are typically lower than those reported in studies with a more limited analysis. Using hierarchical classification methods, we also demonstrate that taxonomic information is in general not well preserved in MALDI-TOF mass spectrometry data. For the novel species scenario, we apply for the first time neural networks with Monte Carlo dropout, which have shown to be successful in other domains, such as computer vision, for the detection of novel species.
Collapse
|
4
|
Monitoring the level of hypnosis using a hierarchical SVM system. J Clin Monit Comput 2020; 34:331-338. [PMID: 30982945 DOI: 10.1007/s10877-019-00311-1] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2018] [Accepted: 04/04/2019] [Indexed: 10/27/2022]
Abstract
Monitoring level of hypnosis is a major ongoing challenge for anesthetists to reduce anesthetic drug consumption, avoiding intraoperative awareness and prolonged recovery. This paper proposes a novel automated method for accurate assessing of the level of hypnosis with sevoflurane in 17 patients using the electroencephalogram signal. In this method, a set of distinctive features and a hierarchical classification structure based on support vector machine (SVM) methods, is proposed to discriminate the four levels of anesthesia (awake, light, general and deep states). The first stage of the hierarchical SVM structure identifies the awake state by extracting Shannon Permutation Entropy, Detrended Fluctuation Analysis and frequency features. Then deep state is identified by extracting the sample entropy feature; and finally light and general states are identified by extracting the three mentioned features of the first step. The accuracy of the proposed method of analyzing the brain activity during anesthesia is 94.11%; which was better than previous studies and also a commercial monitoring system (Response Entropy Index).
Collapse
|
5
|
Chronic Kidney Disease stratification using office visit records: Handling data imbalance via hierarchical meta-classification. BMC Med Inform Decis Mak 2018; 18:125. [PMID: 30537962 PMCID: PMC6290512 DOI: 10.1186/s12911-018-0675-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Background Chronic Kidney Disease (CKD) is one of several conditions that affect a growing percentage of the US population; the disease is accompanied by multiple co-morbidities, and is hard to diagnose in-and-of itself. In its advanced forms it carries severe outcomes and can lead to death. It is thus important to detect the disease as early as possible, which can help devise effective intervention and treatment plan. Here we investigate ways to utilize information available in electronic health records (EHRs) from regular office visits of more than 13,000 patients, in order to distinguish among several stages of the disease. While clinical data stored in EHRs provide valuable information for risk-stratification, one of the major challenges in using them arises from data imbalance. That is, records associated with a more severe condition are typically under-represented compared to those associated with a milder manifestation of the disease. To address imbalance, we propose and develop a sampling-based ensemble approach, hierarchical meta-classification, aiming to stratify CKD patients into severity stages, using simple quantitative non-text features gathered from standard office visit records. Methods The proposed hierarchical meta-classification method frames the multiclass classification task as a hierarchy of two subtasks. The first is binary classification, separating records associated with the majority class from those associated with all minority classes combined, using meta-classification. The second subtask separates the records assigned to the combined minority classes into the individual constituent classes. Results The proposed method identifies a significant proportion of patients suffering from the more advanced stages of the condition, while also correctly identifying most of the less severe cases, maintaining high sensitivity, specificity and F-measure (≥ 93%). Our results show that the high level of performance attained by our method is preserved even when the size of the training set is significantly reduced, demonstrating the stability and generalizability of our approach. Conclusion We present a new approach to perform classification while addressing data imbalance, which is inherent in the biomedical domain. Our model effectively identifies severity stages of CKD patients, using information readily available in office visit records within the realistic context of high data imbalance.
Collapse
|
6
|
Hierarchical classification of land use types using multiple vegetation indices to measure the effects of urbanization. ENVIRONMENTAL MONITORING AND ASSESSMENT 2018; 190:342. [PMID: 29752655 DOI: 10.1007/s10661-018-6714-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/30/2017] [Accepted: 05/02/2018] [Indexed: 06/08/2023]
Abstract
Detecting fine-scale spatiotemporal land use changes is a prerequisite for understanding and predicting the effects of urbanization and its related human impacts on the ecosystem. Land use changes are frequently examined using vegetation indices (VIs), although the validation of these indices has not been conducted at a high resolution. Therefore, a hierarchical classification was constructed to obtain accurate land use types at a fine scale. The characteristics of four popular VIs were investigated prior to examining the hierarchical classification by using Purbachal New Town, Bangladesh, which exhibits ongoing urbanization. These four VIs are the normalized difference VI (NDVI), green-red VI (GRVI), enhanced VI (EVI), and two-band EVI (EVI2). The reflectance data were obtained by the IKONOS (0.8-m resolution) and WorldView-2 sensor (0.5-m resolution) in 2001 and 2015, respectively. The hierarchical classification of land use types was constructed using a decision tree (DT) utilizing all four of the examined VIs. The accuracy of the classification was evaluated using ground truth data with multiple comparisons and kappa (κ) coefficients. The DT showed overall accuracies of 96.1 and 97.8% in 2001 and 2015, respectively, while the accuracies of the VIs were less than 91.2%. These results indicate that each VI exhibits unique advantages. In addition, the DT was the best classifier of land use types, particularly for native ecosystems represented by Shorea forests and homestead vegetation, at the fine scale. Since the conservation of these native ecosystems is of prime importance, DTs based on hierarchical classifications should be used more widely.
Collapse
|
7
|
An ensemble learning system for a 4-way classification of Alzheimer's disease and mild cognitive impairment. J Neurosci Methods 2018; 302:75-81. [PMID: 29578038 DOI: 10.1016/j.jneumeth.2018.03.008] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2017] [Revised: 03/20/2018] [Accepted: 03/20/2018] [Indexed: 01/23/2023]
Abstract
Discriminating Alzheimer's disease (AD) from its prodromal form, mild cognitive impairment (MCI), is a significant clinical problem that may facilitate early diagnosis and intervention, in which a more challenging issue is to classify MCI subtypes, i.e., those who eventually convert to AD (cMCI) versus those who do not (MCI). To solve this difficult 4-way classification problem (AD, MCI, cMCI and healthy controls), a competition was hosted by Kaggle to invite the scientific community to apply their machine learning approaches on pre-processed sets of T1-weighted magnetic resonance images (MRI) data and the demographic information from the international Alzheimer's disease neuroimaging initiative (ADNI) database. This paper summarizes our competition results. We first proposed a hierarchical process by turning the 4-way classification into five binary classification problems. A new feature selection technology based on relative importance was also proposed, aiming to identify a more informative and concise subset from 426 sMRI morphometric and 3 demographic features, to ensure each binary classifier to achieve its highest accuracy. As a result, about 2% of the original features were selected to build a new feature space, which can achieve the final four-way classification with a 54.38% accuracy on testing data through hierarchical grouping, higher than several alternative methods in comparison. More importantly, the selected discriminative features such as hippocampal volume, parahippocampal surface area, and medial orbitofrontal thickness, etc. as well as the MMSE score, are reasonable and consistent with those reported in AD/MCI deficits. In summary, the proposed method provides a new framework for multi-way classification using hierarchical grouping and precise feature selection.
Collapse
|
8
|
A hierarchical classifier based on human blood plasma fluorescence for non-invasive colorectal cancer screening. Artif Intell Med 2017; 82:1-10. [PMID: 28939302 DOI: 10.1016/j.artmed.2017.09.004] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2017] [Revised: 08/15/2017] [Accepted: 09/12/2017] [Indexed: 01/06/2023]
Abstract
Colorectal cancer (CRC) a leading cause of death by cancer, and screening programs for its early identification are at the heart of the increasing survival rates. To motivate population participation, non-invasive, accurate, scalable and cost-effective diagnosis methods are required. Blood fluorescence spectroscopy provides rich information that can be used for cancer identification. The main challenges in analyzing blood fluorescence data for CRC classification are related to its high dimensionality and inherent variability, especially when analyzing a small number of samples. In this paper, we present a hierarchical classification method based on plasma fluorescence to identify not only CRC, but also adenomas and other non-malignant colorectal findings that may require further medical investigation. A feature selection algorithm is proposed to deal with the high dimensionality and select discriminant fluorescence wavelengths. These are used to train a binary support vector machine (SVM) in the first level to identify the CRC samples. The remaining samples are then presented to a one-class SVM trained on healthy subjects to detect deviant samples, and thus non-malignant findings. This hierarchical design, together with the one class-SVM, aims to reduce the effects of small samples and high variability. Using a dataset analyzed in previous studies comprised of 12,341 wavelengths, we achieved much superior results. Sensitivity and specificity are 0.87 and 0.95 for CRC detection, and 0.60 and 0.79 for non-malignant findings, respectively. Compared to related work, the proposed method presented a better accuracy, required fewer features, and provides a unified approach that expands CRC detection to non-malignant findings.
Collapse
|
9
|
Leveraging machine learning-based approaches to assess human papillomavirus vaccination sentiment trends with Twitter data. BMC Med Inform Decis Mak 2017; 17:69. [PMID: 28699569 PMCID: PMC5506590 DOI: 10.1186/s12911-017-0469-6] [Citation(s) in RCA: 68] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
Background As one of the serious public health issues, vaccination refusal has been attracting more and more attention, especially for newly approved human papillomavirus (HPV) vaccines. Understanding public opinion towards HPV vaccines, especially concerns on social media, is of significant importance for HPV vaccination promotion. Methods In this study, we leveraged a hierarchical machine learning based sentiment analysis system to extract public opinions towards HPV vaccines from Twitter. English tweets containing HPV vaccines-related keywords were collected from November 2, 2015 to March 28, 2016. Manual annotation was done to evaluate the performance of the system on the unannotated tweets corpus. Followed time series analysis was applied to this corpus to track the trends of machine-deduced sentiments and their associations with different days of the week. Results The evaluation of the unannotated tweets corpus showed that the micro-averaging F scores have reached 0.786. The learning system deduced the sentiment labels for 184,214 tweets in the collected unannotated tweets corpus. Time series analysis identified a coincidence between mainstream outcome and Twitter contents. A weak trend was found for “Negative” tweets that decreased firstly and began to increase later; an opposite trend was identified for “Positive” tweets. Tweets that contain the worries on efficacy for HPV vaccines showed a relative significant decreasing trend. Strong associations were found between some sentiments (“Positive”, “Negative”, “Negative-Safety” and “Negative-Others”) with different days of the week. Conclusions Our efforts on sentiment analysis for newly approved HPV vaccines provide us an automatic and instant way to extract public opinion and understand the concerns on Twitter. Our approaches can provide a feedback to public health professionals to monitor online public response, examine the effectiveness of their HPV vaccination promotion strategies and adjust their promotion plans. Electronic supplementary material The online version of this article (doi:10.1186/s12911-017-0469-6) contains supplementary material, which is available to authorized users.
Collapse
|
10
|
Optimization on machine learning based approaches for sentiment analysis on HPV vaccines related tweets. J Biomed Semantics 2017; 8:9. [PMID: 28253919 PMCID: PMC5335787 DOI: 10.1186/s13326-017-0120-6] [Citation(s) in RCA: 64] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2016] [Accepted: 02/07/2017] [Indexed: 12/29/2022] Open
Abstract
BACKGROUND Analysing public opinions on HPV vaccines on social media using machine learning based approaches will help us understand the reasons behind the low vaccine coverage and come up with corresponding strategies to improve vaccine uptake. OBJECTIVE To propose a machine learning system that is able to extract comprehensive public sentiment on HPV vaccines on Twitter with satisfying performance. METHOD We collected and manually annotated 6,000 HPV vaccines related tweets as a gold standard. SVM model was chosen and a hierarchical classification method was proposed and evaluated. Additional feature sets evaluation and model parameters optimization was done to maximize the machine learning model performance. RESULTS A hierarchical classification scheme that contains 10 categories was built to access public opinions toward HPV vaccines comprehensively. A 6,000 annotated tweets gold corpus with Kappa annotation agreement at 0.851 was created and made public available. The hierarchical classification model with optimized feature sets and model parameters has increased the micro-averaging and macro-averaging F score from 0.6732 and 0.3967 to 0.7442 and 0.5883 respectively, compared with baseline model. CONCLUSIONS Our work provides a systematical way to improve the machine learning model performance on the highly unbalanced HPV vaccines related tweets corpus. Our system can be further applied on a large tweets corpus to extract large-scale public opinion towards HPV vaccines.
Collapse
|
11
|
Hierarchical Classification and System Combination for Automatically Identifying Physiological and Neuromuscular Laryngeal Pathologies. J Voice 2016; 31:384.e9-384.e14. [PMID: 27743845 DOI: 10.1016/j.jvoice.2016.09.003] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2016] [Revised: 09/08/2016] [Accepted: 09/08/2016] [Indexed: 11/26/2022]
Abstract
OBJECTIVES Speech signal processing techniques have provided several contributions to pathologic voice identification, in which healthy and unhealthy voice samples are evaluated. A less common approach is to identify laryngeal pathologies, for which the use of a noninvasive method for pathologic voice identification is an important step forward for preliminary diagnosis. In this study, a hierarchical classifier and a combination of systems are used to improve the accuracy of a three-class identification system (healthy, physiological larynx pathologies, and neuromuscular larynx pathologies). METHOD Three main subject classes were considered: subjects with physiological larynx pathologies (vocal fold nodules and edemas: 59 samples), subjects with neuromuscular larynx pathologies (unilateral vocal fold paralysis: 59 samples), and healthy subjects (36 samples). The variables used in this study were a speech task (sustained vowel /a/ or continuous reading speech), features with or without perceptual information, and features with or without direct information about formants evaluated using single classifiers. A hierarchical classification system was designed based on this information. RESULTS The resulting system combines an analysis of continuous speech by way of the commonly used sustained vowel /a/ to obtain spectral and perceptual speech features. It achieved an accuracy of 84.4%, which represents an improvement of approximately 9% compared with the stand-alone approach. For pathologic voice identification, the accuracy obtained was 98.7%, and the identification accuracy for the two pathology classes was 81.3%. CONCLUSIONS Hierarchical classification and system combination create significant benefits and introduce a modular approach to the classification of larynx pathologies.
Collapse
|
12
|
Genomics dataset on unclassified published organism (patent US 7547531). Data Brief 2016; 9:602-605. [PMID: 27766287 PMCID: PMC5066183 DOI: 10.1016/j.dib.2016.09.046] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2016] [Revised: 07/28/2016] [Accepted: 09/28/2016] [Indexed: 11/13/2022] Open
Abstract
Nucleotide (DNA) sequence analysis provides important clues regarding the characteristics and taxonomic position of an organism. With the intention that, DNA sequence analysis is very crucial to learn about hierarchical classification of that particular organism. This dataset (patent US 7547531) is chosen to simplify all the complex raw data buried in undisclosed DNA sequences which help to open doors for new collaborations. In this data, a total of 48 unidentified DNA sequences from patent US 7547531 were selected and their complete sequences were retrieved from NCBI BioSample database. Quick response (QR) code of those DNA sequences was constructed by DNA BarID tool. QR code is useful for the identification and comparison of isolates with other organisms. AT/GC content of the DNA sequences was determined using ENDMEMO GC Content Calculator, which indicates their stability at different temperature. The highest GC content was observed in GP445188 (62.5%) which was followed by GP445198 (61.8%) and GP445189 (59.44%), while lowest was in GP445178 (24.39%). In addition, New England BioLabs (NEB) database was used to identify cleavage code indicating the 5, 3 and blunt end and enzyme code indicating the methylation site of the DNA sequences was also shown. These data will be helpful for the construction of the organisms’ hierarchical classification, determination of their phylogenetic and taxonomic position and revelation of their molecular characteristics.
Collapse
|
13
|
Hierarchical classification strategy for Phenotype extraction from epidermal growth factor receptor endocytosis screening. BMC Bioinformatics 2016; 17:196. [PMID: 27142862 PMCID: PMC4855371 DOI: 10.1186/s12859-016-1053-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2015] [Accepted: 04/13/2016] [Indexed: 11/29/2022] Open
Abstract
BACKGROUND Endocytosis is regarded as a mechanism of attenuating the epidermal growth factor receptor (EGFR) signaling and of receptor degradation. There is increasing evidence becoming available showing that breast cancer progression is associated with a defect in EGFR endocytosis. In order to find related Ribonucleic acid (RNA) regulators in this process, high-throughput imaging with fluorescent markers is used to visualize the complex EGFR endocytosis process. Subsequently a dedicated automatic image and data analysis system is developed and applied to extract the phenotype measurement and distinguish different developmental episodes from a huge amount of images acquired through high-throughput imaging. For the image analysis, a phenotype measurement quantifies the important image information into distinct features or measurements. Therefore, the manner in which prominent measurements are chosen to represent the dynamics of the EGFR process becomes a crucial step for the identification of the phenotype. In the subsequent data analysis, classification is used to categorize each observation by making use of all prominent measurements obtained from image analysis. Therefore, a better construction for a classification strategy will support to raise the performance level in our image and data analysis system. RESULTS In this paper, we illustrate an integrated analysis method for EGFR signalling through image analysis of microscopy images. Sophisticated wavelet-based texture measurements are used to obtain a good description of the characteristic stages in the EGFR signalling. A hierarchical classification strategy is designed to improve the recognition of phenotypic episodes of EGFR during endocytosis. Different strategies for normalization, feature selection and classification are evaluated. CONCLUSIONS The results of performance assessment clearly demonstrate that our hierarchical classification scheme combined with a selected set of features provides a notable improvement in the temporal analysis of EGFR endocytosis. Moreover, it is shown that the addition of the wavelet-based texture features contributes to this improvement. Our workflow can be applied to drug discovery to analyze defected EGFR endocytosis processes.
Collapse
|
14
|
OBIA based hierarchical image classification for industrial lake water. THE SCIENCE OF THE TOTAL ENVIRONMENT 2014; 487:565-573. [PMID: 24813772 DOI: 10.1016/j.scitotenv.2014.04.048] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/20/2014] [Revised: 04/09/2014] [Accepted: 04/13/2014] [Indexed: 06/03/2023]
Abstract
Water management is very important in water mining regions for the sustainability of the natural environment and for industrial activities. This study focused on Acigol Lake, which is an important wetland for sodium sulphate (Na2SO4) production, a significant natural protection area and habitat for local bird species and endemic species of this saline environment, and a stopover for migrating flamingos. By a hierarchical classification method, ponds representing the industrial part were classified according to in-situ measured Baumé values, and lake water representing the natural part was classified according to in-situ measurements of water depth. The latter is directly related to the water level, which should not exceed a critical level determined by the regulatory authorities. The resulting data, produced at an accuracy of around 80%, illustrates the status in two main regions for a single date. The output of the analysis may be meaningful for firms and environmental researchers, and authorizations can provide a good perspective for decision making for sustainable resource management in the region which has uncommon and specific ecological characteristics.
Collapse
|
15
|
Segmentation of abdominal organs from CT using a multi-level, hierarchical neural network strategy. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2014; 113:830-852. [PMID: 24480371 DOI: 10.1016/j.cmpb.2013.12.008] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/27/2013] [Revised: 11/09/2013] [Accepted: 12/17/2013] [Indexed: 06/03/2023]
Abstract
Precise measurements on abdominal organs are vital prior to the important clinical procedures. Such measurements require accurate segmentation of these organs, which is a very challenging task due to countless anatomical variations and technical difficulties. Although, several features with various classifiers have been designed to overcome these challenges, abdominal organ segmentation via classification is still an emerging field in order to reach desired precision. Recent studies on multiple feature-classifier combinations show that hierarchical systems outperform composite feature-single classifier models. In this study, how hierarchical formations can translate to improved accuracy, when large size feature spaces are involved, is explored for the problem of abdominal organ segmentation. As a result, a semi-automatic, slice-by-slice segmentation method is developed using a novel multi-level and hierarchical neural network (MHNN). MHNN is designed to collect complementary information about organs at each level of the hierarchy via different feature-classifier combinations. Moreover, each level of MHNN receives residual data from the previous level. The residual data is constructed to preserve zero false positive error until the last level of the hierarchy, where only most challenging samples remain. The algorithm mimics analysis behaviour of a radiologist by using the slice-by-slice iteration, which is supported with adjacent slice similarity features. This enables adaptive determination of system parameters and turns into the advantage of online training, which is done in parallel to the segmentation process. Proposed design can perform robust and accurate segmentation of abdominal organs as validated by using diverse data sets with various challenges.
Collapse
|
16
|
Medical X-ray Image Hierarchical Classification Using a Merging and Splitting Scheme in Feature Space. JOURNAL OF MEDICAL SIGNALS AND SENSORS 2013; 3:150-63. [PMID: 24672763 PMCID: PMC3959005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/24/2013] [Accepted: 06/13/2013] [Indexed: 10/25/2022]
Abstract
Due to the daily mass production and the widespread variation of medical X-ray images, it is necessary to classify these for searching and retrieving proposes, especially for content-based medical image retrieval systems. In this paper, a medical X-ray image hierarchical classification structure based on a novel merging and splitting scheme and using shape and texture features is proposed. In the first level of the proposed structure, to improve the classification performance, similar classes with regard to shape contents are grouped based on merging measures and shape features into the general overlapped classes. In the next levels of this structure, the overlapped classes split in smaller classes based on the classification performance of combination of shape and texture features or texture features only. Ultimately, in the last levels, this procedure is also continued forming all the classes, separately. Moreover, to optimize the feature vector in the proposed structure, we use orthogonal forward selection algorithm according to Mahalanobis class separability measure as a feature selection and reduction algorithm. In other words, according to the complexity and inter-class distance of each class, a sub-space of the feature space is selected in each level and then a supervised merging and splitting scheme is applied to form the hierarchical classification. The proposed structure is evaluated on a database consisting of 2158 medical X-ray images of 18 classes (IMAGECLEF 2005 database) and accuracy rate of 93.6% in the last level of the hierarchical structure for an 18-class classification problem is obtained.
Collapse
|