1
|
Dakal TC, Xu C, Kumar A. Advanced computational tools, artificial intelligence and machine-learning approaches in gut microbiota and biomarker identification. FRONTIERS IN MEDICAL TECHNOLOGY 2025; 6:1434799. [PMID: 40303946 PMCID: PMC12037385 DOI: 10.3389/fmedt.2024.1434799] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2024] [Accepted: 10/16/2024] [Indexed: 05/02/2025] Open
Abstract
The microbiome of the gut is a complex ecosystem that contains a wide variety of microbial species and functional capabilities. The microbiome has a significant impact on health and disease by affecting endocrinology, physiology, and neurology. It can change the progression of certain diseases and enhance treatment responses and tolerance. The gut microbiota plays a pivotal role in human health, influencing a wide range of physiological processes. Recent advances in computational tools and artificial intelligence (AI) have revolutionized the study of gut microbiota, enabling the identification of biomarkers that are critical for diagnosing and treating various diseases. This review hunts through the cutting-edge computational methodologies that integrate multi-omics data-such as metagenomics, metaproteomics, and metabolomics-providing a comprehensive understanding of the gut microbiome's composition and function. Additionally, machine learning (ML) approaches, including deep learning and network-based methods, are explored for their ability to uncover complex patterns within microbiome data, offering unprecedented insights into microbial interactions and their link to host health. By highlighting the synergy between traditional bioinformatics tools and advanced AI techniques, this review underscores the potential of these approaches in enhancing biomarker discovery and developing personalized therapeutic strategies. The convergence of computational advancements and microbiome research marks a significant step forward in precision medicine, paving the way for novel diagnostics and treatments tailored to individual microbiome profiles. Investigators have the ability to discover connections between the composition of microorganisms, the expression of genes, and the profiles of metabolites. Individual reactions to medicines that target gut microbes can be predicted by models driven by artificial intelligence. It is possible to obtain personalized and precision medicine by first gaining an understanding of the impact that the gut microbiota has on the development of disease. The application of machine learning allows for the customization of treatments to the specific microbial environment of an individual.
Collapse
Affiliation(s)
- Tikam Chand Dakal
- Genome and Computational Biology Lab, Department of Biotechnology, Mohanlal Sukhadia University, Udaipur, India
| | - Caiming Xu
- Beckman Research Institute of City of Hope, Monrovia, CA, United States
- Department of General Surgery, The First Affiliated Hospital of Dalian Medical University, Dalian, China
| | - Abhishek Kumar
- Manipal Academy of Higher Education (MAHE), Manipal, India
- Institute of Bioinformatics, International Technology Park, Bangalore, India
| |
Collapse
|
2
|
Bakir-Gungor B, Temiz M, Canakcimaksutoglu B, Yousef M. Prediction of colorectal cancer based on taxonomic levels of microorganisms and discovery of taxonomic biomarkers using the Grouping-Scoring-Modeling (G-S-M) approach. Comput Biol Med 2025; 187:109813. [PMID: 39929003 DOI: 10.1016/j.compbiomed.2025.109813] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2024] [Revised: 01/09/2025] [Accepted: 02/05/2025] [Indexed: 02/12/2025]
Abstract
Colorectal cancer (CRC) is one of the most prevalent forms of cancer globally. The human gut microbiome plays an important role in the development of CRC and serves as a biomarker for early detection and treatment. This research effort focuses on the identification of potential taxonomic biomarkers of CRC using a grouping-based feature selection method. Additionally, this study investigates the effect of incorporating biological domain knowledge into the feature selection process while identifying CRC-associated microorganisms. Conventional feature selection techniques often fail to leverage existing biological knowledge during metagenomic data analysis. To address this gap, we propose taxonomy-based Grouping Scoring Modeling (G-S-M) method that integrates biological domain knowledge into feature grouping and selection. In this study, using metagenomic data related to CRC, classification is performed at three taxonomic levels (genus, family and order). The MetaPhlAn tool is employed to determine the relative abundance values of species in each sample. Comparative performance analyses involve six feature selection methods and four classification algorithms. When experimented on two CRC associated metagenomics datasets, the highest performance metric, yielding an AUC of 0.90, is observed at the genus taxonomic level. At this level, 7 out of top 10 groups (Parvimonas, Peptostreptococcus, Fusobacterium, Gemella, Streptococcus, Porphyromonas and Solobacterium) were commonly identified for both datasets. Moreover, the identified microorganisms at genus, family, and order levels are thoroughly discussed via refering to CRC-related metagenomic literature. This study not only contributes to our understanding of CRC development, but also highlights the applicability of taxonomy-based G-S-M method in tackling various diseases.
Collapse
Affiliation(s)
- Burcu Bakir-Gungor
- Department of Computer Engineering, Faculty of Engineering, Abdullah Gul University, Kayseri, 38080, Turkey
| | - Mustafa Temiz
- Department of Electrical and Computer Engineering, Faculty of Engineering, Abdullah Gul University, Kayseri, 38080, Turkey.
| | - Beyza Canakcimaksutoglu
- Department of Bioengineering, Faculty of Life and Natural Science, Abdullah Gul University, Kayseri, 38080, Turkey
| | - Malik Yousef
- Department of Information Systems, Zefat Academic College, Zefat, 13206, Israel; Galilee Digital Health Research Center (GDH), Zefat Academic College, Israel
| |
Collapse
|
3
|
Jiang Y, Liao D, Zhu Q, Lu YY. PhyloMix: enhancing microbiome-trait association prediction through phylogeny-mixing augmentation. Bioinformatics 2025; 41:btaf014. [PMID: 39799515 PMCID: PMC11849959 DOI: 10.1093/bioinformatics/btaf014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2024] [Revised: 12/14/2024] [Accepted: 01/09/2025] [Indexed: 01/15/2025] Open
Abstract
MOTIVATION Understanding the associations between traits and microbial composition is a fundamental objective in microbiome research. Recently, researchers have turned to machine learning (ML) models to achieve this goal with promising results. However, the effectiveness of advanced ML models is often limited by the unique characteristics of microbiome data, which are typically high-dimensional, compositional, and imbalanced. These characteristics can hinder the models' ability to fully explore the relationships among taxa in predictive analyses. To address this challenge, data augmentation has become crucial. It involves generating synthetic samples with artificial labels based on existing data and incorporating these samples into the training set to improve ML model performance. RESULTS Here, we propose PhyloMix, a novel data augmentation method specifically designed for microbiome data to enhance predictive analyses. PhyloMix leverages the phylogenetic relationships among microbiome taxa as an informative prior to guide the generation of synthetic microbial samples. Leveraging phylogeny, PhyloMix creates new samples by removing a subtree from one sample and combining it with the corresponding subtree from another sample. Notably, PhyloMix is designed to address the compositional nature of microbiome data, effectively handling both raw counts and relative abundances. This approach introduces sufficient diversity into the augmented samples, leading to improved predictive performance. We empirically evaluated PhyloMix on six real microbiome datasets across five commonly used ML models. PhyloMix significantly outperforms distinct baseline methods including sample-mixing-based data augmentation techniques like vanilla mixup and compositional cutmix, as well as the phylogeny-based method TADA. We also demonstrated the wide applicability of PhyloMix in both supervised learning and contrastive representation learning. AVAILABILITY AND IMPLEMENTATION The Apache-licensed source code is available at (https://github.com/batmen-lab/phylomix).
Collapse
Affiliation(s)
- Yifan Jiang
- Cheriton School of Computer Science, University of Waterloo, Waterloo, ON, N2L 3G1, Canada
| | - Disen Liao
- Cheriton School of Computer Science, University of Waterloo, Waterloo, ON, N2L 3G1, Canada
| | - Qiyun Zhu
- School of Life Sciences, Arizona State University, Tempe, AZ, 85281, United States
| | - Yang Young Lu
- Cheriton School of Computer Science, University of Waterloo, Waterloo, ON, N2L 3G1, Canada
| |
Collapse
|
4
|
Su Q, Lau RI, Liu Q, Li MKT, Yan Mak JW, Lu W, Lau ISF, Lau LHS, Yeung GTY, Cheung CP, Tang W, Liu C, Ching JYL, Cheong PK, Chan FKL, Ng SC. The gut microbiome associates with phenotypic manifestations of post-acute COVID-19 syndrome. Cell Host Microbe 2024; 32:651-660.e4. [PMID: 38657605 DOI: 10.1016/j.chom.2024.04.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Revised: 02/28/2024] [Accepted: 04/01/2024] [Indexed: 04/26/2024]
Abstract
The mechanisms underlying the many phenotypic manifestations of post-acute COVID-19 syndrome (PACS) are poorly understood. Herein, we characterized the gut microbiome in heterogeneous cohorts of subjects with PACS and developed a multi-label machine learning model for using the microbiome to predict specific symptoms. Our processed data covered 585 bacterial species and 500 microbial pathways, explaining 12.7% of the inter-individual variability in PACS. Three gut-microbiome-based enterotypes were identified in subjects with PACS and associated with different phenotypic manifestations. The trained model showed an accuracy of 0.89 in predicting individual symptoms of PACS in the test set and maintained a sensitivity of 86% and a specificity of 82% in predicting upcoming symptoms in an independent longitudinal cohort of subjects before they developed PACS. This study demonstrates that the gut microbiome is associated with phenotypic manifestations of PACS, which has potential clinical utility for the prediction and diagnosis of PACS.
Collapse
Affiliation(s)
- Qi Su
- Microbiota I-Center (MagIC), Hong Kong SAR, China; Department of Medicine and Therapeutics, The Chinese University of Hong Kong, Hong Kong SAR, China
| | - Raphaela I Lau
- Microbiota I-Center (MagIC), Hong Kong SAR, China; Department of Medicine and Therapeutics, The Chinese University of Hong Kong, Hong Kong SAR, China
| | - Qin Liu
- Microbiota I-Center (MagIC), Hong Kong SAR, China
| | - Moses K T Li
- Microbiota I-Center (MagIC), Hong Kong SAR, China
| | - Joyce Wing Yan Mak
- Department of Medicine and Therapeutics, The Chinese University of Hong Kong, Hong Kong SAR, China
| | - Wenqi Lu
- Microbiota I-Center (MagIC), Hong Kong SAR, China; Department of Medicine and Therapeutics, The Chinese University of Hong Kong, Hong Kong SAR, China
| | - Ivan S F Lau
- Department of Medicine and Therapeutics, The Chinese University of Hong Kong, Hong Kong SAR, China
| | - Louis H S Lau
- Department of Medicine and Therapeutics, The Chinese University of Hong Kong, Hong Kong SAR, China
| | - Giann T Y Yeung
- Department of Medicine and Therapeutics, The Chinese University of Hong Kong, Hong Kong SAR, China
| | - Chun Pan Cheung
- Microbiota I-Center (MagIC), Hong Kong SAR, China; Department of Medicine and Therapeutics, The Chinese University of Hong Kong, Hong Kong SAR, China
| | - Whitney Tang
- Microbiota I-Center (MagIC), Hong Kong SAR, China; Department of Medicine and Therapeutics, The Chinese University of Hong Kong, Hong Kong SAR, China
| | - Chengyu Liu
- Microbiota I-Center (MagIC), Hong Kong SAR, China; Department of Medicine and Therapeutics, The Chinese University of Hong Kong, Hong Kong SAR, China
| | - Jessica Y L Ching
- Microbiota I-Center (MagIC), Hong Kong SAR, China; Department of Medicine and Therapeutics, The Chinese University of Hong Kong, Hong Kong SAR, China
| | - Pui Kuan Cheong
- Microbiota I-Center (MagIC), Hong Kong SAR, China; Department of Medicine and Therapeutics, The Chinese University of Hong Kong, Hong Kong SAR, China
| | - Francis K L Chan
- Microbiota I-Center (MagIC), Hong Kong SAR, China; Centre for Gut Microbiota Research, The Chinese University of Hong Kong, Hong Kong SAR, China
| | - Siew C Ng
- Microbiota I-Center (MagIC), Hong Kong SAR, China; Department of Medicine and Therapeutics, The Chinese University of Hong Kong, Hong Kong SAR, China; Li Ka Shing Institute of Health Sciences, State Key Laboratory of Digestive Disease, Institute of Digestive Disease, The Chinese University of Hong Kong, Hong Kong SAR, China.
| |
Collapse
|
5
|
Ibrahimi E, Lopes MB, Dhamo X, Simeon A, Shigdel R, Hron K, Stres B, D’Elia D, Berland M, Marcos-Zambrano LJ. Overview of data preprocessing for machine learning applications in human microbiome research. Front Microbiol 2023; 14:1250909. [PMID: 37869650 PMCID: PMC10588656 DOI: 10.3389/fmicb.2023.1250909] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2023] [Accepted: 09/22/2023] [Indexed: 10/24/2023] Open
Abstract
Although metagenomic sequencing is now the preferred technique to study microbiome-host interactions, analyzing and interpreting microbiome sequencing data presents challenges primarily attributed to the statistical specificities of the data (e.g., sparse, over-dispersed, compositional, inter-variable dependency). This mini review explores preprocessing and transformation methods applied in recent human microbiome studies to address microbiome data analysis challenges. Our results indicate a limited adoption of transformation methods targeting the statistical characteristics of microbiome sequencing data. Instead, there is a prevalent usage of relative and normalization-based transformations that do not specifically account for the specific attributes of microbiome data. The information on preprocessing and transformations applied to the data before analysis was incomplete or missing in many publications, leading to reproducibility concerns, comparability issues, and questionable results. We hope this mini review will provide researchers and newcomers to the field of human microbiome research with an up-to-date point of reference for various data transformation tools and assist them in choosing the most suitable transformation method based on their research questions, objectives, and data characteristics.
Collapse
Affiliation(s)
- Eliana Ibrahimi
- Department of Biology, Faculty of Natural Sciences, University of Tirana, Tirana, Albania
| | - Marta B. Lopes
- Department of Mathematics, Center for Mathematics and Applications (NOVA Math), NOVA School of Science and Technology, Caparica, Portugal
- UNIDEMI, Department of Mechanical and Industrial Engineering, NOVA School of Science and Technology, Caparica, Portugal
| | - Xhilda Dhamo
- Department of Applied Mathematics, Faculty of Natural Sciences, University of Tirana, Tirana, Albania
| | - Andrea Simeon
- BioSense Institute, University of Novi Sad, Novi Sad, Serbia
| | - Rajesh Shigdel
- Department of Clinical Science, University of Bergen, Bergen, Norway
| | - Karel Hron
- Department of Mathematical Analysis and Applications of Mathematics, Faculty of Science, Palacký University Olomouc, Olomouc, Czechia
| | - Blaž Stres
- Department of Catalysis and Chemical Reaction Engineering, National Institute of Chemistry, Ljubljana, Slovenia
- Faculty of Civil and Geodetic Engineering, Institute of Sanitary Engineering, Ljubljana, Slovenia
- Department of Automation, Biocybernetics and Robotics, Jožef Stefan Institute, Ljubljana, Slovenia
- Department of Animal Science, Biotechnical Faculty, University of Ljubljana, Ljubljana, Slovenia
| | - Domenica D’Elia
- Department of Biomedical Sciences, National Research Council, Institute for Biomedical Technologies, Bari, Italy
| | - Magali Berland
- INRAE, MetaGenoPolis, Université Paris-Saclay, Jouy-en-Josas, France
| | - Laura Judith Marcos-Zambrano
- Computational Biology Group, Precision Nutrition and Cancer Research Program, IMDEA Food Institute, Madrid, Spain
| |
Collapse
|
6
|
Zhang W, Fan X, Shi H, Li J, Zhang M, Zhao J, Su X. Comprehensive Assessment of 16S rRNA Gene Amplicon Sequencing for Microbiome Profiling across Multiple Habitats. Microbiol Spectr 2023; 11:e0056323. [PMID: 37102867 PMCID: PMC10269731 DOI: 10.1128/spectrum.00563-23] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2023] [Accepted: 04/10/2023] [Indexed: 04/28/2023] Open
Abstract
The 16S rRNA gene works as a rapid and effective marker for the identification of microorganisms in complex communities; hence, a huge number of microbiomes have been surveyed by 16S amplicon-based sequencing. The resolution of the 16S rRNA gene is always considered only at the genus level; however, it has not been verified on a wide range of microbes yet. To fully explore the ability and potential of the 16S rRNA gene in microbial profiling, here, we propose Qscore, a comprehensive method to evaluate the performance of amplicons by integrating the amplification rate, multitier taxonomic annotation, sequence type, and length. Our in silico assessment by a "global view" of 35,889 microbe species across multiple reference databases summarizes the optimal sequencing strategy for 16S short reads. On the other hand, since microbes are unevenly distributed according to their habitats, we also provide the recommended configuration for 16 typical ecosystems based on the Qscores of 157,390 microbiomes in the Microbiome Search Engine (MSE). Detailed data simulation further proves that the 16S amplicons produced with Qscore-suggested parameters exhibit high precision in microbiome profiling, which is close to that of shotgun metagenomes under CAMI metrics. Therefore, by reconsidering the precision of 16S-based microbiome profiling, our work not only enables the high-quality reusability of massive sequence legacy that has already been produced but is also significant for guiding microbiome studies in the future. We have implemented the Qscore as an online service at http://qscore.single-cell.cn to parse the recommended sequencing strategy for specific habitats or expected microbial structures. IMPORTANCE 16S rRNA has long been used as a biomarker to identify distinct microbes from complex communities. However, due to the influence of the amplification region, sequencing type, sequence processing, and reference database, the accuracy of 16S rRNA has not been fully verified on a global range. More importantly, the microbial composition of different habitats varies greatly, and it is necessary to adopt different strategies according to the corresponding target microbes to achieve optimal analytical performance. Here, we developed Qscore, which evaluates the comprehensive performance of 16S amplicons from multiple perspectives, thus providing the best sequencing strategies for common ecological environments by using big data.
Collapse
Affiliation(s)
- Wenke Zhang
- College of Computer Science and Technology, Qingdao University, Qingdao, China
| | - Xiaoqian Fan
- Shouguang Hospital of Traditional Chinese Medicine, Weifang, China
| | - Haobo Shi
- College of Computer Science and Technology, Qingdao University, Qingdao, China
| | - Jian Li
- College of Computer Science and Technology, Qingdao University, Qingdao, China
| | - Mingqian Zhang
- College of Computer Science and Technology, Qingdao University, Qingdao, China
| | - Jin Zhao
- College of Computer Science and Technology, Qingdao University, Qingdao, China
| | - Xiaoquan Su
- College of Computer Science and Technology, Qingdao University, Qingdao, China
| |
Collapse
|
7
|
An Improved Diagnostic of the Mycobacterium tuberculosis Drug Resistance Status by Applying a Decision Tree to Probabilities Assigned by the CatBoost Multiclassifier of Matrix Metalloproteinases Biomarkers. Diagnostics (Basel) 2022; 12:diagnostics12112847. [PMID: 36428907 PMCID: PMC9688965 DOI: 10.3390/diagnostics12112847] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2022] [Revised: 11/10/2022] [Accepted: 11/14/2022] [Indexed: 11/19/2022] Open
Abstract
In this work, we discuss an opportunity to use a set of the matrix metalloproteinases MMP-1, MMP-8, and MMP-9 and the tissue inhibitor TIMP, the concentrations of which can be easily obtained via a blood test from patients suffering from tuberculosis, as the biomarker for a fast diagnosis of the drug resistance status of Mycobacterium tuberculosis. The diagnostic approach is based on machine learning with the CatBoost system, which has been supplied with additional postprocessing. The latter refers not only to the simple probabilities of ML-predicted outcomes but also to the decision tree-like procedure, which takes into account the presence of strict zeros in the primary set of probabilities. It is demonstrated that this procedure significantly elevates the accuracy of distinguishing between sensitive, multi-, and extremely drug-resistant strains.
Collapse
|
8
|
Acharjee A, Singh U, Choudhury SP, Gkoutos GV. The diagnostic potential and barriers of microbiome based therapeutics. Diagnosis (Berl) 2022; 9:411-420. [PMID: 36000189 DOI: 10.1515/dx-2022-0052] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2022] [Accepted: 08/03/2022] [Indexed: 02/07/2023]
Abstract
High throughput technological innovations in the past decade have accelerated research into the trillions of commensal microbes in the gut. The 'omics' technologies used for microbiome analysis are constantly evolving, and large-scale datasets are being produced. Despite of the fact that much of the research is still in its early stages, specific microbial signatures have been associated with the promotion of cancer, as well as other diseases such as inflammatory bowel disease, neurogenerative diareses etc. It has been also reported that the diversity of the gut microbiome influences the safety and efficacy of medicines. The availability and declining sequencing costs has rendered the employment of RNA-based diagnostics more common in the microbiome field necessitating improved data-analytical techniques so as to fully exploit all the resulting rich biological datasets, while accounting for their unique characteristics, such as their compositional nature as well their heterogeneity and sparsity. As a result, the gut microbiome is increasingly being demonstrating as an important component of personalised medicine since it not only plays a role in inter-individual variability in health and disease, but it also represents a potentially modifiable entity or feature that may be addressed by treatments in a personalised way. In this context, machine learning and artificial intelligence-based methods may be able to unveil new insights into biomedical analyses through the generation of models that may be used to predict category labels, and continuous values. Furthermore, diagnostic aspects will add value in the identification of the non invasive markers in the critical diseases like cancer.
Collapse
Affiliation(s)
- Animesh Acharjee
- Institute of Cancer and Genomic Sciences, University of Birmingham, Birmingham, UK.,Institute of Translational Medicine, University of Birmingham, Birmingham, UK.,NIHR Surgical Reconstruction and Microbiology Research Centre, University Hospital Birmingham, Birmingham, UK.,MRC Health Data Research UK (HDR UK), Birmingham, UK
| | - Utpreksha Singh
- Department of Health and Life Sciences, Coventry University, Coventry, UK
| | | | - Georgios V Gkoutos
- Institute of Cancer and Genomic Sciences, University of Birmingham, Birmingham, UK.,Institute of Translational Medicine, University of Birmingham, Birmingham, UK.,NIHR Surgical Reconstruction and Microbiology Research Centre, University Hospital Birmingham, Birmingham, UK.,MRC Health Data Research UK (HDR UK), Birmingham, UK.,NIHR Experimental Cancer Medicine Centre, Birmingham, UK
| |
Collapse
|
9
|
Chen Y, Su X. Search-based health status detection and disease classification using species-level profiles of metagenomes. MEDICINE IN MICROECOLOGY 2022. [DOI: 10.1016/j.medmic.2021.100048] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022] Open
|
10
|
Chen Y, Li J, Zhang Y, Zhang M, Sun Z, Jing G, Huang S, Su X. Parallel-Meta Suite: Interactive and rapid microbiome data analysis on multiple platforms. IMETA 2022; 1:e1. [PMID: 38867729 PMCID: PMC10989749 DOI: 10.1002/imt2.1] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Revised: 12/13/2021] [Accepted: 12/17/2021] [Indexed: 06/14/2024]
Abstract
Massive microbiome sequencing data has been generated, which elucidates associations between microbes and their environmental phenotypes such as host health or ecosystem status. Outstanding bioinformatic tools are the basis to decipher the biological information hidden under microbiome data. However, most approaches placed difficulties on the accessibility to nonprofessional users. On the other side, the computing throughput has become a significant bottleneck of many analytical pipelines in processing large-scale datasets. In this study, we introduce Parallel-Meta Suite (PMS), an interactive software package for fast and comprehensive microbiome data analysis, visualization, and interpretation. It covers a wide array of functions for data preprocessing, statistics, visualization by state-of-the-art algorithms in a user-friendly graphical interface, which is accessible to diverse users. To meet the rapidly increasing computational demands, the entire procedure of PMS has been optimized by a parallel computing scheme, enabling the rapid processing of thousands of samples. PMS is compatible with multiple platforms, and an installer has been integrated for full-automatic installation.
Collapse
Affiliation(s)
- Yuzhu Chen
- College of Computer Science and TechnologyQingdao UniversityQingdaoShandongChina
| | - Jian Li
- College of Computer Science and TechnologyQingdao UniversityQingdaoShandongChina
| | - Yufeng Zhang
- College of Computer Science and TechnologyQingdao UniversityQingdaoShandongChina
| | - Mingqian Zhang
- College of Computer Science and TechnologyQingdao UniversityQingdaoShandongChina
| | - Zheng Sun
- Single‐Cell Center, Qingdao Institute of BioEnergy and Bioprocess TechnologyChinese Academy of SciencesQingdaoShandongChina
| | - Gongchao Jing
- Single‐Cell Center, Qingdao Institute of BioEnergy and Bioprocess TechnologyChinese Academy of SciencesQingdaoShandongChina
| | - Shi Huang
- Faculty of DentistryThe University of Hong KongHong KongHong Kong SARChina
| | - Xiaoquan Su
- College of Computer Science and TechnologyQingdao UniversityQingdaoShandongChina
- Single‐Cell Center, Qingdao Institute of BioEnergy and Bioprocess TechnologyChinese Academy of SciencesQingdaoShandongChina
| |
Collapse
|