Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Fritz A, Hofmann P, Majda S, Dahms E, Dröge J, Fiedler J, Lesker TR, Belmann P, DeMaere MZ, Darling AE, Sczyrba A, Bremges A, McHardy AC. CAMISIM: simulating metagenomes and microbial communities. Microbiome 2019;7:17. [PMID: 30736849 PMCID: PMC6368784 DOI: 10.1186/s40168-019-0633-6] [Citation(s) in RCA: 109] [Impact Index Per Article: 18.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/18/2018] [Accepted: 01/21/2019] [Indexed: 05/11/2023]

For:	Fritz A, Hofmann P, Majda S, Dahms E, Dröge J, Fiedler J, Lesker TR, Belmann P, DeMaere MZ, Darling AE, Sczyrba A, Bremges A, McHardy AC. CAMISIM: simulating metagenomes and microbial communities. Microbiome 2019;7:17. [PMID: 30736849 PMCID: PMC6368784 DOI: 10.1186/s40168-019-0633-6] [Citation(s) in RCA: 109] [Impact Index Per Article: 18.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/18/2018] [Accepted: 01/21/2019] [Indexed: 05/11/2023]

Number

Cited by Other Article(s)

Plaza-Díaz J, Fernández MF, García F, Chueca N, Fontana L, Álvarez-Mercado AI. Comparison of Three DNA Isolation Methods and Two Sequencing Techniques for the Study of the Human Microbiota. Life (Basel) 2025;15:599. [PMID: 40283154 PMCID: PMC12028492 DOI: 10.3390/life15040599] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2025] [Revised: 03/21/2025] [Accepted: 04/02/2025] [Indexed: 04/07/2025] Open

Abstract

Breast cancer is the most commonly diagnosed cancer in women and the second leading cause of female death. Altered interactions between the host and the gut microbiota appear to play an influential role in carcinogenesis. Several studies have shown different signatures of the gut microbiota in patients with breast cancer compared to healthy women. Currently, there is disagreement regarding the different DNA isolation and sequencing methodologies for studies on the human microbiota, given that they can influence the interpretation of the results obtained. The goal of this work was to compare (1) three different DNA extraction strategies to minimize the impact of human DNA, and (2) two sequencing strategies (16S rRNA and shotgun) to identify discrepancies in microbiome results. We made use of breast tissue and fecal samples from both healthy women and breast cancer patients who participated in the MICROMA study (reference NCT03885648). DNA was isolated by means of mechanical lysis, trypsin, or saponin. The amount of eukaryotic DNA isolated using the trypsin and saponin methods was lower compared to the mechanical lysis method (mechanical lysis, 89.11 ± 2.32%; trypsin method, 82.63 ± 1.23%; saponin method, 80.53 ± 4.09%). In samples with a predominance of prokaryotic cells, such as feces, 16S rRNA sequencing was the most advantageous approach. For other tissues, which are expected to have a more complex microbial composition, the need for an in-depth evaluation of the multifactorial interaction between the various components of the microbiota makes shotgun sequencing the most appropriate method. As for the three extraction methods evaluated, when sequencing samples other than stool, the trypsin method is the most convenient. For fecal samples, where contamination by host DNA is low, no prior treatment is necessary.

Collapse

Affiliation(s)

Julio Plaza-Díaz Institute of Biosanitary Research (ibs.GRANADA), San Cecilio University Clinical Hospital, 18012 Granada, Spain; (J.P.-D.); (M.F.F.); (F.G.); (N.C.) School of Health Sciences, International University of La Rioja, 26001 Logroño, Spain
Mariana F. Fernández Institute of Biosanitary Research (ibs.GRANADA), San Cecilio University Clinical Hospital, 18012 Granada, Spain; (J.P.-D.); (M.F.F.); (F.G.); (N.C.) Spanish Consortium for Research on Epidemiology and Public Health (CIBERESP), 28029 Madrid, Spain Department of Radiology and Physical Medicine, School of Medicine, University of Granada, 18016 Granada, Spain
Federico García Institute of Biosanitary Research (ibs.GRANADA), San Cecilio University Clinical Hospital, 18012 Granada, Spain; (J.P.-D.); (M.F.F.); (F.G.); (N.C.) Microbiology Unit, San Cecilio University Clinical Hospital, 18016 Granada, Spain Spanish Consortium for Research on Infectious Diseases (CIBERINFEC), 28029 Madrid, Spain
Natalia Chueca Institute of Biosanitary Research (ibs.GRANADA), San Cecilio University Clinical Hospital, 18012 Granada, Spain; (J.P.-D.); (M.F.F.); (F.G.); (N.C.) Microbiology Unit, San Cecilio University Clinical Hospital, 18016 Granada, Spain Spanish Consortium for Research on Infectious Diseases (CIBERINFEC), 28029 Madrid, Spain
Luis Fontana Institute of Biosanitary Research (ibs.GRANADA), San Cecilio University Clinical Hospital, 18012 Granada, Spain; (J.P.-D.); (M.F.F.); (F.G.); (N.C.) Department of Biochemistry and Molecular Biology II, School of Pharmacy, University of Granada, 18071 Granada, Spain Institute of Nutrition and Food Technology “José Matáix”, Centre of Biomedical Research, University of Granada, 18016 Granada, Spain
Ana I. Álvarez-Mercado Institute of Biosanitary Research (ibs.GRANADA), San Cecilio University Clinical Hospital, 18012 Granada, Spain; (J.P.-D.); (M.F.F.); (F.G.); (N.C.) Institute of Nutrition and Food Technology “José Matáix”, Centre of Biomedical Research, University of Granada, 18016 Granada, Spain Department Pharmacology, School of Pharmacy, 18071 Granada, Spain

Collapse

Herazo-Álvarez J, Mora M, Cuadros-Orellana S, Vilches-Ponce K, Hernández-García R. A review of neural networks for metagenomic binning. Brief Bioinform 2025;26:bbaf065. [PMID: 40131312 PMCID: PMC11934572 DOI: 10.1093/bib/bbaf065] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2024] [Revised: 01/02/2025] [Accepted: 03/07/2025] [Indexed: 03/26/2025] Open

Ramos Lopez D, Flores FJ, Espindola AS. MeStanG-Resource for High-Throughput Sequencing Standard Data Sets Generation for Bioinformatic Methods Evaluation and Validation. BIOLOGY 2025;14:69. [PMID: 39857299 PMCID: PMC11762867 DOI: 10.3390/biology14010069] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/12/2024] [Revised: 01/10/2025] [Accepted: 01/11/2025] [Indexed: 01/27/2025]

Kohnert E, Kreutz C. Computational Study Protocol: Leveraging Synthetic Data to Validate a Benchmark Study for Differential Abundance Tests for 16S Microbiome Sequencing Data. F1000Res 2025;13:1180. [PMID: 39866725 PMCID: PMC11757917 DOI: 10.12688/f1000research.155230.2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 12/19/2024] [Indexed: 01/28/2025] Open

Abstract

Background

Synthetic data's utility in benchmark studies depends on its ability to closely mimic real-world conditions and reproduce results obtained from experimental data. Building on Nearing et al.'s study (1), who assessed 14 differential abundance tests using 38 experimental 16S rRNA datasets in a case-control design, we are generating synthetic datasets that mimic the experimental data to verify their findings. We will employ statistical tests to rigorously assess the similarity between synthetic and experimental data and to validate the conclusions on the performance of these tests drawn by Nearing et al. (1). This protocol adheres to the SPIRIT guidelines, demonstrating how established reporting frameworks can support robust, transparent, and unbiased study planning.

Methods

We replicate Nearing et al.'s (1) methodology, incorporating synthetic data simulated using two distinct tools, mirroring the 38 experimental datasets. Equivalence tests will be conducted on a non-redundant subset of 46 data characteristics comparing synthetic and experimental data, complemented by principal component analysis for overall similarity assessment. The 14 differential abundance tests will be applied to synthetic and experimental datasets, evaluating the consistency of significant feature identification and the number of significant features per tool. Correlation analysis and multiple regression will explore how differences between synthetic and experimental data characteristics may affect the results.

Conclusions

Synthetic data enables the validation of findings through controlled experiments. We assess how well synthetic data replicates experimental data, try to validate previous findings with the most recent versions of the DA methods and delineate the strengths and limitations of synthetic data in benchmark studies. Moreover, to our knowledge this is the first computational benchmark study to systematically incorporate synthetic data for validating differential abundance methods while strictly adhering to a pre-specified study protocol following SPIRIT guidelines, contributing to transparency, reproducibility, and unbiased research.

Collapse

Amaro-da-Cruz A, Rubio-Tomás T, Álvarez-Mercado AI. Specific microbiome patterns and their association with breast cancer: the intestinal microbiota as a potential biomarker and therapeutic strategy. Clin Transl Oncol 2025;27:15-41. [PMID: 38890244 PMCID: PMC11735593 DOI: 10.1007/s12094-024-03554-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2024] [Accepted: 06/04/2024] [Indexed: 06/20/2024]

Abstract

Breast cancer (BC) is one of the most diagnosed cancers in women. Based on histological characteristics, they are classified as non-invasive, or in situ (tumors located within the milk ducts or milk lobules) and invasive. BC may develop from in situ carcinomas over time. Determining prognosis and predicting response to treatment are essential tools to manage this disease and reduce its incidence and mortality, as well as to promote personalized therapy for patients. However, over half of the cases are not associated with known risk factors. In addition, some patients develop resistance to treatment and relapse. Therefore, it is necessary to identify new biomarkers and treatment strategies that improve existing therapies. In this regard, the role of the microbiome is being researched as it could play a role in carcinogenesis and the efficacy of BC therapies. This review aims to describe specific microbiome patterns associated with BC. For this, a literature search was carried out in PubMed database using the MeSH terms "Breast Neoplasms" and "Gastrointestinal Microbiome", including 29 publications. Most of the studies have focused on characterizing the gut or breast tissue microbiome of the patients. Likewise, studies in animal models and in vitro that investigated the impact of gut microbiota (GM) on BC treatments and the effects of the microbiome on tumor cells were included. Based on the results of the included articles, BC could be associated with an imbalance in the GM. This imbalance varied depending on molecular type, stage and grade of cancer, menopause, menarche, body mass index, and physical activity. However, a specific microbial profile could not be identified as a biomarker. On the other hand, some studies suggest that the GM may influence the efficacy of BC therapies. In addition, some microorganisms and bacterial metabolites could improve the effects of therapies or influence tumor development.

Collapse

Puller V, Plaza Oñate F, Prifti E, de Lahondès R. Impact of simulation and reference catalogues on the evaluation of taxonomic profiling pipelines. Microb Genom 2025;11:001330. [PMID: 39804694 PMCID: PMC11728698 DOI: 10.1099/mgen.0.001330] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2024] [Accepted: 11/06/2024] [Indexed: 01/16/2025] Open

Chaabane F, Pillonel T, Bertelli C. MeSS and assembly_finder: a toolkit for in silico metagenomic sample generation. Bioinformatics 2024;41:btae760. [PMID: 39739308 PMCID: PMC11755095 DOI: 10.1093/bioinformatics/btae760] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2024] [Revised: 11/17/2024] [Accepted: 12/30/2024] [Indexed: 01/02/2025] Open

Sena F, Ingervo E, Khan S, Prjibelski A, Schmidt S, Tomescu A. Flowtigs: Safety in flow decompositions for assembly graphs. iScience 2024;27:111208. [PMID: 39759024 PMCID: PMC11700653 DOI: 10.1016/j.isci.2024.111208] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2024] [Revised: 09/30/2024] [Accepted: 10/15/2024] [Indexed: 01/07/2025] Open

Liu Y, Li Y, Chen E, Xu J, Zhang W, Zeng X, Luo X. Repeat and haplotype aware error correction in nanopore sequencing reads with DeChat. Commun Biol 2024;7:1678. [PMID: 39702496 DOI: 10.1038/s42003-024-07376-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2024] [Accepted: 12/05/2024] [Indexed: 12/21/2024] Open

Sankaran K, Kodikara S, Li JJ, Cao KAL. Semisynthetic simulation for microbiome data analysis. Brief Bioinform 2024;26:bbaf051. [PMID: 39927858 PMCID: PMC11808806 DOI: 10.1093/bib/bbaf051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2024] [Revised: 12/19/2024] [Accepted: 01/23/2025] [Indexed: 02/11/2025] Open

Nickols WA, McIver LJ, Walsh A, Zhang Y, Nearing JT, Asnicar F, Punčochář M, Segata N, Nguyen LH, Hartmann EM, Franzosa EA, Huttenhower C, Thompson KN. Evaluating metagenomic analyses for undercharacterized environments: what's needed to light up the microbial dark matter? BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.11.08.622677. [PMID: 39574575 PMCID: PMC11580994 DOI: 10.1101/2024.11.08.622677] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/29/2024]

Affiliation(s)

William A. Nickols Department of Biostatistics, T.H. Chan School of Public Health, Harvard University, Boston, MA, USA Harvard Chan Microbiome in Public Health Center, Harvard T. H. Chan School of Public Health, Boston, MA, USA
Lauren J. McIver Department of Biostatistics, T.H. Chan School of Public Health, Harvard University, Boston, MA, USA Harvard Chan Microbiome in Public Health Center, Harvard T. H. Chan School of Public Health, Boston, MA, USA
Aaron Walsh Department of Biostatistics, T.H. Chan School of Public Health, Harvard University, Boston, MA, USA Harvard Chan Microbiome in Public Health Center, Harvard T. H. Chan School of Public Health, Boston, MA, USA Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, MA, USA
Yancong Zhang Department of Biostatistics, T.H. Chan School of Public Health, Harvard University, Boston, MA, USA Harvard Chan Microbiome in Public Health Center, Harvard T. H. Chan School of Public Health, Boston, MA, USA Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, MA, USA
Jacob T. Nearing Department of Biostatistics, T.H. Chan School of Public Health, Harvard University, Boston, MA, USA Harvard Chan Microbiome in Public Health Center, Harvard T. H. Chan School of Public Health, Boston, MA, USA Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, MA, USA
Francesco Asnicar Department of Cellular, Computational and Integrative Biology (CIBIO), University of Trento, Trento, Italy
Michal Punčochář Department of Cellular, Computational and Integrative Biology (CIBIO), University of Trento, Trento, Italy
Nicola Segata Department of Cellular, Computational and Integrative Biology (CIBIO), University of Trento, Trento, Italy
Long H. Nguyen Department of Biostatistics, T.H. Chan School of Public Health, Harvard University, Boston, MA, USA Harvard Chan Microbiome in Public Health Center, Harvard T. H. Chan School of Public Health, Boston, MA, USA Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, MA, USA Division of Gastroenterology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
Erica M. Hartmann Department of Civil and Environmental Engineering, McCormick School of Engineering, Northwestern University, Evanston, IL, USA Center for Synthetic Biology, Northwestern University, Evanston, IL, USA Department of Medicine/Division of Pulmonary Medicine, Feinberg School of Medicine, Northwestern University, Chicago, IL, USA
Eric A. Franzosa Department of Biostatistics, T.H. Chan School of Public Health, Harvard University, Boston, MA, USA Harvard Chan Microbiome in Public Health Center, Harvard T. H. Chan School of Public Health, Boston, MA, USA Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, MA, USA
Curtis Huttenhower Department of Biostatistics, T.H. Chan School of Public Health, Harvard University, Boston, MA, USA Harvard Chan Microbiome in Public Health Center, Harvard T. H. Chan School of Public Health, Boston, MA, USA Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, MA, USA Division of Gastroenterology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA Department of Immunology and Infectious Diseases, T.H. Chan School of Public Health, Harvard University, Boston, MA, USA
Kelsey N. Thompson Department of Biostatistics, T.H. Chan School of Public Health, Harvard University, Boston, MA, USA Harvard Chan Microbiome in Public Health Center, Harvard T. H. Chan School of Public Health, Boston, MA, USA Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, MA, USA

Collapse

Gulyás G, Kakuk B, Dörmő Á, Járay T, Prazsák I, Csabai Z, Henkrich MM, Boldogkői Z, Tombácz D. Cross-comparison of gut metagenomic profiling strategies. Commun Biol 2024;7:1445. [PMID: 39505993 PMCID: PMC11541596 DOI: 10.1038/s42003-024-07158-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2024] [Accepted: 10/28/2024] [Indexed: 11/08/2024] Open

Kang X, Zhang W, Li Y, Luo X, Schönhuth A. HyLight: Strain aware assembly of low coverage metagenomes. Nat Commun 2024;15:8665. [PMID: 39375348 PMCID: PMC11458758 DOI: 10.1038/s41467-024-52907-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Accepted: 09/23/2024] [Indexed: 10/09/2024] Open

Yang Z, Shan Y, Liu X, Chen G, Pan Y, Gou Q, Zou J, Chang Z, Zeng Q, Yang C, Kong J, Sun Y, Li S, Zhang X, Wu WC, Li C, Peng H, Holmes EC, Guo D, Shi M. VirID: Beyond Virus Discovery-An Integrated Platform for Comprehensive RNA Virus Characterization. Mol Biol Evol 2024;41:msae202. [PMID: 39331699 PMCID: PMC11523140 DOI: 10.1093/molbev/msae202] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2024] [Revised: 09/10/2024] [Accepted: 09/24/2024] [Indexed: 09/29/2024] Open

Affiliation(s)

Ziyue Yang State Key Laboratory for Biocontrol, School of Medicine, Shenzhen Campus of Sun Yat-sen University, Sun Yat-sen University, Shenzhen, China National Key Laboratory of Intelligent Tracking and Forecasting for Infectious Diseases, Sun Yat-sen University, Shenzhen, China Shenzhen Key Laboratory for Systems Medicine in Inflammatory Diseases, Shenzhen Campus of Sun Yat-sen University, Sun Yat-sen University, Shenzhen, China
Yongtao Shan State Key Laboratory for Biocontrol, School of Medicine, Shenzhen Campus of Sun Yat-sen University, Sun Yat-sen University, Shenzhen, China National Key Laboratory of Intelligent Tracking and Forecasting for Infectious Diseases, Sun Yat-sen University, Shenzhen, China Shenzhen Key Laboratory for Systems Medicine in Inflammatory Diseases, Shenzhen Campus of Sun Yat-sen University, Sun Yat-sen University, Shenzhen, China
Xue Liu State Key Laboratory for Biocontrol, School of Medicine, Shenzhen Campus of Sun Yat-sen University, Sun Yat-sen University, Shenzhen, China National Key Laboratory of Intelligent Tracking and Forecasting for Infectious Diseases, Sun Yat-sen University, Shenzhen, China Shenzhen Key Laboratory for Systems Medicine in Inflammatory Diseases, Shenzhen Campus of Sun Yat-sen University, Sun Yat-sen University, Shenzhen, China
Guowei Chen Department of Electrical Engineering, City University of Hong Kong, Kowloon, Hong Kong (SAR), China
Yuanfei Pan Ministry of Education Key Laboratory of Biodiversity Science and Ecological Engineering, School of Life Sciences, Fudan University, Shanghai, China
Qinyu Gou State Key Laboratory for Biocontrol, School of Medicine, Shenzhen Campus of Sun Yat-sen University, Sun Yat-sen University, Shenzhen, China National Key Laboratory of Intelligent Tracking and Forecasting for Infectious Diseases, Sun Yat-sen University, Shenzhen, China Shenzhen Key Laboratory for Systems Medicine in Inflammatory Diseases, Shenzhen Campus of Sun Yat-sen University, Sun Yat-sen University, Shenzhen, China
Jie Zou State Key Laboratory for Biocontrol, School of Medicine, Shenzhen Campus of Sun Yat-sen University, Sun Yat-sen University, Shenzhen, China National Key Laboratory of Intelligent Tracking and Forecasting for Infectious Diseases, Sun Yat-sen University, Shenzhen, China Shenzhen Key Laboratory for Systems Medicine in Inflammatory Diseases, Shenzhen Campus of Sun Yat-sen University, Sun Yat-sen University, Shenzhen, China
Zilong Chang State Key Laboratory for Biocontrol, School of Medicine, Shenzhen Campus of Sun Yat-sen University, Sun Yat-sen University, Shenzhen, China National Key Laboratory of Intelligent Tracking and Forecasting for Infectious Diseases, Sun Yat-sen University, Shenzhen, China Shenzhen Key Laboratory for Systems Medicine in Inflammatory Diseases, Shenzhen Campus of Sun Yat-sen University, Sun Yat-sen University, Shenzhen, China
Qiang Zeng State Key Laboratory for Biocontrol, School of Medicine, Shenzhen Campus of Sun Yat-sen University, Sun Yat-sen University, Shenzhen, China National Key Laboratory of Intelligent Tracking and Forecasting for Infectious Diseases, Sun Yat-sen University, Shenzhen, China Shenzhen Key Laboratory for Systems Medicine in Inflammatory Diseases, Shenzhen Campus of Sun Yat-sen University, Sun Yat-sen University, Shenzhen, China
Chunhui Yang State Key Laboratory for Biocontrol, School of Medicine, Shenzhen Campus of Sun Yat-sen University, Sun Yat-sen University, Shenzhen, China National Key Laboratory of Intelligent Tracking and Forecasting for Infectious Diseases, Sun Yat-sen University, Shenzhen, China Shenzhen Key Laboratory for Systems Medicine in Inflammatory Diseases, Shenzhen Campus of Sun Yat-sen University, Sun Yat-sen University, Shenzhen, China
Jianbin Kong State Key Laboratory for Biocontrol, School of Medicine, Shenzhen Campus of Sun Yat-sen University, Sun Yat-sen University, Shenzhen, China National Key Laboratory of Intelligent Tracking and Forecasting for Infectious Diseases, Sun Yat-sen University, Shenzhen, China Shenzhen Key Laboratory for Systems Medicine in Inflammatory Diseases, Shenzhen Campus of Sun Yat-sen University, Sun Yat-sen University, Shenzhen, China
Yanni Sun Department of Electrical Engineering, City University of Hong Kong, Kowloon, Hong Kong (SAR), China
Shaochuan Li Goodwill Institute of Life Sciences, Guangzhou, China
Xu Zhang Goodwill Institute of Life Sciences, Guangzhou, China
Wei-chen Wu State Key Laboratory for Biocontrol, School of Medicine, Shenzhen Campus of Sun Yat-sen University, Sun Yat-sen University, Shenzhen, China National Key Laboratory of Intelligent Tracking and Forecasting for Infectious Diseases, Sun Yat-sen University, Shenzhen, China Shenzhen Key Laboratory for Systems Medicine in Inflammatory Diseases, Shenzhen Campus of Sun Yat-sen University, Sun Yat-sen University, Shenzhen, China
Chunmei Li State Key Laboratory for Biocontrol, School of Medicine, Shenzhen Campus of Sun Yat-sen University, Sun Yat-sen University, Shenzhen, China National Key Laboratory of Intelligent Tracking and Forecasting for Infectious Diseases, Sun Yat-sen University, Shenzhen, China Shenzhen Key Laboratory for Systems Medicine in Inflammatory Diseases, Shenzhen Campus of Sun Yat-sen University, Sun Yat-sen University, Shenzhen, China
Hong Peng State Key Laboratory for Biocontrol, School of Medicine, Shenzhen Campus of Sun Yat-sen University, Sun Yat-sen University, Shenzhen, China National Key Laboratory of Intelligent Tracking and Forecasting for Infectious Diseases, Sun Yat-sen University, Shenzhen, China Shenzhen Key Laboratory for Systems Medicine in Inflammatory Diseases, Shenzhen Campus of Sun Yat-sen University, Sun Yat-sen University, Shenzhen, China
Edward C Holmes School of Medical Sciences, The University of Sydney, Sydney, New South Wales, Australia Laboratory of Data Discovery for Health Limited, Hong Kong (SAR), China
Deyin Guo Guangzhou National Laboratory, Guangzhou International Bio-Island, Guangzhou, China State Key Laboratory of Respiratory Disease, National Clinical Research Center for Respiratory Disease, Guangzhou Institute of Respiratory Health, The First Affiliated Hospital of Guangzhou Medical University, Guangzhou, Guangdong, China
Mang Shi State Key Laboratory for Biocontrol, School of Medicine, Shenzhen Campus of Sun Yat-sen University, Sun Yat-sen University, Shenzhen, China National Key Laboratory of Intelligent Tracking and Forecasting for Infectious Diseases, Sun Yat-sen University, Shenzhen, China Shenzhen Key Laboratory for Systems Medicine in Inflammatory Diseases, Shenzhen Campus of Sun Yat-sen University, Sun Yat-sen University, Shenzhen, China Guangdong Provincial Center for Disease Control and Prevention, Guangzhou, China

Collapse

Ciuchcinski K, Stokke R, Steen IH, Dziewit L. Landscape of the metaplasmidome of deep-sea hydrothermal vents located at Arctic Mid-Ocean Ridges in the Norwegian-Greenland Sea: ecological insights from comparative analysis of plasmid identification tools. FEMS Microbiol Ecol 2024;100:fiae124. [PMID: 39271469 PMCID: PMC11451466 DOI: 10.1093/femsec/fiae124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2024] [Revised: 09/04/2024] [Accepted: 09/12/2024] [Indexed: 09/15/2024] Open

Espindola AS. Simulated High Throughput Sequencing Datasets: A Crucial Tool for Validating Bioinformatic Pathogen Detection Pipelines. BIOLOGY 2024;13:700. [PMID: 39336128 PMCID: PMC11428249 DOI: 10.3390/biology13090700] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/23/2024] [Revised: 09/03/2024] [Accepted: 09/03/2024] [Indexed: 09/30/2024]

Hera MR, Liu S, Wei W, Rodriguez JS, Ma C, Koslicki D. Metagenomic functional profiling: to sketch or not to sketch? Bioinformatics 2024;40:ii165-ii173. [PMID: 39230701 PMCID: PMC11373326 DOI: 10.1093/bioinformatics/btae397] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/05/2024] Open

Sanguineti D, Zampieri G, Treu L, Campanaro S. Metapresence: a tool for accurate species detection in metagenomics based on the genome-wide distribution of mapping reads. mSystems 2024;9:e0021324. [PMID: 38980053 PMCID: PMC11338496 DOI: 10.1128/msystems.00213-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2024] [Accepted: 06/15/2024] [Indexed: 07/10/2024] Open

Abstract

Shotgun metagenomics allows comprehensive sampling of the genomic information of microbes in a given environment and is a tool of choice for studying complex microbial systems. Mapping sequencing reads against a set of reference or metagenome-assembled genomes is in principle a simple and powerful approach to define the species-level composition of the microbial community under investigation. However, despite the widespread use of this approach, there is no established way to properly interpret the alignment results, with arbitrary relative abundance thresholds being routinely used to discriminate between present and absent species. Such an approach can be affected by significant biases, especially in the identification of rare species. Therefore, it is important to develop new metrics to overcome these biases. Here, we present Metapresence, a new tool to perform reliable identification of the species in metagenomic samples based on the distribution of mapped reads on the reference genomes. The analysis is based on two metrics describing the breadth of coverage and the genomic distance between consecutive reads. We demonstrate the high precision and wide applicability of the tool using data from various synthetic communities, a real mock community, and the gut microbiome of healthy individuals and antibiotic-associated-diarrhea patients. Overall, our results suggest that the proposed approach has a robust performance in hard-to-analyze microbial communities containing contaminated or closely related genomes in low abundance.IMPORTANCEDespite the prevalent use of genome-centric alignment-based methods to characterize microbial community composition, there lacks a standardized approach for accurately identifying the species within a sample. Currently, arbitrary relative abundance thresholds are commonly employed for this purpose. However, due to the inherent complexity of genome structure and biases associated with genome-centric approaches, this practice tends to be imprecise. Notably, it introduces significant biases, particularly in the identification of rare species. The method presented here addresses these limitations and contributes significantly to overcoming inaccuracies in precisely defining community composition, especially when dealing with rare members.

Collapse

Mallawaarachchi V, Wickramarachchi A, Xue H, Papudeshi B, Grigson SR, Bouras G, Prahl RE, Kaphle A, Verich A, Talamantes-Becerra B, Dinsdale EA, Edwards RA. Solving genomic puzzles: computational methods for metagenomic binning. Brief Bioinform 2024;25:bbae372. [PMID: 39082646 PMCID: PMC11289683 DOI: 10.1093/bib/bbae372] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2024] [Revised: 06/05/2024] [Accepted: 07/15/2024] [Indexed: 08/03/2024] Open

Affiliation(s)

Vijini Mallawaarachchi Flinders Accelerator for Microbiome Exploration, College of Science and Engineering, Flinders University, Adelaide, SA 5042, Australia
Anuradha Wickramarachchi Australian e-Health Research Centre, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Westmead, NSW 2145, Australia
Hansheng Xue School of Computing, National University of Singapore, Singapore 119077, Singapore
Bhavya Papudeshi Flinders Accelerator for Microbiome Exploration, College of Science and Engineering, Flinders University, Adelaide, SA 5042, Australia
Susanna R Grigson Flinders Accelerator for Microbiome Exploration, College of Science and Engineering, Flinders University, Adelaide, SA 5042, Australia
George Bouras Adelaide Medical School, Faculty of Health and Medical Sciences, The University of Adelaide, Adelaide, SA 5005, Australia The Department of Surgery—Otolaryngology Head and Neck Surgery, University of Adelaide and the Basil Hetzel Institute for Translational Health Research, Central Adelaide Local Health Network, Adelaide, SA 5011, Australia
Rosa E Prahl Australian e-Health Research Centre, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Westmead, NSW 2145, Australia
Anubhav Kaphle Australian e-Health Research Centre, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Westmead, NSW 2145, Australia
Andrey Verich Australian e-Health Research Centre, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Westmead, NSW 2145, Australia The Kirby Institute, The University of New South Wales, Randwick, Sydney, NSW 2052, Australia
Berenice Talamantes-Becerra Australian e-Health Research Centre, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Westmead, NSW 2145, Australia
Elizabeth A Dinsdale Flinders Accelerator for Microbiome Exploration, College of Science and Engineering, Flinders University, Adelaide, SA 5042, Australia
Robert A Edwards Flinders Accelerator for Microbiome Exploration, College of Science and Engineering, Flinders University, Adelaide, SA 5042, Australia

Collapse

Zhang Z, Xiao J, Wang H, Yang C, Huang Y, Yue Z, Chen Y, Han L, Yin K, Lyu A, Fang X, Zhang L. Exploring high-quality microbial genomes by assembling short-reads with long-range connectivity. Nat Commun 2024;15:4631. [PMID: 38821971 PMCID: PMC11143213 DOI: 10.1038/s41467-024-49060-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2023] [Accepted: 05/17/2024] [Indexed: 06/02/2024] Open

Yu R, Huang Z, Lam TYC, Sun Y. Utilizing profile hidden Markov model databases for discovering viruses from metagenomic data: a comprehensive review. Brief Bioinform 2024;25:bbae292. [PMID: 39003531 PMCID: PMC11246558 DOI: 10.1093/bib/bbae292] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/01/2024] [Revised: 05/19/2024] [Accepted: 06/04/2024] [Indexed: 07/15/2024] Open

Pinto Y, Chakraborty M, Jain N, Bhatt AS. Phage-inclusive profiling of human gut microbiomes with Phanta. Nat Biotechnol 2024;42:651-662. [PMID: 37231259 DOI: 10.1038/s41587-023-01799-4] [Citation(s) in RCA: 19] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2022] [Accepted: 04/20/2023] [Indexed: 05/27/2023]

Sepich-Poore GD, McDonald D, Kopylova E, Guccione C, Zhu Q, Austin G, Carpenter C, Fraraccio S, Wandro S, Kosciolek T, Janssen S, Metcalf JL, Song SJ, Kanbar J, Miller-Montgomery S, Heaton R, Mckay R, Patel SP, Swafford AD, Korem T, Knight R. Robustness of cancer microbiome signals over a broad range of methodological variation. Oncogene 2024;43:1127-1148. [PMID: 38396294 PMCID: PMC10997506 DOI: 10.1038/s41388-024-02974-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2023] [Revised: 02/03/2024] [Accepted: 02/07/2024] [Indexed: 02/25/2024]

Affiliation(s)

Gregory D Sepich-Poore Department of Bioengineering, University of California San Diego, La Jolla, CA, USA Micronoma, San Diego, CA, USA Feinberg School of Medicine, Northwestern University, Chicago, IL, USA
Daniel McDonald Department of Pediatrics, University of California San Diego, La Jolla, CA, USA
Evguenia Kopylova Department of Pediatrics, University of California San Diego, La Jolla, CA, USA Clarity Genomics, Antwerp, Belgium
Caitlin Guccione Department of Pediatrics, University of California San Diego, La Jolla, CA, USA
Qiyun Zhu Department of Pediatrics, University of California San Diego, La Jolla, CA, USA School of Life Sciences, Arizona State University, Tempe, AZ, USA
George Austin Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, NY, USA Program for Mathematical Genomics, Department of Systems Biology, Columbia University Irving Medical Center, New York, NY, USA
Carolina Carpenter Center for Microbiome Innovation, University of California San Diego, La Jolla, CA, USA
Serena Fraraccio Center for Microbiome Innovation, University of California San Diego, La Jolla, CA, USA Micronoma, San Diego, CA, USA
Stephen Wandro Center for Microbiome Innovation, University of California San Diego, La Jolla, CA, USA Micronoma, San Diego, CA, USA
Tomasz Kosciolek Department of Pediatrics, University of California San Diego, La Jolla, CA, USA Malopolska Centre of Biotechnology, Jagiellonian University in Kraków, Kraków, Poland
Stefan Janssen Department of Pediatrics, University of California San Diego, La Jolla, CA, USA Algorithmic Bioinformatics, Department of Biology and Chemistry, Justus Liebig University Gießen, Gießen, Germany
Jessica L Metcalf Department of Animal Sciences, Colorado State University, Fort Collins, CO, USA
Se Jin Song Department of Pediatrics, University of California San Diego, La Jolla, CA, USA Center for Microbiome Innovation, University of California San Diego, La Jolla, CA, USA
Jad Kanbar Department of Medicine, University of California San Diego, La Jolla, CA, USA
Sandrine Miller-Montgomery Department of Bioengineering, University of California San Diego, La Jolla, CA, USA Micronoma, San Diego, CA, USA
Robert Heaton Department of Psychiatry, University of California San Diego, La Jolla, CA, USA
Rana Mckay Moores Cancer Center, University of California San Diego Health, La Jolla, CA, USA
Sandip Pravin Patel Center for Microbiome Innovation, University of California San Diego, La Jolla, CA, USA Moores Cancer Center, University of California San Diego Health, La Jolla, CA, USA
Austin D Swafford Center for Microbiome Innovation, University of California San Diego, La Jolla, CA, USA
Tal Korem Program for Mathematical Genomics, Department of Systems Biology, Columbia University Irving Medical Center, New York, NY, USA Department of Obstetrics and Gynecology, Columbia University Irving Medical Center, New York, NY, USA
Rob Knight Department of Bioengineering, University of California San Diego, La Jolla, CA, USA. Department of Pediatrics, University of California San Diego, La Jolla, CA, USA. Center for Microbiome Innovation, University of California San Diego, La Jolla, CA, USA. Department of Computer Science and Engineering, University of California San Diego, La Jolla, CA, USA.

Collapse

Qiu Z, Yuan L, Lian CA, Lin B, Chen J, Mu R, Qiao X, Zhang L, Xu Z, Fan L, Zhang Y, Wang S, Li J, Cao H, Li B, Chen B, Song C, Liu Y, Shi L, Tian Y, Ni J, Zhang T, Zhou J, Zhuang WQ, Yu K. BASALT refines binning from metagenomic data and increases resolution of genome-resolved metagenomic analysis. Nat Commun 2024;15:2179. [PMID: 38467684 PMCID: PMC10928208 DOI: 10.1038/s41467-024-46539-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2023] [Accepted: 03/01/2024] [Indexed: 03/13/2024] Open

Affiliation(s)

Zhiguang Qiu Eco-environment and Resource Efficiency Research Laboratory, School of Environment and Energy, Shenzhen Graduate School, Peking University, Shenzhen, China AI for Science (AI4S)-Preferred Program, Peking University, Shenzhen, China
Li Yuan AI for Science (AI4S)-Preferred Program, Peking University, Shenzhen, China School of Electronic and Computer Engineering, Peking University, Shenzhen, China Peng Cheng Laboratory, Shenzhen, China
Chun-Ang Lian Eco-environment and Resource Efficiency Research Laboratory, School of Environment and Energy, Shenzhen Graduate School, Peking University, Shenzhen, China AI for Science (AI4S)-Preferred Program, Peking University, Shenzhen, China
Bin Lin School of Electronic and Computer Engineering, Peking University, Shenzhen, China
Jie Chen AI for Science (AI4S)-Preferred Program, Peking University, Shenzhen, China School of Electronic and Computer Engineering, Peking University, Shenzhen, China Peng Cheng Laboratory, Shenzhen, China
Rong Mu Eco-environment and Resource Efficiency Research Laboratory, School of Environment and Energy, Shenzhen Graduate School, Peking University, Shenzhen, China
Xuejiao Qiao Eco-environment and Resource Efficiency Research Laboratory, School of Environment and Energy, Shenzhen Graduate School, Peking University, Shenzhen, China
Liyu Zhang Eco-environment and Resource Efficiency Research Laboratory, School of Environment and Energy, Shenzhen Graduate School, Peking University, Shenzhen, China
Zheng Xu Southern University of Sciences and Technology Yantian Hospital, Shenzhen, China Institute of Biomedicine and Biotechnology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, Guangdong, China
Lu Fan Department of Ocean Science and Engineering, Southern University of Science and Technology (SUSTech), Shenzhen, China
Yunzeng Zhang Joint International Research Laboratory of Agriculture and Agri-Product Safety, the Ministry of Education of China, Yangzhou University, Yangzhou, China
Shanquan Wang Environmental Microbiomics Research Center, School of Environmental Science and Engineering, Sun Yat-Sen University, Guangzhou, China
Junyi Li School of Computer Science and Technology, Harbin Institute of Technology (Shenzhen), Shenzhen, Guangdong, China
Huiluo Cao Department of Microbiology, University of Hong Kong, Hong Kong, China
Bing Li Shenzhen International Graduate School, Tsinghua University, Shenzhen, China
Baowei Chen Guangdong Provincial Key Laboratory of Marine Resources and Coastal Engineering, School of Marine Sciences, Sun Yat-sen University, Zhuhai, China
Chi Song Institute of Herbgenomics, Chengdu University of Traditional Chinese Medicine, Chengdu, China Wuhan Benagen Technology Co., Ltd, Wuhan, China
Yongxin Liu Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
Lili Shi AI for Science (AI4S)-Preferred Program, Peking University, Shenzhen, China State Key Laboratory of Chemical Oncogenomics, School of Chemical Biology and Biotechnology, Peking University Shenzhen Graduate School, Shenzhen, China
Yonghong Tian AI for Science (AI4S)-Preferred Program, Peking University, Shenzhen, China School of Electronic and Computer Engineering, Peking University, Shenzhen, China Peng Cheng Laboratory, Shenzhen, China
Jinren Ni Eco-environment and Resource Efficiency Research Laboratory, School of Environment and Energy, Shenzhen Graduate School, Peking University, Shenzhen, China College of Environmental Sciences and Engineering, Key Laboratory of Water and Sediment Sciences, Ministry of Education, Peking University, Beijing, China
Tong Zhang Department of Civil Engineering, University of Hong Kong, Hong Kong, China
Jizhong Zhou Institute for Environmental Genomics, University of Oklahoma, Norman, OK, USA
Wei-Qin Zhuang Department of Civil and Environmental Engineering, Faculty of Engineering, University of Auckland, Auckland, New Zealand
Ke Yu Eco-environment and Resource Efficiency Research Laboratory, School of Environment and Energy, Shenzhen Graduate School, Peking University, Shenzhen, China. AI for Science (AI4S)-Preferred Program, Peking University, Shenzhen, China.

Collapse

Hui X, Yang J, Sun J, Liu F, Pan W. MCSS: microbial community simulator based on structure. Front Microbiol 2024;15:1358257. [PMID: 38516019 PMCID: PMC10956353 DOI: 10.3389/fmicb.2024.1358257] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2023] [Accepted: 02/20/2024] [Indexed: 03/23/2024] Open

Matchado MS, Rühlemann M, Reitmeier S, Kacprowski T, Frost F, Haller D, Baumbach J, List M. On the limits of 16S rRNA gene-based metagenome prediction and functional profiling. Microb Genom 2024;10:001203. [PMID: 38421266 PMCID: PMC10926695 DOI: 10.1099/mgen.0.001203] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2023] [Accepted: 02/05/2024] [Indexed: 03/02/2024] Open

Valencia EM, Maki KA, Dootz JN, Barb JJ. Mock community taxonomic classification performance of publicly available shotgun metagenomics pipelines. Sci Data 2024;11:81. [PMID: 38233447 PMCID: PMC10794705 DOI: 10.1038/s41597-023-02877-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2023] [Accepted: 12/22/2023] [Indexed: 01/19/2024] Open

Steinke K, Pamp SJ, Munk P. MAGICIAN: MAG simulation for investigating criteria for bioinformatic analysis. BMC Genomics 2024;25:55. [PMID: 38216924 PMCID: PMC10785454 DOI: 10.1186/s12864-023-09912-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2023] [Accepted: 12/15/2023] [Indexed: 01/14/2024] Open

Abstract

BACKGROUND

The possibility of recovering metagenome-assembled genomes (MAGs) from sequence reads allows for further insights into microbial communities and their members, possibly even analyzing such sequences with tools designed for single-isolate genomes. As result quality depends on sequence quality, performance of tools for single-isolate genomes on MAGs should be tested beforehand. Bioinformatics can be leveraged to quickly create varied synthetic test sets with known composition for this purpose.

RESULTS

We present MAGICIAN, a flexible, user-friendly pipeline for the simulation of MAGs. MAGICIAN combines a synthetic metagenome simulator with a metagenomic assembly and binning pipeline to simulate MAGs based on user-supplied input genomes, allowing users to test performance of tools on MAGs while having a ground truth to compare results to. Using MAGICIAN, we found that even very slight (1%) changes in depth of coverage can drastically affect whether a genome can be recovered. We also demonstrate the use of simulated MAGs by evaluating the suitability of such genomes obtained with MAGICIAN's current default pipeline for analysis with the antimicrobial resistance gene identification tool ResFinder.

CONCLUSIONS

Using MAGICIAN, it is possible to simulate MAGs which, while generally high in quality, reflect issues encountered with real-world data, thus providing realistic best-case data. Evaluating the results of ResFinder analysis of these genomes revealed a risk for plausible-looking false positives, which underlines the need for pipeline validation so that researchers are aware of the potential issues when interpreting real-world data. Furthermore, the effects of fluctuations in depth of coverage on genome recovery in our simulated "random sequencing" warrant further investigation and indicate random subsampling of reads may affect discovery of more genomes.

Collapse

Baud A, Kennedy SP. Targeted Metagenomic Databases Provide Improved Analysis of Microbiota Samples. Microorganisms 2024;12:135. [PMID: 38257962 PMCID: PMC10819777 DOI: 10.3390/microorganisms12010135] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2023] [Revised: 12/15/2023] [Accepted: 12/28/2023] [Indexed: 01/24/2024] Open

Kim N, Kim CY, Ma J, Yang S, Park DJ, Ha SJ, Belenky P, Lee I. MRGM: an enhanced catalog of mouse gut microbial genomes substantially broadening taxonomic and functional landscapes. Gut Microbes 2024;16:2393791. [PMID: 39230075 PMCID: PMC11376411 DOI: 10.1080/19490976.2024.2393791] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/10/2024] [Revised: 08/12/2024] [Accepted: 08/13/2024] [Indexed: 09/05/2024] Open

Kang X, Xu J, Luo X, Schönhuth A. Hybrid-hybrid correction of errors in long reads with HERO. Genome Biol 2023;24:275. [PMID: 38041098 PMCID: PMC10690975 DOI: 10.1186/s13059-023-03112-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2023] [Accepted: 11/16/2023] [Indexed: 12/03/2023] Open

Walsh LH, Coakley M, Walsh AM, O'Toole PW, Cotter PD. Bioinformatic approaches for studying the microbiome of fermented food. Crit Rev Microbiol 2023;49:693-725. [PMID: 36287644 DOI: 10.1080/1040841x.2022.2132850] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2022] [Revised: 08/11/2022] [Accepted: 09/28/2022] [Indexed: 11/03/2022]

Huttenhower C, Finn RD, McHardy AC. Challenges and opportunities in sharing microbiome data and analyses. Nat Microbiol 2023;8:1960-1970. [PMID: 37783751 DOI: 10.1038/s41564-023-01484-x] [Citation(s) in RCA: 20] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2021] [Accepted: 08/28/2023] [Indexed: 10/04/2023]

Park H, Lim SJ, Cosme J, O'Connell K, Sandeep J, Gayanilo F, Cutter Jr. GR, Montes E, Nitikitpaiboon C, Fisher S, Moustahfid H, Thompson LR. Investigation of machine learning algorithms for taxonomic classification of marine metagenomes. Microbiol Spectr 2023;11:e0523722. [PMID: 37695074 PMCID: PMC10580933 DOI: 10.1128/spectrum.05237-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Accepted: 06/30/2023] [Indexed: 09/12/2023] Open

Affiliation(s)

Helen Park Center for Synthetic and Systems Biology, School of Life Sciences, Tsinghua-Peking Center for Life Sciences, Tsinghua University, Beijing, China EPSRC/BBSRC Future Biomanufacturing Research Hub, EPSRC Synthetic Biology Research Centre SYNBIOCHEM Manchester Institute of Biotechnology and School of Chemistry, The University of Manchester, Manchester, United Kingdom
Shen Jean Lim Cooperative Institute for Marine and Atmospheric Studies, Rosenstiel School of Marine, Atmospheric, and Earth Science, University of Miami, Miami, Florida, USA Ocean Chemistry and Ecosystems Division, Atlantic Oceanographic and Meteorological Laboratory, National Oceanic and Atmospheric Administration, Miami, Florida, USA College of Marine Science, University of South Florida, St Petersburg, Florida, USA
Jonathan Cosme Run:AI, Office of the CTO, Tel Aviv, Israel
Kyle O'Connell Deloitte Consulting LLP, Biomedical Data Science Team, Arlington, Virginia, USA Department of Vertebrate Zoology, National Museum of Natural History, Smithsonian Institution, Northwest, Washington, DC, USA
Jilla Sandeep Harte Research Institute, Texas A&M University-Corpus Christi, Corpus Christi, Texas, USA
Felimon Gayanilo Harte Research Institute, Texas A&M University-Corpus Christi, Corpus Christi, Texas, USA
George R. Cutter Jr. Southwest Fisheries Science Center, Antarctic Ecosystem Research Division, National Oceanic and Atmospheric Administration, La Jolla, California, USA
Enrique Montes Cooperative Institute for Marine and Atmospheric Studies, Rosenstiel School of Marine, Atmospheric, and Earth Science, University of Miami, Miami, Florida, USA Ocean Chemistry and Ecosystems Division, Atlantic Oceanographic and Meteorological Laboratory, National Oceanic and Atmospheric Administration, Miami, Florida, USA
Chotinan Nitikitpaiboon Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Tokyo, Japan
Sam Fisher Deloitte Consulting LLP, Biomedical Data Science Team, Arlington, Virginia, USA
Hassan Moustahfid NOAA/US Integrated Ocean Observing System (IOOS), Silver Spring, Maryland, USA
Luke R. Thompson Ocean Chemistry and Ecosystems Division, Atlantic Oceanographic and Meteorological Laboratory, National Oceanic and Atmospheric Administration, Miami, Florida, USA Northern Gulf Institute, Mississippi State University, Mississippi, USA

Collapse

Trinh P, Clausen DS, Willis AD. happi: a hierarchical approach to pangenomics inference. Genome Biol 2023;24:214. [PMID: 37773075 PMCID: PMC10540326 DOI: 10.1186/s13059-023-03040-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2022] [Accepted: 08/16/2023] [Indexed: 09/30/2023] Open

Price C, Russell JA. AMAnD: an automated metagenome anomaly detection methodology utilizing DeepSVDD neural networks. Front Public Health 2023;11:1181911. [PMID: 37497030 PMCID: PMC10368493 DOI: 10.3389/fpubh.2023.1181911] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2023] [Accepted: 06/12/2023] [Indexed: 07/28/2023] Open

Abstract

The composition of metagenomic communities within the human body often reflects localized medical conditions such as upper respiratory diseases and gastrointestinal diseases. Fast and accurate computational tools to flag anomalous metagenomic samples from typical samples are desirable to understand different phenotypes, especially in contexts where repeated, long-duration temporal sampling is done. Here, we present Automated Metagenome Anomaly Detection (AMAnD), which utilizes two types of Deep Support Vector Data Description (DeepSVDD) models; one trained on taxonomic feature space output by the Pan-Genomics for Infectious Agents (PanGIA) taxonomy classifier and one trained on kmer frequency counts. AMAnD's semi-supervised one-class approach makes no assumptions about what an anomaly may look like, allowing the flagging of potentially novel anomaly types. Three diverse datasets are profiled. The first dataset is hosted on the National Center for Biotechnology Information's (NCBI) Sequence Read Archive (SRA) and contains nasopharyngeal swabs from healthy and COVID-19-positive patients. The second dataset is also hosted on SRA and contains gut microbiome samples from normal controls and from patients with slow transit constipation (STC). AMAnD can learn a typical healthy nasopharyngeal or gut microbiome profile and reliably flag the anomalous COVID+ or STC samples in both feature spaces. The final dataset is a synthetic metagenome created by the Critical Assessment of Metagenome Annotation Simulator (CAMISIM). A control dataset of 50 well-characterized organisms was submitted to CAMISIM to generate 100 synthetic control class samples. The experimental conditions included 12 different spiked-in contaminants that are taxonomically similar to organisms present in the laboratory blank sample ranging from one strain tree branch taxonomic distance away to one family tree branch taxonomic distance away. This experiment was repeated in triplicate at three different coverage levels to probe the dependence on sample coverage. AMAnD was again able to flag the contaminant inserts as anomalous. AMAnD's assumption-free flagging of metagenomic anomalies, the real-time model training update potential of the deep learning approach, and the strong performance even with lightweight models of low sample cardinality would make AMAnD well-suited to a wide array of applied metagenomics biosurveillance use-cases, from environmental to clinical utility.

Collapse

Zhou B, Li H. STEMSIM: a simulator of within-strain short-term evolutionary mutations for longitudinal metagenomic data. Bioinformatics 2023;39:btad302. [PMID: 37154701 PMCID: PMC10188296 DOI: 10.1093/bioinformatics/btad302] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Revised: 03/29/2023] [Accepted: 04/29/2023] [Indexed: 05/10/2023] Open

Mineeva O, Danciu D, Schölkopf B, Ley RE, Rätsch G, Youngblut ND. ResMiCo: Increasing the quality of metagenome-assembled genomes with deep learning. PLoS Comput Biol 2023;19:e1011001. [PMID: 37126495 PMCID: PMC10174551 DOI: 10.1371/journal.pcbi.1011001] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2022] [Revised: 05/11/2023] [Accepted: 03/06/2023] [Indexed: 05/02/2023] Open

García Mendez D, Sanabria J, Wist J, Holmes E. Effect of Operational Parameters on the Cultivation of the Gut Microbiome in Continuous Bioreactors Inoculated with Feces: A Systematic Review. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2023;71:6213-6225. [PMID: 37070710 PMCID: PMC10143624 DOI: 10.1021/acs.jafc.2c08146] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/20/2022] [Revised: 01/27/2023] [Accepted: 01/27/2023] [Indexed: 05/03/2023]

Yang C, Lo T, Nip KM, Hafezqorani S, Warren RL, Birol I. Characterization and simulation of metagenomic nanopore sequencing data with Meta-NanoSim. Gigascience 2023;12:giad013. [PMID: 36939007 PMCID: PMC10025935 DOI: 10.1093/gigascience/giad013] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Revised: 01/19/2023] [Accepted: 02/17/2023] [Indexed: 03/21/2023] Open

Gabrielli M, Dai Z, Delafont V, Timmers PHA, van der Wielen PWJJ, Antonelli M, Pinto AJ. Identifying Eukaryotes and Factors Influencing Their Biogeography in Drinking Water Metagenomes. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2023;57:3645-3660. [PMID: 36827617 PMCID: PMC9996835 DOI: 10.1021/acs.est.2c09010] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Revised: 02/13/2023] [Accepted: 02/13/2023] [Indexed: 06/18/2023]

Jurado-Rueda F, Alonso-Guirado L, Perea-Chamblee TE, Elliott OT, Filip I, Rabadán R, Malats N. Benchmarking of microbiome detection tools on RNA-seq synthetic databases according to diverse conditions. BIOINFORMATICS ADVANCES 2023;3:vbad014. [PMID: 36874954 PMCID: PMC9976984 DOI: 10.1093/bioadv/vbad014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/15/2022] [Revised: 11/15/2022] [Accepted: 02/03/2023] [Indexed: 02/24/2023]

Metagenomic Antimicrobial Susceptibility Testing from Simulated Native Patient Samples. Antibiotics (Basel) 2023;12:antibiotics12020366. [PMID: 36830277 PMCID: PMC9952719 DOI: 10.3390/antibiotics12020366] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2023] [Revised: 02/06/2023] [Accepted: 02/08/2023] [Indexed: 02/12/2023] Open

Martin S, Ayling M, Patrono L, Caccamo M, Murcia P, Leggett RM. Capturing variation in metagenomic assembly graphs with MetaCortex. Bioinformatics 2023;39:6986127. [PMID: 36722204 PMCID: PMC9889960 DOI: 10.1093/bioinformatics/btad020] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Revised: 11/10/2022] [Accepted: 01/11/2023] [Indexed: 01/13/2023] Open

Salazar VW, Shaban B, Quiroga MDM, Turnbull R, Tescari E, Rossetto Marcelino V, Verbruggen H, Lê Cao KA. Metaphor-A workflow for streamlined assembly and binning of metagenomes. Gigascience 2022;12:giad055. [PMID: 37522759 PMCID: PMC10388702 DOI: 10.1093/gigascience/giad055] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2023] [Revised: 06/05/2023] [Accepted: 07/04/2023] [Indexed: 08/01/2023] Open

Mendes CI, Vila-Cerqueira P, Motro Y, Moran-Gilad J, Carriço JA, Ramirez M. LMAS: evaluating metagenomic short de novo assembly methods through defined communities. Gigascience 2022;12:giac122. [PMID: 36576131 PMCID: PMC9795473 DOI: 10.1093/gigascience/giac122] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2022] [Revised: 09/26/2022] [Accepted: 11/16/2022] [Indexed: 12/29/2022] Open

Zhu Y, Shang J, Peng C, Sun Y. Phage family classification under Caudoviricetes: A review of current tools using the latest ICTV classification framework. Front Microbiol 2022;13:1032186. [PMID: 36590402 PMCID: PMC9800612 DOI: 10.3389/fmicb.2022.1032186] [Citation(s) in RCA: 47] [Impact Index Per Article: 15.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2022] [Accepted: 11/29/2022] [Indexed: 12/23/2022] Open

Mining of novel secondary metabolite biosynthetic gene clusters from acid mine drainage. Sci Data 2022;9:760. [PMID: 36494363 PMCID: PMC9734747 DOI: 10.1038/s41597-022-01866-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2021] [Accepted: 11/23/2022] [Indexed: 12/13/2022] Open

Liu Y, Elworth RAL, Jochum MD, Aagaard KM, Treangen TJ. De novo identification of microbial contaminants in low microbial biomass microbiomes with Squeegee. Nat Commun 2022;13:6799. [PMID: 36357382 PMCID: PMC9649624 DOI: 10.1038/s41467-022-34409-z] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2021] [Accepted: 10/25/2022] [Indexed: 11/12/2022] Open

VeChat: correcting errors in long reads using variation graphs. Nat Commun 2022;13:6657. [PMID: 36333324 PMCID: PMC9636371 DOI: 10.1038/s41467-022-34381-8] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2022] [Accepted: 10/24/2022] [Indexed: 11/06/2022] Open