1
|
Pujolassos M, Susín A, Calle M. Microbiome compositional data analysis for survival studies. NAR Genom Bioinform 2024; 6:lqae038. [PMID: 38666212 PMCID: PMC11044448 DOI: 10.1093/nargab/lqae038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2023] [Revised: 01/23/2024] [Accepted: 04/08/2024] [Indexed: 04/28/2024] Open
Abstract
The growing interest in studying the relationship between the human microbiome and our health has also extended to time-to-event studies where researchers explore the connection between the microbiome and the occurrence of a specific event of interest. The analysis of microbiome obtained through high throughput sequencing techniques requires the use of specialized Compositional Data Analysis (CoDA) methods designed to accommodate its compositional nature. There is a limited availability of statistical tools for microbiome analysis that incorporate CoDA, and this is even more pronounced in the context of survival analysis. To fill this methodological gap, we present coda4microbiome for survival studies, a new methodology for the identification of microbial signatures in time-to-event studies. The algorithm implements an elastic-net penalized Cox regression model adapted to compositional covariates. We illustrate coda4microbiome algorithm for survival studies with a case study about the time to develop type 1 diabetes for non-obese diabetic mice. Our algorithm identified a bacterial signature composed of 21 genera associated with diabetes development. coda4microbiome for survival studies is integrated in the R package coda4microbiome as an extension of the existing functions for cross-sectional and longitudinal studies.
Collapse
Affiliation(s)
- Meritxell Pujolassos
- Bioscience Department, Faculty of Sciences, Technology and Engineering, University of Vic – Central University of Catalunya, Vic 08500, Spain
| | - Antoni Susín
- Mathematical Department, UPC-Barcelona Tech, Barcelona 08034, Spain
| | - M.Luz Calle
- Bioscience Department, Faculty of Sciences, Technology and Engineering, University of Vic – Central University of Catalunya, Vic 08500, Spain
- Institut de Recerca i Innovació en Ciències de la Vida i de la Salut a la Catalunya Central (IRIS-CC), Vic 08500, Spain
| |
Collapse
|
2
|
Muñoz-Fernandez SS, Garcez FB, Alencar JCG, Bastos AA, Morley JE, Cederholm T, Aprahamian I, de Souza HP, Avelino-Silva TJ, Bindels LB, Ribeiro SML. Gut microbiota disturbances in hospitalized older adults with malnutrition and clinical outcomes. Nutrition 2024; 122:112369. [PMID: 38422755 DOI: 10.1016/j.nut.2024.112369] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2023] [Revised: 01/15/2024] [Accepted: 01/23/2024] [Indexed: 03/02/2024]
Abstract
OBJECTIVE Malnutrition is one of the most threatening conditions in geriatric populations. The gut microbiota has an important role in the host's metabolic and muscular health: however, its interplay with disease-related malnutrition is not well understood. We aimed to identify the association of malnutrition with the gut microbiota and predict clinical outcomes in hospitalized acutely ill older adults. METHODS We performed a secondary longitudinal analysis in 108 geriatric patients from a prospective cohort evaluated at admission and 72 h of hospitalization. We collected clinical, demographic, nutritional, and 16S rRNA gene-sequenced gut microbiota data. Microbiota diversity, overall composition, and differential abundance were calculated and compared between patients with and without malnutrition. Microbiota features associated with malnutrition were used to predict clinical outcomes. RESULTS Patients with malnutrition (51%) had a different microbiota composition compared to those who were well-nourished during hospitalization (ANOSIM R = 0.079, P = 0.003). Patients with severe malnutrition showed poorer α-diversity at admission (Shannon P = 0.012, Simpson P = 0.018) and follow-up (Shannon P = 0.023, Chao1 P = 0.008). Differential abundance of Lachnospiraceae NK4A136 group, Subdoligranulum, and Faecalibacterium prausnitzii were significantly lower and inversely associated with malnutrition, while Corynebacterium, Ruminococcaceae Incertae Sedis, and Fusobacterium were significantly increased and positively associated with malnutrition. Corynebacterium, Ruminococcaceae Incertae Sedis, and the overall composition were important predictors of critical care in patients with malnutrition during hospitalization. CONCLUSION Older adults with malnutrition, especially in a severe stage, may be subject to substantial gut microbial disturbances during hospitalization. The gut microbiota profile of patients with malnutrition might help us to predict worse clinical outcomes.
Collapse
Affiliation(s)
- Shirley S Muñoz-Fernandez
- Nutrition Department, School of Public Health, University of São Paulo, São Paulo, Sao Paulo, Brazil.
| | - Flavia B Garcez
- Laboratorio de Investigacao Medica em Envelhecimento (LIM 66), Servico de Geriatria, Hospital das Clinicas HCFMUSP, Faculdade de Medicina, Universidade de São Paulo, São Paulo, Sao Paulo, Brazil; Departamento de Medicina, Hospital Universitario, Universidade Federal de Sergipe, Aracaju, Sergipe, Brazil
| | - Julio C G Alencar
- Disciplina de Emergencias Clínicas, Departamento de Clínica Medica, Faculty of Medicine, University of São Paulo, São Paulo, Sao Paulo, Brazil
| | - Amália A Bastos
- Nutrition Department, School of Public Health, University of São Paulo, São Paulo, Sao Paulo, Brazil
| | - John E Morley
- Division of Geriatric Medicine, School of Medicine, Saint Louis University, St. Louis, Missouri, USA
| | - Tommy Cederholm
- Department of Public Health and Caring Sciences, Clinical Nutrition and Metabolism, Uppsala University, Uppsala, Sweden; Karolinska University Hospital, Stockholm, Sweden
| | - Ivan Aprahamian
- Division of Geriatrics, Department of Internal Medicine, Jundiaí Medical School, Group of Investigation on Multimorbidity and Mental Health in Aging (GIMMA), Jundiaí, Sao Paulo, Brazil
| | - Heraldo P de Souza
- Disciplina de Emergencias Clínicas, Departamento de Clínica Medica, Faculty of Medicine, University of São Paulo, São Paulo, Sao Paulo, Brazil
| | - Thiago J Avelino-Silva
- Laboratorio de Investigacao Medica em Envelhecimento (LIM 66), Servico de Geriatria, Hospital das Clinicas HCFMUSP, Faculdade de Medicina, Universidade de São Paulo, São Paulo, Sao Paulo, Brazil
| | - Laure B Bindels
- Metabolism and Nutrition Research Group, Louvain Drug Research Institute, Université catholique de Louvain, Brussels, Belgium
| | - Sandra M L Ribeiro
- Nutrition Department, School of Public Health, University of São Paulo, São Paulo, Sao Paulo, Brazil; School of Arts, Science, and Humanity, University of São Paulo, São Paulo, Sao Paulo, Brazil
| |
Collapse
|
3
|
Genius P, Calle ML, Rodríguez-Fernández B, Minguillon C, Cacciaglia R, Garrido-Martin D, Esteller M, Navarro A, Gispert JD, Vilor-Tejedor N. Compositional structural brain signatures capture Alzheimer's genetic risk on brain structure along the disease continuum. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.05.08.24307046. [PMID: 38766190 PMCID: PMC11100942 DOI: 10.1101/2024.05.08.24307046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2024]
Abstract
INTRODUCTION Traditional brain imaging genetics studies have primarily focused on how genetic factors influence the volume of specific brain regions, often neglecting the overall complexity of brain architecture and its genetic underpinnings. METHODS This study analyzed data from participants across the Alzheimer's disease (AD) continuum from the ALFA and ADNI studies. We exploited compositional data analysis to examine relative brain volumetric variations that (i) differentiate cognitively unimpaired (CU) individuals, defined as amyloid-negative (A-) based on CSF profiling, from those at different AD stages, and (ii) associated with increased genetic susceptibility to AD, assessed using polygenic risk scores. RESULTS Distinct brain signatures differentiated CU A-individuals from amyloid-positive MCI and AD. Moreover, disease stage-specific signatures were associated with higher genetic risk of AD. DISCUSSION The findings underscore the complex interplay between genetics and disease stages in shaping brain structure, which could inform targeted preventive strategies and interventions in preclinical AD.
Collapse
|
4
|
Aqueel R, Badar A, Roy N, Mushtaq Q, Ali AF, Bashir A, Ijaz UZ, Malik KA. Cotton microbiome profiling and Cotton Leaf Curl Disease (CLCuD) suppression through microbial consortia associated with Gossypium arboreum. NPJ Biofilms Microbiomes 2023; 9:100. [PMID: 38097579 PMCID: PMC10721634 DOI: 10.1038/s41522-023-00470-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2023] [Accepted: 11/28/2023] [Indexed: 12/17/2023] Open
Abstract
The failure of breeding strategies has caused scientists to shift to other means where the new approach involves exploring the microbiome to modulate plant defense mechanisms against Cotton Leaf Curl Disease (CLCuD). The cotton microbiome of CLCuD-resistant varieties may harbor a multitude of bacterial genera that significantly contribute to disease resistance and provide information on metabolic pathways that differ between the susceptible and resistant varieties. The current study explores the microbiome of CLCuD-susceptible Gossypium hirsutum and CLCuD-resistant Gossypium arboreum using 16 S rRNA gene amplification for the leaf endophyte, leaf epiphyte, rhizosphere, and root endophyte of the two cotton species. This revealed that Pseudomonas inhabited the rhizosphere while Bacillus was predominantly found in the phyllosphere of CLCuV-resistant G. arboreum. Using salicylic acid-producing Serratia spp. and Fictibacillus spp. isolated from CLCuD-resistant G. arboreum, and guided by our analyses, we have successfully suppressed CLCuD in the susceptible G. hirsutum through pot assays. The applied strains exhibited less than 10% CLCuD incidence as compared to control group where it was 40% at 40 days post viral inoculation. Through detailed analytics, we have successfully demonstrated that the applied microbes serve as a biocontrol agent to suppress viral disease in Cotton.
Collapse
Affiliation(s)
- Rhea Aqueel
- Kauser Abdulla Malik School of Life Sciences, Forman Christian College (A Chartered University), Ferozepur Road, Lahore, 54600, Pakistan
- Water & Environment Research Group, University of Glasgow, Mazumdar-Shaw Advanced Research Centre, Glasgow, G11 6EW, UK
| | - Ayesha Badar
- Kauser Abdulla Malik School of Life Sciences, Forman Christian College (A Chartered University), Ferozepur Road, Lahore, 54600, Pakistan
| | - Nazish Roy
- Kauser Abdulla Malik School of Life Sciences, Forman Christian College (A Chartered University), Ferozepur Road, Lahore, 54600, Pakistan
| | - Qandeel Mushtaq
- Kauser Abdulla Malik School of Life Sciences, Forman Christian College (A Chartered University), Ferozepur Road, Lahore, 54600, Pakistan
| | - Aimen Fatima Ali
- Kauser Abdulla Malik School of Life Sciences, Forman Christian College (A Chartered University), Ferozepur Road, Lahore, 54600, Pakistan
| | - Aftab Bashir
- Kauser Abdulla Malik School of Life Sciences, Forman Christian College (A Chartered University), Ferozepur Road, Lahore, 54600, Pakistan
| | - Umer Zeeshan Ijaz
- Water & Environment Research Group, University of Glasgow, Mazumdar-Shaw Advanced Research Centre, Glasgow, G11 6EW, UK.
- National University of Ireland, Galway, University Road, Galway, H91 TK33, Ireland.
- Department of Molecular and Clinical Cancer Medicine, University of Liverpool, Liverpool, L69 7BE, UK.
| | - Kauser Abdulla Malik
- Kauser Abdulla Malik School of Life Sciences, Forman Christian College (A Chartered University), Ferozepur Road, Lahore, 54600, Pakistan.
- Pakistan Academy of Sciences, Islamabad, Pakistan.
| |
Collapse
|
5
|
Swarte JC, Knobbe TJ, Björk JR, Gacesa R, Nieuwenhuis LM, Zhang S, Vila AV, Kremer D, Douwes RM, Post A, Quint EE, Pol RA, Jansen BH, de Borst MH, de Meijer VE, Blokzijl H, Berger SP, Festen EAM, Zhernakova A, Fu J, Harmsen HJM, Bakker SJL, Weersma RK. Health-related quality of life is linked to the gut microbiome in kidney transplant recipients. Nat Commun 2023; 14:7968. [PMID: 38042820 PMCID: PMC10693618 DOI: 10.1038/s41467-023-43431-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2023] [Accepted: 11/08/2023] [Indexed: 12/04/2023] Open
Abstract
Kidney transplant recipients (KTR) have impaired health-related quality of life (HRQoL) and suffer from intestinal dysbiosis. Increasing evidence shows that gut health and HRQoL are tightly related in the general population. Here, we investigate the association between the gut microbiome and HRQoL in KTR, using metagenomic sequencing data from fecal samples collected from 507 KTR. Multiple bacterial species are associated with lower HRQoL, many of which have previously been associated with adverse health conditions. Gut microbiome distance to the general population is highest among KTR with an impaired physical HRQoL (R = -0.20, P = 2.3 × 10-65) and mental HRQoL (R = -0.14, P = 1.3 × 10-3). Physical and mental HRQoL explain a significant part of variance in the gut microbiome (R2 = 0.58%, FDR = 5.43 × 10-4 and R2 = 0.37%, FDR = 1.38 × 10-3, respectively). Additionally, multiple metabolic and neuroactive pathways (gut brain modules) are associated with lower HRQoL. While the observational design of our study does not allow us to analyze causality, we provide a comprehensive overview of the associations between the gut microbiome and HRQoL while controlling for confounders.
Collapse
Affiliation(s)
- J Casper Swarte
- Department of Gastroenterology and Hepatology, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
- Department of Internal Medicine, Division of Nephrology, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
| | - Tim J Knobbe
- Department of Internal Medicine, Division of Nephrology, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
| | - Johannes R Björk
- Department of Gastroenterology and Hepatology, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
- Department of Genetics, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
| | - Ranko Gacesa
- Department of Gastroenterology and Hepatology, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
- Department of Genetics, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
| | - Lianne M Nieuwenhuis
- Department of Gastroenterology and Hepatology, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
| | - Shuyan Zhang
- Department of Gastroenterology and Hepatology, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
| | - Arnau Vich Vila
- Department of Gastroenterology and Hepatology, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
- Department of Genetics, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
| | - Daan Kremer
- Department of Internal Medicine, Division of Nephrology, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
| | - Rianne M Douwes
- Department of Gastroenterology and Hepatology, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
- Department of Internal Medicine, Division of Nephrology, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
| | - Adrian Post
- Department of Internal Medicine, Division of Nephrology, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
| | - Evelien E Quint
- Department of Surgery, division of Transplantation Surgery, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
| | - Robert A Pol
- Department of Surgery, division of Transplantation Surgery, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
| | - Bernadien H Jansen
- Department of Gastroenterology and Hepatology, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
| | - Martin H de Borst
- Department of Internal Medicine, Division of Nephrology, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
| | - Vincent E de Meijer
- Department of Surgery, section of Hepatobiliary Surgery and Liver Transplantation, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
| | - Hans Blokzijl
- Department of Gastroenterology and Hepatology, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
| | - Stefan P Berger
- Department of Internal Medicine, Division of Nephrology, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
| | - Eleonora A M Festen
- Department of Gastroenterology and Hepatology, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
- Department of Genetics, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
| | - Alexandra Zhernakova
- Department of Genetics, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
| | - Jingyuan Fu
- Department of Genetics, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
- Department of Pediatrics, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
| | - Hermie J M Harmsen
- Department of Medical Microbiology and Infection prevention, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
| | - Stephan J L Bakker
- Department of Internal Medicine, Division of Nephrology, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
| | - Rinse K Weersma
- Department of Gastroenterology and Hepatology, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands.
| |
Collapse
|
6
|
Thomsen M, Künstner A, Wohlers I, Olbrich M, Lenfers T, Osumi T, Shimazaki Y, Nishifuji K, Ibrahim SM, Watson A, Busch H, Hirose M. A comprehensive analysis of gut and skin microbiota in canine atopic dermatitis in Shiba Inu dogs. MICROBIOME 2023; 11:232. [PMID: 37864204 PMCID: PMC10590023 DOI: 10.1186/s40168-023-01671-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/11/2022] [Accepted: 09/14/2023] [Indexed: 10/22/2023]
Abstract
BACKGROUND Like its human counterpart, canine atopic dermatitis (cAD) is a chronic relapsing condition; thus, most cAD-affected dogs will require lifelong treatment to maintain an acceptable quality of life. A potential intervention is modulation of the composition of gut microbiota, and in fact, probiotic treatment has been proposed and tried in human atopic dermatitis (AD) patients. Since dogs are currently receiving intensive medical care, this will be the same option for dogs, while evidence of gut dysbiosis in cAD is still missing, although skin microbial profiling in cAD has been conducted in several studies. Therefore, we conducted a comprehensive analysis of both gut and skin microbiota in cAD in one specific cAD-predisposed breed, Shiba Inu. Additionally, we evaluated the impact of commonly used medical management on cAD (Janus kinase; JAK inhibitor, oclacitinib) on the gut and skin microbiota. Furthermore, we genotyped the Shiba Inu dogs according to the mitochondrial DNA haplogroup and assessed its association with the composition of the gut microbiota. RESULTS Staphylococcus was the most predominant bacterial genus observed in the skin; Escherichia/Shigella and Clostridium sensu stricto were highly abundant in the gut of cAD-affected dogs. In the gut microbiota, Fusobacteria and Megamonas were highly abundant in healthy dogs but significantly reduced in cAD-affected dogs. The abundance of these bacterial taxa was positively correlated with the effect of the treatment and state of the disease. Oclacitinib treatment on cAD-affected dogs shifted the composition of microbiota towards that in healthy dogs, and the latter brought it much closer to healthy microbiota, particularly in the gut. Additionally, even within the same dog breed, the mtDNA haplogroup varied, and there was an association between the mtDNA haplogroup and microbial composition in the gut and skin. CONCLUSIONS Dysbiosis of both the skin and the gut was observed in cAD in Shiba Inu dogs. Our findings provide a basis for the potential treatment of cAD by manipulating the gut microbiota as well as the skin microbiota. Video Abstract.
Collapse
Affiliation(s)
- Mirja Thomsen
- Lübeck Institute of Experimental Dermatology, University of Lübeck, Ratzeburger Allee 160, 23562, Lübeck, Germany
- Institute of Neurogenetics, University of Lübeck, Ratzeburger Allee 160, 23562, Lübeck, Germany
| | - Axel Künstner
- Lübeck Institute of Experimental Dermatology, University of Lübeck, Ratzeburger Allee 160, 23562, Lübeck, Germany
- Institute of Cardiogenetics, University of Lübeck, Ratzeburger Allee 160, 23562, Lübeck, Germany
| | - Inken Wohlers
- Lübeck Institute of Experimental Dermatology, University of Lübeck, Ratzeburger Allee 160, 23562, Lübeck, Germany
- Institute of Cardiogenetics, University of Lübeck, Ratzeburger Allee 160, 23562, Lübeck, Germany
- Biomolecular Data Science in Pneumology, Research Center Borstel, Parkallee 1-40, 23845, Borstel, Germany
| | - Michael Olbrich
- Lübeck Institute of Experimental Dermatology, University of Lübeck, Ratzeburger Allee 160, 23562, Lübeck, Germany
- Institute of Cardiogenetics, University of Lübeck, Ratzeburger Allee 160, 23562, Lübeck, Germany
- Center for Biotechnology, Khalifa University, Abu Dhabi, UAE
| | - Tim Lenfers
- Lübeck Institute of Experimental Dermatology, University of Lübeck, Ratzeburger Allee 160, 23562, Lübeck, Germany
| | - Takafumi Osumi
- Animal Medical Center, Faculty of Agriculture, Tokyo University of Agriculture and Technology, 3-5-8 Saiwai-cho, Fuchu-shi, Tokyo, 183-8509, Japan
| | - Yotaro Shimazaki
- Animal Medical Center, Faculty of Agriculture, Tokyo University of Agriculture and Technology, 3-5-8 Saiwai-cho, Fuchu-shi, Tokyo, 183-8509, Japan
| | - Koji Nishifuji
- Division of Animal Life Science, Graduate School of Agriculture, Tokyo University of Agriculture and Technology, 3-5-8 Saiwai-cho, Fuchu-shi, Tokyo, 183-8509, Japan
| | - Saleh M Ibrahim
- Lübeck Institute of Experimental Dermatology, University of Lübeck, Ratzeburger Allee 160, 23562, Lübeck, Germany
- College of Medical and Health Sciences, Khalifa University of Science and Technology, Shakhbout Bin Sultan Street, Abu Dhabi, UAE
| | - Adrian Watson
- Royal Canin SAS, 650 avenue de la Petite Camargue, 30470, Aimargues, France
| | - Hauke Busch
- Lübeck Institute of Experimental Dermatology, University of Lübeck, Ratzeburger Allee 160, 23562, Lübeck, Germany
- Institute of Cardiogenetics, University of Lübeck, Ratzeburger Allee 160, 23562, Lübeck, Germany
| | - Misa Hirose
- Lübeck Institute of Experimental Dermatology, University of Lübeck, Ratzeburger Allee 160, 23562, Lübeck, Germany.
| |
Collapse
|
7
|
Lecomte M, Tomassi D, Rizzoli R, Tenon M, Berton T, Harney S, Fança-Berthon P. Effect of a Hop Extract Standardized in 8-Prenylnaringenin on Bone Health and Gut Microbiome in Postmenopausal Women with Osteopenia: A One-Year Randomized, Double-Blind, Placebo-Controlled Trial. Nutrients 2023; 15:2688. [PMID: 37375599 DOI: 10.3390/nu15122688] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2023] [Revised: 05/21/2023] [Accepted: 05/25/2023] [Indexed: 06/29/2023] Open
Abstract
Estrogen deficiency increases the risk of osteoporosis and fracture. The aim of this study was to investigate whether a hop extract standardized in 8-prenylnaringenin (8-PN), a potent phytoestrogen, could improve bone status of osteopenic women and to explore the gut microbiome roles in this effect. In this double-blind, placebo-controlled, randomized trial, 100 postmenopausal, osteopenic women were supplemented with calcium and vitamin D3 (CaD) tablets and either a hop extract (HE) standardized in 8-PN (n = 50) or a placebo (n = 50) for 48 weeks. Bone mineral density (BMD) and bone metabolism were assessed by DXA measurements and plasma bone biomarkers, respectively. Participant's quality of life (SF-36), gut microbiome composition, and short-chain fatty acid (SCFA) levels were also investigated. In addition to the CaD supplements, 48 weeks of HE supplementation increased total body BMD (1.8 ± 0.4% vs. baseline, p < 0.0001; 1.0 ± 0.6% vs. placebo, p = 0.08), with a higher proportion of women experiencing an increase ≥1% compared to placebo (odds ratio: 2.41 ± 1.07, p < 0.05). An increase in the SF-36 physical functioning score was observed with HE versus placebo (p = 0.05). Gut microbiome α-diversity and SCFA levels did not differ between groups. However, a higher abundance of genera Turicibacter and Shigella was observed in the HE group; both genera have been previously identified as associated with total body BMD. These results suggest that an 8-PN standardized hop extract could beneficially impact bone health of postmenopausal women with osteopenia.
Collapse
Affiliation(s)
| | | | - René Rizzoli
- Service of Bone Disease, Geneva University Hospitals and Faculty of Medicine, 1211 Geneva, Switzerland
| | | | | | - Sinead Harney
- Rheumatology Department, Cork University Hospital, T12 DFK4 Cork, Ireland
| | | |
Collapse
|
8
|
Singh S, Keating C, Ijaz UZ, Hassard F. Molecular insights informing factors affecting low temperature anaerobic applications: Diversity, collated core microbiomes and complexity stability relationships in LCFA-fed systems. THE SCIENCE OF THE TOTAL ENVIRONMENT 2023; 874:162420. [PMID: 36842571 DOI: 10.1016/j.scitotenv.2023.162420] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/30/2022] [Revised: 01/31/2023] [Accepted: 02/19/2023] [Indexed: 06/18/2023]
Abstract
Fats, oil and grease, and their hydrolyzed counterparts-long chain fatty acids (LCFA) make up a large fraction of numerous wastewaters and are challenging to degrade anaerobically, more so, in low temperature anaerobic digestion (LtAD) systems. Herein, we perform a comparative analysis of publicly available Illumina 16S rRNA datasets generated from LCFA-degrading anaerobic microbiomes at low temperatures (10 and 20 °C) to comprehend the factors affecting microbial community dynamics. The various factors considered were the inoculum, substrate and operational characteristics, the reactor operation mode and reactor configuration, and the type of nucleic acid sequenced. We found that LCFA-degrading anaerobic microbiomes were differentiated primarily by inoculum characteristics (inoculum source and morphology) in comparison to the other factors tested. Inoculum characteristics prominently shaped the species richness, species evenness and beta-diversity patterns in the microbiomes even after long term operation of continuous reactors up to 150 days, implying the choice of inoculum needs careful consideration. The generalised additive models represented through beta diversity contour plots revealed that psychrophilic bacteria RBG-13-54-9 from family Anaerolineae, and taxa WCHB1-41 and Williamwhitmania were highly abundant in LCFA-fed microbial niches, suggesting their role in anaerobic treatment of LCFAs at low temperatures of 10-20 °C. Overall, we showed that the following bacterial genera: uncultured Propionibacteriaceae, Longilinea, Christensenellaceae R7 group, Lactivibrio, candidatus Caldatribacterium, Aminicenantales, Syntrophus, Syntrophomonas, Smithella, RBG-13-54-9, WCHB1-41, Trichococcus, Proteiniclasticum, SBR1031, Lutibacter and Lentimicrobium have prominent roles in LtAD of LCFA-rich wastewaters at 10-20 °C. This study provides molecular insights of anaerobic LCFA degradation under low temperatures from collated datasets and will aid in improving LtAD systems for treating LCFA-rich wastewaters.
Collapse
Affiliation(s)
- Suniti Singh
- Cranfield Water Science Institute, Cranfield University, College Way, Bedfordshire MK43 0AL, UK.
| | - Ciara Keating
- School of Biodiversity, One Health and Veterinary Medicine, College of Medical, Veterinary and Life Sciences, University of Glasgow, UK.
| | - Umer Zeeshan Ijaz
- Infrastructure and Environment Research Division, James Watt School of Engineering, University of Glasgow, UK; Department of Molecular and Clinical Cancer Medicine, University of Liverpool, UK; College of Science and Engineering, NUI Galway, Ireland.
| | - Francis Hassard
- Cranfield Water Science Institute, Cranfield University, College Way, Bedfordshire MK43 0AL, UK; Institute for Nanotechnology and Water Sustainability, University of South Africa, UNISA Science Campus, 1710 Roodepoort, Johannesburg, South Africa.
| |
Collapse
|
9
|
Wang Y, Lê Cao KA. PLSDA-batch: a multivariate framework to correct for batch effects in microbiome data. Brief Bioinform 2023; 24:6991121. [PMID: 36653900 PMCID: PMC10025448 DOI: 10.1093/bib/bbac622] [Citation(s) in RCA: 15] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2022] [Revised: 12/14/2022] [Accepted: 12/17/2022] [Indexed: 01/20/2023] Open
Abstract
Microbial communities are highly dynamic and sensitive to changes in the environment. Thus, microbiome data are highly susceptible to batch effects, defined as sources of unwanted variation that are not related to and obscure any factors of interest. Existing batch effect correction methods have been primarily developed for gene expression data. As such, they do not consider the inherent characteristics of microbiome data, including zero inflation, overdispersion and correlation between variables. We introduce new multivariate and non-parametric batch effect correction methods based on Partial Least Squares Discriminant Analysis (PLSDA). PLSDA-batch first estimates treatment and batch variation with latent components, then subtracts batch-associated components from the data. The resulting batch-effect-corrected data can then be input in any downstream statistical analysis. Two variants are proposed to handle unbalanced batch x treatment designs and to avoid overfitting when estimating the components via variable selection. We compare our approaches with popular methods managing batch effects, namely, removeBatchEffect, ComBat and Surrogate Variable Analysis, in simulated and three case studies using various visual and numerical assessments. We show that our three methods lead to competitive performance in removing batch variation while preserving treatment variation, especially for unbalanced batch $\times $ treatment designs. Our downstream analyses show selections of biologically relevant taxa. This work demonstrates that batch effect correction methods can improve microbiome research outputs. Reproducible code and vignettes are available on GitHub.
Collapse
Affiliation(s)
- Yiwen Wang
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, 97 Buxin Rd, Shenzhen, 518000, Guangdong, China
- Melbourne Integrative Genomics, School of Mathematics and Statistics, The University of Melbourne, 30 Royal Parade, Melbourne, 3052, VIC, Australia
| | - Kim-Anh Lê Cao
- Melbourne Integrative Genomics, School of Mathematics and Statistics, The University of Melbourne, 30 Royal Parade, Melbourne, 3052, VIC, Australia
| |
Collapse
|
10
|
Calle ML, Pujolassos M, Susin A. coda4microbiome: compositional data analysis for microbiome cross-sectional and longitudinal studies. BMC Bioinformatics 2023; 24:82. [PMID: 36879227 PMCID: PMC9990256 DOI: 10.1186/s12859-023-05205-3] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2022] [Accepted: 02/22/2023] [Indexed: 03/08/2023] Open
Abstract
BACKGROUND One of the main challenges of microbiome analysis is its compositional nature that if ignored can lead to spurious results. Addressing the compositional structure of microbiome data is particularly critical in longitudinal studies where abundances measured at different times can correspond to different sub-compositions. RESULTS We developed coda4microbiome, a new R package for analyzing microbiome data within the Compositional Data Analysis (CoDA) framework in both, cross-sectional and longitudinal studies. The aim of coda4microbiome is prediction, more specifically, the method is designed to identify a model (microbial signature) containing the minimum number of features with the maximum predictive power. The algorithm relies on the analysis of log-ratios between pairs of components and variable selection is addressed through penalized regression on the "all-pairs log-ratio model", the model containing all possible pairwise log-ratios. For longitudinal data, the algorithm infers dynamic microbial signatures by performing penalized regression over the summary of the log-ratio trajectories (the area under these trajectories). In both, cross-sectional and longitudinal studies, the inferred microbial signature is expressed as the (weighted) balance between two groups of taxa, those that contribute positively to the microbial signature and those that contribute negatively. The package provides several graphical representations that facilitate the interpretation of the analysis and the identified microbial signatures. We illustrate the new method with data from a Crohn's disease study (cross-sectional data) and on the developing microbiome of infants (longitudinal data). CONCLUSIONS coda4microbiome is a new algorithm for identification of microbial signatures in both, cross-sectional and longitudinal studies. The algorithm is implemented as an R package that is available at CRAN ( https://cran.r-project.org/web/packages/coda4microbiome/ ) and is accompanied with a vignette with a detailed description of the functions. The website of the project contains several tutorials: https://malucalle.github.io/coda4microbiome/.
Collapse
Affiliation(s)
- M Luz Calle
- Biosciences Department, Faculty of Sciences, Technology and Engineering, University of Vic - Central University of Catalonia, Carrer de La Laura, 13, 08500, Vic, Spain.
| | - Meritxell Pujolassos
- Biosciences Department, Faculty of Sciences, Technology and Engineering, University of Vic - Central University of Catalonia, Carrer de La Laura, 13, 08500, Vic, Spain
| | - Antoni Susin
- Mathematical Department, UPC-Barcelona Tech, Barcelona, Spain
| |
Collapse
|
11
|
Small Spatial Scale Drivers of Secondary Metabolite Biosynthetic Diversity in Environmental Microbiomes. mSystems 2023; 8:e0072422. [PMID: 36790187 PMCID: PMC10134846 DOI: 10.1128/msystems.00724-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/16/2023] Open
Abstract
In the search for novel drug candidates, diverse environmental microbiomes have been surveyed for their secondary metabolite biosynthesis potential, yet little is known about the biosynthetic diversity encoded by divergent microbiomes from different ecosystems, and the environmental parameters driving this diversity. Here, we used targeted amplicon sequencing of adenylation (AD) and ketosynthase (KS) domains along with 16S sequencing to delineate the unique biosynthetic potential of microbiomes from three separate habitats (soil, water, and sediments) exhibiting unique small spatial scale physicochemical gradients. The estimated richness of AD domains was highest in marine sediments with 656 ± 58 operational biosynthetic units (OBUs), while the KS domain richness was highest in soil microbiomes with 388 ± 67 OBUs. Microbiomes with rich and diverse bacterial communities displayed the highest PK potential across all ecosystems, and on a small spatial scale, pH and salinity were significantly, positively correlated to KS domain richness in soil and aquatic systems, respectively. Integrating our findings, we were able to predict the KS domain richness with a RMSE of 31 OBUs and a R2 of 0.91, and by the use of publicly available information on bacterial richness and diversity, we identified grassland biomes as being particularly promising sites for the discovery of novel polyketides. Furthermore, a focus on acidobacterial taxa is likely to be fruitful, as these were responsible for most of the variation in biosynthetic diversity. Overall, our results highlight the importance of sampling diverse environments with high taxonomic diversity in the pursuit for novel secondary metabolites. IMPORTANCE To counteract the antibiotic resistance crisis, novel anti-infective agents need to be discovered and brought to market. Microbial secondary metabolites have been important sources of inspiration for small-molecule therapeutics. However, the isolation of novel antibiotics is difficult, and the risk of rediscovery is high. With the overarching purpose of identifying promising microbiomes for discovery of novel bioactivity, we mapped out the most significant drivers of biosynthetic diversity across divergent microbiomes. We found the biosynthetic potential to be unique to individual ecosystems, and to depend on bacterial taxonomic diversity. Within systems, and on small spatial scales, pH and salinity correlated positively to the biosynthetic richness of the microbiomes, Acidobacteria representing the taxa most highly associated with biosynthetic diversity. Ultimately, understanding the key drivers of the biosynthesis potential of environmental microbiomes will allow us to focus bioprospecting efforts and facilitate the discovery of novel therapeutics.
Collapse
|
12
|
Lee S, Jung S, Lourenco J, Pringle D, Ahn J. Resampling-based inferences for compositional regression with application to beef cattle microbiomes. Stat Methods Med Res 2023; 32:151-164. [PMID: 36267026 DOI: 10.1177/09622802221133550] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Abstract
Gut microbiomes are increasingly found to be associated with many health-related characteristics of humans as well as animals. Regression with compositional microbiomes covariates is commonly used to identify important bacterial taxa that are related to various phenotype responses. Often the dimension of microbiome taxa easily exceeds the number of available samples, which creates a serious challenge in the estimation and inference of the model. The sparse log-contrast regression method is useful for such cases as it can yield a model estimate that depends on only a small number of taxa. However, a formal statistical inference procedure for individual regression coefficients has not been properly established yet. We propose a new estimation and inference procedure for linear regression models with extremely low-sample-sized compositional predictors. Under the compositional log-contrast regression framework, the proposed approach consists of two steps. The first step is to screen relevant predictors by fitting a log-contrast model with a sparse penalty. The screened-in variables are used as predictors in the non-sparse log-contrast model in the second step, where each of the regression coefficients is tested using nonparametric, resampling-based methods such as permutation and bootstrap. The performances of the proposed methods are evaluated by a simulation study, which shows they outperform traditional approaches based on normal assumptions or large sample asymptotics. Application to steer microbiome data successfully identifies key bacterial taxa that are related to important cattle quality measures.
Collapse
Affiliation(s)
- Sujin Lee
- Department of Statistics, 26725Seoul National University, Seoul, Republic of Korea
| | - Sungkyu Jung
- Department of Statistics, 26725Seoul National University, Seoul, Republic of Korea
| | - Jeferson Lourenco
- Department of Animal and Dairy Science, 1355University of Georgia, Athens, GA, USA
| | - Dean Pringle
- Department of Animal and Dairy Science, 1355University of Georgia, Athens, GA, USA
| | - Jeongyoun Ahn
- Department of Industrial and Systems Engineering, 34968KAIST, Daejeon, Republic of Korea
| |
Collapse
|
13
|
Acharjee A, Singh U, Choudhury SP, Gkoutos GV. The diagnostic potential and barriers of microbiome based therapeutics. Diagnosis (Berl) 2022; 9:411-420. [PMID: 36000189 DOI: 10.1515/dx-2022-0052] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2022] [Accepted: 08/03/2022] [Indexed: 02/07/2023]
Abstract
High throughput technological innovations in the past decade have accelerated research into the trillions of commensal microbes in the gut. The 'omics' technologies used for microbiome analysis are constantly evolving, and large-scale datasets are being produced. Despite of the fact that much of the research is still in its early stages, specific microbial signatures have been associated with the promotion of cancer, as well as other diseases such as inflammatory bowel disease, neurogenerative diareses etc. It has been also reported that the diversity of the gut microbiome influences the safety and efficacy of medicines. The availability and declining sequencing costs has rendered the employment of RNA-based diagnostics more common in the microbiome field necessitating improved data-analytical techniques so as to fully exploit all the resulting rich biological datasets, while accounting for their unique characteristics, such as their compositional nature as well their heterogeneity and sparsity. As a result, the gut microbiome is increasingly being demonstrating as an important component of personalised medicine since it not only plays a role in inter-individual variability in health and disease, but it also represents a potentially modifiable entity or feature that may be addressed by treatments in a personalised way. In this context, machine learning and artificial intelligence-based methods may be able to unveil new insights into biomedical analyses through the generation of models that may be used to predict category labels, and continuous values. Furthermore, diagnostic aspects will add value in the identification of the non invasive markers in the critical diseases like cancer.
Collapse
Affiliation(s)
- Animesh Acharjee
- Institute of Cancer and Genomic Sciences, University of Birmingham, Birmingham, UK.,Institute of Translational Medicine, University of Birmingham, Birmingham, UK.,NIHR Surgical Reconstruction and Microbiology Research Centre, University Hospital Birmingham, Birmingham, UK.,MRC Health Data Research UK (HDR UK), Birmingham, UK
| | - Utpreksha Singh
- Department of Health and Life Sciences, Coventry University, Coventry, UK
| | | | - Georgios V Gkoutos
- Institute of Cancer and Genomic Sciences, University of Birmingham, Birmingham, UK.,Institute of Translational Medicine, University of Birmingham, Birmingham, UK.,NIHR Surgical Reconstruction and Microbiology Research Centre, University Hospital Birmingham, Birmingham, UK.,MRC Health Data Research UK (HDR UK), Birmingham, UK.,NIHR Experimental Cancer Medicine Centre, Birmingham, UK
| |
Collapse
|
14
|
Beyond Basic Diversity Estimates-Analytical Tools for Mechanistic Interpretations of Amplicon Sequencing Data. Microorganisms 2022; 10:microorganisms10101961. [PMID: 36296237 PMCID: PMC9609705 DOI: 10.3390/microorganisms10101961] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2022] [Revised: 09/29/2022] [Accepted: 09/30/2022] [Indexed: 11/07/2022] Open
Abstract
Understanding microbial ecology through amplifying short read regions, typically 16S rRNA for prokaryotic species or 18S rRNA for eukaryotic species, remains a popular, economical choice. These methods provide relative abundances of key microbial taxa, which, depending on the experimental design, can be used to infer mechanistic ecological underpinnings. In this review, we discuss recent advancements in in situ analytical tools that have the power to elucidate ecological phenomena, unveil the metabolic potential of microbial communities, identify complex multidimensional interactions between species, and compare stability and complexity under different conditions. Additionally, we highlight methods that incorporate various modalities and additional information, which in combination with abundance data, can help us understand how microbial communities respond to change in a typical ecosystem. Whilst the field of microbial informatics continues to progress substantially, our emphasis is on popular methods that are applicable to a broad range of study designs. The application of these methods can increase our mechanistic understanding of the ongoing dynamics of complex microbial communities.
Collapse
|
15
|
Identification of microbial features in multivariate regression under false discovery rate control. Comput Stat Data Anal 2022. [DOI: 10.1016/j.csda.2022.107621] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
16
|
Boyraz A, Pawlowsky-Glahn V, Egozcue JJ, Acar AC. Principal microbial groups: compositional alternative to phylogenetic grouping of microbiome data. Brief Bioinform 2022; 23:6675749. [PMID: 36007229 DOI: 10.1093/bib/bbac328] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2022] [Revised: 07/19/2022] [Accepted: 07/20/2022] [Indexed: 11/13/2022] Open
Abstract
Statistical and machine learning techniques based on relative abundances have been used to predict health conditions and to identify microbial biomarkers. However, high dimensionality, sparsity and the compositional nature of microbiome data represent statistical challenges. On the other hand, the taxon grouping allows summarizing microbiome abundance with a coarser resolution in a lower dimension, but it presents new challenges when correlating taxa with a disease. In this work, we present a novel approach that groups Operational Taxonomical Units (OTUs) based only on relative abundances as an alternative to taxon grouping. The proposed procedure acknowledges the compositional data making use of principal balances. The identified groups are called Principal Microbial Groups (PMGs). The procedure reduces the need for user-defined aggregation of $\textrm{OTU}$s and offers the possibility of working with coarse group of $\textrm{OTU}$s, which are not present in a phylogenetic tree. PMGs can be used for two different goals: (1) as a dimensionality reduction method for compositional data, (2) as an aggregation procedure that provides an alternative to taxon grouping for construction of microbial balances afterward used for disease prediction. We illustrate the procedure with a cirrhosis study data. PMGs provide a coherent data analysis for the search of biomarkers in human microbiota. The source code and demo data for PMGs are available at: https://github.com/asliboyraz/PMGs.
Collapse
Affiliation(s)
- Aslı Boyraz
- Department of Computer Programming, Recep Tayyip Erdoğan University, Ardeşen Vocational School, Rize, 53400, Turkey
| | - Vera Pawlowsky-Glahn
- Department of Computer Sciences, Applied Mathematics and Statistics, University of Girona, Campus Montilivi, 17003 Girona, Spain
| | - Juan José Egozcue
- Department of Civil and Environmental Engineering, Universitat Politécnica de Catalunya, Barcelona, 08034, Spain
| | - Aybar Can Acar
- Department of Medical Informatics, Middle East Technical University, Ankara Turkey
| |
Collapse
|
17
|
Coenders G, Greenacre M. Three approaches to supervised learning for compositional data with pairwise logratios. J Appl Stat 2022; 50:3272-3293. [PMID: 37969895 PMCID: PMC10637191 DOI: 10.1080/02664763.2022.2108007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2022] [Accepted: 07/25/2022] [Indexed: 10/15/2022]
Abstract
Logratios between pairs of compositional parts (pairwise logratios) are the easiest to interpret in compositional data analysis, and include the well-known additive logratios as particular cases. When the number of parts is large (sometimes even larger than the number of cases), some form of logratio selection is needed. In this article, we present three alternative stepwise supervised learning methods to select the pairwise logratios that best explain a dependent variable in a generalized linear model, each geared for a specific problem. The first method features unrestricted search, where any pairwise logratio can be selected. This method has a complex interpretation if some pairs of parts in the logratios overlap, but it leads to the most accurate predictions. The second method restricts parts to occur only once, which makes the corresponding logratios intuitively interpretable. The third method uses additive logratios, so that K-1 selected logratios involve a K-part subcomposition. Our approach allows logratios or non-compositional covariates to be forced into the models based on theoretical knowledge, and various stopping criteria are available based on information measures or statistical significance with the Bonferroni correction. We present an application on a dataset from a study predicting Crohn's disease.
Collapse
Affiliation(s)
- Germà Coenders
- Department of Economics, Universitat de Girona, Girona, Spain
| | - Michael Greenacre
- Department of Economics and Business and Barcelona School of Management, Universitat Pompeu Fabra, Barcelona, Spain
| |
Collapse
|
18
|
Liang H, Jo JH, Zhang Z, MacGibeny MA, Han J, Proctor DM, Taylor ME, Che Y, Juneau P, Apolo AB, McCulloch JA, Davar D, Zarour HM, Dzutsev AK, Brownell I, Trinchieri G, Gulley JL, Kong HH. Predicting cancer immunotherapy response from gut microbiomes using machine learning models. Oncotarget 2022; 13:876-889. [PMID: 35875611 PMCID: PMC9295706 DOI: 10.18632/oncotarget.28252] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2022] [Accepted: 06/20/2022] [Indexed: 01/04/2023] Open
Abstract
Cancer immunotherapy has significantly improved patient survival. Yet, half of patients do not respond to immunotherapy. Gut microbiomes have been linked to clinical responsiveness of melanoma patients on immunotherapies; however, different taxa have been associated with response status with implicated taxa inconsistent between studies. We used a tumor-agnostic approach to find common gut microbiome features of response among immunotherapy patients with different advanced stage cancers. A combined meta-analysis of 16S rRNA gene sequencing data from our mixed tumor cohort and three published immunotherapy gut microbiome datasets from different melanoma patient cohorts found certain gut bacterial taxa correlated with immunotherapy response status regardless of tumor type. Using multivariate selbal analysis, we identified two separate groups of bacterial genera associated with responders versus non-responders. Statistical models of gut microbiome community features showed robust prediction accuracy of immunotherapy response in amplicon sequencing datasets and in cross-sequencing platform validation with shotgun metagenomic datasets. Results suggest baseline gut microbiome features may be predictive of clinical outcomes in oncology patients on immunotherapies, and some of these features may be generalizable across different tumor types, patient cohorts, and sequencing platforms. Findings demonstrate how machine learning models can reveal microbiome-immunotherapy interactions that may ultimately improve cancer patient outcomes.
Collapse
Affiliation(s)
- Hai Liang
- Dermatology Branch, National Institute of Arthritis and Musculoskeletal and Skin Diseases, National Institutes of Health, Bethesda, MD 20892, USA
| | - Jay-Hyun Jo
- Dermatology Branch, National Institute of Arthritis and Musculoskeletal and Skin Diseases, National Institutes of Health, Bethesda, MD 20892, USA
| | - Zhiwei Zhang
- Biostatistics Branch, Division of Cancer Treatment and Diagnostics, National Cancer Institute, NIH, Bethesda, MD 20892, USA
| | - Margaret A. MacGibeny
- Dermatology Branch, National Institute of Arthritis and Musculoskeletal and Skin Diseases, National Institutes of Health, Bethesda, MD 20892, USA
- Department of Medical Education, West Virginia University, Morgantown, WV 26506, USA
| | - Jungmin Han
- Dermatology Branch, National Institute of Arthritis and Musculoskeletal and Skin Diseases, National Institutes of Health, Bethesda, MD 20892, USA
| | - Diana M. Proctor
- Translational and Functional Genomics Branch, National Human Genome Research Institute, NIH, Bethesda, MD 20892, USA
| | - Monica E. Taylor
- Dermatology Branch, National Institute of Arthritis and Musculoskeletal and Skin Diseases, National Institutes of Health, Bethesda, MD 20892, USA
| | - You Che
- Dermatology Branch, National Institute of Arthritis and Musculoskeletal and Skin Diseases, National Institutes of Health, Bethesda, MD 20892, USA
| | - Paul Juneau
- NIH Library, Division of Library Services, Office of Research Services, NIH, Bethesda, MD 20892, USA
- Zimmerman Associates Inc., Fairfax, VA 22030, USA
| | - Andrea B. Apolo
- Genitourinary Malignancies Branch, Center for Cancer Research, NCI, NIH, Bethesda, MD 20892, USA
| | - John A. McCulloch
- Genetics and Microbiome Core, Laboratory of Integrative Cancer Immunology, Center for Cancer Research, NCI, NIH, Bethesda, MD 20892, USA
| | - Diwakar Davar
- Department of Medicine and UPMC Hillman Cancer Center University of Pittsburgh, Pittsburgh, PA 15213, USA
| | - Hassane M. Zarour
- Department of Medicine and UPMC Hillman Cancer Center University of Pittsburgh, Pittsburgh, PA 15213, USA
| | - Amiran K. Dzutsev
- Laboratory of Integrative Cancer Immunology, Center for Cancer Research, NCI, NIH, Bethesda, MD 20892, USA
| | - Isaac Brownell
- Dermatology Branch, National Institute of Arthritis and Musculoskeletal and Skin Diseases, National Institutes of Health, Bethesda, MD 20892, USA
- Center for Immuno-Oncology, Center for Cancer Research, NCI, NIH, Bethesda, MD 20892, USA
| | - Giorgio Trinchieri
- Laboratory of Integrative Cancer Immunology, Center for Cancer Research, NCI, NIH, Bethesda, MD 20892, USA
| | - James L. Gulley
- Center for Immuno-Oncology, Center for Cancer Research, NCI, NIH, Bethesda, MD 20892, USA
| | - Heidi H. Kong
- Dermatology Branch, National Institute of Arthritis and Musculoskeletal and Skin Diseases, National Institutes of Health, Bethesda, MD 20892, USA
- Laboratory of Integrative Cancer Immunology, Center for Cancer Research, NCI, NIH, Bethesda, MD 20892, USA
| |
Collapse
|
19
|
Principal Amalgamation Analysis for Microbiome Data. Genes (Basel) 2022; 13:genes13071139. [PMID: 35885922 PMCID: PMC9318429 DOI: 10.3390/genes13071139] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2022] [Revised: 06/14/2022] [Accepted: 06/21/2022] [Indexed: 12/02/2022] Open
Abstract
In recent years microbiome studies have become increasingly prevalent and large-scale. Through high-throughput sequencing technologies and well-established analytical pipelines, relative abundance data of operational taxonomic units and their associated taxonomic structures are routinely produced. Since such data can be extremely sparse and high dimensional, there is often a genuine need for dimension reduction to facilitate data visualization and downstream statistical analysis. We propose Principal Amalgamation Analysis (PAA), a novel amalgamation-based and taxonomy-guided dimension reduction paradigm for microbiome data. Our approach aims to aggregate the compositions into a smaller number of principal compositions, guided by the available taxonomic structure, by minimizing a properly measured loss of information. The choice of the loss function is flexible and can be based on familiar diversity indices for preserving either within-sample or between-sample diversity in the data. To enable scalable computation, we develop a hierarchical PAA algorithm to trace the entire trajectory of successive simple amalgamations. Visualization tools including dendrogram, scree plot, and ordination plot are developed. The effectiveness of PAA is demonstrated using gut microbiome data from a preterm infant study and an HIV infection study.
Collapse
|
20
|
A potential oral microbiome signature associated with coronary artery disease in Tunisia. Biosci Rep 2022; 42:231418. [PMID: 35695679 PMCID: PMC9251586 DOI: 10.1042/bsr20220583] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2022] [Revised: 05/17/2022] [Accepted: 05/30/2022] [Indexed: 11/17/2022] Open
Abstract
The coronary artery disease is a chronic inflammatory disease involving genetic as well as environmental factors. Recent evidence suggests that the oral microbiome has a significant role in triggering atherosclerosis. The present study assessed the oral microbiome composition variation between coronary patients and healthy subjects in order to identify a potential pathogenic signature associated with coronary artery disease (CAD). We performed metagenomic profiling of salivary microbiomes by 16S rRNA next-generation sequencing. Oral microbiota profiling was performed for 30 individuals including 20 patients with CAD and 10 healthy individuals without carotid plaques or previous stroke or myocardial infarction.We found that oral microbial communities in patients and healthy controls are represented by similar global core oral microbiome. The predominant taxa belonged to Firmicutes (genus Streptococcus, Veillonella, Granulicatella, Selenomonas), Proteobacteria (genus Neisseria, Haemophilus), Actinobacteria (genus Rothia), Bacteroidetes (genus Prevotella, Porphyromonas) and Fusobacteria (genus Fusobacterium, Leptotrichia). More than 60% relative abundance of each sample for both CAD patients and controls is represented by three major genera including Streptococcus (24.97% and 26.33%), Veillonella (21.43% and 19.91%) and Neisseria (14.23% and 15.33%). Using penalized regression analysis, the bacterial genus Eikenella was involved as the major discriminant genus for both status and Syntax score of CAD. We also reported a significant negative correlation between Syntax score and Eikenella abundance in coronary patients' group (Spearman rho =-0.68, p= 0.00094). In conclusion, the abundance of Eikenella in oral coronary patient samples compared to controls could be a prominent pathological indicator for the development of CAD.
Collapse
|
21
|
Hinton AL, Mucha PJ. A Simultaneous Feature Selection and Compositional Association Test for Detecting Sparse Associations in High-Dimensional Metagenomic Data. Front Microbiol 2022; 13:837396. [PMID: 35387076 PMCID: PMC8978828 DOI: 10.3389/fmicb.2022.837396] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2021] [Accepted: 02/15/2022] [Indexed: 12/14/2022] Open
Abstract
Numerous metagenomic studies aim to discover associations between the microbial composition of an environment (e.g., gut, skin, oral) and a phenotype of interest. Multivariate analysis is often performed in these studies without critical a priori knowledge of which taxa are associated with the phenotype being studied. This approach typically reduces statistical power in settings where the true associations among only a few taxa are obscured by high dimensionality (i.e., sparse association signals). At the same time, low sample size and compositional sample space constraints may reduce beyond-study generalizability if not properly accounted for. To address these difficulties, we developed the Selection-Energy-Permutation (SelEnergyPerm) method, a nonparametric group association test with embedded feature selection that directly accounts for compositional constraints using parsimonious logratio signatures between taxonomic features, for characterizing and understanding alterations in microbial community structure. Simulation results show SelEnergyPerm selects small independent sets of logratios that capture strong associations in a range of scenarios. Additionally, our simulation results demonstrate SelEnergyPerm consistently detects/rejects associations in synthetic data with sparse, dense, or no association signals. We demonstrate the novel benefits of our method in four case studies utilizing publicly available 16S amplicon and whole-genome sequencing datasets. Our R implementation of Selection-Energy-Permutation, including an example demonstration and the code to generate all of the scenarios used here, is available at https://www.github.com/andrew84830813/selEnergyPermR.
Collapse
Affiliation(s)
- Andrew L Hinton
- Curriculum in Bioinformatics and Computational Biology, University of North Carolina, Chapel Hill, NC, United States.,School of Medicine, University of North Carolina at Chapel Hill Food Allergy Initiative, Chapel Hill, NC, United States
| | - Peter J Mucha
- Curriculum in Bioinformatics and Computational Biology, University of North Carolina, Chapel Hill, NC, United States.,Departments of Mathematics and Applied Physical Sciences, University of North Carolina, Chapel Hill, NC, United States.,Department of Mathematics, Dartmouth College, Hanover, NH, United States
| |
Collapse
|
22
|
Banerjee K, Chen J, Zhan X. Adaptive and powerful microbiome multivariate association analysis via feature selection. NAR Genom Bioinform 2022; 4:lqab120. [PMID: 35047812 PMCID: PMC8759573 DOI: 10.1093/nargab/lqab120] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2021] [Revised: 11/13/2021] [Accepted: 12/24/2021] [Indexed: 02/06/2023] Open
Abstract
The important role of human microbiome is being increasingly recognized in health and disease conditions. Since microbiome data is typically high dimensional, one popular mode of statistical association analysis for microbiome data is to pool individual microbial features into a group, and then conduct group-based multivariate association analysis. A corresponding challenge within this approach is to achieve adequate power to detect an association signal between a group of microbial features and the outcome of interest across a wide range of scenarios. Recognizing some existing methods' susceptibility to the adverse effects of noise accumulation, we introduce the Adaptive Microbiome Association Test (AMAT), a novel and powerful tool for multivariate microbiome association analysis, which unifies both blessings of feature selection in high-dimensional inference and robustness of adaptive statistical association testing. AMAT first alleviates the burden of noise accumulation via distance correlation learning, and then conducts a data-adaptive association test under the flexible generalized linear model framework. Extensive simulation studies and real data applications demonstrate that AMAT is highly robust and often more powerful than several existing methods, while preserving the correct type I error rate. A free implementation of AMAT in R computing environment is available at https://github.com/kzb193/AMAT.
Collapse
Affiliation(s)
| | | | - Xiang Zhan
- To whom correspondence should be addressed. Tel: +86 10 62744132; Fax: +86 10 62744134;
| |
Collapse
|
23
|
Novel application of survival models for predicting microbial community transitions with variable selection for eDNA. Appl Environ Microbiol 2022; 88:e0214621. [PMID: 35138931 DOI: 10.1128/aem.02146-21] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Survival analysis is a prolific statistical tool in medicine for inferring risk and time to disease-related events. However, it is under-utilized in microbiome research to predict microbial community mediated events, partly due to the sparsity and high dimensional nature of the data. We advance the application of Cox proportional hazards (Cox PH) survival models to environmental DNA (eDNA) data with feature selection suitable for filtering irrelevant and redundant taxonomic variables. Selection methods are compared in terms of false positives, sensitivity, and survival estimation accuracy in simulation and in a real data setting to forecast harmful cyanobacterial blooms. A novel extension of a method for selecting microbial biomarkers with survival data (SuRFCox) reliably outperforms other methods. We determine Cox PH models with SuRFCox selected predictors are more robust to varied signal, noise, and data correlation structure. SuRFCox also yields the most accurate and consistent prediction of blooms according to cross-validated testing by year over eight different bloom seasons. Identification of common biomarkers among validated survival forecasts over changing conditions has clear biological significance. Survival models with such biomarkers inform risk assessment and provide insight into the causes of critical community transitions. Importance In this paper, we report on a novel approach of selecting microorganisms for model-based prediction of the time to critical microbially-modulated events (e.g., harmful algal blooms, clinical outcomes, community shifts, etc.). Our novel method for identifying biomarkers from large, dynamic communities of microbes has broad utility to environmental and ecological impact risk assessment and public health. Results will also promote theoretical and practical advancements relevant to the biology of specific organisms. To address the unique challenge posed by diverse environmental conditions and sparse microbes, we developed a novel method of selecting predictors for modelling time-to-event data. Competing methods for selecting predictors are rigorously compared to determine which is the most accurate and generalizable. Model forecasts are applied to show suitable predictors can precisely quantify the risk over time of biological events like harmful cyanobacterial blooms.
Collapse
|
24
|
Briscoe L, Balliu B, Sankararaman S, Halperin E, Garud NR. Evaluating supervised and unsupervised background noise correction in human gut microbiome data. PLoS Comput Biol 2022; 18:e1009838. [PMID: 35130266 PMCID: PMC8853548 DOI: 10.1371/journal.pcbi.1009838] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2021] [Revised: 02/17/2022] [Accepted: 01/15/2022] [Indexed: 12/13/2022] Open
Abstract
The ability to predict human phenotypes and identify biomarkers of disease from metagenomic data is crucial for the development of therapeutics for microbiome-associated diseases. However, metagenomic data is commonly affected by technical variables unrelated to the phenotype of interest, such as sequencing protocol, which can make it difficult to predict phenotype and find biomarkers of disease. Supervised methods to correct for background noise, originally designed for gene expression and RNA-seq data, are commonly applied to microbiome data but may be limited because they cannot account for unmeasured sources of variation. Unsupervised approaches address this issue, but current methods are limited because they are ill-equipped to deal with the unique aspects of microbiome data, which is compositional, highly skewed, and sparse. We perform a comparative analysis of the ability of different denoising transformations in combination with supervised correction methods as well as an unsupervised principal component correction approach that is presently used in other domains but has not been applied to microbiome data to date. We find that the unsupervised principal component correction approach has comparable ability in reducing false discovery of biomarkers as the supervised approaches, with the added benefit of not needing to know the sources of variation apriori. However, in prediction tasks, it appears to only improve prediction when technical variables contribute to the majority of variance in the data. As new and larger metagenomic datasets become increasingly available, background noise correction will become essential for generating reproducible microbiome analyses. The human gut microbiome is known to play a major role in health and is associated with many diseases including colorectal cancer, obesity, and diabetes. The prediction of host phenotypes and identification of biomarkers of disease is essential for harnessing the therapeutic potential of the microbiome. However, many metagenomic datasets are affected by technical variables that introduce unwanted variation that can confound the ability to predict phenotypes and identify biomarkers. Currently, supervised methods originally designed for gene expression and RNA-seq data are commonly applied to microbiome data for correction of background noise, but they are limited in that they cannot correct for unmeasured sources of variation. Unsupervised approaches address this issue, but current methods are limited because they are ill-equipped to deal with the unique aspects of microbiome data, which is compositional, highly skewed, and sparse. We perform a comparative analysis of the ability of different denoising transformations in combination with supervised correction methods as well as an unsupervised principal component correction approach and find that all correction approaches reduce false positives for biomarker discovery. In the task of predicting phenotypes, different approaches have varying success where the unsupervised correction can improve prediction when technical variables contribute to the majority of variance in the data. As new and larger metagenomic datasets become increasingly available, background noise correction will become essential for generating reproducible microbiome analyses.
Collapse
Affiliation(s)
- Leah Briscoe
- Bioinformatics Interdepartmental Program, University of California Los Angeles, Los Angeles, California, United States of America
- * E-mail: (LB); (EH); (NRG)
| | - Brunilda Balliu
- Department of Computational Medicine, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, California, United States of America
| | - Sriram Sankararaman
- Department of Computer Science, University of California Los Angeles, Los Angeles, California, United States of America
- Department of Human Genetics, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, California, United States of America
- Department of Computational Medicine, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, California, United States of America
| | - Eran Halperin
- Department of Computer Science, University of California Los Angeles, Los Angeles, California, United States of America
- Department of Human Genetics, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, California, United States of America
- Department of Computational Medicine, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, California, United States of America
- Department of Anesthesiology and Perioperative Medicine, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, California, United States of America
- Institute of Precision Health, University of California Los Angeles, Los Angeles, California, United States of America
- * E-mail: (LB); (EH); (NRG)
| | - Nandita R. Garud
- Department of Human Genetics, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, California, United States of America
- Department of Ecology and Evolutionary Biology, University of California Los Angeles, Los Angeles, California, United States of America
- * E-mail: (LB); (EH); (NRG)
| |
Collapse
|
25
|
Changes in Serum N-Glycome for Risk Drinkers: A Comparison with Standard Markers for Alcohol Abuse in Men and Women. Biomolecules 2022; 12:biom12020241. [PMID: 35204742 PMCID: PMC8961540 DOI: 10.3390/biom12020241] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2021] [Revised: 01/26/2022] [Accepted: 01/27/2022] [Indexed: 01/11/2023] Open
Abstract
Background and aim: Glycomic alterations serve as biomarker tools for different diseases. The present study aims to evaluate the diagnostic capability of serum N-glycosylation to identify alcohol risk drinking in comparison with standard markers. Methods: We included 1516 adult individuals (age range 18–91 years; 55.3% women), randomly selected from a general population. A total of 143 (21.0%) men and 50 (5.9%) women were classified as risk drinkers after quantification of daily alcohol consumption and the Alcohol Use Disorders Identification Test (AUDIT). Hydrophilic interaction ultra-performance liquid chromatography (HILIC-UPLC) was used for the quantification of 46 serum N-glycan peaks. Serum gamma-glutamyltransferase (GGT), carbohydrate-deficient transferrin (CDT), and red blood cell mean corpuscular volume (MCV) were measured by standard clinical laboratory methods. Results: Variations in serum N-glycome associated risk drinking were more prominent in men compared to women. A unique combination of N-glycan peaks selected by the selbal algorithm shows good discrimination between risk-drinkers and non-risk drinkers for men and women. Receiver operating characteristics (ROC) curves show accuracy for the diagnosis of risk drinking, which is comparable to that of the golden standards, GGT, MCV and CDT markers for men and women. Additionally, the inclusion of N-glycan peaks improves the diagnostic accuracy of the standard markers, although it remains relatively low, due to low sensitivity. For men, the area under the ROC curve using N-glycome data is 0.75, 0.76, and 0.77 when combined with GGT, MCV, and CDT, respectively. In women, the areas were 0.76, 0.73, and 0.73, respectively. Conclusion: Risk drinking is associated with significant variations in the serum N-glycome, which highlights its potential diagnostic utility.
Collapse
|
26
|
Alenazi A. A review of compositional data analysis and recent advances. COMMUN STAT-THEOR M 2021. [DOI: 10.1080/03610926.2021.2014890] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Affiliation(s)
- Abdulaziz Alenazi
- Department of Mathematics, College of Science, Northern Border University, Arar, Saudi Arabia
| |
Collapse
|
27
|
Microbiome Analysis of Mucosal Ileoanal Pouch in Ulcerative Colitis Patients Revealed Impairment of the Pouches Immunometabolites. Cells 2021; 10:cells10113243. [PMID: 34831464 PMCID: PMC8624401 DOI: 10.3390/cells10113243] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2021] [Revised: 11/12/2021] [Accepted: 11/16/2021] [Indexed: 12/30/2022] Open
Abstract
The pathogenesis of ulcerative colitis (UC) is unknown, although genetic loci and altered gut microbiota have been implicated. Up to a third of patients with moderate to severe UC require proctocolectomy with ileal pouch ano-anastomosis (IPAA). We aimed to explore the mucosal microbiota of UC patients who underwent IPAA. METHODS For microbiome analysis, mucosal specimens were collected from 34 IPAA individuals. Endoscopic and histological examinations of IPAA were normal in 21 cases, while pouchitis was in 13 patients. 19 specimens from the healthy control (10 from colonic and 9 from ileum) were also analyzed. Data were analyzed using an ensemble of software packages: QIIME2, coda-lasso, clr-lasso, PICRUSt2, and ALDEx2. RESULTS IPAA specimens had significantly lower bacterial diversity as compared to normal. The microbial composition of the normal pouch was also decreased also when compared to pouchitis. Faecalibacterium prausnitzii, Gemmiger formicilis, Blautia obeum, Ruminococcus torques, Dorea formicigenerans, and an unknown species from Roseburia were the most uncommon in pouch/pouchitis, while an unknown species from Enterobacteriaceae was over-represented. Propionibacterium acnes and Enterobacteriaceae were the species most abundant in the pouchitis and in the normal pouch, respectively. Predicted metabolic pathways among the IPAA bacterial communities revealed an important role of immunometabolites such as SCFA, butyrate, and amino acids. CONCLUSIONS Our findings showed specific bacterial signature hallmarks of dysbiosis and could represent bacterial biomarkers in IPAA patients useful to develop novel treatments in the future by modulating the gut microbiota through the administration of probiotic immunometabolites-producing bacterial strains and the addition of specific prebiotics and the faecal microbiota transplantation.
Collapse
|
28
|
Ishiya K, Aburatani S. Multivariate statistical monitoring system for microbial population dynamics. Phys Biol 2021; 19. [PMID: 34788744 DOI: 10.1088/1478-3975/ac3ad6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2021] [Accepted: 11/17/2021] [Indexed: 11/12/2022]
Abstract
Microbiomes in their natural environments vary dynamically with changing environmental conditions. The detection of these dynamic changes in microbial populations is critical for understanding the impact of environmental changes on the microbial community. Here, we propose a novel method to detect time-series changes in the microbiome, based on multivariate statistical process control. By focusing on the interspecies structures, this approach enables the robust detection of time-series changes in a microbiome composed of a large number of microbial species. Applying this approach to empirical human gut microbiome data, we accurately traced time-series changes in microbiota composition induced by a dietary intervention trial. This method was also excellent for tracking the recovery process after the intervention. Our approach can be useful for monitoring dynamic changes in complex microbial communities.
Collapse
Affiliation(s)
- Koji Ishiya
- Bioproduction Research Institute, National Institute of Advance Industrial Science and Technology, 2-17-2-1 Tsukisamu-Higashi, Toyohira-ku, Sapporo, Hokkaido, 062-8517, JAPAN
| | - Sachiyo Aburatani
- Computational Bio Big-Data Open Innovation Laboratory, National Institute of Advanced Industrial Science and Technology, 2-4-7 Aomi, Koto-ku,, Tokyo, Tokyo, 135-0064, JAPAN
| |
Collapse
|
29
|
Ramakodi MP. A comprehensive evaluation of single-end sequencing data analyses for environmental microbiome research. Arch Microbiol 2021; 203:6295-6302. [PMID: 34654941 DOI: 10.1007/s00203-021-02597-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2021] [Revised: 09/17/2021] [Accepted: 09/28/2021] [Indexed: 01/04/2023]
Abstract
Illumina sequencing platforms have been widely used for amplicon-based environmental microbiome research. Analyses of amplicon data of environmental samples, generated from Illumina MiSeq platform illustrate the reverse (R2) reads in the PE datasets to have low quality towards the 3' end of the reads which affect the sequencing depth of samples and ultimately impact the sample size which may possibly lead to an altered outcome. This study evaluates the usefulness of single-end (SE) sequencing data in microbiome research when the Illumina MiSeq PE dataset shows significantly high number of low-quality reverse reads. In this study, the amplicon data (V1V3, V3V4, V4V5 and V6V8) from 128 environmental (soil) samples, downloaded from SRA, demonstrate the efficiency of single-end (SE) sequencing data analyses in microbiome research. The SE datasets were found to infer the core microbiome structure as comparable to the PE dataset. Conspicuously, the forward (R1) datasets inferred a higher number of taxa as compared to PE datasets for most of the amplicon regions, except V3V4. Thus, analyses of SE sequencing data, especially R1 reads, in environmental microbiome studies could ameliorate the problems arising on sample size of the study due to low quality reverse reads in the dataset. However, care must be taken while interpreting the microbiome structure as few taxa observed in the PE datasets were absent in the SE datasets. In conclusion, this study demonstrates the availability of choices in analyzing the amplicon data without having the need to remove samples with low quality reverse reads.
Collapse
Affiliation(s)
- Meganathan P Ramakodi
- CSIR-National Environmental Engineering Research Institute (NEERI), Hyderabad Zonal Centre, IICT Campus, Tarnaka, Hyderabad, Telangana, 500007, India.
| |
Collapse
|
30
|
Pineda S, López de Maturana E, Yu K, Ravoor A, Wood I, Malats N, Sirota M. Tumor-Infiltrating B- and T-Cell Repertoire in Pancreatic Cancer Associated With Host and Tumor Features. Front Immunol 2021; 12:730746. [PMID: 34630409 PMCID: PMC8495220 DOI: 10.3389/fimmu.2021.730746] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2021] [Accepted: 09/02/2021] [Indexed: 12/28/2022] Open
Abstract
Background Infiltrating B and T cells have been observed in several tumor tissues, including pancreatic ductal adenocarcinoma (PDAC). The majority known PDAC risk factors point to a chronic inflammatory process leading to different forms of immunological infiltration. Understanding pancreatic tumor infiltration may lead to improved knowledge of this devastating disease. Methods We extracted the immunoglobulins (IGs) and T cell receptors (TCRs) from RNA-sequencing of 144 PDAC from TCGA and 180 pancreatic normal tissue from GTEx. We used Shannon entropy to find differences in IG/TCR diversity. We performed a clonotype analysis considering the IG clone definition (same V and J segments, same CDR3 length, and 90% nucleotide identity between CDR3s) to study differences among the tumor samples. Finally, we performed an association analysis to find host and tumor factors associated with the IG/TCR. Results PDAC presented a richer and more diverse IG and TCR infiltration than normal pancreatic tissue. A higher IG infiltration was present in heavy smokers and females and it was associated with better overall survival. In addition, specific IG clonotypes classified samples with better prognosis explaining 24% of the prognosis phenotypic variance. On the other hand, a larger TCR infiltration was present in patients with previous history of diabetes and was associated with lower nonantigen load. Conclusions Our findings support PDAC subtyping according to its immune repertoire landscape with a potential impact on the understanding of the inflammatory basis of PDAC risk factors as well as the design of treatment options and prognosis monitoring.
Collapse
Affiliation(s)
- Silvia Pineda
- Genetic and Molecular Epidemiology Group, Spanish National Cancer Research Centre (CNIO), and Centro de Investigación Biomédica en Red Cáncer (CIBERONC), Madrid, Spain.,Bakar Computational Health Sciences Institute, University of San Francisco, California (UCSF), San Francisco, CA, United States
| | - Evangelina López de Maturana
- Genetic and Molecular Epidemiology Group, Spanish National Cancer Research Centre (CNIO), and Centro de Investigación Biomédica en Red Cáncer (CIBERONC), Madrid, Spain
| | - Katharine Yu
- Bakar Computational Health Sciences Institute, University of San Francisco, California (UCSF), San Francisco, CA, United States.,Department of Pediatrics, University of San Francisco, California (UCSF), San Francisco, CA, United States
| | - Akshay Ravoor
- Bakar Computational Health Sciences Institute, University of San Francisco, California (UCSF), San Francisco, CA, United States
| | - Inés Wood
- Genetic and Molecular Epidemiology Group, Spanish National Cancer Research Centre (CNIO), and Centro de Investigación Biomédica en Red Cáncer (CIBERONC), Madrid, Spain
| | - Núria Malats
- Genetic and Molecular Epidemiology Group, Spanish National Cancer Research Centre (CNIO), and Centro de Investigación Biomédica en Red Cáncer (CIBERONC), Madrid, Spain
| | - Marina Sirota
- Bakar Computational Health Sciences Institute, University of San Francisco, California (UCSF), San Francisco, CA, United States.,Department of Pediatrics, University of San Francisco, California (UCSF), San Francisco, CA, United States
| |
Collapse
|
31
|
Gordon-Rodriguez E, Quinn TP, Cunningham JP. Learning sparse log-ratios for high-throughput sequencing data. Bioinformatics 2021; 38:157-163. [PMID: 34498030 PMCID: PMC8696089 DOI: 10.1093/bioinformatics/btab645] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2021] [Revised: 08/09/2021] [Accepted: 09/03/2021] [Indexed: 02/03/2023] Open
Abstract
MOTIVATION The automatic discovery of sparse biomarkers that are associated with an outcome of interest is a central goal of bioinformatics. In the context of high-throughput sequencing (HTS) data, and compositional data (CoDa) more generally, an important class of biomarkers are the log-ratios between the input variables. However, identifying predictive log-ratio biomarkers from HTS data is a combinatorial optimization problem, which is computationally challenging. Existing methods are slow to run and scale poorly with the dimension of the input, which has limited their application to low- and moderate-dimensional metagenomic datasets. RESULTS Building on recent advances from the field of deep learning, we present CoDaCoRe, a novel learning algorithm that identifies sparse, interpretable and predictive log-ratio biomarkers. Our algorithm exploits a continuous relaxation to approximate the underlying combinatorial optimization problem. This relaxation can then be optimized efficiently using the modern ML toolbox, in particular, gradient descent. As a result, CoDaCoRe runs several orders of magnitude faster than competing methods, all while achieving state-of-the-art performance in terms of predictive accuracy and sparsity. We verify the outperformance of CoDaCoRe across a wide range of microbiome, metabolite and microRNA benchmark datasets, as well as a particularly high-dimensional dataset that is outright computationally intractable for existing sparse log-ratio selection methods. AVAILABILITY AND IMPLEMENTATION The CoDaCoRe package is available at https://github.com/egr95/R-codacore. Code and instructions for reproducing our results are available at https://github.com/cunningham-lab/codacore. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
| | - Thomas P Quinn
- Applied Artificial Intelligence Institute, Deakin University, Geelong, VIC 3126, Australia
| | - John P Cunningham
- Department of Statistics, Columbia University, New York, NY 10025, USA
| |
Collapse
|
32
|
Pérez-Enciso M, Zingaretti LM, Ramayo-Caldas Y, de Los Campos G. Opportunities and limits of combining microbiome and genome data for complex trait prediction. Genet Sel Evol 2021; 53:65. [PMID: 34362312 PMCID: PMC8344190 DOI: 10.1186/s12711-021-00658-7] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2021] [Accepted: 07/20/2021] [Indexed: 12/12/2022] Open
Abstract
Background Analysis and prediction of complex traits using microbiome data combined with host genomic information is a topic of utmost interest. However, numerous questions remain to be answered: how useful can the microbiome be for complex trait prediction? Are estimates of microbiability reliable? Can the underlying biological links between the host’s genome, microbiome, and phenome be recovered? Methods Here, we address these issues by (i) developing a novel simulation strategy that uses real microbiome and genotype data as inputs, and (ii) using variance-component approaches (Bayesian Reproducing Kernel Hilbert Space (RKHS) and Bayesian variable selection methods (Bayes C)) to quantify the proportion of phenotypic variance explained by the genome and the microbiome. The proposed simulation approach can mimic genetic links between the microbiome and genotype data by a permutation procedure that retains the distributional properties of the data. Results Using real genotype and rumen microbiota abundances from dairy cattle, simulation results suggest that microbiome data can significantly improve the accuracy of phenotype predictions, regardless of whether some microbiota abundances are under direct genetic control by the host or not. This improvement depends logically on the microbiome being stable over time. Overall, random-effects linear methods appear robust for variance components estimation, in spite of the typically highly leptokurtic distribution of microbiota abundances. The predictive performance of Bayes C was higher but more sensitive to the number of causative effects than RKHS. Accuracy with Bayes C depended, in part, on the number of microorganisms’ taxa that influence the phenotype. Conclusions While we conclude that, overall, genome-microbiome-links can be characterized using variance component estimates, we are less optimistic about the possibility of identifying the causative host genetic effects that affect microbiota abundances, which would require much larger sample sizes than are typically available for genome-microbiome-phenome studies. The R code to replicate the analyses is in https://github.com/miguelperezenciso/simubiome. Supplementary Information The online version contains supplementary material available at 10.1186/s12711-021-00658-7.
Collapse
Affiliation(s)
- Miguel Pérez-Enciso
- ICREA, Passeig de Lluís Companys 23, 08010, Barcelona, Spain. .,Centre for Research in Agricultural Genomics (CRAG), CSIC-IRTA-UAB-UB, 08193, Bellaterra, Barcelona, Spain. .,Dept. of Epidemiology & Biostatistics, and Dept. of Statistics & Probability, Michigan State University, East Lansing, MI, 48824, USA.
| | - Laura M Zingaretti
- Centre for Research in Agricultural Genomics (CRAG), CSIC-IRTA-UAB-UB, 08193, Bellaterra, Barcelona, Spain.,Dept. of Epidemiology & Biostatistics, and Dept. of Statistics & Probability, Michigan State University, East Lansing, MI, 48824, USA
| | - Yuliaxis Ramayo-Caldas
- Animal Breeding and Genetics Program, Institute for Research and Technology in Food and Agriculture (IRTA), Torre Marimon, 08140, Caldes de Montbui, Barcelona, Spain
| | - Gustavo de Los Campos
- Dept. of Epidemiology & Biostatistics, and Dept. of Statistics & Probability, Michigan State University, East Lansing, MI, 48824, USA
| |
Collapse
|
33
|
Adler CJ, Cao KAL, Hughes T, Kumar P, Austin C. How does the early life environment influence the oral microbiome and determine oral health outcomes in childhood? Bioessays 2021; 43:e2000314. [PMID: 34151446 DOI: 10.1002/bies.202000314] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2020] [Revised: 05/24/2021] [Accepted: 05/25/2021] [Indexed: 11/11/2022]
Abstract
The first 1000 days of life, from conception to 2 years, are a critical window for the influence of environmental exposures on the assembly of the oral microbiome, which is the precursor to dental caries (decay), one of the most prevalent microbially induced disorders worldwide. While it is known that the human microbiome is susceptible to environmental exposures, there is limited understanding of the impact of prenatal and early childhood exposures on the oral microbiome trajectory and oral health. A barrier has been the lack of technology to directly measure the foetal "exposome", which includes nutritional and toxic exposures crossing the placenta. Another barrier has been the lack of statistical methods to account for the high dimensional data generated by-omic assays. Through identifying which early life exposures influence the oral microbiome and modify oral health, these findings can be translated into interventions to reduce dental decay prevalence.
Collapse
Affiliation(s)
- Christina Jane Adler
- School of Medical Sciences, Faculty of Medicine and Health, The University of Sydney, Sydney, New South Wales, Australia.,Charles Perkins Centre, The University of Sydney, Sydney, New South Wales, Australia
| | - Kim-Anh Lê Cao
- Melbourne Integrative Genomics, School of Mathematics and Statistics, The University of Melbourne, Parkville, Victoria, Australia
| | - Toby Hughes
- Adelaide Dental School, The University of Adelaide, Adelaide, South Australia, Australia
| | - Piyush Kumar
- Department of Environmental Medicine and Public Health, Mount Sinai School of Medicine, New York, New York, USA
| | - Christine Austin
- Department of Environmental Medicine and Public Health, Mount Sinai School of Medicine, New York, New York, USA
| |
Collapse
|
34
|
Ketchum RN, Smith EG, Vaughan GO, McParland D, Al-Mansoori N, Burt JA, Reitzel AM. Unraveling the predictive role of temperature in the gut microbiota of the sea urchin Echinometra sp. EZ across spatial and temporal gradients. Mol Ecol 2021; 30:3869-3881. [PMID: 34008895 DOI: 10.1111/mec.15990] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2020] [Revised: 05/06/2021] [Accepted: 05/11/2021] [Indexed: 01/03/2023]
Abstract
Shifts in microbial communities represent a rapid response mechanism for host organisms to respond to changes in environmental conditions. Therefore, they are likely to be important in assisting the acclimatization of hosts to seasonal temperature changes as well as to variation in temperatures across a species' range. The Persian/Arabian Gulf is the world's warmest sea, with large seasonal fluctuations in temperature (20℃ - 37℃) and is connected to the Gulf of Oman which experiences more typical oceanic conditions (<32℃ in the summer). This system is an informative model for understanding how symbiotic microbial assemblages respond to thermal variation across temporal and spatial scales. Here, we elucidate the role of temperature on the microbial gut community of the sea urchin Echinometra sp. EZ and identify microbial taxa that are tightly correlated with the thermal environment. We generated two independent datasets with a high degree of geographic and temporal resolution. The results show that microbial communities vary across thermally variable habitats, display temporal shifts that correlate with temperature, and can become more disperse as temperatures rise. The relative abundances of several ASVs significantly correlate with temperature in both independent datasets despite the >300 km distance between the furthest sites and the extreme seasonal variations. Notably, over 50% of the temperature predictive ASVs identified from the two datasets belonged to the family Vibrionaceae. Together, our results identify temperature as a robust predictor of community-level variation and highlight specific microbial taxa putatively involved in the response to thermal environment.
Collapse
Affiliation(s)
- Remi N Ketchum
- Department of Biological Sciences, University of North Carolina at Charlotte, Charlotte, NC, USA
| | - Edward G Smith
- Department of Biological Sciences, University of North Carolina at Charlotte, Charlotte, NC, USA.,Water Research Center & Center for Genomics and Systems Biology, New York University Abu Dhabi, Abu Dhabi, UAE
| | - Grace O Vaughan
- Water Research Center & Center for Genomics and Systems Biology, New York University Abu Dhabi, Abu Dhabi, UAE
| | - Dain McParland
- Water Research Center & Center for Genomics and Systems Biology, New York University Abu Dhabi, Abu Dhabi, UAE
| | - Noura Al-Mansoori
- Water Research Center & Center for Genomics and Systems Biology, New York University Abu Dhabi, Abu Dhabi, UAE
| | - John A Burt
- Water Research Center & Center for Genomics and Systems Biology, New York University Abu Dhabi, Abu Dhabi, UAE
| | - Adam M Reitzel
- Department of Biological Sciences, University of North Carolina at Charlotte, Charlotte, NC, USA
| |
Collapse
|
35
|
Yang F, Zou Q. DisBalance: a platform to automatically build balance-based disease prediction models and discover microbial biomarkers from microbiome data. Brief Bioinform 2021; 22:6217721. [PMID: 33834198 DOI: 10.1093/bib/bbab094] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2021] [Revised: 02/22/2021] [Accepted: 03/03/2021] [Indexed: 12/23/2022] Open
Abstract
How best to utilize the microbial taxonomic abundances in regard to the prediction and explanation of human diseases remains appealing and challenging, and the relative nature of microbiome data necessitates a proper feature selection method to resolve the compositional problem. In this study, we developed an all-in-one platform to address a series of issues in microbiome-based human disease prediction and taxonomic biomarkers discovery. We prioritize the interpretation, runtime and classification accuracy of the distal discriminative balances analysis (DBA-distal) method in selecting a set of distal discriminative balances, and develop DisBalance, a comprehensive platform, to integrate and streamline the workflows of disease model building, disease risk prediction and disease-related biomarker discovery for microbiome-based binary classifications. DisBalance allows the de novo model-building and disease risk prediction in a very fast and convenient way. To facilitate the model-driven and knowledge-driven discoveries, DisBalance dedicates multiple strategies for the mining of microbial biomarkers. The independent validation of the models constructed by the DisBalance pipeline is performed on seven microbiome datasets from the original article of DBA-distal. The implementation of the DisBalance platform is demonstrated by a complete analysis of a shotgun metagenomic dataset of Ulcerative Colitis (UC). As a free and open-source, DisBlance can be accessed at http://lab.malab.cn/soft/DisBalance. The source code and demo data for Disbalance are available at https://github.com/yangfenglong/DisBalance.
Collapse
Affiliation(s)
- Fenglong Yang
- University of Electronic Science and Technology of China
| | - Quan Zou
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China
| |
Collapse
|
36
|
Erb I, Gloor GB, Quinn TP. Editorial: Compositional data analysis and related methods applied to genomics-a first special issue from NAR Genomics and Bioinformatics. NAR Genom Bioinform 2020; 2:lqaa103. [PMID: 33575646 PMCID: PMC7724639 DOI: 10.1093/nargab/lqaa103] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Affiliation(s)
- Ionas Erb
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr Aiguader 88, 08003 Barcelona, Spain
| | - Gregory B Gloor
- Department of Biochemistry, Schulich School of Medicine & Dentistry, Western University, London, ON N6A 5C1, Canada
| | - Thomas P Quinn
- Applied Artificial Intelligence Institute, Deakin University, Waurn Ponds, VIC 3216, Australia
| |
Collapse
|