1
|
Abegaz F, Chaichoompu K, Génin E, Fardo DW, König IR, Mahachie John JM, Van Steen K. Principals about principal components in statistical genetics. Brief Bioinform 2020; 20:2200-2216. [PMID: 30219892 DOI: 10.1093/bib/bby081] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2018] [Revised: 07/21/2018] [Accepted: 08/12/2018] [Indexed: 12/13/2022] Open
Abstract
Principal components (PCs) are widely used in statistics and refer to a relatively small number of uncorrelated variables derived from an initial pool of variables, while explaining as much of the total variance as possible. Also in statistical genetics, principal component analysis (PCA) is a popular technique. To achieve optimal results, a thorough understanding about the different implementations of PCA is required and their impact on study results, compared to alternative approaches. In this review, we focus on the possibilities, limitations and role of PCs in ancestry prediction, genome-wide association studies, rare variants analyses, imputation strategies, meta-analysis and epistasis detection. We also describe several variations of classic PCA that deserve increased attention in statistical genetics applications.
Collapse
|
2
|
Ryan FJ, Ahern AM, Fitzgerald RS, Laserna-Mendieta EJ, Power EM, Clooney AG, O'Donoghue KW, McMurdie PJ, Iwai S, Crits-Christoph A, Sheehan D, Moran C, Flemer B, Zomer AL, Fanning A, O'Callaghan J, Walton J, Temko A, Stack W, Jackson L, Joyce SA, Melgar S, DeSantis TZ, Bell JT, Shanahan F, Claesson MJ. Colonic microbiota is associated with inflammation and host epigenomic alterations in inflammatory bowel disease. Nat Commun. 2020;11:1512. [PMID: 32251296 PMCID: PMC7089947 DOI: 10.1038/s41467-020-15342-5] [Citation(s) in RCA: 138] [Impact Index Per Article: 34.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2019] [Accepted: 02/27/2020] [Indexed: 12/13/2022] Open
Abstract
Studies of inflammatory bowel disease (IBD) have been inconclusive in relating microbiota with distribution of inflammation. We report microbiota, host transcriptomics, epigenomics and genetics from matched inflamed and non-inflamed colonic mucosa [50 Crohn's disease (CD); 80 ulcerative colitis (UC); 31 controls]. Changes in community-wide and within-patient microbiota are linked with inflammation, but we find no evidence for a distinct microbial diagnostic signature, probably due to heterogeneous host-microbe interactions, and show only marginal microbiota associations with habitual diet. Epithelial DNA methylation improves disease classification and is associated with both inflammation and microbiota composition. Microbiota sub-groups are driven by dominant Enterbacteriaceae and Bacteroides species, representative strains of which are pro-inflammatory in vitro, are also associated with immune-related epigenetic markers. In conclusion, inflamed and non-inflamed colonic segments in both CD and UC differ in microbiota composition and epigenetic profiles.
Collapse
|
3
|
Duroux D, Climente-gonzález H, Wienbrandt L, Van Steen K. Network Aggregation to Enhance Results Derived from Multiple Analytics. IFIP Advances in Information and Communication Technology 2020. [DOI: 10.1007/978-3-030-49161-1_12] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
The more complex data are, the higher the number of possibilities to extract partial information from those data. These possibilities arise by adopting different analytic approaches. The heterogeneity among these approaches and in particular the heterogeneity in results they produce are challenging for follow-up studies, including replication, validation and translational studies. Furthermore, they complicate the interpretation of findings with wide-spread relevance. Here, we take the example of statistical epistasis networks derived from genome-wide association studies with single nucleotide polymorphisms as nodes. Even though we are only dealing with a single data type, the epistasis detection problem suffers from many pitfalls, such as the wide variety of analytic tools to detect them, each highlighting different aspects of epistasis and exhibiting different properties in maintaining false positive control. To reconcile different network views to the same problem, we considered 3 network aggregation methods and discussed their performance in the context of epistasis network aggregation. We furthermore applied a latent class method as best performer to real-life data on inflammatory bowel disease (IBD) and highlighted its benefits to increase our understanding about IBD underlying genetic architectures.
Collapse
|
4
|
Zhang Z, Huang K, Gu C, Zhao L, Wang N, Wang X, Zhao D, Zhang C, Lu Y, Meng Y. Molecular Subtyping of Serous Ovarian Cancer Based on Multi-omics Data. Sci Rep 2016; 6:26001. [PMID: 27184229 PMCID: PMC4868982 DOI: 10.1038/srep26001] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2016] [Accepted: 04/25/2016] [Indexed: 01/22/2023] Open
Abstract
Classification of ovarian cancer by morphologic features has a limited effect on serous ovarian cancer (SOC) treatment and prognosis. Here, we proposed a new system for SOC subtyping based on the molecular categories from the Cancer Genome Atlas project. We analyzed the DNA methylation, protein, microRNA, and gene expression of 1203 samples from 599 serous ovarian cancer patients. These samples were divided into nine subtypes based on RNA-seq data, and each subtype was found to be associated with the activation and/or suppression of the following four biological processes: immunoactivity, hormone metabolic, mesenchymal development and the MAPK signaling pathway. We also identified four DNA methylation, two protein expression, six microRNA sequencing and four pathway subtypes. By integrating the subtyping results across different omics platforms, we found that most RNA-seq subtypes overlapped with one or two subtypes from other omics data. Our study sheds light on the molecular mechanisms of SOC and provides a new perspective for the more accurate stratification of its subtypes.
Collapse
Affiliation(s)
- Zhe Zhang
- Department of Gynecologic Oncology, Chinese PLA General Hospital, Beijing 100853, China
| | - Ke Huang
- Department of Gynecologic Oncology, Chinese PLA General Hospital, Beijing 100853, China
| | - Chenglei Gu
- Department of Gynecologic Oncology, Chinese PLA General Hospital, Beijing 100853, China
| | - Luyang Zhao
- Department of Gynecologic Oncology, Chinese PLA General Hospital, Beijing 100853, China
| | - Nan Wang
- Department of Gynecologic Oncology, Chinese PLA General Hospital, Beijing 100853, China
| | - Xiaolei Wang
- Beijing Institute of Health Service and Medical Information, Beijing 100850, China
| | - Dongsheng Zhao
- Beijing Institute of Health Service and Medical Information, Beijing 100850, China
| | - Chenggang Zhang
- Beijing Institute of Radiation Medicine, State Key Laboratory of Proteomics, Cognitive and Mental Health Research Center, Beijing 100850, China
| | - Yiming Lu
- Beijing Institute of Radiation Medicine, State Key Laboratory of Proteomics, Cognitive and Mental Health Research Center, Beijing 100850, China
| | - Yuanguang Meng
- Department of Gynecologic Oncology, Chinese PLA General Hospital, Beijing 100853, China
| |
Collapse
|
5
|
Abstract
Disease risk and treatment response are determined, at the individual level, by a complex history of genetic and environmental interactions, including those with our endogenous microbiomes. Personalized health care requires a deep understanding of patient biology that can now be measured using a range of '-omics' technologies. Patient stratification involves the identification of genetic and/or phenotypic disease subclasses that require different therapeutic strategies. Stratified medicine approaches to disease diagnosis, prognosis and therapeutic response monitoring herald a new dimension in patient care. Here, we explore the potential value of metabolic profiling as applied to unmet clinical needs in gastroenterology and hepatology. We describe potential applications in a number of diseases, with emphasis on large-scale population studies as well as metabolic profiling on the individual level, using spectrometric and imaging technologies that will leverage the discovery of mechanistic information and deliver novel health care solutions to improve clinical pathway management.
Collapse
Affiliation(s)
- Elaine Holmes
- Division of Computational and Systems Medicine, Department of Surgery and Cancer, Faculty of Medicine, Imperial College London, London SW7 2AZ, UK
| | - Anisha Wijeyesekera
- Division of Computational and Systems Medicine, Department of Surgery and Cancer, Faculty of Medicine, Imperial College London, London SW7 2AZ, UK
| | | | - Jeremy K Nicholson
- MRC-NIHR National Phenome Centre, Department of Surgery and Cancer, Faculty of Medicine, Imperial College London, London SW7 2AZ, UK
| |
Collapse
|