1
|
Osborne AJ, Bierzynska A, Colby E, Andag U, Kalra PA, Radresa O, Skroblin P, Taal MW, Welsh GI, Saleem MA, Campbell C. Multivariate canonical correlation analysis identifies additional genetic variants for chronic kidney disease. NPJ Syst Biol Appl 2024; 10:28. [PMID: 38459044 PMCID: PMC10924093 DOI: 10.1038/s41540-024-00350-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2023] [Accepted: 02/20/2024] [Indexed: 03/10/2024] Open
Abstract
Chronic kidney diseases (CKD) have genetic associations with kidney function. Univariate genome-wide association studies (GWAS) have identified single nucleotide polymorphisms (SNPs) associated with estimated glomerular filtration rate (eGFR) and blood urea nitrogen (BUN), two complementary kidney function markers. However, it is unknown whether additional SNPs for kidney function can be identified by multivariate statistical analysis. To address this, we applied canonical correlation analysis (CCA), a multivariate method, to two individual-level CKD genotype datasets, and metaCCA to two published GWAS summary statistics datasets. We identified SNPs previously associated with kidney function by published univariate GWASs with high replication rates, validating the metaCCA method. We then extended discovery and identified previously unreported lead SNPs for both kidney function markers, jointly. These showed expression quantitative trait loci (eQTL) colocalisation with genes having significant differential expression between CKD and healthy individuals. Several of these identified lead missense SNPs were predicted to have a functional impact, including in SLC14A2. We also identified previously unreported lead SNPs that showed significant correlation with both kidney function markers, jointly, in the European ancestry CKDGen, National Unified Renal Translational Research Enterprise (NURTuRE)-CKD and Salford Kidney Study (SKS) datasets. Of these, rs3094060 colocalised with FLOT1 gene expression and was significantly more common in CKD cases in both NURTURE-CKD and SKS, than in the general population. Overall, by using multivariate analysis by CCA, we identified additional SNPs and genes for both kidney function and CKD, that can be prioritised for further CKD analyses.
Collapse
Affiliation(s)
- Amy J Osborne
- Intelligent Systems Laboratory, University of Bristol, Bristol, BS8 1TW, UK.
| | - Agnieszka Bierzynska
- Bristol Renal, University of Bristol and Bristol Royal Hospital for Children, Bristol, BS1 3NY, UK
| | - Elizabeth Colby
- Bristol Renal, University of Bristol and Bristol Royal Hospital for Children, Bristol, BS1 3NY, UK
| | - Uwe Andag
- Department of Metabolic and Renal Diseases, Evotec International GmbH, Marie-Curie-Strasse 7, 37079, Göttingen, Germany
| | - Philip A Kalra
- Department of Renal Medicine, Salford Royal Hospital, Northern Care Alliance NHS Foundation Trust, Stott Lane, Salford, M6 8HD, UK
| | - Olivier Radresa
- Department of Metabolic and Renal Diseases, Evotec International GmbH, Marie-Curie-Strasse 7, 37079, Göttingen, Germany
| | - Philipp Skroblin
- Department of Metabolic and Renal Diseases, Evotec International GmbH, Marie-Curie-Strasse 7, 37079, Göttingen, Germany
| | - Maarten W Taal
- Centre for Kidney Research and Innovation, University of Nottingham, Derby, UK
| | - Gavin I Welsh
- Bristol Renal, University of Bristol and Bristol Royal Hospital for Children, Bristol, BS1 3NY, UK
| | - Moin A Saleem
- Bristol Renal, University of Bristol and Bristol Royal Hospital for Children, Bristol, BS1 3NY, UK
| | - Colin Campbell
- Intelligent Systems Laboratory, University of Bristol, Bristol, BS8 1TW, UK.
| |
Collapse
|
2
|
Yang B, Meng T, Wang X, Li J, Zhao S, Wang Y, Yi S, Zhou Y, Zhang Y, Li L, Guo L. CAT Bridge: an efficient toolkit for gene-metabolite association mining from multiomics data. Gigascience 2024; 13:giae083. [PMID: 39517109 PMCID: PMC11548955 DOI: 10.1093/gigascience/giae083] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2024] [Revised: 08/08/2024] [Accepted: 10/04/2024] [Indexed: 11/16/2024] Open
Abstract
BACKGROUND With advancements in sequencing and mass spectrometry technologies, multiomics data can now be easily acquired for understanding complex biological systems. Nevertheless, substantial challenges remain in determining the association between gene-metabolite pairs due to the nonlinear and multifactorial interactions within cellular networks. The complexity arises from the interplay of multiple genes and metabolites, often involving feedback loops and time-dependent regulatory mechanisms that are not easily captured by traditional analysis methods. FINDINGS Here, we introduce Compounds And Transcripts Bridge (abbreviated as CAT Bridge, available at https://catbridge.work), a free user-friendly platform for longitudinal multiomics analysis to efficiently identify transcripts associated with metabolites using time-series omics data. To evaluate the association of gene-metabolite pairs, CAT Bridge is a pioneering work benchmarking a set of statistical methods spanning causality estimation and correlation coefficient calculation for multiomics analysis. Additionally, CAT Bridge features an artificial intelligence agent to assist users interpreting the association results. CONCLUSIONS We applied CAT Bridge to experimentally obtained Capsicum chinense (chili pepper) and public human and Escherichia coli time-series transcriptome and metabolome datasets. CAT Bridge successfully identified genes involved in the biosynthesis of capsaicin in C. chinense. Furthermore, case study results showed that the convergent cross-mapping method outperforms traditional approaches in longitudinal multiomics analyses. CAT Bridge simplifies access to various established methods for longitudinal multiomics analysis and enables researchers to swiftly identify associated gene-metabolite pairs for further validation.
Collapse
Affiliation(s)
- Bowen Yang
- Shandong Key Laboratory of Precision Molecular Crop Design and Breeding, Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agricultural Sciences in Weifang, Weifang 261325, China
- Department of Chemistry, University of Alberta, Edmonton, AB T6G 2G2, Canada
| | - Tan Meng
- Shandong Key Laboratory of Precision Molecular Crop Design and Breeding, Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agricultural Sciences in Weifang, Weifang 261325, China
| | - Xinrui Wang
- Shandong Key Laboratory of Precision Molecular Crop Design and Breeding, Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agricultural Sciences in Weifang, Weifang 261325, China
| | - Jun Li
- Shandong Key Laboratory of Precision Molecular Crop Design and Breeding, Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agricultural Sciences in Weifang, Weifang 261325, China
| | - Shuang Zhao
- The Metabolomics Innovation Centre, University of Alberta, Edmonton, AB T6G 1C9, Canada
| | - Yingheng Wang
- Department of Computer Science, Cornell University, Ithaca, NY 14853, USA
| | - Shu Yi
- Shandong Key Laboratory of Precision Molecular Crop Design and Breeding, Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agricultural Sciences in Weifang, Weifang 261325, China
| | - Yi Zhou
- Shandong Key Laboratory of Precision Molecular Crop Design and Breeding, Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agricultural Sciences in Weifang, Weifang 261325, China
| | - Yi Zhang
- Shandong Key Laboratory of Precision Molecular Crop Design and Breeding, Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agricultural Sciences in Weifang, Weifang 261325, China
| | - Liang Li
- Department of Chemistry, University of Alberta, Edmonton, AB T6G 2G2, Canada
- The Metabolomics Innovation Centre, University of Alberta, Edmonton, AB T6G 1C9, Canada
| | - Li Guo
- Shandong Key Laboratory of Precision Molecular Crop Design and Breeding, Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agricultural Sciences in Weifang, Weifang 261325, China
| |
Collapse
|
3
|
Guo H, Li T, Wang Z. Pleiotropic genetic association analysis with multiple phenotypes using multivariate response best-subset selection. BMC Genomics 2023; 24:759. [PMID: 38082214 PMCID: PMC10712198 DOI: 10.1186/s12864-023-09820-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Accepted: 11/20/2023] [Indexed: 12/18/2023] Open
Abstract
Genetic pleiotropy refers to the simultaneous association of a gene with multiple phenotypes. It is widely distributed in the whole genome and can help to understand the common genetic mechanism of diseases or traits. In this study, a multivariate response best-subset selection (MRBSS) model based pleiotropic association analysis method is proposed. Different from the traditional genetic association model, the high-dimensional genotypic data are viewed as response variables while the multiple phenotypic data as predictor variables. Moreover, the response best-subset selection procedure is converted into an 0-1 integer optimization problem by introducing a separation parameter and a tuning parameter. Furthermore, the model parameters are estimated by using the curve search under the modified Bayesian information criterion. Simulation experiments show that the proposed method MRBSS remarkably reduces the computational time, obtains higher statistical power under most of the considered scenarios, and controls the type I error rate at a low level. The application studies in the datasets of maize yield traits and pig lipid traits further verifies the effectiveness.
Collapse
Affiliation(s)
- Hongping Guo
- School of Mathematics and Statistics, Hubei Normal University, Huangshi, 435002, People's Republic of China.
| | - Tong Li
- School of Mathematics and Statistics, Hubei Normal University, Huangshi, 435002, People's Republic of China
| | - Zixuan Wang
- School of Mathematics and Statistics, South-Central Minzu University, Wuhan, 430074, People's Republic of China
| |
Collapse
|
4
|
Zhou Z, Tarzanagh DA, Hou B, Tong B, Xu J, Feng Y, Long Q, Shen L. Fair Canonical Correlation Analysis. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 2023; 36:3675-3705. [PMID: 38665178 PMCID: PMC11040228] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 04/28/2024]
Abstract
This paper investigates fairness and bias in Canonical Correlation Analysis (CCA), a widely used statistical technique for examining the relationship between two sets of variables. We present a framework that alleviates unfairness by minimizing the correlation disparity error associated with protected attributes. Our approach enables CCA to learn global projection matrices from all data points while ensuring that these matrices yield comparable correlation levels to group-specific projection matrices. Experimental evaluation on both synthetic and real-world datasets demonstrates the efficacy of our method in reducing correlation disparity error without compromising CCA accuracy.
Collapse
Affiliation(s)
| | | | | | | | - Jia Xu
- University of Pennsylvania
| | | | | | | |
Collapse
|
5
|
St-Pierre J, Oualkacha K. A copula-based set-variant association test for bivariate continuous, binary or mixed phenotypes. Int J Biostat 2023; 19:369-387. [PMID: 36279152 PMCID: PMC10644254 DOI: 10.1515/ijb-2022-0010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2022] [Revised: 05/26/2022] [Accepted: 08/23/2022] [Indexed: 11/15/2022]
Abstract
In genome wide association studies (GWAS), researchers are often dealing with dichotomous and non-normally distributed traits, or a mixture of discrete-continuous traits. However, most of the current region-based methods rely on multivariate linear mixed models (mvLMMs) and assume a multivariate normal distribution for the phenotypes of interest. Hence, these methods are not applicable to disease or non-normally distributed traits. Therefore, there is a need to develop unified and flexible methods to study association between a set of (possibly rare) genetic variants and non-normal multivariate phenotypes. Copulas are multivariate distribution functions with uniform margins on the [0, 1] interval and they provide suitable models to deal with non-normality of errors in multivariate association studies. We propose a novel unified and flexible copula-based multivariate association test (CBMAT) for discovering association between a genetic region and a bivariate continuous, binary or mixed phenotype. We also derive a data-driven analytic p-value procedure of the proposed region-based score-type test. Through simulation studies, we demonstrate that CBMAT has well controlled type I error rates and higher power to detect associations compared with other existing methods, for discrete and non-normally distributed traits. At last, we apply CBMAT to detect the association between two genes located on chromosome 11 and several lipid levels measured on 1477 subjects from the ASLPAC study.
Collapse
Affiliation(s)
- Julien St-Pierre
- Department of Epidemiology, Biostatistics and Occupational Health, McGill University, Montreal, QC, Canada
| | - Karim Oualkacha
- Département de Mathématiques, Université du Québec à Montréal, Montreal, QC, Canada
| |
Collapse
|
6
|
Mazaya M, Kwon YK. In Silico Pleiotropy Analysis in KEGG Signaling Networks Using a Boolean Network Model. Biomolecules 2022; 12:biom12081139. [PMID: 36009032 PMCID: PMC9406064 DOI: 10.3390/biom12081139] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2022] [Revised: 08/10/2022] [Accepted: 08/15/2022] [Indexed: 11/16/2022] Open
Abstract
Pleiotropy, which refers to the ability of different mutations on the same gene to cause different pathological effects in human genetic diseases, is important in understanding system-level biological diseases. Although some biological experiments have been proposed, still little is known about pleiotropy on gene–gene dynamics, since most previous studies have been based on correlation analysis. Therefore, a new perspective is needed to investigate pleiotropy in terms of gene–gene dynamical characteristics. To quantify pleiotropy in terms of network dynamics, we propose a measure called in silico Pleiotropic Scores (sPS), which represents how much a gene is affected against a pair of different types of mutations on a Boolean network model. We found that our model can identify more candidate pleiotropic genes that are not known to be pleiotropic than the experimental database. In addition, we found that many types of functionally important genes tend to have higher sPS values than other genes; in other words, they are more pleiotropic. We investigated the relations of sPS with the structural properties in the signaling network and found that there are highly positive relations to degree, feedback loops, and centrality measures. This implies that the structural characteristics are principles to identify new pleiotropic genes. Finally, we found some biological evidence showing that sPS analysis is relevant to the real pleiotropic data and can be considered a novel candidate for pleiotropic gene research. Taken together, our results can be used to understand the dynamics pleiotropic characteristics in complex biological systems in terms of gene–phenotype relations.
Collapse
Affiliation(s)
- Maulida Mazaya
- Research Center for Computing, National Research and Innovation Agency (BRIN), Cibinong Science Center, Jl. Raya Jakarta-Bogor KM 46, Cibinong 16911, West Java, Indonesia
| | - Yung-Keun Kwon
- School of IT Convergence, University of Ulsan, 93 Daehak-ro, Nam-gu, Ulsan 44610, Korea
- Correspondence:
| |
Collapse
|
7
|
Exploring the Pleiotropic Genes and Therapeutic Targets Associated with Heart Failure and Chronic Kidney Disease by Integrating metaCCA and SGLT2 Inhibitors' Target Prediction. BIOMED RESEARCH INTERNATIONAL 2021; 2021:4229194. [PMID: 34540994 PMCID: PMC8443964 DOI: 10.1155/2021/4229194] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/19/2021] [Revised: 08/08/2021] [Accepted: 08/11/2021] [Indexed: 11/18/2022]
Abstract
Background Previous studies have shown that heart failure (HF) and chronic kidney disease (CKD) have common genetic mechanisms, overlapping pathophysiological pathways, and therapeutic drug—sodium-glucose cotransporter 2 (SGLT2) inhibitors. Methods The genetic pleiotropy metaCCA method was applied on summary statistics data from two independent meta-analyses of GWAS comprising more than 1 million people to identify shared variants and pleiotropic effects between HF and CKD. Targets of SGLT2 inhibitors were predicted by SwissTargetPrediction and DrugBank databases. To refine all genes, we performed using versatile gene-based association study 2 (VEGAS2) and transcriptome-wide association studies (TWAS) for HF and CKD, respectively. Gene enrichment and KEGG pathway analyses were used to explore the potential functional significance of the identified genes and targets. Results After metaCCA analysis, 4,624 SNPs and 1,745 genes were identified to be potentially pleiotropic in the univariate and multivariate SNP-multivariate phenotype analyses, respectively. 21 common genes were detected in both metaCCA and SGLT2 inhibitors' target prediction. In addition, 169 putative pleiotropic genes were identified, which met the significance threshold both in metaCCA analysis and in the VEGAS2 or TWAS analysis for at least one disease. Conclusion We identified novel variants associated with HF and CKD using effectively incorporating information from different GWAS datasets. Our analysis may provide new insights into HF and CKD therapeutic approaches based on the pleiotropic genes, common targets, and mechanisms by integrating the metaCCA method, TWAS and VEGAS2 analyses, and target prediction of SGLT2 inhibitors.
Collapse
|
8
|
Guo B, Wu B. Integrate multiple traits to detect novel trait-gene association using GWAS summary data with an adaptive test approach. Bioinformatics 2020; 35:2251-2257. [PMID: 30476000 DOI: 10.1093/bioinformatics/bty961] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2018] [Revised: 10/30/2018] [Accepted: 11/22/2018] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION Genetics hold great promise to precision medicine by tailoring treatment to the individual patient based on their genetic profiles. Toward this goal, many large-scale genome-wide association studies (GWAS) have been performed in the last decade to identify genetic variants associated with various traits and diseases. They have successfully identified tens of thousands of disease-related variants. However they have explained only a small proportion of the overall trait heritability for most traits and are of very limited clinical use. This is partly owing to the small effect sizes of most genetic variants, and the common practice of testing association between one trait and one genetic variant at a time in most GWAS, even when multiple related traits are often measured for each individual. Increasing evidence suggests that many genetic variants can influence multiple traits simultaneously, and we can gain more power by testing association of multiple traits simultaneously. It is appealing to develop novel multi-trait association test methods that need only GWAS summary data, since it is generally very hard to access the individual-level GWAS phenotype and genotype data. RESULTS Many existing GWAS summary data-based association test methods have relied on ad hoc approach or crude Monte Carlo approximation. In this article, we develop rigorous statistical methods for efficient and powerful multi-trait association test. We develop robust and efficient methods to accurately estimate the marginal trait correlation matrix using only GWAS summary data. We construct the principal component (PC)-based association test from the summary statistics. PC-based test has optimal power when the underlying multi-trait signal can be captured by the first PC, and otherwise it will have suboptimal performance. We develop an adaptive test by optimally weighting the PC-based test and the omnibus chi-square test to achieve robust performance under various scenarios. We develop efficient numerical algorithms to compute the analytical P-values for all the proposed tests without the need of Monte Carlo sampling. We illustrate the utility of proposed methods through application to the GWAS meta-analysis summary data for multiple lipids and glycemic traits. We identify multiple novel loci that were missed by individual trait-based association test. AVAILABILITY AND IMPLEMENTATION All the proposed methods are implemented in an R package available at http://www.github.com/baolinwu/MTAR. The developed R programs are extremely efficient: it takes less than 2 min to compute the list of genome-wide significant single nucleotide polymorphisms (SNPs) for all proposed multi-trait tests for the lipids GWAS summary data with 2.5 million SNPs on a single Linux desktop. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
|
9
|
Guo B, Wu B. Powerful and efficient SNP-set association tests across multiple phenotypes using GWAS summary data. Bioinformatics 2020; 35:1366-1372. [PMID: 30239606 DOI: 10.1093/bioinformatics/bty811] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2018] [Revised: 08/29/2018] [Accepted: 09/18/2018] [Indexed: 01/09/2023] Open
Abstract
MOTIVATION Many GWAS conducted in the past decade have identified tens of thousands of disease related variants, which in total explained only part of the heritability for most traits. There remain many more genetics variants with small effect sizes to be discovered. This has motivated the development of sequencing studies with larger sample sizes and increased resolution of genotyped variants, e.g., the ongoing NHLBI Trans-Omics for Precision Medicine (TOPMed) whole genome sequencing project. An alternative approach is the development of novel and more powerful statistical methods. The current dominating approach in the field of GWAS analysis is the "single trait single variant" association test, despite the fact that most GWAS are conducted in deeply-phenotyped cohorts with many correlated traits measured. In this paper, we aim to develop rigorous methods that integrate multiple correlated traits and multiple variants to improve the power to detect novel variants. In recognition of the difficulty of accessing raw genotype and phenotype data due to privacy and logistic concerns, we develop methods that are applicable to publicly available GWAS summary data. RESULTS We build rigorous statistical models for GWAS summary statistics to motivate novel multi-trait SNP-set association tests, including variance component test, burden test and their adaptive test, and develop efficient numerical algorithms to quickly compute their analytical P-values. We implement the proposed methods in an open source R package. We conduct thorough simulation studies to verify the proposed methods rigorously control type I errors at the genome-wide significance level, and further demonstrate their utility via comprehensive analysis of GWAS summary data for multiple lipids traits and glycemic traits. We identified many novel loci that were not detected by the individual trait based GWAS analysis. AVAILABILITY AND IMPLEMENTATION We have implemented the proposed methods in an R package freely available at http://www.github.com/baolinwu/MSKAT. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Bin Guo
- Division of Biostatistics, School of Public Health, University of Minnesota, Minneapolis, MN, USA
| | - Baolin Wu
- Division of Biostatistics, School of Public Health, University of Minnesota, Minneapolis, MN, USA
| |
Collapse
|
10
|
Jia X, Shi N, Feng Y, Li Y, Tan J, Xu F, Wang W, Sun C, Deng H, Yang Y, Shi X. Identification of 67 Pleiotropic Genes Associated With Seven Autoimmune/Autoinflammatory Diseases Using Multivariate Statistical Analysis. Front Immunol 2020; 11:30. [PMID: 32117227 PMCID: PMC7008725 DOI: 10.3389/fimmu.2020.00030] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2019] [Accepted: 01/08/2020] [Indexed: 12/19/2022] Open
Abstract
Although genome-wide association studies (GWAS) have a dramatic impact on susceptibility locus discovery, this univariate approach has limitations in detecting complex genotype-phenotype correlations. Multivariate analysis is essential to identify shared genetic risk factors acting through common biological mechanisms of autoimmune/autoinflammatory diseases. In this study, GWAS summary statistics, including 41,274 single nucleotide polymorphisms (SNPs) located in 11,516 gene regions, were analyzed to identify shared variants of seven autoimmune/autoinflammatory diseases using the metaCCA method. Gene-based association analysis was used to refine the pleiotropic genes. In addition, GO term enrichment analysis and protein-protein interaction network analysis were applied to explore the potential biological functions of the identified genes. A total of 4,962 SNPs (P < 1.21 × 10-6) and 1,044 pleotropic genes (P < 4.34 × 10-6) were identified by metaCCA analysis. By screening the results of gene-based P-values, we identified the existence of 27 confirmed pleiotropic genes and highlighted 40 novel pleiotropic genes that achieved statistical significance in the metaCCA analysis and were also associated with at least one autoimmune/autoinflammatory in the VEGAS2 analysis. Using the metaCCA method, we identified novel variants associated with complex diseases incorporating different GWAS datasets. Our analysis may provide insights for the development of common therapeutic approaches for autoimmune/autoinflammatory diseases based on the pleiotropic genes and common mechanisms identified.
Collapse
Affiliation(s)
- Xiaocan Jia
- Department of Epidemiology and Biostatistics, College of Public Health, Zhengzhou University, Zhengzhou, China
| | - Nian Shi
- Department of Physical Diagnosis, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, China
| | - Yu Feng
- Department of Epidemiology and Biostatistics, College of Public Health, Zhengzhou University, Zhengzhou, China
| | - Yifan Li
- Department of Epidemiology and Biostatistics, College of Public Health, Zhengzhou University, Zhengzhou, China
| | - Jiebing Tan
- Department of Epidemiology and Biostatistics, College of Public Health, Zhengzhou University, Zhengzhou, China
| | - Fei Xu
- Department of Epidemiology and Biostatistics, College of Public Health, Zhengzhou University, Zhengzhou, China
| | - Wei Wang
- Department of Occupational and Environmental Health, College of Public Health, Zhengzhou University, Zhengzhou, China
| | - Changqing Sun
- Department of Epidemiology and Biostatistics, College of Public Health, Zhengzhou University, Zhengzhou, China
| | - Hongwen Deng
- Center for Bioinformatics and Genomics, School of Public Health and Tropical Medicine, Tulane University, New Orleans, LA, United States
| | - Yongli Yang
- Department of Epidemiology and Biostatistics, College of Public Health, Zhengzhou University, Zhengzhou, China
| | - Xuezhong Shi
- Department of Epidemiology and Biostatistics, College of Public Health, Zhengzhou University, Zhengzhou, China
| |
Collapse
|
11
|
Masotti M, Guo B, Wu B. Pleiotropy informed adaptive association test of multiple traits using genome-wide association study summary data. Biometrics 2019; 75:1076-1085. [PMID: 31021400 DOI: 10.1111/biom.13076] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2017] [Accepted: 04/16/2019] [Indexed: 12/17/2022]
Abstract
Genetic variants associated with disease outcomes can be used to develop personalized treatment. To reach this precision medicine goal, hundreds of large-scale genome-wide association studies (GWAS) have been conducted in the past decade to search for promising genetic variants associated with various traits. They have successfully identified tens of thousands of disease-related variants. However, in total these identified variants explain only part of the variation for most complex traits. There remain many genetic variants with small effect sizes to be discovered, which calls for the development of (a) GWAS with more samples and more comprehensively genotyped variants, for example, the NHLBI Trans-Omics for Precision Medicine (TOPMed) Program is planning to conduct whole genome sequencing on over 100 000 individuals; and (b) novel and more powerful statistical analysis methods. The current dominating GWAS analysis approach is the "single trait" association test, despite the fact that many GWAS are conducted in deeply phenotyped cohorts including many correlated and well-characterized outcomes, which can help improve the power to detect novel variants if properly analyzed, as suggested by increasing evidence that pleiotropy, where a genetic variant affects multiple traits, is the norm in genome-phenome associations. We aim to develop pleiotropy informed powerful association test methods across multiple traits for GWAS. Since it is generally very hard to access individual-level GWAS phenotype and genotype data for those existing GWAS, due to privacy concerns and various logistical considerations, we develop rigorous statistical methods for pleiotropy informed adaptive multitrait association test methods that need only summary association statistics publicly available from most GWAS. We first develop a pleiotropy test, which has powerful performance for truly pleiotropic variants but is sensitive to the pleiotropy assumption. We then develop a pleiotropy informed adaptive test that has robust and powerful performance under various genetic models. We develop accurate and efficient numerical algorithms to compute the analytical P-value for the proposed adaptive test without the need of resampling or permutation. We illustrate the performance of proposed methods through application to joint association test of GWAS meta-analysis summary data for several glycemic traits. Our proposed adaptive test identified several novel loci missed by individual trait based GWAS meta-analysis. All the proposed methods are implemented in a publicly available R package.
Collapse
Affiliation(s)
- Maria Masotti
- Division of Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota
| | - Bin Guo
- Division of Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota
| | - Baolin Wu
- Division of Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota
| |
Collapse
|
12
|
Jia X, Yang Y, Chen Y, Xia Z, Zhang W, Feng Y, Li Y, Tan J, Xu C, Zhang Q, Deng H, Shi X. Multivariate analysis of genome-wide data to identify potential pleiotropic genes for type 2 diabetes, obesity and coronary artery disease using MetaCCA. Int J Cardiol 2019; 283:144-150. [DOI: 10.1016/j.ijcard.2018.10.102] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/09/2018] [Revised: 10/17/2018] [Accepted: 10/29/2018] [Indexed: 01/26/2023]
|
13
|
Jia X, Yang Y, Chen Y, Cheng Z, Du Y, Xia Z, Zhang W, Xu C, Zhang Q, Xia X, Deng H, Shi X. Multivariate analysis of genome-wide data to identify potential pleiotropic genes for five major psychiatric disorders using MetaCCA. J Affect Disord 2019; 242:234-243. [PMID: 30212762 PMCID: PMC6343670 DOI: 10.1016/j.jad.2018.07.046] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/05/2018] [Revised: 06/06/2018] [Accepted: 07/16/2018] [Indexed: 12/14/2022]
Abstract
BACKGROUND Genome-wide association studies have been extensively applied in identifying SNP associated with major psychiatric disorders. However, the SNPs identified by the prevailing univariate approach only explain a small percentage of the genetic variance of traits, and the extensive data have shown the major psychiatric disorders have common biological mechanisms and the overlapping pathophysiological pathways. METHODS We applied the genetic pleiotropy-informed metaCCA method on summary statistics data from the Psychiatric Genomics Consortium Cross-Disorder Group to examine the overlapping genetic relations between the five major psychiatric disorders. Furthermore, to refine all genes, we performed gene-based association analyses for the five disorders respectively using VEGAS2. Gene enrichment analysis was applied to explore the potential functional significance of the identified genes. RESULTS After metaCCA analysis, 1147 SNPs reached the Bonferroni corrected threshold (p < 1.06 × 10-6) in the univariate SNP-multivariate phenotype analysis, and 246 genes with a significance threshold (p < 3.85 × 10-6) were identified as potentially pleiotropic genes in the multivariate SNP-multivariate phenotype analysis. By screening the results of gene-based p-values, we identified 37 putative pleiotropic genes which achieved significance threshold in metaCCA analyses and were also associated with at least one disorder in the VEGAS2 analyses. LIMITATIONS Alternative approaches and experimental studies may be applied to check whether novel genes could still be identified/substantiated with these methods. CONCLUSIONS The metaCCA method identified novel variants associated with psychiatric disorders by effectively incorporating information from different GWAS datasets. Our analyses may provide insights for some common therapeutic approaches of these five major psychiatric disorders based on the pleiotropic genes and common mechanisms identified.
Collapse
Affiliation(s)
- XiaoCan Jia
- Department of Epidemiology and Biostatistics, College of Public Health, Zhengzhou University, Zhengzhou, Henan, China
| | - YongLi Yang
- Department of Epidemiology and Biostatistics, College of Public Health, Zhengzhou University, Zhengzhou, Henan, China
| | - YuanCheng Chen
- Department of Endocrinology and Metabolism, The Third Affiliated Hospital of Southern Medical University, Guang Zhou, Guangdong, China
| | - ZhiWei Cheng
- Department of Epidemiology and Biostatistics, College of Public Health, Zhengzhou University, Zhengzhou, Henan, China
| | - Yuhui Du
- Department of Epidemiology and Biostatistics, College of Public Health, Zhengzhou University, Zhengzhou, Henan, China
| | - Zhenhua Xia
- Department of Epidemiology and Biostatistics, College of Public Health, Zhengzhou University, Zhengzhou, Henan, China
| | - Weiping Zhang
- Department of Epidemiology and Biostatistics, College of Public Health, Zhengzhou University, Zhengzhou, Henan, China
| | - Chao Xu
- Center for Bioinformatics and Genomics, School of Public Health and Tropical Medicine, Tulane University, New Orleans, LA, USA
| | - Qiang Zhang
- Department of Epidemiology and Biostatistics, College of Public Health, Zhengzhou University, Zhengzhou, Henan, China
| | - Xin Xia
- Department of Epidemiology and Biostatistics, College of Public Health, Zhengzhou University, Zhengzhou, Henan, China
| | - HongWen Deng
- Department of Epidemiology and Biostatistics, College of Public Health, Zhengzhou University, Zhengzhou, Henan, China.
| | - XueZhong Shi
- Department of Epidemiology and Biostatistics, College of Public Health, Zhengzhou University, Zhengzhou, Henan, China.
| |
Collapse
|
14
|
Hackinger S, Zeggini E. Statistical methods to detect pleiotropy in human complex traits. Open Biol 2018; 7:rsob.170125. [PMID: 29093210 PMCID: PMC5717338 DOI: 10.1098/rsob.170125] [Citation(s) in RCA: 89] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2017] [Accepted: 09/29/2017] [Indexed: 12/13/2022] Open
Abstract
In recent years pleiotropy, the phenomenon of one genetic locus influencing several traits, has become a widely researched field in human genetics. With the increasing availability of genome-wide association study summary statistics, as well as the establishment of deeply phenotyped sample collections, it is now possible to systematically assess the genetic overlap between multiple traits and diseases. In addition to increasing power to detect associated variants, multi-trait methods can also aid our understanding of how different disorders are aetiologically linked by highlighting relevant biological pathways. A plethora of available tools to perform such analyses exists, each with their own advantages and limitations. In this review, we outline some of the currently available methods to conduct multi-trait analyses. First, we briefly introduce the concept of pleiotropy and outline the current landscape of pleiotropy research in human genetics; second, we describe analytical considerations and analysis methods; finally, we discuss future directions for the field.
Collapse
|
15
|
Fast and Accurate Genome-Wide Association Test of Multiple Quantitative Traits. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2018; 2018:2564531. [PMID: 29743933 PMCID: PMC5878919 DOI: 10.1155/2018/2564531] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/27/2017] [Accepted: 01/24/2018] [Indexed: 02/03/2023]
Abstract
Multiple correlated traits are often collected in genetic studies. By jointly analyzing multiple traits, we can increase power by aggregating multiple weak effects and reveal additional insights into the genetic architecture of complex human diseases. In this article, we propose a multivariate linear regression-based method to test the joint association of multiple quantitative traits. It is flexible to accommodate any covariates, has very accurate control of type I errors, and offers very competitive performance. We also discuss fast and accurate significance p value computation especially for genome-wide association studies with small-to-medium sample sizes. We demonstrate through extensive numerical studies that the proposed method has competitive performance. Its usefulness is further illustrated with application to genome-wide association analysis of diabetes-related traits in the Atherosclerosis Risk in Communities (ARIC) study. We found some very interesting associations with diabetes traits which have not been reported before. We implemented the proposed methods in a publicly available R package.
Collapse
|
16
|
Lin N, Zhu Y, Fan R, Xiong M. A quadratically regularized functional canonical correlation analysis for identifying the global structure of pleiotropy with NGS data. PLoS Comput Biol 2017; 13:e1005788. [PMID: 29040274 PMCID: PMC5659802 DOI: 10.1371/journal.pcbi.1005788] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2016] [Revised: 10/27/2017] [Accepted: 09/21/2017] [Indexed: 01/12/2023] Open
Abstract
Investigating the pleiotropic effects of genetic variants can increase statistical power, provide important information to achieve deep understanding of the complex genetic structures of disease, and offer powerful tools for designing effective treatments with fewer side effects. However, the current multiple phenotype association analysis paradigm lacks breadth (number of phenotypes and genetic variants jointly analyzed at the same time) and depth (hierarchical structure of phenotype and genotypes). A key issue for high dimensional pleiotropic analysis is to effectively extract informative internal representation and features from high dimensional genotype and phenotype data. To explore correlation information of genetic variants, effectively reduce data dimensions, and overcome critical barriers in advancing the development of novel statistical methods and computational algorithms for genetic pleiotropic analysis, we proposed a new statistic method referred to as a quadratically regularized functional CCA (QRFCCA) for association analysis which combines three approaches: (1) quadratically regularized matrix factorization, (2) functional data analysis and (3) canonical correlation analysis (CCA). Large-scale simulations show that the QRFCCA has a much higher power than that of the ten competing statistics while retaining the appropriate type 1 errors. To further evaluate performance, the QRFCCA and ten other statistics are applied to the whole genome sequencing dataset from the TwinsUK study. We identify a total of 79 genes with rare variants and 67 genes with common variants significantly associated with the 46 traits using QRFCCA. The results show that the QRFCCA substantially outperforms the ten other statistics.
Collapse
Affiliation(s)
- Nan Lin
- Department of Biostatistics and Data Science, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, United States of America
| | - Yun Zhu
- Department of Epidemiology, Tulane University School of Public Health and Tropical Medicine, New Orleans, LA, United States of America
| | - Ruzong Fan
- Biostatistics and Bioinformatics Branch (BBB), Division of Intramural Population Health Research (DIPHR), Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health (NIH), Bethesda, MD, United States of America
| | - Momiao Xiong
- Department of Biostatistics and Data Science, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, United States of America
| |
Collapse
|
17
|
Gordon D, Londono D, Patel P, Kim W, Finch SJ, Heiman GA. An Analytic Solution to the Computation of Power and Sample Size for Genetic Association Studies under a Pleiotropic Mode of Inheritance. Hum Hered 2017; 81:194-209. [PMID: 28315880 DOI: 10.1159/000457135] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2016] [Accepted: 01/20/2017] [Indexed: 01/14/2023] Open
Abstract
Our motivation here is to calculate the power of 3 statistical tests used when there are genetic traits that operate under a pleiotropic mode of inheritance and when qualitative phenotypes are defined by use of thresholds for the multiple quantitative phenotypes. Specifically, we formulate a multivariate function that provides the probability that an individual has a vector of specific quantitative trait values conditional on having a risk locus genotype, and we apply thresholds to define qualitative phenotypes (affected, unaffected) and compute penetrances and conditional genotype frequencies based on the multivariate function. We extend the analytic power and minimum-sample-size-necessary (MSSN) formulas for 2 categorical data-based tests (genotype, linear trend test [LTT]) of genetic association to the pleiotropic model. We further compare the MSSN of the genotype test and the LTT with that of a multivariate ANOVA (Pillai). We approximate the MSSN for statistics by linear models using a factorial design and ANOVA. With ANOVA decomposition, we determine which factors most significantly change the power/MSSN for all statistics. Finally, we determine which test statistics have the smallest MSSN. In this work, MSSN calculations are for 2 traits (bivariate distributions) only (for illustrative purposes). We note that the calculations may be extended to address any number of traits. Our key findings are that the genotype test usually has lower MSSN requirements than the LTT. More inclusive thresholds (top/bottom 25% vs. top/bottom 10%) have higher sample size requirements. The Pillai test has a much larger MSSN than both the genotype test and the LTT, as a result of sample selection. With these formulas, researchers can specify how many subjects they must collect to localize genes for pleiotropic phenotypes.
Collapse
Affiliation(s)
- Derek Gordon
- Department of Genetics, The State University of New Jersey, Piscataway, NJ, USA
| | | | | | | | | | | |
Collapse
|
18
|
Adeoye AM, Ovbiagele B, Kolo P, Appiah L, Aje A, Adebayo O, Sarfo F, Akinyemi J, Adekunle G, Agyekum F, Shidali V, Ogah O, Lackland D, Gebregziabher M, Arnett D, Tiwari HK, Akinyemi R, Olagoke OO, Oguntade AS, Olunuga T, Uwanruochi K, Jenkins C, Adadey P, Iheonye H, Owolabi L, Obiako R, Akinjopo S, Armstrong K, Akpalu A, Fakunle A, Saulson R, Aridegbe M, Olowoyo P, Osaigbovo G, Akpalu J, Fawale B, Adebayo P, Arulogun O, Ibinaiye P, Agunloye A, Ishaq N, Wahab K, Akpa O, Adeleye O, Bock-Oruma A, Ogbole G, Melikam S, Yaria J, Ogunjimi L, Salaam A, Sunmonu T, Makanjuola A, Farombi T, Laryea R, Uvere E, Kehinde S, Chukwuonye I, Azuh P, Komolafe M, Akintunde A, Obiabo O, Areo O, Kehinde I, Amusa AG, Owolabi M. Exploring Overlaps Between the Genomic and Environmental Determinants of LVH and Stroke: A Multicenter Study in West Africa. Glob Heart 2017; 12:107-113.e5. [PMID: 28302552 DOI: 10.1016/j.gheart.2017.01.001] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2017] [Accepted: 01/05/2017] [Indexed: 11/25/2022] Open
Abstract
BACKGROUND Whether left ventricular hypertrophy (LVH) is determined by similar genomic and environmental risk factors with stroke, or is simply an intermediate stroke marker, is unknown. OBJECTIVES We present a research plan and preliminary findings to explore the overlap in the genomic and environmental determinants of LVH and stroke among Africans participating in the SIREN (Stroke Investigative Research and Education Network) study. METHODS SIREN is a transnational, multicenter study involving acute stroke patients and age-, ethnicity-, and sex-matched control subjects recruited from 9 sites in Ghana and Nigeria. Genomic and environmental risk factors and other relevant phenotypes for stroke and LVH are being collected and compared using standard techniques. RESULTS This preliminary analysis included only 725 stroke patients (mean age 59.1 ± 13.2 years; 54.3% male). Fifty-five percent of the stroke subjects had LVH with greater proportion among women (51.6% vs. 48.4%; p < 0.001). Those with LVH were younger (57.9 ± 12.8 vs. 60.6 ± 13.4; p = 0.006) and had higher mean systolic and diastolic blood pressure (167.1/99.5 mm Hg vs 151.7/90.6 mm Hg; p < 0.001). Uncontrolled blood pressure at presentation was prevalent in subjects with LVH (76.2% vs. 57.7%; p < 0.001). Significant independent predictors of LVH were age <45 years (adjusted odds ratio [AOR]: 1.91; 95% confidence interval [CI]: 1.14 to 3.19), female sex (AOR: 2.01; 95% CI: 1.44 to 2.81), and diastolic blood pressure > 90 mm Hg (AOR: 2.10; 95% CI: 1.39 to 3.19; p < 0.001). CONCLUSIONS The prevalence of LVH was high among stroke patients especially the younger ones, suggesting a genetic component to LVH. Hypertension was a major modifiable risk factor for stroke as well as LVH. It is envisaged that the SIREN project will elucidate polygenic overlap (if present) between LVH and stroke among Africans, thereby defining the role of LVH as a putative intermediate cardiovascular phenotype and therapeutic target to inform interventions to reduce stroke risk in populations of African ancestry.
Collapse
Affiliation(s)
| | | | - Philip Kolo
- University of Ilorin Teaching Hospital, Ilorin, Nigeria
| | | | | | | | - Fred Sarfo
- Komfo Anokye Teaching Hospital, Kumasi, Ghana
| | | | | | | | | | | | - Dan Lackland
- Medical University of South Carolina, Charleston, SC, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | - Raelle Saulson
- Medical University of South Carolina, Charleston, SC, USA
| | | | - Paul Olowoyo
- Federal University Teaching Hospital, Ido-Ekiti, Nigeria
| | | | | | - Bimbo Fawale
- Obafemi Awolowo University Teaching Hospital, Ile-Ife, Nigeria
| | - Philip Adebayo
- Ladoke Akintola University of Technology, Ogbomoso, Nigeria
| | | | | | | | | | | | | | - Omisore Adeleye
- Obafemi Awolowo University Teaching Hospital, Ile-Ife, Nigeria
| | | | | | | | | | | | | | | | | | | | - Ruth Laryea
- University of Ghana Medical School, Accra, Ghana
| | | | | | | | | | | | | | - Olugbo Obiabo
- Delta State University Teaching Hospital, Ogara, Nigeria
| | - Olusegun Areo
- Federal University Teaching Hospital, Ido-Ekiti, Nigeria
| | | | | | | | | |
Collapse
|
19
|
Wu B, Pankow JS. Genome-wide association test of multiple continuous traits using imputed SNPs. STATISTICS AND ITS INTERFACE 2017; 10:379-386. [PMID: 28217245 PMCID: PMC5310616 DOI: 10.4310/sii.2017.v10.n3.a2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
More and more large cohort studies have conducted or are conducting genome-wide association studies (GWAS) to reveal the genetic components of many complex human diseases. These large cohort studies often collected a broad array of correlated phenotypes that reflect common physiological processes. By jointly analyzing these correlated traits, we can gain more power by aggregating multiple weak effects and shed light on the mechanisms underlying complex human diseases. The majority of existing multi-trait association test methods are based on jointly modeling the multivariate traits conditional on the genotype as covariate, and can readily accommodate the imputed SNPs by using their imputed dosage as a covariate. An alternative class of multi-trait association tests is based on the inverted regression, which models the distribution of genotypes conditional on the covariate and multivariate traits, and has been shown to have competitive performance. To our knowledge, all existing inverted regression approaches have implicitly used the "best-guess" genotypes, which is not efficient and known to lead to dramatic power loss, and there have not been any proposed methods of incorporating imputation uncertainty into inverted regressions. In this work, we propose a general and efficient framework that can account for the imputation uncertainty to further improve the association test power of inverted regression models for imputed SNPs. We demonstrate through extensive numerical studies that the proposed method has competitive performance. We further illustrate its usefulness by application to association test of diabetes-related glycemic traits in the Atherosclerosis Risk in Communities (ARIC) Study.
Collapse
Affiliation(s)
- Baolin Wu
- Division of Biostatistics, University of Minnesota
| | - James S. Pankow
- Division of Epidemiology and Community Health School of Public Health, University of Minnesota
| |
Collapse
|
20
|
Tyler AL, Crawford DC, Pendergrass SA. The detection and characterization of pleiotropy: discovery, progress, and promise. Brief Bioinform 2015. [PMID: 26223525 DOI: 10.1093/bib/bbv050] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open
Abstract
The impact of a single genetic locus on multiple phenotypes, or pleiotropy, is an important area of research. Biological systems are dynamic complex networks, and these networks exist within and between cells. In humans, the consideration of multiple phenotypes such as physiological traits, clinical outcomes and drug response, in the context of genetic variation, can provide ways of developing a more complete understanding of the complex relationships between genetic architecture and how biological systems function in health and disease. In this article, we describe recent studies exploring the relationships between genetic loci and more than one phenotype. We also cover methodological developments incorporating pleiotropy applied to model organisms as well as humans, and discuss how stepping beyond the analysis of a single phenotype leads to a deeper understanding of complex genetic architecture.
Collapse
|