1
|
Rathee S, Ojha A, Sagar P, Upadhyay A, Rather IA, Shukla S. Decoration of Fe 3O 4-vitamin C nanoparticles on alginate-chitosan nanocomplex: Characterization, safety, bioacessibility boost and Iron Nanofortification in A2 goat milk gels. Food Chem 2025; 470:142711. [PMID: 39756086 DOI: 10.1016/j.foodchem.2024.142711] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2024] [Revised: 09/14/2024] [Accepted: 12/28/2024] [Indexed: 01/07/2025]
Abstract
In this study, an alginate-chitosan (AL-CS) nanocomplex decorated with vitamin C coated iron oxide nanoparticles (Fe3O4-vit C NPs) was investigated as a novel nanoiron fortification agent. The Fe3O4-vit C NPs decorated on AL-CS nanocomplex underwent comprehensive characterization, including zeta potential, fourier transform infrared spectroscopy, X-ray diffraction, and UV-vis spectroscopy. The transmission electron microscopy (TEM) analysis confirmed the decoration of Fe3O4-vit C NPs on AL-CS nanocomplex. The dynamic light scattering and thermogravimetric analysis showed enhanced thermal properties of decorated nanocomplex than the undecorated control. Biocompatibility testing on HepG2 cell lines revealed improved compatibility, while intestinal Caco2 cell lines showed approximately 51 % greater bioacessibility than controls. Further, 8 mg of Fe3O4-vit C NPs decorated AL-CS nanocomplex nanofortified 80 g of A2 goat milk gels (GMGs) which provided 0.072 mg/g of nanoiron without showing significant changes in texture and color compared to the control A2 GMGs. The PCA analysis helped to identify the impact of various factors for the preparation of decorated nanocomplex.
Collapse
Affiliation(s)
- Shweta Rathee
- Department of Food Science and Technology, National Institute of Food Science Technology Entrepreneurship and Management, Kundli (NIFTEM-K), Sonipat 131028, Haryana, India.
| | - Ankur Ojha
- Department of Food Science and Technology, National Institute of Food Science Technology Entrepreneurship and Management, Kundli (NIFTEM-K), Sonipat 131028, Haryana, India.
| | - Poonam Sagar
- National Agri-Food Biotechnology Institute (NABI), S.A.S. Nagar, Mohali, Punjab 140208, India
| | - Ashutosh Upadhyay
- Department of Food Science and Technology, National Institute of Food Science Technology Entrepreneurship and Management, Kundli (NIFTEM-K), Sonipat 131028, Haryana, India
| | - Irfan A Rather
- Department of Biological Sciences, Faculty of Science, King Abdulaziz University, Jeddah 21589, Saudi Arabia.
| | - Shruti Shukla
- Department of Nanotechnology, North-Eastern Hill University (NEHU), Shillong 793022, Meghalaya, India.
| |
Collapse
|
2
|
Guo X, Feng Y, Ji X, Jia N, Maimaiti A, Lai J, Wang Z, Yang S, Hu S. Shared genetic architecture and bidirectional clinical risks within the psycho-metabolic nexus. EBioMedicine 2025; 111:105530. [PMID: 39731856 PMCID: PMC11743124 DOI: 10.1016/j.ebiom.2024.105530] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2024] [Revised: 12/12/2024] [Accepted: 12/12/2024] [Indexed: 12/30/2024] Open
Abstract
BACKGROUND Increasing evidence suggests a complex interplay between psychiatric disorders and metabolic dysregulations. However, most research has been limited to specific disorder pairs, leaving a significant gap in our understanding of the broader psycho-metabolic nexus. METHODS This study leveraged large-scale cohort data and genome-wide association study (GWAS) summary statistics, covering 8 common psychiatric disorders and 43 metabolic traits. We introduced a comprehensive analytical strategy to identify shared genetic bases sequentially, from key genetic correlation regions to local pleiotropy and pleiotropic genes. Finally, we developed polygenic risk score (PRS) models to translate these findings into clinical applications. FINDINGS We identified significant bidirectional clinical risks between psychiatric disorders and metabolic dysregulations among 310,848 participants from the UK Biobank. Genetic correlation analysis confirmed 104 robust trait pairs, revealing 1088 key genomic regions, including critical hotspots such as chr3: 47588462-50387742. Cross-trait meta-analysis uncovered 388 pleiotropic single nucleotide variants (SNVs) and 126 shared causal variants. Among variants, 45 novel SNVs were associated with psychiatric disorders and 75 novel SNVs were associated with metabolic traits, shedding light on new targets to unravel the mechanism of comorbidity. Notably, RBM6, a gene involved in alternative splicing and cellular stress response regulation, emerged as a key pleiotropic gene. When psychiatric and metabolic genetic information were integrated, PRS models demonstrated enhanced predictive power. INTERPRETATION The study highlights the intertwined genetic and clinical relationships between psychiatric disorders and metabolic dysregulations, emphasising the need for integrated approaches in diagnosis and treatment. FUNDING The National Key Research and Development Program of China (2023YFC2506200, SHH). The National Natural Science Foundation of China (82273741, SY).
Collapse
Affiliation(s)
- Xiaonan Guo
- Department of Psychiatry, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, China
| | - Yu Feng
- Department of Psychiatry, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, China; Department of Psychiatry, Melbourne Neuropsychiatry Centre, The University of Melbourne, Carlton South, VIC, Australia
| | - Xiaolong Ji
- Department of Biostatistics, Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing, Jiangsu, China
| | - Ningning Jia
- Department of Epidemiology and Biostatistics, School of Public Health, Jilin University, Changchun, Jilin, China
| | - Aierpati Maimaiti
- Department of Neurosurgery, Xinjiang Medical University Affiliated First Hospital, Urumqi, Xinjiang, China
| | - Jianbo Lai
- Department of Psychiatry, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, China
| | - Zheng Wang
- Department of Psychiatry, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, China.
| | - Sheng Yang
- Department of Biostatistics, Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing, Jiangsu, China.
| | - Shaohua Hu
- Department of Psychiatry, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, China; Nanhu Brain-Computer Interface Institute, Hangzhou, Zhejiang, China; Zhejiang Key Laboratory of Precision Psychiatry, Hangzhou, 310003, China; Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 311121, China; Brain Research Institute of Zhejiang University, Hangzhou, 310058, China; MOE Frontier Science Center for Brain Science and Brain-Machine Integration, Zhejiang University School of Medicine, Hangzhou, 310058, China; Department of Psychology and Behavioral Sciences, Graduate School, Zhejiang University, Hangzhou, 310058, China.
| |
Collapse
|
3
|
Kojima K, Tadaka S, Okamura Y, Kinoshita K. Two-stage strategy using denoising autoencoders for robust reference-free genotype imputation with missing input genotypes. J Hum Genet 2024; 69:511-518. [PMID: 38918526 PMCID: PMC11422160 DOI: 10.1038/s10038-024-01261-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2023] [Revised: 04/16/2024] [Accepted: 05/13/2024] [Indexed: 06/27/2024]
Abstract
Widely used genotype imputation methods are based on the Li and Stephens model, which assumes that new haplotypes can be represented by modifying existing haplotypes in a reference panel through mutations and recombinations. These methods use genotypes from SNP arrays as inputs to estimate haplotypes that align with the input genotypes by analyzing recombination patterns within a reference panel, and then infer unobserved variants. While these methods require reference panels in an identifiable form, their public use is limited due to privacy and consent concerns. One strategy to overcome these limitations is to use de-identified haplotype information, such as summary statistics or model parameters. Advances in deep learning (DL) offer the potential to develop imputation methods that use haplotype information in a reference-free manner by handling it as model parameters, while maintaining comparable imputation accuracy to methods based on the Li and Stephens model. Here, we provide a brief introduction to DL-based reference-free genotype imputation methods, including RNN-IMP, developed by our research group. We then evaluate the performance of RNN-IMP against widely-used Li and Stephens model-based imputation methods in terms of accuracy (R2), using the 1000 Genomes Project Phase 3 dataset and corresponding simulated Omni2.5 SNP genotype data. Although RNN-IMP is sensitive to missing values in input genotypes, we propose a two-stage imputation strategy: missing genotypes are first imputed using denoising autoencoders; RNN-IMP then processes these imputed genotypes. This approach restores the imputation accuracy that is degraded by missing values, enhancing the practical use of RNN-IMP.
Collapse
Affiliation(s)
- Kaname Kojima
- Tohoku Medical Megabank Organization, Tohoku University, 2-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi, 980-8573, Japan.
| | - Shu Tadaka
- Tohoku Medical Megabank Organization, Tohoku University, 2-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi, 980-8573, Japan
| | - Yasunobu Okamura
- Tohoku Medical Megabank Organization, Tohoku University, 2-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi, 980-8573, Japan
- Advanced Research Center for Innovations in Next-Generation Medicine, Tohoku University, 2-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi, 980-0873, Japan
| | - Kengo Kinoshita
- Tohoku Medical Megabank Organization, Tohoku University, 2-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi, 980-8573, Japan.
- Advanced Research Center for Innovations in Next-Generation Medicine, Tohoku University, 2-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi, 980-0873, Japan.
- Graduate School of Information Sciences, Tohoku University, 6-3-09 Aza-Aoba, Aramaki, Aoba-ku, Sendai, Miyagi, 980-8579, Japan.
- Institute of Development, Aging and Cancer, Tohoku University, 4-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi, 980-8575, Japan.
| |
Collapse
|
4
|
Kontou PI, Bagos PG. The goldmine of GWAS summary statistics: a systematic review of methods and tools. BioData Min 2024; 17:31. [PMID: 39238044 PMCID: PMC11375927 DOI: 10.1186/s13040-024-00385-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2024] [Accepted: 08/27/2024] [Indexed: 09/07/2024] Open
Abstract
Genome-wide association studies (GWAS) have revolutionized our understanding of the genetic architecture of complex traits and diseases. GWAS summary statistics have become essential tools for various genetic analyses, including meta-analysis, fine-mapping, and risk prediction. However, the increasing number of GWAS summary statistics and the diversity of software tools available for their analysis can make it challenging for researchers to select the most appropriate tools for their specific needs. This systematic review aims to provide a comprehensive overview of the currently available software tools and databases for GWAS summary statistics analysis. We conducted a comprehensive literature search to identify relevant software tools and databases. We categorized the tools and databases by their functionality, including data management, quality control, single-trait analysis, and multiple-trait analysis. We also compared the tools and databases based on their features, limitations, and user-friendliness. Our review identified a total of 305 functioning software tools and databases dedicated to GWAS summary statistics, each with unique strengths and limitations. We provide descriptions of the key features of each tool and database, including their input/output formats, data types, and computational requirements. We also discuss the overall usability and applicability of each tool for different research scenarios. This comprehensive review will serve as a valuable resource for researchers who are interested in using GWAS summary statistics to investigate the genetic basis of complex traits and diseases. By providing a detailed overview of the available tools and databases, we aim to facilitate informed tool selection and maximize the effectiveness of GWAS summary statistics analysis.
Collapse
Affiliation(s)
| | - Pantelis G Bagos
- Department of Computer Science and Biomedical Informatics, University of Thessaly, 35131, Lamia, Greece.
| |
Collapse
|
5
|
Cao R, Schladt DP, Dorr C, Matas AJ, Oetting WS, Jacobson PA, Israni A, Chen J, Guan W. Polygenic risk score for acute rejection based on donor-recipient non-HLA genotype mismatch. PLoS One 2024; 19:e0303446. [PMID: 38820342 PMCID: PMC11142483 DOI: 10.1371/journal.pone.0303446] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2023] [Accepted: 04/24/2024] [Indexed: 06/02/2024] Open
Abstract
BACKGROUND Acute rejection (AR) after kidney transplantation is an important allograft complication. To reduce the risk of post-transplant AR, determination of kidney transplant donor-recipient mismatching focuses on blood type and human leukocyte antigens (HLA), while it remains unclear whether non-HLA genetic mismatching is related to post-transplant complications. METHODS We carried out a genome-wide scan (HLA and non-HLA regions) on AR with a large kidney transplant cohort of 784 living donor-recipient pairs of European ancestry. An AR polygenic risk score (PRS) was constructed with the non-HLA single nucleotide polymorphisms (SNPs) filtered by independence (r2 < 0.2) and P-value (< 1×10-3) criteria. The PRS was validated in an independent cohort of 352 living donor-recipient pairs. RESULTS By the genome-wide scan, we identified one significant SNP rs6749137 with HR = 2.49 and P-value = 2.15×10-8. 1,307 non-HLA PRS SNPs passed the clumping plus thresholding and the PRS exhibited significant association with the AR in the validation cohort (HR = 1.54, 95% CI = (1.07, 2.22), p = 0.019). Further pathway analysis attributed the PRS genes into 13 categories, and the over-representation test identified 42 significant biological processes, the most significant of which is the cell morphogenesis (GO:0000902), with 4.08 fold of the percentage from homo species reference and FDR-adjusted P-value = 8.6×10-4. CONCLUSIONS Our results show the importance of donor-recipient mismatching in non-HLA regions. Additional work will be needed to understand the role of SNPs included in the PRS and to further improve donor-recipient genetic matching algorithms. Trial registry: Deterioration of Kidney Allograft Function Genomics (NCT00270712) and Genomics of Kidney Transplantation (NCT01714440) are registered on ClinicalTrials.gov.
Collapse
Affiliation(s)
- Rui Cao
- Division of Biostatistics and Health Data Science, School of Public Health, University of Minnesota, Minneapolis, Minnesota, United States of America
| | - David P. Schladt
- Hennepin Healthcare Research Institute, Minneapolis, Minnesota, United States of America
| | - Casey Dorr
- Hennepin Healthcare Research Institute, Minneapolis, Minnesota, United States of America
- Department of Medicine, University of Minnesota Medical School, Minneapolis, Minnesota, United States of America
| | - Arthur J. Matas
- Department of Surgery, University of Minnesota Medical School, Minneapolis, Minnesota, United States of America
| | - William S. Oetting
- Department of Experimental and Clinical Pharmacology, College of Pharmacy, University of Minnesota, Minneapolis, Minnesota, United States of America
| | - Pamala A. Jacobson
- Department of Experimental and Clinical Pharmacology, College of Pharmacy, University of Minnesota, Minneapolis, Minnesota, United States of America
| | - Ajay Israni
- Hennepin Healthcare Research Institute, Minneapolis, Minnesota, United States of America
- Department of Medicine, University of Minnesota Medical School, Minneapolis, Minnesota, United States of America
| | - Jinbo Chen
- Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| | - Weihua Guan
- Division of Biostatistics and Health Data Science, School of Public Health, University of Minnesota, Minneapolis, Minnesota, United States of America
| |
Collapse
|
6
|
Zhang QX, Liu T, Guo X, Zhen J, Yang MY, Khederzadeh S, Zhou F, Han X, Zheng Q, Jia P, Ding X, He M, Zou X, Liao JK, Zhang H, He J, Zhu X, Lu D, Chen H, Zeng C, Liu F, Zheng HF, Liu S, Xu HM, Chen GB. Searching across-cohort relatives in 54,092 GWAS samples via encrypted genotype regression. PLoS Genet 2024; 20:e1011037. [PMID: 38206971 PMCID: PMC10783776 DOI: 10.1371/journal.pgen.1011037] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2023] [Accepted: 12/13/2023] [Indexed: 01/13/2024] Open
Abstract
Explicitly sharing individual level data in genomics studies has many merits comparing to sharing summary statistics, including more strict QCs, common statistical analyses, relative identification and improved statistical power in GWAS, but it is hampered by privacy or ethical constraints. In this study, we developed encG-reg, a regression approach that can detect relatives of various degrees based on encrypted genomic data, which is immune of ethical constraints. The encryption properties of encG-reg are based on the random matrix theory by masking the original genotypic matrix without sacrificing precision of individual-level genotype data. We established a connection between the dimension of a random matrix, which masked genotype matrices, and the required precision of a study for encrypted genotype data. encG-reg has false positive and false negative rates equivalent to sharing original individual level data, and is computationally efficient when searching relatives. We split the UK Biobank into their respective centers, and then encrypted the genotype data. We observed that the relatives estimated using encG-reg was equivalently accurate with the estimation by KING, which is a widely used software but requires original genotype data. In a more complex application, we launched a finely devised multi-center collaboration across 5 research institutes in China, covering 9 cohorts of 54,092 GWAS samples. encG-reg again identified true relatives existing across the cohorts with even different ethnic backgrounds and genotypic qualities. Our study clearly demonstrates that encrypted genomic data can be used for data sharing without loss of information or data sharing barrier.
Collapse
Affiliation(s)
- Qi-Xin Zhang
- Institute of Bioinformatics, Zhejiang University, Hangzhou, Zhejiang, China
- Center for Reproductive Medicine, Department of Genetic and Genomic Medicine, and Clinical Research Institute, Zhejiang Provincial People’s Hospital, People’s Hospital of Hangzhou Medical College, Hangzhou, Zhejiang, China
| | - Tianzi Liu
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, Chinese Academy of Sciences, Shanghai, China
- CAS Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China
| | - Xinxin Guo
- School of Public Health (Shenzhen), Sun Yat-sen University, Shenzhen, Guangdong, China
| | - Jianxin Zhen
- Central Laboratory, Shenzhen Baoan Women’s and Children’s Hospital, Shenzhen, Guangdong, China
| | - Meng-yuan Yang
- Diseases & Population (DaP) Geninfo Lab, School of Life Sciences, Westlake University, Hangzhou, Zhejiang, China
| | - Saber Khederzadeh
- Diseases & Population (DaP) Geninfo Lab, School of Life Sciences, Westlake University, Hangzhou, Zhejiang, China
| | - Fang Zhou
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, China
| | - Xiaotong Han
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Guangdong Provincial Clinical Research Center for Ocular Diseases, Guangzhou, Guangdong, China
| | - Qiwen Zheng
- CAS Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China
| | - Peilin Jia
- CAS Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China
| | - Xiaohu Ding
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Guangdong Provincial Clinical Research Center for Ocular Diseases, Guangzhou, Guangdong, China
| | - Mingguang He
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Guangdong Provincial Clinical Research Center for Ocular Diseases, Guangzhou, Guangdong, China
- Centre for Eye Research Australia, Royal Victorian Eye and Ear Hospital, Melbourne, Victoria, Australia
- Ophthalmology, Department of Surgery, University of Melbourne, Melbourne, Victoria, Australia
| | - Xin Zou
- State Key Laboratory of CAD & GC, Zhejiang University, Hangzhou, Zhejiang, China
| | - Jia-Kai Liao
- School of Mathematics and Statistics and Research Institute of Mathematical Sciences (RIMS), Jiangsu Provincial Key Laboratory of Educational Big Data Science and Engineering, Jiangsu Normal University, Xuzhou, Jiangsu, China
- Ningbo Institute of Life and Health Industry, University of Chinese Academy of Sciences, Ningbo, Zhejiang, China
| | - Hongxin Zhang
- State Key Laboratory of CAD & GC, Zhejiang University, Hangzhou, Zhejiang, China
| | - Ji He
- Department of Neurology, Peking University Third Hospital, Beijing, China
| | - Xiaofeng Zhu
- Department of Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, Ohio, United States of America
| | - Daru Lu
- State Key Laboratory of Genetic Engineering and MOE Engineering Research Center of Gene Technology, School of Life Sciences and Zhongshan Hospital, Fudan University, Shanghai, China
- NHC Key Laboratory of Birth Defects and Reproductive Health, Chongqing Population and Family Planning Science and Technology Research Institute, Chongqing, China
| | - Hongyan Chen
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, China
| | - Changqing Zeng
- CAS Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China
- Henan Academy of Sciences, Zhengzhou, Henan, China
| | - Fan Liu
- CAS Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China
- Department of Forensic Sciences, College of Criminal Justice, Naif Arab University of Security Sciences, Riyadh, Kingdom of Saudi Arabia
| | - Hou-Feng Zheng
- Diseases & Population (DaP) Geninfo Lab, School of Life Sciences, Westlake University, Hangzhou, Zhejiang, China
| | - Siyang Liu
- School of Public Health (Shenzhen), Sun Yat-sen University, Shenzhen, Guangdong, China
| | - Hai-Ming Xu
- Institute of Bioinformatics, Zhejiang University, Hangzhou, Zhejiang, China
| | - Guo-Bo Chen
- Center for Reproductive Medicine, Department of Genetic and Genomic Medicine, and Clinical Research Institute, Zhejiang Provincial People’s Hospital, People’s Hospital of Hangzhou Medical College, Hangzhou, Zhejiang, China
- Key Laboratory of Endocrine Gland Diseases of Zhejiang Province, Hangzhou, Zhejiang, China
| |
Collapse
|
7
|
Khunsriraksakul C, Li Q, Markus H, Patrick MT, Sauteraud R, McGuire D, Wang X, Wang C, Wang L, Chen S, Shenoy G, Li B, Zhong X, Olsen NJ, Carrel L, Tsoi LC, Jiang B, Liu DJ. Multi-ancestry and multi-trait genome-wide association meta-analyses inform clinical risk prediction for systemic lupus erythematosus. Nat Commun 2023; 14:668. [PMID: 36750564 PMCID: PMC9905560 DOI: 10.1038/s41467-023-36306-5] [Citation(s) in RCA: 28] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2022] [Accepted: 01/25/2023] [Indexed: 02/09/2023] Open
Abstract
Systemic lupus erythematosus is a heritable autoimmune disease that predominantly affects young women. To improve our understanding of genetic etiology, we conduct multi-ancestry and multi-trait meta-analysis of genome-wide association studies, encompassing 12 systemic lupus erythematosus cohorts from 3 different ancestries and 10 genetically correlated autoimmune diseases, and identify 16 novel loci. We also perform transcriptome-wide association studies, computational drug repurposing analysis, and cell type enrichment analysis. We discover putative drug classes, including a histone deacetylase inhibitor that could be repurposed to treat lupus. We also identify multiple cell types enriched with putative target genes, such as non-classical monocytes and B cells, which may be targeted for future therapeutics. Using this newly assembled result, we further construct polygenic risk score models and demonstrate that integrating polygenic risk score with clinical lab biomarkers improves the diagnostic accuracy of systemic lupus erythematosus using the Vanderbilt BioVU and Michigan Genomics Initiative biobanks.
Collapse
Affiliation(s)
- Chachrit Khunsriraksakul
- Program in Bioinformatics and Genomics, Pennsylvania State University College of Medicine, Hershey, PA, 17033, USA
- Institute for Personalized Medicine, Pennsylvania State University College of Medicine, Hershey, PA, 17033, USA
| | - Qinmengge Li
- Department of Dermatology, University of Michigan Medical School, Ann Arbor, MI, 48109, USA
| | - Havell Markus
- Program in Bioinformatics and Genomics, Pennsylvania State University College of Medicine, Hershey, PA, 17033, USA
- Institute for Personalized Medicine, Pennsylvania State University College of Medicine, Hershey, PA, 17033, USA
| | - Matthew T Patrick
- Department of Dermatology, University of Michigan Medical School, Ann Arbor, MI, 48109, USA
| | - Renan Sauteraud
- Department of Public Health Sciences, Pennsylvania State University College of Medicine, Hershey, PA, 17033, USA
| | - Daniel McGuire
- Department of Public Health Sciences, Pennsylvania State University College of Medicine, Hershey, PA, 17033, USA
| | - Xingyan Wang
- Department of Public Health Sciences, Pennsylvania State University College of Medicine, Hershey, PA, 17033, USA
| | - Chen Wang
- Program in Bioinformatics and Genomics, Pennsylvania State University College of Medicine, Hershey, PA, 17033, USA
| | - Lida Wang
- Department of Public Health Sciences, Pennsylvania State University College of Medicine, Hershey, PA, 17033, USA
| | - Siyuan Chen
- Department of Public Health Sciences, Pennsylvania State University College of Medicine, Hershey, PA, 17033, USA
| | - Ganesh Shenoy
- Department of Neurosurgery, Pennsylvania State University College of Medicine, Hershey, PA, 17033, USA
| | - Bingshan Li
- Department of Molecular Physiology & Biophysics, Vanderbilt University, Nashville, TN, 37235, USA
| | - Xue Zhong
- Department of Medicine, Division of Genetic Medicine, Vanderbilt University Medical Center, Nashville, TN, 37232, USA
| | - Nancy J Olsen
- Department of Medicine, Pennsylvania State University College of Medicine, Hershey, PA, 17033, USA
| | - Laura Carrel
- Department of Biochemistry and Molecular Biology, Pennsylvania State University College of Medicine, Hershey, PA, 17033, USA
| | - Lam C Tsoi
- Department of Dermatology, University of Michigan Medical School, Ann Arbor, MI, 48109, USA
| | - Bibo Jiang
- Department of Public Health Sciences, Pennsylvania State University College of Medicine, Hershey, PA, 17033, USA
| | - Dajiang J Liu
- Program in Bioinformatics and Genomics, Pennsylvania State University College of Medicine, Hershey, PA, 17033, USA.
- Institute for Personalized Medicine, Pennsylvania State University College of Medicine, Hershey, PA, 17033, USA.
- Department of Public Health Sciences, Pennsylvania State University College of Medicine, Hershey, PA, 17033, USA.
| |
Collapse
|
8
|
Elhaik E. Principal Component Analyses (PCA)-based findings in population genetic studies are highly biased and must be reevaluated. Sci Rep 2022; 12:14683. [PMID: 36038559 PMCID: PMC9424212 DOI: 10.1038/s41598-022-14395-4] [Citation(s) in RCA: 76] [Impact Index Per Article: 25.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2022] [Accepted: 06/06/2022] [Indexed: 12/29/2022] Open
Abstract
Principal Component Analysis (PCA) is a multivariate analysis that reduces the complexity of datasets while preserving data covariance. The outcome can be visualized on colorful scatterplots, ideally with only a minimal loss of information. PCA applications, implemented in well-cited packages like EIGENSOFT and PLINK, are extensively used as the foremost analyses in population genetics and related fields (e.g., animal and plant or medical genetics). PCA outcomes are used to shape study design, identify, and characterize individuals and populations, and draw historical and ethnobiological conclusions on origins, evolution, dispersion, and relatedness. The replicability crisis in science has prompted us to evaluate whether PCA results are reliable, robust, and replicable. We analyzed twelve common test cases using an intuitive color-based model alongside human population data. We demonstrate that PCA results can be artifacts of the data and can be easily manipulated to generate desired outcomes. PCA adjustment also yielded unfavorable outcomes in association studies. PCA results may not be reliable, robust, or replicable as the field assumes. Our findings raise concerns about the validity of results reported in the population genetics literature and related fields that place a disproportionate reliance upon PCA outcomes and the insights derived from them. We conclude that PCA may have a biasing role in genetic investigations and that 32,000-216,000 genetic studies should be reevaluated. An alternative mixed-admixture population genetic model is discussed.
Collapse
Affiliation(s)
- Eran Elhaik
- Department of Biology, Lund University, 22362, Lund, Sweden.
| |
Collapse
|
9
|
Zhang F, Cao H, Baranova A. Shared Genetic Liability and Causal Associations Between Major Depressive Disorder and Cardiovascular Diseases. Front Cardiovasc Med 2021; 8:735136. [PMID: 34859065 PMCID: PMC8631916 DOI: 10.3389/fcvm.2021.735136] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2021] [Accepted: 10/08/2021] [Indexed: 12/20/2022] Open
Abstract
Major depressive disorder (MDD) is phenotypically associated with cardiovascular diseases (CVD). We aim to investigate mechanisms underlying relationships between MDD and CVD in the context of shared genetic variations. Polygenic overlap analysis was used to test genetic correlation and to analyze shared genetic variations between MDD and seven cardiovascular outcomes (coronary artery disease (CAD), heart failure, atrial fibrillation, stroke, systolic blood pressure, diastolic blood pressure, and pulse pressure measurement). Mendelian randomization analysis was used to uncover causal relationships between MDD and cardiovascular traits. By cross-trait meta-analysis, we identified a set of genomic loci shared between the traits of MDD and stroke. Putative causal genes for MDD and stroke were prioritized by fine-mapping of transcriptome-wide associations. Polygenic overlap analysis pointed toward substantial genetic variation overlap between MDD and CVD. Mendelian randomization analysis indicated that genetic liability to MDD has a causal effect on CAD and stroke. Comparison of genome-wide genes shared by MDD and CVD suggests 20q12 as a pleiotropic region conferring risk for both MDD and CVD. Cross-trait meta-analyses and fine-mapping of transcriptome-wide association signals identified novel risk genes for MDD and stroke, including RPL31P12, BORSC7, PNPT11, and PGF. Many genetic variations associated with MDD and CVD outcomes are shared, thus, pointing that genetic liability to MDD may also confer risk for stroke and CAD. Presented results shed light on mechanistic connections between MDD and CVD phenotypes.
Collapse
Affiliation(s)
- Fuquan Zhang
- Wuxi Mental Health Center of Nanjing Medical University, Wuxi, China.,Department of Psychiatry, The Affiliated Brain Hospital of Nanjing Medical University, Nanjing, China.,Institute of Neuropsychiatry, The Affiliated Brain Hospital of Nanjing Medical University, Nanjing, China
| | - Hongbao Cao
- School of Systems Biology, George Mason University, Fairfax, VA, United States
| | - Ancha Baranova
- School of Systems Biology, George Mason University, Fairfax, VA, United States.,Research Centre for Medical Genetics, Moscow, Russia
| |
Collapse
|
10
|
Zhang F, Rao S, Cao H, Zhang X, Wang Q, Xu Y, Sun J, Wang C, Chen J, Xu X, Zhang N, Tian L, Yuan J, Wang G, Cai L, Xu M, Baranova A. Genetic evidence suggests posttraumatic stress disorder as a subtype of major depressive disorder. J Clin Invest 2021; 132:145942. [PMID: 33905376 PMCID: PMC8803333 DOI: 10.1172/jci145942] [Citation(s) in RCA: 61] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2020] [Accepted: 04/22/2021] [Indexed: 02/05/2023] Open
Abstract
BACKGROUND Major depressive disorder (MDD) and posttraumatic stress disorder (PTSD) are highly comorbid and exhibit strong correlations with one another. We aimed to investigate mechanisms of underlying relationships between PTSD and three kinds of depressive phenotypes, namely, MDD, depressed affect (DAF), and depression (DEP, including both MDD and the broad definition of depression). METHODS Genetic correlations between PTSD and the depressive phenotypes were tested using linkage disequilibrium score regression. Polygenic overlap analysis was used to estimate shared and trait-specific causal variants across a pair of traits. Causal relationships between PTSD and the depressive phenotypes were investigated using Mendelian randomization. Shared genomic loci between PTSD and MDD were identified using cross-trait meta-analysis. RESULTS Genetic correlations of PTSD with the depressive phenotypes were in the range of 0.71~0.80. The estimated numbers of causal variants were 14,565, 12,965, 10,565, and 4,986 for MDD, DEP, DAF, and PTSD, respectively. In each case, causal variants contributing to PTSD were completely or largely covered by causal variants defining each of the depressive phenotypes. Mendelian randomization analysis indicates that the genetically determined depressive phenotypes confer a causal effect on PTSD (b = 0.21~0.31). Notably, genetically determined PTSD confers a causal effect on DEP (b = 0.14) and DAF (b = 0.15), but not MDD. Cross-trait meta-analysis of MDD and PTSD identifies 47 genomic loci, including 29 loci shared between PTSD and MDD. CONCLUSION Evidence from shared genetics suggests that PTSD is a subtype of MDD. This study provides support to the efforts in reducing diagnostic heterogeneity in psychiatric nosology. FUNDING The National Key Research and Development Program of China (2018YFC1314300) and the National Natural Science Foundation of China (81471364 and 81971255).
Collapse
Affiliation(s)
- Fuquan Zhang
- Department of Psychiatry, The Affiliated Brain Hospital of Nanjing Medical University, Nanjing, China
| | - Shuquan Rao
- State Key Laboratory of Experimental Hematology, Chinese Academy of Medical Sciences and Peking Union Medical College, Tianjin, China
| | - Hongbao Cao
- School of Systems Biology, George Mason University, Fairfax, United States of America
| | - Xiangrong Zhang
- Department of Geriatric Psychiatry, The Affiliated Brain Hospital of Nanjing Medical University, Nanjing, China
| | - Qiang Wang
- Mental Health Center and Psychiatric Laboratory, West China Hospital of Sichuan University, Chengdu, China
| | - Yong Xu
- Department of Psychiatry, First Hospital/First Clinical Medical College of Shanxi Medical University, Taiyuan, China
| | - Jing Sun
- Department of Psychiatry, The Affiliated Brain Hospital of Nanjing Medical University, Nanjing, China
| | - Chun Wang
- Department of Psychiatry, The Affiliated Brain Hospital of Nanjing Medical University, Nanjing, China
| | - Jiu Chen
- Institute of Neuropsychiatry, The Affiliated Brain Hospital of Nanjing Medical University, Nanjing, China
| | - Xijia Xu
- Department of Psychiatry, The Affiliated Brain Hospital of Nanjing Medical University, Nanjing, China
| | - Ning Zhang
- Institute of Neuropsychiatry, The Affiliated Brain Hospital of Nanjing Medical University, Nanjing, China
| | - Lin Tian
- Wuxi Mental Health Center of Nanjing Medical University, Wuxi, China
| | - Jianmin Yuan
- Wuxi Mental Health Center of Nanjing Medical University, Wuxi, China
| | - Guoqiang Wang
- Department of Psychiatry, Wuxi Mental Health Center of Nanjing Medical University, Wuxi, China
| | - Lei Cai
- Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disor, Shanghai Jiao Tong University, Shanghai, China
| | - Mingqing Xu
- Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disor, Shanghai Jiao Tong University, Shanghai, China
| | - Ancha Baranova
- School of Systems Biology, George Mason University, Fairfax, United States of America
| |
Collapse
|
11
|
Mušo M, Dumbell R, Pulit S, Sinnott-Armstrong N, Laber S, Zolkiewski L, Bentley L, Claussnitzer M, Cox RD. A lead candidate functional single nucleotide polymorphism within the WARS2 gene associated with waist-hip-ratio does not alter RNA stability. BIOCHIMICA ET BIOPHYSICA ACTA. GENE REGULATORY MECHANISMS 2020; 1863:194640. [PMID: 33007465 PMCID: PMC7695619 DOI: 10.1016/j.bbagrm.2020.194640] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/10/2020] [Revised: 09/22/2020] [Accepted: 09/22/2020] [Indexed: 11/06/2022]
Abstract
We have prioritised a single nucleotide polymorphism (SNP) rs2645294 as one candidate functional SNP in the TBX15-WARS2 waist-hip-ratio locus using posterior probability analysis. This SNP is located in the 3' untranslated region of the WARS2 (tryptophanyl tRNA synthetase 2, mitochondrial) gene with which it has an expression quantitative trait in subcutaneous white adipose tissue. We show that transcripts of the WARS2 gene in a human white adipose cell line, heterozygous for the rs2645294 SNP, showed allelic imbalance. We tested whether the rs2645294 SNP altered WARS2 RNA stability using three different methods: actinomycin-D inhibition and RNA decay, mature and nascent RNA analysis and luciferase reporter assays. We found no evidence of a difference in RNA stability between the rs2645294 alleles indicating that the allelic expression imbalance was likely due to transcriptional regulation.
Collapse
Affiliation(s)
- Milan Mušo
- MRC Harwell Institute, Mammalian Genetics Unit, Harwell Campus, Oxfordshire OX11 0RD, UK
| | - Rebecca Dumbell
- MRC Harwell Institute, Mammalian Genetics Unit, Harwell Campus, Oxfordshire OX11 0RD, UK
| | - Sara Pulit
- Department of Genetics, Center for Molecular Medicine, University Medical Center Utrecht, Utrecht, the Netherlands; Big Data Institute, Li Ka Shing Center for Health Information and Discovery, Oxford University, Oxford, UK; Program in Medical Population Genetics, Broad Institute, Cambridge, MA, USA
| | | | - Samantha Laber
- MRC Harwell Institute, Mammalian Genetics Unit, Harwell Campus, Oxfordshire OX11 0RD, UK
| | - Louisa Zolkiewski
- MRC Harwell Institute, Mammalian Genetics Unit, Harwell Campus, Oxfordshire OX11 0RD, UK
| | - Liz Bentley
- MRC Harwell Institute, Mammalian Genetics Unit, Harwell Campus, Oxfordshire OX11 0RD, UK
| | - Melina Claussnitzer
- The Broad Institute of MIT and Harvard, Cambridge, MA, USA; Gerontology Division, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA, USA; Institute of Nutritional Science, University of Hohenheim, Stuttgart, Germany
| | - Roger D Cox
- MRC Harwell Institute, Mammalian Genetics Unit, Harwell Campus, Oxfordshire OX11 0RD, UK.
| |
Collapse
|
12
|
Kojima K, Tadaka S, Katsuoka F, Tamiya G, Yamamoto M, Kinoshita K. A genotype imputation method for de-identified haplotype reference information by using recurrent neural network. PLoS Comput Biol 2020; 16:e1008207. [PMID: 33001993 PMCID: PMC7529210 DOI: 10.1371/journal.pcbi.1008207] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2020] [Accepted: 07/30/2020] [Indexed: 02/02/2023] Open
Abstract
Genotype imputation estimates the genotypes of unobserved variants using the genotype data of other observed variants based on a collection of haplotypes for thousands of individuals, which is known as a haplotype reference panel. In general, more accurate imputation results were obtained using a larger size of haplotype reference panel. Most of the existing genotype imputation methods explicitly require the haplotype reference panel in precise form, but the accessibility of haplotype data is often limited, due to the requirement of agreements from the donors. Since de-identified information such as summary statistics or model parameters can be used publicly, imputation methods using de-identified haplotype reference information might be useful to enhance the quality of imputation results under the condition where the access of the haplotype data is limited. In this study, we proposed a novel imputation method that handles the reference panel as its model parameters by using bidirectional recurrent neural network (RNN). The model parameters are presented in the form of de-identified information from which the restoration of the genotype data at the individual-level is almost impossible. We demonstrated that the proposed method provides comparable imputation accuracy when compared with the existing imputation methods using haplotype datasets from the 1000 Genomes Project (1KGP) and the Haplotype Reference Consortium. We also considered a scenario where a subset of haplotypes is made available only in de-identified form for the haplotype reference panel. In the evaluation using the 1KGP dataset under the scenario, the imputation accuracy of the proposed method is much higher than that of the existing imputation methods. We therefore conclude that our RNN-based method is quite promising to further promote the data-sharing of sensitive genome data under the recent movement for the protection of individuals’ privacy. Genotype imputation estimates the genotypes of unobserved variants using the genotype data of other observed variants based on a collection of genome data of a large number of individuals called a reference panel. In general, more accurate imputation results are obtained using a larger size of the reference panel. Although most of the existing imputation methods use the reference panel in an explicit form, the accessibility of genome data is often limited due to the requirement of agreements from the donors. We thus proposed a new imputation method that handles the reference panel as its model parameters by using bidirectional recurrent neural network. Since it is almost impossible to restore genome data at the individual-level from the model parameters, they can be shared publicly as the de-identified information even when the accessibility of the original reference panel is limited. We demonstrate that the proposed method provides comparable imputation accuracy with the existing methods. We also considered a scenario where a part of the genome data is made available only in de-identified form for the reference panel and have shown that the imputation accuracy of the proposed method is much higher than that of the existing methods under the scenario.
Collapse
Affiliation(s)
- Kaname Kojima
- Tohoku Medical Megabank Organization, Tohoku University, Sendai, Miyagi, Japan
- RIKEN Center for Advanced Intelligence Project, Chuo-ku, Tokyo, Japan
| | - Shu Tadaka
- Tohoku Medical Megabank Organization, Tohoku University, Sendai, Miyagi, Japan
| | - Fumiki Katsuoka
- Tohoku Medical Megabank Organization, Tohoku University, Sendai, Miyagi, Japan
| | - Gen Tamiya
- Tohoku Medical Megabank Organization, Tohoku University, Sendai, Miyagi, Japan
- RIKEN Center for Advanced Intelligence Project, Chuo-ku, Tokyo, Japan
| | - Masayuki Yamamoto
- Tohoku Medical Megabank Organization, Tohoku University, Sendai, Miyagi, Japan
- School of Medicine, Tohoku University, Sendai, Miyagi, Japan
- Advanced Research Center for Innovations in Next-Generation Medicine, Tohoku University, Sendai, Miyagi, Japan
| | - Kengo Kinoshita
- Tohoku Medical Megabank Organization, Tohoku University, Sendai, Miyagi, Japan
- Advanced Research Center for Innovations in Next-Generation Medicine, Tohoku University, Sendai, Miyagi, Japan
- Graduate School of Information Sciences, Tohoku University, Sendai, Miyagi, Japan
- Institute of Development, Aging and Cancer, Tohoku University, Sendai, Miyagi, Japan
- * E-mail:
| |
Collapse
|
13
|
Linder RA, Majumder A, Chakraborty M, Long A. Two Synthetic 18-Way Outcrossed Populations of Diploid Budding Yeast with Utility for Complex Trait Dissection. Genetics 2020; 215:323-342. [PMID: 32241804 PMCID: PMC7268983 DOI: 10.1534/genetics.120.303202] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2020] [Accepted: 03/31/2020] [Indexed: 02/07/2023] Open
Abstract
Advanced-generation multiparent populations (MPPs) are a valuable tool for dissecting complex traits, having more power than genome-wide association studies to detect rare variants and higher resolution than F2 linkage mapping. To extend the advantages of MPPs in budding yeast, we describe the creation and characterization of two outbred MPPs derived from 18 genetically diverse founding strains. We carried out de novo assemblies of the genomes of the 18 founder strains, such that virtually all variation segregating between these strains is known, and represented those assemblies as Santa Cruz Genome Browser tracks. We discovered complex patterns of structural variation segregating among the founders, including a large deletion within the vacuolar ATPase VMA1, several different deletions within the osmosensor MSB2, a series of deletions and insertions at PRM7 and the adjacent BSC1, as well as copy number variation at the dehydrogenase ALD2 Resequenced haploid recombinant clones from the two MPPs have a median unrecombined block size of 66 kb, demonstrating that the population is highly recombined. We pool-sequenced the two MPPs to 3270× and 2226× coverage and demonstrated that we can accurately estimate local haplotype frequencies using pooled data. We further downsampled the pool-sequenced data to ∼20-40× and showed that local haplotype frequency estimates remained accurate, with median error rates 0.8 and 0.6% at 20× and 40×, respectively. Haplotypes frequencies are estimated much more accurately than SNP frequencies obtained directly from the same data. Deep sequencing of the two populations revealed that 10 or more founders are present at a detectable frequency for > 98% of the genome, validating the utility of this resource for the exploration of the role of standing variation in the architecture of complex traits.
Collapse
Affiliation(s)
- Robert A Linder
- Department of Ecology and Evolutionary Biology, School of Biological Sciences, University of California, Irvine, California 92697-2525
| | - Arundhati Majumder
- Department of Ecology and Evolutionary Biology, School of Biological Sciences, University of California, Irvine, California 92697-2525
| | - Mahul Chakraborty
- Department of Ecology and Evolutionary Biology, School of Biological Sciences, University of California, Irvine, California 92697-2525
| | - Anthony Long
- Department of Ecology and Evolutionary Biology, School of Biological Sciences, University of California, Irvine, California 92697-2525
| |
Collapse
|
14
|
Gamazon ER, Zwinderman AH, Cox NJ, Denys D, Derks EM. Multi-tissue transcriptome analyses identify genetic mechanisms underlying neuropsychiatric traits. Nat Genet 2019; 51:933-940. [PMID: 31086352 PMCID: PMC6590703 DOI: 10.1038/s41588-019-0409-8] [Citation(s) in RCA: 61] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2017] [Accepted: 04/03/2019] [Indexed: 01/02/2023]
Abstract
The genetic architecture of psychiatric disorders is characterized by a large number of small-effect variants1 located primarily in non-coding regions, suggesting that the underlying causal effects may influence disease risk by modulating gene expression2-4. We provide comprehensive analyses using transcriptome data from an unprecedented collection of tissues to gain pathophysiological insights into the role of the brain, neuroendocrine factors (adrenal gland) and gastrointestinal systems (colon) in psychiatric disorders. In each tissue, we perform PrediXcan analysis and identify trait-associated genes for schizophrenia (n associations = 499; n unique genes = 275), bipolar disorder (n associations = 17; n unique genes = 13), attention deficit hyperactivity disorder (n associations = 19; n unique genes = 12) and broad depression (n associations = 41; n unique genes = 31). Importantly, both PrediXcan and summary-data-based Mendelian randomization/heterogeneity in dependent instruments analyses suggest potentially causal genes in non-brain tissues, showing the utility of these tissues for mapping psychiatric disease genetic predisposition. Our analyses further highlight the importance of joint tissue approaches as 76% of the genes were detected only in difficult-to-acquire tissues.
Collapse
Affiliation(s)
- Eric R Gamazon
- Academic Medical Center, University of Amsterdam, Amsterdam, the Netherlands.
- Division of Genetic Medicine, Department of Medicine, Vanderbilt University, Nashville, TN, USA.
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA.
- Data Science Institute, Vanderbilt University, Nashville, TN, USA.
- Clare Hall, University of Cambridge, Cambridge, UK.
| | - Aeilko H Zwinderman
- Academic Medical Center, University of Amsterdam, Amsterdam, the Netherlands
| | - Nancy J Cox
- Division of Genetic Medicine, Department of Medicine, Vanderbilt University, Nashville, TN, USA
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Damiaan Denys
- Academic Medical Center, University of Amsterdam, Amsterdam, the Netherlands
| | - Eske M Derks
- Academic Medical Center, University of Amsterdam, Amsterdam, the Netherlands.
- QIMR Berghofer, Translational Neurogenomics Group, Brisbane, Queensland, Australia.
| |
Collapse
|
15
|
Saad M, Wijsman EM. Association score testing for rare variants and binary traits in family data with shared controls. Brief Bioinform 2019; 20:245-253. [PMID: 28968627 DOI: 10.1093/bib/bbx107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2017] [Indexed: 11/12/2022] Open
Abstract
Genome-wide association studies have been an important approach used to localize trait loci, with primary focus on common variants. The multiple rare variant-common disease hypothesis may explain the missing heritability remaining after accounting for identified common variants. Advances of sequencing technologies with their decreasing costs, coupled with methodological advances in the context of association studies in large samples, now make the study of rare variants at a genome-wide scale feasible. The resurgence of family-based association designs because of their advantage in studying rare variants has also stimulated more methods development, mainly based on linear mixed models (LMMs). Other tests such as score tests can have advantages over the LMMs, but to date have mainly been proposed for single-marker association tests. In this article, we extend several score tests (χcorrected2, WQLS, and SKAT) to the multiple variant association framework. We evaluate and compare their statistical performances relative with the LMM. Moreover, we show that three tests can be cast as the difference between marker allele frequencies (AFs) estimated in each of the group of affected and unaffected subjects. We show that these tests are flexible, as they can be based on related, unrelated or both related and unrelated subjects. They also make feasible an increasingly common design that only sequences a subset of affected subjects (related or unrelated) and uses for comparison publicly available AFs estimated in a group of healthy subjects. Finally, we show the great impact of linkage disequilibrium on the performance of all these tests.
Collapse
Affiliation(s)
- Mohamad Saad
- Department of Biostatistics, University of Washington, Seattle, USA.,Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, USA.,Qatar Computing Research Institute, Hamad Bin Khalifa University, Doha, Qatar
| | - Ellen M Wijsman
- Department of Biostatistics, University of Washington, Seattle, USA.,Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, USA
| |
Collapse
|
16
|
Ni G, Gratten J, Wray NR, Lee SH. Age at first birth in women is genetically associated with increased risk of schizophrenia. Sci Rep 2018; 8:10168. [PMID: 29977057 PMCID: PMC6033923 DOI: 10.1038/s41598-018-28160-z] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2017] [Accepted: 06/14/2018] [Indexed: 11/10/2022] Open
Abstract
Previous studies have shown an increased risk for mental health problems in children born to both younger and older parents compared to children of average-aged parents. We previously used a novel design to reveal a latent mechanism of genetic association between schizophrenia and age at first birth in women (AFB). Here, we use independent data from the UK Biobank (N = 38,892) to replicate the finding of an association between predicted genetic risk of schizophrenia and AFB in women, and to estimate the genetic correlation between schizophrenia and AFB in women stratified into younger and older groups. We find evidence for an association between predicted genetic risk of schizophrenia and AFB in women (P-value = 1.12E-05), and we show genetic heterogeneity between younger and older AFB groups (P-value = 3.45E-03). The genetic correlation between schizophrenia and AFB in the younger AFB group is -0.16 (SE = 0.04) while that between schizophrenia and AFB in the older AFB group is 0.14 (SE = 0.08). Our results suggest that early, and perhaps also late, age at first birth in women is associated with increased genetic risk for schizophrenia in the UK Biobank sample. These findings contribute new insights into factors contributing to the complex bio-social risk architecture underpinning the association between parental age and offspring mental health.
Collapse
Affiliation(s)
- Guiyan Ni
- Australian Center for Precision Health, University of South Australia Cancer Research Institute, University of South Australia, Adelaide, SA, 5000, Australia
- School of Environmental and Rural Science, University of New England, Armidale, NSW, 2351, Australia
| | - Jacob Gratten
- Institute for Molecular Bioscience, University of Queensland, Brisbane, Queensland, 4072, Australia
- Queensland Brain Institute, University of Queensland, Brisbane, Queensland, 4072, Australia
| | - Naomi R Wray
- Institute for Molecular Bioscience, University of Queensland, Brisbane, Queensland, 4072, Australia
- Queensland Brain Institute, University of Queensland, Brisbane, Queensland, 4072, Australia
| | - Sang Hong Lee
- Australian Center for Precision Health, University of South Australia Cancer Research Institute, University of South Australia, Adelaide, SA, 5000, Australia.
- School of Environmental and Rural Science, University of New England, Armidale, NSW, 2351, Australia.
- Queensland Brain Institute, University of Queensland, Brisbane, Queensland, 4072, Australia.
| |
Collapse
|
17
|
LeBlanc M, Zuber V, Thompson WK, Andreassen OA, Schizophrenia and Bipolar Disorder Working Groups of the Psychiatric Genomics Consortium, Frigessi A, Andreassen BK. A correction for sample overlap in genome-wide association studies in a polygenic pleiotropy-informed framework. BMC Genomics 2018; 19:494. [PMID: 29940862 PMCID: PMC6019513 DOI: 10.1186/s12864-018-4859-7] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2017] [Accepted: 06/06/2018] [Indexed: 01/21/2023] Open
Abstract
BACKGROUND There is considerable evidence that many complex traits have a partially shared genetic basis, termed pleiotropy. It is therefore useful to consider integrating genome-wide association study (GWAS) data across several traits, usually at the summary statistic level. A major practical challenge arises when these GWAS have overlapping subjects. This is particularly an issue when estimating pleiotropy using methods that condition the significance of one trait on the signficance of a second, such as the covariate-modulated false discovery rate (cmfdr). RESULTS We propose a method for correcting for sample overlap at the summary statistic level. We quantify the expected amount of spurious correlation between the summary statistics from two GWAS due to sample overlap, and use this estimated correlation in a simple linear correction that adjusts the joint distribution of test statistics from the two GWAS. The correction is appropriate for GWAS with case-control or quantitative outcomes. Our simulations and data example show that without correcting for sample overlap, the cmfdr is not properly controlled, leading to an excessive number of false discoveries and an excessive false discovery proportion. Our correction for sample overlap is effective in that it restores proper control of the false discovery rate, at very little loss in power. CONCLUSIONS With our proposed correction, it is possible to integrate GWAS summary statistics with overlapping samples in a statistical framework that is dependent on the joint distribution of the two GWAS.
Collapse
Affiliation(s)
- Marissa LeBlanc
- Oslo Centre for Biostatistics and Epidemiology, Oslo University Hospital, Oslo universitetssykehus HF, Sogn Arena, PB 4950 Nydalen, Oslo, 0424 Norway
| | - Verena Zuber
- MRC Biostatistics Unit, University of Cambridge, MRC Biostatistics Unit, Cambridge Institute of Public Health, Robinson Way, Cambridge, CB2 0SR United Kingdom
| | - Wesley K. Thompson
- Department of Psychiatry, University of California, San Diego, 9500 Gilman Drive, MC 0603, La Jolla, CA, 92093-0603 USA
| | - Ole A. Andreassen
- NORMENT-KG Jebsen Centre for Psychosis Research, Institute of Clinical Medicine, University of Oslo, P.O. Box 1039 Blindern, Oslo, N-0315 Norway
- Division of Mental Health and Addiction, Oslo University Hospital HF, Ullevaal Hospital, building 49,P.O. Box 4956 Nydalen, Oslo, N-0424 Norway
| | - Schizophrenia and Bipolar Disorder Working Groups of the Psychiatric Genomics Consortium
- Oslo Centre for Biostatistics and Epidemiology, Oslo University Hospital, Oslo universitetssykehus HF, Sogn Arena, PB 4950 Nydalen, Oslo, 0424 Norway
- MRC Biostatistics Unit, University of Cambridge, MRC Biostatistics Unit, Cambridge Institute of Public Health, Robinson Way, Cambridge, CB2 0SR United Kingdom
- Department of Psychiatry, University of California, San Diego, 9500 Gilman Drive, MC 0603, La Jolla, CA, 92093-0603 USA
- NORMENT-KG Jebsen Centre for Psychosis Research, Institute of Clinical Medicine, University of Oslo, P.O. Box 1039 Blindern, Oslo, N-0315 Norway
- Division of Mental Health and Addiction, Oslo University Hospital HF, Ullevaal Hospital, building 49,P.O. Box 4956 Nydalen, Oslo, N-0424 Norway
- Oslo Centre for Biostatistics and Epidemiology, University of Oslo and Oslo University Hospital, Oslo universitetssykehus HF, Sogn Arena, PB 4950 Nydalen, Oslo, 0424 Norway
- Department of Research, Cancer Registry of Norway, P.O. box 5313 Majorstuen, Oslo, N-0304 Norway
| | - Arnoldo Frigessi
- Oslo Centre for Biostatistics and Epidemiology, University of Oslo and Oslo University Hospital, Oslo universitetssykehus HF, Sogn Arena, PB 4950 Nydalen, Oslo, 0424 Norway
| | - Bettina Kulle Andreassen
- Department of Research, Cancer Registry of Norway, P.O. box 5313 Majorstuen, Oslo, N-0304 Norway
| |
Collapse
|
18
|
Andersen AM, Pietrzak RH, Kranzler HR, Ma L, Zhou H, Liu X, Kramer J, Kuperman S, Edenberg HJ, Nurnberger JI, Rice JP, Tischfield JA, Goate A, Foroud TM, Meyers JL, Porjesz B, Dick DM, Hesselbrock V, Boerwinkle E, Southwick SM, Krystal JH, Weissman MM, Levinson DF, Potash JB, Gelernter J, Han S. Polygenic Scores for Major Depressive Disorder and Risk of Alcohol Dependence. JAMA Psychiatry 2017; 74:1153-1160. [PMID: 28813562 PMCID: PMC5710224 DOI: 10.1001/jamapsychiatry.2017.2269] [Citation(s) in RCA: 65] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/15/2017] [Accepted: 06/13/2017] [Indexed: 01/06/2023]
Abstract
Importance Major depressive disorder (MDD) and alcohol dependence (AD) are heritable disorders with significant public health burdens, and they are frequently comorbid. Common genetic factors that influence the co-occurrence of MDD and AD have been sought in family, twin, and adoption studies, and results to date have been promising but inconclusive. Objective To examine whether AD and MDD overlap genetically, using a polygenic score approach. Design, Settings, and Participants Association analyses were conducted between MDD polygenic risk score (PRS) and AD case-control status in European ancestry samples from 4 independent genome-wide association study (GWAS) data sets: the Collaborative Study on the Genetics of Alcoholism (COGA); the Study of Addiction, Genetics, and Environment (SAGE); the Yale-Penn genetic study of substance dependence; and the National Health and Resilience in Veterans Study (NHRVS). Results from a meta-analysis of MDD (9240 patients with MDD and 9519 controls) from the Psychiatric Genomics Consortium were applied to calculate PRS at thresholds from P < .05 to P ≤ .99 in each AD GWAS data set. Main Outcomes and Measures Association between MDD PRS and AD. Results Participants analyzed included 788 cases (548 [69.5%] men; mean [SD] age, 38.2 [10.8] years) and 522 controls (151 [28.9.%] men; age [SD], 43.9 [11.6] years) from COGA; 631 cases (333 [52.8%] men; age [SD], 35.0 [7.7] years) and 756 controls (260 [34.4%] male; age [SD] 36.1 [7.7] years) from SAGE; 2135 cases (1375 [64.4%] men; age [SD], 39.4 [11.5] years) and 350 controls (126 [36.0%] men; age [SD], 43.5 [13.9] years) from Yale-Penn; and 317 cases (295 [93.1%] men; age [SD], 59.1 [13.1] years) and 1719 controls (1545 [89.9%] men; age [SD], 64.5 [13.3] years) from NHRVS. Higher MDD PRS was associated with a significantly increased risk of AD in all samples (COGA: best P = 1.7 × 10-6, R2 = 0.026; SAGE: best P = .001, R2 = 0.01; Yale-Penn: best P = .035, R2 = 0.0018; and NHRVS: best P = .004, R2 = 0.0074), with stronger evidence for association after meta-analysis of the 4 samples (best P = 3.3 × 10-9). In analyses adjusted for MDD status in 3 AD GWAS data sets, similar patterns of association were observed (COGA: best P = 7.6 × 10-6, R2 = 0.023; Yale-Penn: best P = .08, R2 = 0.0013; and NHRVS: best P = .006, R2 = 0.0072). After recalculating MDD PRS using MDD GWAS data sets without comorbid MDD-AD cases, significant evidence was observed for an association between the MDD PRS and AD in the meta-analysis of 3 GWAS AD samples without MDD cases (best P = .007). Conclusions and Relevance These results suggest that shared genetic susceptibility contributes modestly to MDD and AD comorbidity. Individuals with elevated polygenic risk for MDD may also be at risk for AD.
Collapse
Affiliation(s)
- Allan M. Andersen
- Department of Psychiatry, Carver College of Medicine, University of Iowa, Iowa City
| | - Robert H. Pietrzak
- US Department of Veterans Affairs (VA) National Center for Posttraumatic Stress Disorder, Clinical Neurosciences Division, VA Connecticut Healthcare System, West Haven
- Department of Psychiatry, Yale University School of Medicine, New Haven, Connecticut
| | - Henry R. Kranzler
- Department of Psychiatry, University of Pennsylvania Perelman School of Medicine, Philadelphia
- Mental Illness, Research, Education and Clinical Center of Veterans Integrated Service Network 4, Corporal Michael J. Crescenz VA Medical Center, Philadelphia, Pennsylvania
| | - Li Ma
- Department of Animal and Avian Sciences, University of Maryland, College Park
| | - Hang Zhou
- Department of Psychiatry, Yale University School of Medicine, New Haven, Connecticut
| | - Xiaoming Liu
- Human Genetics Center, University of Texas Health Science Center at Houston
| | - John Kramer
- Department of Psychiatry, Carver College of Medicine, University of Iowa, Iowa City
| | - Samuel Kuperman
- Department of Psychiatry, Carver College of Medicine, University of Iowa, Iowa City
| | - Howard J. Edenberg
- Department of Biochemistry and Molecular Biology, Indiana University School of Medicine, Indianapolis
| | - John I. Nurnberger
- Department of Psychiatry, Indiana University School of Medicine, Indianapolis
| | - John P. Rice
- Department of Psychiatry, Washington University School of Medicine, St Louis, Missouri
| | - Jay A. Tischfield
- Department of Genetics and the Human Genetics Institute of New Jersey, Rutgers University, Piscataway
| | - Alison Goate
- Department of Neuroscience, Icahn School of Medicine at Mount Sinai, New York, New York
| | - Tatiana M. Foroud
- Department of Medical and Molecular Genetics, Indiana University School of Medicine, Indianapolis
| | - Jacquelyn L. Meyers
- Department of Psychiatry and Behavioral Sciences, State University of New York Downstate Medical Center, Brooklyn
| | - Bernice Porjesz
- Department of Psychiatry and Behavioral Sciences, State University of New York Downstate Medical Center, Brooklyn
| | - Danielle M. Dick
- Departments of Psychology and Human and Molecular Genetics, Virginia Commonwealth University, Richmond
| | - Victor Hesselbrock
- Department of Psychiatry, University of Connecticut School of Medicine, Farmington
| | - Eric Boerwinkle
- Human Genetics Center, University of Texas Health Science Center at Houston
| | - Steven M. Southwick
- US Department of Veterans Affairs (VA) National Center for Posttraumatic Stress Disorder, Clinical Neurosciences Division, VA Connecticut Healthcare System, West Haven
- Department of Psychiatry, Yale University School of Medicine, New Haven, Connecticut
| | - John H. Krystal
- US Department of Veterans Affairs (VA) National Center for Posttraumatic Stress Disorder, Clinical Neurosciences Division, VA Connecticut Healthcare System, West Haven
- Department of Psychiatry, Yale University School of Medicine, New Haven, Connecticut
| | - Myrna M. Weissman
- Division of Epidemiology, New York State Psychiatric Institute, New York
- Department of Psychiatry, Columbia University, College of Physicians and Surgeons, New York, New York
- Columbia University, Mailman School of Public Health, New York, New York
| | - Douglas F. Levinson
- Department of Psychiatry and Behavioral Sciences, Stanford University, Stanford, California
| | - James B. Potash
- Department of Psychiatry, Carver College of Medicine, University of Iowa, Iowa City
- Interdisciplinary Graduate Program in Genetics, University of Iowa, Iowa City
- now with the Department of Psychiatry and Behavioral Sciences, Johns Hopkins School of Medicine, Baltimore, Maryland
| | - Joel Gelernter
- US Department of Veterans Affairs (VA) National Center for Posttraumatic Stress Disorder, Clinical Neurosciences Division, VA Connecticut Healthcare System, West Haven
- Department of Psychiatry, Yale University School of Medicine, New Haven, Connecticut
| | - Shizhong Han
- Department of Psychiatry, Carver College of Medicine, University of Iowa, Iowa City
- now with the Department of Psychiatry and Behavioral Sciences, Johns Hopkins School of Medicine, Baltimore, Maryland
| |
Collapse
|
19
|
Niu YF, Ye C, He J, Han F, Guo LB, Zheng HF, Chen GB. Reproduction and In-Depth Evaluation of Genome-Wide Association Studies and Genome-Wide Meta-analyses Using Summary Statistics. G3 (BETHESDA, MD.) 2017; 7:943-952. [PMID: 28122950 PMCID: PMC5345724 DOI: 10.1534/g3.116.038877] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/16/2016] [Accepted: 01/16/2017] [Indexed: 11/21/2022]
Abstract
In line with open-source genetics, we report a novel linear regression technique for genome-wide association studies (GWAS), called Open GWAS algoriTHm (OATH). When individual-level data are not available, OATH can not only completely reproduce reported results from an experimental model, but also recover underreported results from other alternative models with a different combination of nuisance parameters using naïve summary statistics (NSS). OATH can also reliably evaluate all reported results in-depth (e.g., p-value variance analysis), as demonstrated for 42 Arabidopsis phenotypes under three magnesium (Mg) conditions. In addition, OATH can be used for consortium-driven genome-wide association meta-analyses (GWAMA), and can greatly improve the flexibility of GWAMA. A prototype of OATH is available in the Genetic Analysis Repository (https://github.com/gc5k/GEAR).
Collapse
Affiliation(s)
- Yao-Fang Niu
- State Key Laboratory of Rice Biology, China National Rice Research Institute, Hangzhou, 310006 Zhejiang, China
| | - Chengyin Ye
- Department of Health Management, College of Medicine, Hangzhou Normal University, Hangzhou, 310021 Zhejiang, China
| | - Ji He
- Department of Neurology, Peking University Third Hospital, Beijing 100191, China
| | - Fang Han
- Department of Pulmonary and Critical Care Medicine, Peking University People's Hospital, Beijing 100044, China
| | - Long-Biao Guo
- State Key Laboratory of Rice Biology, China National Rice Research Institute, Hangzhou, 310006 Zhejiang, China
| | - Hou-Feng Zheng
- Institute of Aging Research, College of Medicine, Hangzhou Normal University, Hangzhou, 310021 Zhejiang, China
- The Affiliated Hospital of Hangzhou Normal University, Hangzhou, 310015 Zhejiang, China
| | - Guo-Bo Chen
- Evergreen Landscape and Architecture Studio, Hangzhou, 310026 Zhejiang, China
| |
Collapse
|