1
|
Hai Y, Zhao W, Meng Q, Liu L, Wen Y. Bayesian linear mixed model with multiple random effects for family-based genetic studies. Front Genet 2023; 14:1267704. [PMID: 37928242 PMCID: PMC10620972 DOI: 10.3389/fgene.2023.1267704] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2023] [Accepted: 09/25/2023] [Indexed: 11/07/2023] Open
Abstract
Motivation: Family-based study design is one of the popular designs used in genetic research, and the whole-genome sequencing data obtained from family-based studies offer many unique features for risk prediction studies. They can not only provide a more comprehensive view of many complex diseases, but also utilize information in the design to further improve the prediction accuracy. While promising, existing analytical methods often ignore the information embedded in the study design and overlook the predictive effects of rare variants, leading to a prediction model with sub-optimal performance. Results: We proposed a Bayesian linear mixed model for the prediction analysis of sequencing data obtained from family-based studies. Our method can not only capture predictive effects from both common and rare variants, but also easily accommodate various disease model assumptions. It uses information embedded in the study design to form surrogates, where the predictive effects from unmeasured/unknown genetic and environmental risk factors can be modelled. Through extensive simulation studies and the analysis of sequencing data obtained from the Michigan State University Twin Registry study, we have demonstrated that the proposed method outperforms commonly adopted techniques. Availability: R package is available at https://github.com/yhai943/FBLMM.
Collapse
Affiliation(s)
- Yang Hai
- Department of Statistics, University of Auckland, Auckland, New Zealand
| | - Wenxuan Zhao
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, China
| | - Qingyu Meng
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, China
| | - Long Liu
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, China
| | - Yalu Wen
- Department of Statistics, University of Auckland, Auckland, New Zealand
| |
Collapse
|
2
|
Mazo GE, Kasyanov ED, Nikolishin AE, Rukavishnikov GV, Shmukler AB, Golimbet VE, Neznanov NG, Kibitov AO. [Family history of affective disorders, the gender factor and clinical characteristics of depression]. Zh Nevrol Psikhiatr Im S S Korsakova 2021; 121:75-83. [PMID: 34405661 DOI: 10.17116/jnevro202112105275] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
OBJECTIVE Analysis of clinical features of development and course of depression in patients with FH of mood disorders taking into account sex differences. MATERIAL AND METHODS This multicenter cross-sectional study included patients over 18 years of age with depressive episode/recurrent depressive disorder. Clinical characteristics of depression, presence of comorbid mental illness and family history (FH) information were obtained in a structured clinical interview. RESULTS One hundred and seventy-one patients (mean age (M (SD)) 40.87 (15.86) y.o.), including 64.5% of women, were enrolled in the study. FH was revealed in 30.2% of patients. The proportion of FH did not differ in men and women (p=0.375). Generalized anxiety disorder (GAD) was more frequent in FH positive patients (p=0.016). Logistic regression also revealed that FH is a risk factor for concomitant GAD (p=0.019, OR=2.4). The GLM demonstrated a significant joint effect of FH and sex on the maximum duration of a depressive episode (p=0.044), as well on the number of suicide attempts (p=0.055) and the number of depressive episodes as a trend (p=0.072). CONCLUSION We have demonstrated the specific interaction of FH of mood disorders with sex on clinical course of depression. Thus, the manifestation of a genetic influence on the clinical phenotype of depression can be significantly moderated by sex.
Collapse
Affiliation(s)
- G E Mazo
- Bekhterev National Medical Research Center For Psychiatry And Neurology, St Petersburg, Russia
| | - E D Kasyanov
- Bekhterev National Medical Research Center For Psychiatry And Neurology, St Petersburg, Russia.,Saint-Petersburg State University Pirogov Clinic of High Medical Technologies, St. Petersburg, Russia
| | - A E Nikolishin
- Serbsky National Medical Research Center on Psychiatry and Addictions, Moscow, Russia
| | - G V Rukavishnikov
- Bekhterev National Medical Research Center For Psychiatry And Neurology, St Petersburg, Russia
| | - A B Shmukler
- Bekhterev National Medical Research Center For Psychiatry And Neurology, St Petersburg, Russia.,Serbsky National Medical Research Center on Psychiatry and Addictions, Moscow, Russia
| | - V E Golimbet
- Bekhterev National Medical Research Center For Psychiatry And Neurology, St Petersburg, Russia.,Mental Health Research Center, Moscow, Russia
| | - N G Neznanov
- Bekhterev National Medical Research Center For Psychiatry And Neurology, St Petersburg, Russia.,Pavlov First Saint-Petersburg State Medical University, St. Petersburg, Russia
| | - A O Kibitov
- Bekhterev National Medical Research Center For Psychiatry And Neurology, St Petersburg, Russia.,Serbsky National Medical Research Center on Psychiatry and Addictions, Moscow, Russia
| |
Collapse
|
3
|
Xiang X, Wang S, Liu T, Wang M, Li J, Jiang J, Wu T, Hu Y. Exploring gene-gene interaction in family-based data with an unsupervised machine learning method: EPISFA. Genet Epidemiol 2020; 44:811-824. [PMID: 32869348 DOI: 10.1002/gepi.22342] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2019] [Revised: 06/06/2020] [Accepted: 06/21/2020] [Indexed: 11/06/2022]
Abstract
Gene-gene interaction (G × G) is thought to fill the gap between the estimated heritability of complex diseases and the limited genetic proportion explained by identified single-nucleotide polymorphisms. The current tools for exploring G × G were often developed for case-control designs with less considerations for their applications in families. Family-based studies are robust against bias led from population stratification in genetic studies and helpful in understanding G × G. We proposed a new algorithm epistasis sparse factor analysis (EPISFA) and epistasis sparse factor analysis for linkage disequilibrium (EPISFA-LD) based on unsupervised machine learning to screen G × G. Extensive simulations were performed to compare EPISFA/EPISFA-LD with a classical family-based algorithm FAM-MDR (family-based multifactor dimensionality reduction). The results showed that EPISFA/EPISFA-LD is a tool of both high power and computational efficiency that could be applied in family designs and is applicable within high-dimensionality datasets. Finally, we applied EPISFA/EPISFA-LD to a real dataset drawn from the Fangshan/family-based Ischemic Stroke Study in China. Five pairs of G × G were discovered by EPISFA/EPISFA-LD, including three pairs verified by other algorithms (FAM-MDR and logistic), and an additional two pairs uniquely identified by EPISFA/EPISFA-LD only. The results from EPISFA might offer new insights for understanding the genetic etiology of complex diseases. EPISFA/EPISFA-LD was implemented in R. All relevant source code as well as simulated data could be freely downloaded from https://github.com/doublexism/episfa.
Collapse
Affiliation(s)
- Xiao Xiang
- Department of Epidemiology and Biostatistics, School of Public Health, Peking University, Beijing, China
| | - Siyue Wang
- Department of Epidemiology and Biostatistics, School of Public Health, Peking University, Beijing, China
| | - Tianyi Liu
- Department of Epidemiology and Biostatistics, School of Public Health, Capital Medical University, Beijing, China
| | - Mengying Wang
- Department of Epidemiology and Biostatistics, School of Public Health, Peking University, Beijing, China
| | - Jiawen Li
- Department of Clinical Medicine, School of Medicine, Peking University, Beijing, China
| | - Jin Jiang
- Department of Epidemiology and Biostatistics, School of Public Health, Peking University, Beijing, China
| | - Tao Wu
- Department of Epidemiology and Biostatistics, School of Public Health, Peking University, Beijing, China
| | - Yonghua Hu
- Department of Epidemiology and Biostatistics, School of Public Health, Peking University, Beijing, China
| |
Collapse
|
4
|
Kasyanov ED, Merkulova TV, Kibitov AO, Mazo GE. Genetics of Bipolar Spectrum Disorders: Focus on Family Studies Using Whole Exome Sequencing. RUSS J GENET+ 2020. [DOI: 10.1134/s1022795420070054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
5
|
Fang H, Yang Y, Chen L. Weighted Transmission Disequilibrium Test for Family Trio Association Design. Hum Hered 2019; 83:196-209. [PMID: 30865952 DOI: 10.1159/000494353] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2017] [Accepted: 10/09/2018] [Indexed: 11/19/2022] Open
Abstract
BACKGROUND Family-based design is one of the most popular designs in genetic studies. Transmission disequilibrium test (TDT) for family trio design is optimal only under the additive trait model and may lose power under the other trait models. The TDT-type tests are powerful only when the underlying trait model is correctly specified. Usually, the true trait model is unknown, and the selection of the TDT-type test is problematic. Several methods, which are robust against the mis-specification of the trait model, have been proposed. In this paper, we propose a new efficiency robust procedure for family trio design, namely, the weighted TDT (WTDT) test. METHODS We combine information of the largest two TDT-type tests by using weights related to the three TDT-type tests and take the weighted sum as the test statistic. RESULTS Simulation results demonstrate that WTDT has power close to, but much more robust than, the optimal TDT-type test based on a single trait model. WTDT also outperforms other efficiency robust methods in terms of power. Applications to real and simulated data from Genetic Analysis Workshop (GAW15) illustrate the practical application of the WTDT method. CONCLUSION WTDT is not only efficiency robust to model mis-specifications but also efficiency robust against mis-specifications of risk allele.
Collapse
Affiliation(s)
- Hongyan Fang
- School of Mathematical Sciences, Anhui University, Hefei, China
| | - Yaning Yang
- Department of Statistics and Finance, University of Science and Technology of China, Hefei, China,
| | - Ling Chen
- School of Mathematical Sciences, Anhui University, Hefei, China
| |
Collapse
|
6
|
Kasyanov ED, Mazo GE, Kibitov AO. The role of family studies in research of neurobiological basis of depressive disorders. Zh Nevrol Psikhiatr Im S S Korsakova 2019; 119:87-93. [DOI: 10.17116/jnevro201911902187] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
7
|
An H, Wei CS, Wang O, Wang DH, Xu LW, Lu Q, Ye CY. An ensemble-based likelihood ratio approach for family-based genomic risk prediction. J Zhejiang Univ Sci B 2018; 19:935-947. [PMID: 30507077 PMCID: PMC6305257 DOI: 10.1631/jzus.b1800162] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2018] [Revised: 07/12/2018] [Accepted: 07/12/2018] [Indexed: 11/11/2022]
Abstract
OBJECTIVE As one of the most popular designs used in genetic research, family-based design has been well recognized for its advantages, such as robustness against population stratification and admixture. With vast amounts of genetic data collected from family-based studies, there is a great interest in studying the role of genetic markers from the aspect of risk prediction. This study aims to develop a new statistical approach for family-based risk prediction analysis with an improved prediction accuracy compared with existing methods based on family history. METHODS In this study, we propose an ensemble-based likelihood ratio (ELR) approach, Fam-ELR, for family-based genomic risk prediction. Fam-ELR incorporates a clustered receiver operating characteristic (ROC) curve method to consider correlations among family samples, and uses a computationally efficient tree-assembling procedure for variable selection and model building. RESULTS Through simulations, Fam-ELR shows its robustness in various underlying disease models and pedigree structures, and attains better performance than two existing family-based risk prediction methods. In a real-data application to a family-based genome-wide dataset of conduct disorder, Fam-ELR demonstrates its ability to integrate potential risk predictors and interactions into the model for improved accuracy, especially on a genome-wide level. CONCLUSIONS By comparing existing approaches, such as genetic risk-score approach, Fam-ELR has the capacity of incorporating genetic variants with small or moderate marginal effects and their interactions into an improved risk prediction model. Therefore, it is a robust and useful approach for high-dimensional family-based risk prediction, especially on complex disease with unknown or less known disease etiology.
Collapse
Affiliation(s)
- Hui An
- Department of Health Management, School of Medicine, Hangzhou Normal University, Hangzhou 310036, China
| | - Chang-shuai Wei
- Department of Biostatistics and Epidemiology, University of North Texas Health Science Center, Fort Worth, TX 76107, USA
| | | | - Da-hui Wang
- Department of Health Management, School of Medicine, Hangzhou Normal University, Hangzhou 310036, China
| | - Liang-wen Xu
- Department of Preventive Medicine, School of Medicine, Hangzhou Normal University, Hangzhou 310036, China
| | - Qing Lu
- Department of Epidemiology and Biostatistics, College of Human Medicine, Michigan State University, East Lansing, MI 48824, USA
| | - Cheng-yin Ye
- Department of Health Management, School of Medicine, Hangzhou Normal University, Hangzhou 310036, China
| |
Collapse
|