1
|
Du J, Wang C, Wang L, Mao S, Zhu B, Li Z, Fan X. Automatic block-wise genotype-phenotype association detection based on hidden Markov model. BMC Bioinformatics 2023; 24:138. [PMID: 37029361 PMCID: PMC10082540 DOI: 10.1186/s12859-023-05265-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2022] [Accepted: 03/31/2023] [Indexed: 04/09/2023] Open
Abstract
BACKGROUND For detecting genotype-phenotype association from case-control single nucleotide polymorphism (SNP) data, one class of methods relies on testing each genomic variant site individually. However, this approach ignores the tendency for associated variant sites to be spatially clustered instead of uniformly distributed along the genome. Therefore, a more recent class of methods looks for blocks of influential variant sites. Unfortunately, existing such methods either assume prior knowledge of the blocks, or rely on ad hoc moving windows. A principled method is needed to automatically detect genomic variant blocks which are associated with the phenotype. RESULTS In this paper, we introduce an automatic block-wise Genome-Wide Association Study (GWAS) method based on Hidden Markov model. Using case-control SNP data as input, our method detects the number of blocks associated with the phenotype and the locations of the blocks. Correspondingly, the minor allele of each variate site will be classified as having negative influence, no influence or positive influence on the phenotype. We evaluated our method using both datasets simulated from our model and datasets from a block model different from ours, and compared the performance with other methods. These included both simple methods based on the Fisher's exact test, applied site-by-site, as well as more complex methods built into the recent Zoom-Focus Algorithm. Across all simulations, our method consistently outperformed the comparisons. CONCLUSIONS With its demonstrated better performance, we expect our algorithm for detecting influential variant sites may help find more accurate signals across a wide range of case-control GWAS.
Collapse
Affiliation(s)
- Jin Du
- Department of Statistics, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong.
| | - Chaojie Wang
- School of Mathematical Science, Jiangsu University, Zhenjiang, Jiangsu Province, China
| | - Lijun Wang
- Department of Statistics, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong
| | - Shanjun Mao
- College of Finance and Statistics, Hunan University, Changsha, Hunan Province, China
| | - Bencong Zhu
- Department of Statistics, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong
| | - Zheng Li
- Department of Surgery, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong
| | - Xiaodan Fan
- Department of Statistics, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong.
| |
Collapse
|
2
|
Sun R, Weng H, Wang MH. W-Test for Genetic Epistasis Testing. Methods Mol Biol 2021; 2212:45-53. [PMID: 33733349 DOI: 10.1007/978-1-0716-0947-7_4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
The genetic epistasis effect has been widely acknowledged as an essential contributor to genetic variation in complex diseases. In this chapter, we introduce a powerful and efficient statistical method, called W-test, for genetic epistasis testing. A wtest R package is developed for the implementation of the W-test method, which provides various functions to measure the main effect, pairwise interaction, higher-order interaction, and cis-regulation of SNP-CpG pairs in genetic and epigenetic data. It allows flexible stagewise and exhaustive association testing as well as diagnostic checking on the probability distributions in a user-friendly interface. The wtest package is available in CRAN at https://CRAN.R-project.org/package=wtest .
Collapse
Affiliation(s)
- Rui Sun
- The Chinese University of Hong Kong, Hong Kong, China
| | - Haoyi Weng
- The Chinese University of Hong Kong, Hong Kong, China
| | | |
Collapse
|
3
|
Sun R, Xia X, Chong KC, Zee BCY, Wu WKK, Wang MH. wtest: an integrated R package for genetic epistasis testing. BMC Med Genomics 2019; 12:180. [PMID: 31874630 PMCID: PMC6929460 DOI: 10.1186/s12920-019-0638-9] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2019] [Accepted: 11/26/2019] [Indexed: 02/07/2023] Open
Abstract
BACKGROUND With the increasing amount of high-throughput genomic sequencing data, there is a growing demand for a robust and flexible tool to perform interaction analysis. The identification of SNP-SNP, SNP-CpG, and higher order interactions helps explain the genetic etiology of human diseases, yet genome-wide analysis for interactions has been very challenging, due to the computational burden and a lack of statistical power in most datasets. RESULTS The wtest R package performs association testing for main effects, pairwise and high order interactions in genome-wide association study data, and cis-regulation of SNP and CpG sites in genome-wide and epigenome-wide data. The software includes a number of post-test diagnostic and analysis functions and offers an integrated toolset for genetic epistasis testing. CONCLUSIONS The wtest is an efficient and powerful statistical tool for integrated genetic epistasis testing. The package is available in CRAN: https://CRAN.R-project.org/package=wtest.
Collapse
Affiliation(s)
- Rui Sun
- Division of Biostatistics and Centre for Clinical Research and Biostatistics(CCRB), JC School of Public Health and Primary Care, the Chinese University of Hong Kong, Sha Tin, Hong Kong SAR, China.,Centre for Clinical Trials and Biostatistics, CUHK Shenzhen Research Institute, Shenzhen, China
| | - Xiaoxuan Xia
- Division of Biostatistics and Centre for Clinical Research and Biostatistics(CCRB), JC School of Public Health and Primary Care, the Chinese University of Hong Kong, Sha Tin, Hong Kong SAR, China.,Centre for Clinical Trials and Biostatistics, CUHK Shenzhen Research Institute, Shenzhen, China
| | - Ka Chun Chong
- Division of Biostatistics and Centre for Clinical Research and Biostatistics(CCRB), JC School of Public Health and Primary Care, the Chinese University of Hong Kong, Sha Tin, Hong Kong SAR, China.,Centre for Clinical Trials and Biostatistics, CUHK Shenzhen Research Institute, Shenzhen, China
| | - Benny Chung-Ying Zee
- Division of Biostatistics and Centre for Clinical Research and Biostatistics(CCRB), JC School of Public Health and Primary Care, the Chinese University of Hong Kong, Sha Tin, Hong Kong SAR, China.,Centre for Clinical Trials and Biostatistics, CUHK Shenzhen Research Institute, Shenzhen, China
| | - William Ka Kei Wu
- Institute of Digestive Diseases and Department of Medicine & Therapeutics, State Key Laboratory of Digestive Diseases, LKS Institute of Health Sciences, CUHK Shenzhen Research Institute, Shenzhen, China.,Department of Anesthesia, the Chinese University of Hong Kong, Sha Tin, Hong Kong SAR, China
| | - Maggie Haitian Wang
- Division of Biostatistics and Centre for Clinical Research and Biostatistics(CCRB), JC School of Public Health and Primary Care, the Chinese University of Hong Kong, Sha Tin, Hong Kong SAR, China. .,Centre for Clinical Trials and Biostatistics, CUHK Shenzhen Research Institute, Shenzhen, China.
| |
Collapse
|
4
|
Wang MH, Weng H. Genetic Test, Risk Prediction, and Counseling. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2017; 1005:21-46. [DOI: 10.1007/978-981-10-5717-5_2] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
|