1
|
Alarcón-Granados MC, Camargo-Villalba GE, Forero-Castro M. Exploring Genetic Interactions in Colombian Women with Polycystic Ovarian Syndrome: A Study on SNP-SNP Associations. Int J Mol Sci 2024; 25:9212. [PMID: 39273163 PMCID: PMC11395444 DOI: 10.3390/ijms25179212] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2024] [Revised: 08/05/2024] [Accepted: 08/06/2024] [Indexed: 09/15/2024] Open
Abstract
Polycystic ovary syndrome (PCOS) is an endocrine and metabolic disorder with high prevalence in women around the world. The identification of single-nucleotide polymorphisms (SNPs) through genome-wide association studies has classified it as a polygenic disease. Most studies have independently evaluated the contribution of each SNP to the risk of PCOS. Few studies have assessed the effect of epistasis among the identified SNPs. Therefore, this exploratory study aimed to evaluate the interaction of 27 SNPs identified as risk candidates and their contribution to the pathogenesis of PCOS. The study population included 49 control women and 49 women with PCOS with a normal BMI. Genotyping was carried out through the MassARRAY iPLEX single-nucleotide polymorphism typing platform. Using the multifactor dimensionality reduction (MDR) method, the interaction between SNPs was evaluated. The analysis showed that the best interaction model (p < 0.0001) was composed of three loci (rs11692782-FSHR, rs2268361-FSHR, and rs4784165-TOX3). Furthermore, a tendency towards synergy was evident between rs2268361 and the SNPs rs7371084-rs11692782-rs4784165, as well as a redundancy in rs7371084-rs11692782-rs4784165. This pilot study suggests that epistasis may influence PCOS pathophysiology. Large-scale analysis is needed to deepen our understanding of its impact on this complex syndrome affecting thousands of women.
Collapse
Affiliation(s)
| | | | - Maribel Forero-Castro
- Faculty of Sciences, Universidad Pedagógica y Tecnológica de Colombia, Tunja 150003, Colombia
| |
Collapse
|
2
|
Lin HY, Mazumder H, Sarkar I, Huang PY, Eeles RA, Kote-Jarai Z, Muir KR, Schleutker J, Pashayan N, Batra J, Neal DE, Nielsen SF, Nordestgaard BG, Grönberg H, Wiklund F, MacInnis RJ, Haiman CA, Travis RC, Stanford JL, Kibel AS, Cybulski C, Khaw KT, Maier C, Thibodeau SN, Teixeira MR, Cannon-Albright L, Brenner H, Kaneva R, Pandha H, Park JY. Cluster effect for SNP-SNP interaction pairs for predicting complex traits. Sci Rep 2024; 14:18677. [PMID: 39134575 PMCID: PMC11319716 DOI: 10.1038/s41598-024-66311-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2024] [Accepted: 07/01/2024] [Indexed: 08/15/2024] Open
Abstract
Single nucleotide polymorphism (SNP) interactions are the key to improving polygenic risk scores. Previous studies reported several significant SNP-SNP interaction pairs that shared a common SNP to form a cluster, but some identified pairs might be false positives. This study aims to identify factors associated with the cluster effect of false positivity and develop strategies to enhance the accuracy of SNP-SNP interactions. The results showed the cluster effect is a major cause of false-positive findings of SNP-SNP interactions. This cluster effect is due to high correlations between a causal pair and null pairs in a cluster. The clusters with a hub SNP with a significant main effect and a large minor allele frequency (MAF) tended to have a higher false-positive rate. In addition, peripheral null SNPs in a cluster with a small MAF tended to enhance false positivity. We also demonstrated that using the modified significance criterion based on the 3 p-value rules and the bootstrap approach (3pRule + bootstrap) can reduce false positivity and maintain high true positivity. In addition, our results also showed that a pair without a significant main effect tends to have weak or no interaction. This study identified the cluster effect and suggested using the 3pRule + bootstrap approach to enhance SNP-SNP interaction detection accuracy.
Collapse
Affiliation(s)
- Hui-Yi Lin
- Biostatistics and Data Science Program, School of Public Health, Louisiana State University Health Sciences Center, New Orleans, LA, 70112, USA.
| | - Harun Mazumder
- Biostatistics and Data Science Program, School of Public Health, Louisiana State University Health Sciences Center, New Orleans, LA, 70112, USA
| | - Indrani Sarkar
- Biostatistics and Data Science Program, School of Public Health, Louisiana State University Health Sciences Center, New Orleans, LA, 70112, USA
| | - Po-Yu Huang
- Information and Communications Research Laboratories, Industrial Technology Research Institute, Hsinchu, Taiwan
| | - Rosalind A Eeles
- The Institute of Cancer Research, London, SM2 5NG, UK
- Royal Marsden NHS Foundation Trust, London, SW3 6JJ, UK
| | | | - Kenneth R Muir
- Division of Population Health, Health Services Research and Primary Care, University of Manchester, Oxford Road, Manchester, M13 9PL, UK
| | - Johanna Schleutker
- Institute of Biomedicine, University of Turku, Turku, Finland
- Department of Medical Genetics, Genomics, Laboratory Division, Turku University Hospital, PO Box 52, 20521, Turku, Finland
| | - Nora Pashayan
- Department of Applied Health Research, University College London, London, WC1E 7HB, UK
- Centre for Cancer Genetic Epidemiology, Department of Oncology, University of Cambridge, Strangeways Laboratory, Worts Causeway, Cambridge, CB1 8RN, UK
| | - Jyotsna Batra
- Australian Prostate Cancer Research Centre-Qld, Institute of Health and Biomedical Innovation and School of Biomedical Science, Queensland University of Technology, Brisbane, QLD, 4059, Australia
- Translational Research Institute, Brisbane, QLD, 4102, Australia
| | - David E Neal
- Nuffield Department of Surgical Sciences, University of Oxford, John Radcliffe Hospital, Room 6603, Level 6, Headley Way, Headington, Oxford, OX3 9DU, UK
- Department of Oncology, University of Cambridge, Addenbrooke's Hospital, Hills Road, Box 279, Cambridge, CB2 0QQ, UK
- Cancer Research UK, Cambridge Research Institute, Li Ka Shing Centre, Cambridge, CB2 0RE, UK
| | - Sune F Nielsen
- Faculty of Health and Medical Sciences, University of Copenhagen, 2200, Copenhagen, Denmark
- Department of Clinical Biochemistry, Herlev and Gentofte Hospital, Copenhagen University Hospital, Herlev, 2200, Copenhagen, Denmark
| | - Børge G Nordestgaard
- Faculty of Health and Medical Sciences, University of Copenhagen, 2200, Copenhagen, Denmark
- Department of Clinical Biochemistry, Herlev and Gentofte Hospital, Copenhagen University Hospital, Herlev, 2200, Copenhagen, Denmark
| | - Henrik Grönberg
- Department of Medical Epidemiology and Biostatistics, Karolinska Institute, 171 77, Stockholm, Sweden
| | - Fredrik Wiklund
- Department of Medical Epidemiology and Biostatistics, Karolinska Institute, 171 77, Stockholm, Sweden
| | - Robert J MacInnis
- Cancer Epidemiology Division, Cancer Council Victoria, 200 Victoria Parade, East Melbourne, 3002, Australia
- Centre for Epidemiology and Biostatistics, Melbourne School of Population and Global Health, The University of Melbourne, Grattan Street, Parkville, VIC, 3010, Australia
| | - Christopher A Haiman
- Center for Genetic Epidemiology, Department of Preventive Medicine, Keck School of Medicine, University of Southern California/Norris Comprehensive Cancer Center, Los Angeles, CA, 90015, USA
| | - Ruth C Travis
- Cancer Epidemiology Unit, Nuffield Department of Population Health, University of Oxford, Oxford, OX3 7LF, UK
| | - Janet L Stanford
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington, 98109-1024, USA
- Department of Epidemiology, School of Public Health, University of Washington, Seattle, WA, 98195, USA
| | - Adam S Kibel
- Division of Urologic Surgery, Brigham and Womens Hospital, 75 Francis Street, Boston, MA, 02115, USA
| | - Cezary Cybulski
- International Hereditary Cancer Center, Department of Genetics and Pathology, Pomeranian Medical University, 70-115, Szczecin, Poland
| | - Kay-Tee Khaw
- Clinical Gerontology Unit, University of Cambridge, Cambridge, CB2 2QQ, UK
| | - Christiane Maier
- Humangenetik Tuebingen, Paul-Ehrlich-Str 23, 72076, Tuebingen, Germany
| | - Stephen N Thibodeau
- Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, 55905, USA
| | - Manuel R Teixeira
- Department of Laboratory Genetics, Portuguese Oncology Institute of Porto (IPO Porto)/Porto Comprehensive Cancer Center, Porto, Portugal
- Cancer Genetics Group, IPO Porto Research Center (CI-IPOP)/RISE@CI-IPOP (Health Research Network), Portuguese Oncology Institute of Porto (IPO Porto)/Porto Comprehensive Cancer Center, Porto, Portugal
- School of Medicine and Biomedical Sciences (ICBAS), University of Porto, Porto, Portugal
| | - Lisa Cannon-Albright
- Division of Epidemiology, Department of Internal Medicine, University of Utah School of Medicine, Salt Lake City, UT, 84132, USA
- George E. Wahlen Department of Veterans Affairs Medical Center, Salt Lake City, UT, 84148, USA
| | - Hermann Brenner
- Division of Clinical Epidemiology and Aging Research, German Cancer Research Center (DKFZ), 69120, Heidelberg, Germany
- German Cancer Consortium (DKTK), German Cancer Research Center (DKFZ), 69120, Heidelberg, Germany
- Division of Preventive Oncology, German Cancer Research Center (DKFZ) and National Center for Tumor Diseases (NCT), Im Neuenheimer Feld 460, 69120, Heidelberg, Germany
| | - Radka Kaneva
- Molecular Medicine Center, Department of Medical Chemistry and Biochemistry, Medical University of Sofia, Sofia, 2 Zdrave Str., 1431, Sofia, Bulgaria
| | - Hardev Pandha
- The University of Surrey, Guildford, Surrey, GU2 7XH, UK
| | - Jong Y Park
- Department of Cancer Epidemiology, Moffitt Cancer Center, 12902 Magnolia Drive, Tampa, FL, 33612, USA
| |
Collapse
|
3
|
Cheng Y, Zhang C, Li Q, Yang X, Chen W, He K, Chen M. MTF1 genetic variants are associated with lung cancer risk in the Chinese Han population. BMC Cancer 2024; 24:778. [PMID: 38943058 PMCID: PMC11212402 DOI: 10.1186/s12885-024-12516-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Accepted: 06/13/2024] [Indexed: 07/01/2024] Open
Abstract
BACKGROUND Metal-regulatory transcription factor 1 (MTF1), a conserved metal-binding transcription factor in eukaryotes, regulates the proliferation of cancer cells by activating downstream target genes and then participates in the formation and progression of tumors, including lung cancer (LC). The expression level of MTF1 is down-regulated in LC, and high expression of MTF1 is associated with a good prognosis of LC. However, the association between MTF1 polymorphism and LC risk has not been explored. METHODS The genotyping of MTF1 Single nucleotide polymorphisms (SNPs) including rs473279, rs28411034, rs28411352, and rs3748682 was identified by the Agena MassARRAY system among 670 healthy controls and 670 patients with LC. The odds ratio (OR) and 95% confidence intervals (CI) were calculated by logistics regression to assess the association of these SNPs with LC risk. RESULTS MTF1 rs28411034 (OR 1.22, 95% CI 1.03-1.45, p = 0.024) and rs3748682 (OR 1.24, 95% CI 1.04-1.47, p = 0.014) were associated with higher LC susceptibility overall. Moreover, the effect of rs28411034 and rs3748682 on LC susceptibility was observed in males, subjects with body mass index (BMI) ≥ 24 kg/m2, smokers, drinkers, and patients with lung squamous carcinoma (OR and 95% CI > 1, p < 0.05). Besides, rs28411352 (OR 0.73, 95% CI 0.55-0.97, p = 0.028,) showed protective effect for reduced LC risk in drinkers. CONCLUSIONS We were first who reported that rs28411034 and rs3748682 tended to be relevant to increased LC susceptibility among the Chinese Han population. These results of this study could help to recognize the pathogenic mechanisms of the MTF1 gene in LC progress.
Collapse
Affiliation(s)
- Yujing Cheng
- Department of Respiratory Medicine, The First Affiliated Hospital of School of Medicine of Xi'an Jiaotong University, Yanta District, No. 277, Yanta West Road, Xi'an, 710061, Shaanxi, China
- Department of Blood Transfusion, The First People's Hospital of Yunnan Province, The Afiliated Hospital of Kunming University of Science and Technology, Kunming, 650032, Yunnan, China
| | - Chan Zhang
- Department of Blood Transfusion, The First People's Hospital of Yunnan Province, The Afiliated Hospital of Kunming University of Science and Technology, Kunming, 650032, Yunnan, China
| | - Qi Li
- Department of Blood Transfusion, The First People's Hospital of Yunnan Province, The Afiliated Hospital of Kunming University of Science and Technology, Kunming, 650032, Yunnan, China
| | - Xin Yang
- Department of Blood Transfusion, The First People's Hospital of Yunnan Province, The Afiliated Hospital of Kunming University of Science and Technology, Kunming, 650032, Yunnan, China
| | - Wanlu Chen
- Department of Blood Transfusion, The First People's Hospital of Yunnan Province, The Afiliated Hospital of Kunming University of Science and Technology, Kunming, 650032, Yunnan, China
| | - KunHua He
- Department of Blood Transfusion, The First People's Hospital of Qujing City, Qujing, 655099, Yunnan, China
| | - Mingwei Chen
- Department of Respiratory Medicine, The First Affiliated Hospital of School of Medicine of Xi'an Jiaotong University, Yanta District, No. 277, Yanta West Road, Xi'an, 710061, Shaanxi, China.
| |
Collapse
|
4
|
Prieto-Fernández A, Sánchez-Barroso G, González-Domínguez J, García-Sanz-Calcedo J. Interaction between maintenance variables of medical ultrasound scanners through multifactor dimensionality reduction. Expert Rev Med Devices 2023; 20:851-864. [PMID: 37522639 DOI: 10.1080/17434440.2023.2243208] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2023] [Revised: 06/14/2023] [Accepted: 06/22/2023] [Indexed: 08/01/2023]
Abstract
BACKGROUND Proper maintenance of electro-medical devices is crucial for the quality of care to patients and the economic performance of healthcare organizations. This research aims to identify the interaction between Ultrasound scanners (US) maintenance variables as a function of maintenance indicators: US in service or decommissioned, excessive number of failures, and failure rate. Knowing those interactions, specific maintenance measures will be developed to improve the reliability of the US. RESEARCH DESIGN AND METHODS Multifactor Dimensionality Reduction (MDR) method was eployed to analyze data from 222 US and their four-year maintenance history. Models were developed based on the variables with the greatest influence on maintenance indicators, where US were classified according to the associated risk. RESULTS US with more than one major failure or at least one major component replacement had up to 496.4% more failures than the average. Failure rate increased by up to 188.7% over the average for those US with more than three moderate failures, three replacements, or both. CONCLUSIONS This study identifies and quantifies the causes of risk to establish a specific maintenance plan for US. It helps to better understand the degradation of US to optimize their operation and maintenance.
Collapse
Affiliation(s)
| | - Gonzalo Sánchez-Barroso
- Engineering Projects Area, School of Industrial Engineering, University of Extremadura, Badajoz, Spain
| | - Jaime González-Domínguez
- Engineering Projects Area, School of Industrial Engineering, University of Extremadura, Badajoz, Spain
| | - Justo García-Sanz-Calcedo
- Engineering Projects Area, School of Industrial Engineering, University of Extremadura, Badajoz, Spain
| |
Collapse
|
5
|
Yang CH, Hou MF, Chuang LY, Yang CS, Lin YD. Dimensionality reduction approach for many-objective epistasis analysis. Brief Bioinform 2023; 24:6858949. [PMID: 36458451 DOI: 10.1093/bib/bbac512] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2022] [Revised: 10/07/2022] [Accepted: 10/26/2022] [Indexed: 12/04/2022] Open
Abstract
In epistasis analysis, single-nucleotide polymorphism-single-nucleotide polymorphism interactions (SSIs) among genes may, alongside other environmental factors, influence the risk of multifactorial diseases. To identify SSI between cases and controls (i.e. binary traits), the score for model quality is affected by different objective functions (i.e. measurements) because of potential disease model preferences and disease complexities. Our previous study proposed a multiobjective approach-based multifactor dimensionality reduction (MOMDR), with the results indicating that two objective functions could enhance SSI identification with weak marginal effects. However, SSI identification using MOMDR remains a challenge because the optimal measure combination of objective functions has yet to be investigated. This study extended MOMDR to the many-objective version (i.e. many-objective MDR, MaODR) by integrating various disease probability measures based on a two-way contingency table to improve the identification of SSI between cases and controls. We introduced an objective function selection approach to determine the optimal measure combination in MaODR among 10 well-known measures. In total, 6 disease models with and 40 disease models without marginal effects were used to evaluate the general algorithms, namely those based on multifactor dimensionality reduction, MOMDR and MaODR. Our results revealed that the MaODR-based three objective function model, correct classification rate, likelihood ratio and normalized mutual information (MaODR-CLN) exhibited the higher 6.47% detection success rates (Accuracy) than MOMDR and higher 17.23% detection success rates than MDR through the application of an objective function selection approach. In a Wellcome Trust Case Control Consortium, MaODR-CLN successfully identified the significant SSIs (P < 0.001) associated with coronary artery disease. We performed a systematic analysis to identify the optimal measure combination in MaODR among 10 objective functions. Our combination detected SSIs-based binary traits with weak marginal effects and thus reduced spurious variables in the score model. MOAI is freely available at https://sites.google.com/view/maodr/home.
Collapse
Affiliation(s)
- Cheng-Hong Yang
- Department of Information Management at the Tainan University of Technology, and at the Department of Electronic Engineering at National Kaohsiung of Science and Technology, Taiwan.,Biomedical Engineering, Kaohsiung Medical University, Taiwan
| | - Ming-Feng Hou
- Kaohsiung Medical University Hospital, and Professor at the Department of Surgery, College of Medicine, Kaohsiung Medical University, Kaohsiung, Taiwan
| | - Li-Yeh Chuang
- Department of Chemical Engineering & Institute of Biotechnology and Chemical Engineering at I-Shou University, Taiwan
| | - Cheng-San Yang
- Department of Plastic Surgery, and serves as the Medical Matters Secretary of Chia-Yi Christian Hospital, Taiwan
| | - Yu-Da Lin
- Department of Computer Science and Information Engineering, and at the National Penghu University of Science and Technology, Taiwan
| |
Collapse
|
6
|
Yang CH, Huang HC, Hou MF, Chuang LY, Lin YD. Fuzzy-Based Multiobjective Multifactor Dimensionality Reduction for Epistasis Analysis. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023; 20:378-387. [PMID: 35061588 DOI: 10.1109/tcbb.2022.3144303] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Epistasis detection is vital for understanding disease susceptibility in genetics. Multiobjective multifactor dimensionality reduction (MOMDR) was previously proposed to detect epistasis. MOMDR was performed using binary classification to distinguish the high-risk (H) and low-risk (L) groups to reduce multifactor dimensionality. However, the binary classification does not reflect the uncertainty of the H and L classification. In this study, we proposed an empirical fuzzy MOMDR (EFMOMDR) to address the limitations of binary classification using the degree of membership through an empirical fuzzy approach. The EFMOMDR can simultaneously consider two incorporated fuzzy-based measures, including correct classification rate and likelihood rate, and does not require parameter tuning. Simulation studies revealed that EFMOMDR has higher 7.14% detection success rates than MOMDR, indicating that the limitations of binary classification of MOMDR have been successfully improved by empirical fuzzy. Moreover, EFMOMDR was used to analyze coronary artery disease in the Wellcome Trust Case Control Consortium dataset.
Collapse
|
7
|
Curtis A, Yu Y, Carey M, Parfrey P, Yilmaz YE, Savas S. Examining SNP-SNP interactions and risk of clinical outcomes in colorectal cancer using multifactor dimensionality reduction based methods. Front Genet 2022; 13:902217. [PMID: 35991579 PMCID: PMC9385108 DOI: 10.3389/fgene.2022.902217] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2022] [Accepted: 06/30/2022] [Indexed: 12/24/2022] Open
Abstract
Background: SNP interactions may explain the variable outcome risk among colorectal cancer patients. Examining SNP interactions is challenging, especially with large datasets. Multifactor Dimensionality Reduction (MDR)-based programs may address this problem.Objectives: 1) To compare two MDR-based programs for their utility; and 2) to apply these programs to sets of MMP and VEGF-family gene SNPs in order to examine their interactions in relation to colorectal cancer survival outcomes.Methods: This study applied two data reduction methods, Cox-MDR and GMDR 0.9, to study one to three way SNP interactions. Both programs were run using a 5-fold cross validation step and the top models were verified by permutation testing. Prognostic associations of the SNP interactions were verified using multivariable regression methods. Eight datasets, including SNPs from MMP family genes (n = 201) and seven sets of VEGF-family interaction networks (n = 1,517 SNPs) were examined.Results: ∼90 million potential interactions were examined. Analyses in the MMP and VEGF gene family datasets found several novel 1- to 3-way SNP interactions. These interactions were able to distinguish between the patients with different outcome risks (regression p-values 0.03–2.2E-09). The strongest association was detected for a 3-way interaction including CHRM3.rs665159_EPN1.rs6509955_PTGER3.rs1327460 variants.Conclusion: Our work demonstrates the utility of data reduction methods while identifying potential prognostic markers in colorectal cancer.
Collapse
Affiliation(s)
- Aaron Curtis
- Discipline of Genetics, Faculty of Medicine, Memorial University, St. John’s, NL, Canada
- Division of Biomedical Sciences, Faculty of Medicine, Memorial University, St. John’s, NL, Canada
| | - Yajun Yu
- Discipline of Genetics, Faculty of Medicine, Memorial University, St. John’s, NL, Canada
- Division of Biomedical Sciences, Faculty of Medicine, Memorial University, St. John’s, NL, Canada
| | - Megan Carey
- Discipline of Genetics, Faculty of Medicine, Memorial University, St. John’s, NL, Canada
| | - Patrick Parfrey
- Discipline of Medicine, Faculty of Medicine, Memorial University, St. John’s, NL, Canada
| | - Yildiz E. Yilmaz
- Discipline of Genetics, Faculty of Medicine, Memorial University, St. John’s, NL, Canada
- Discipline of Medicine, Faculty of Medicine, Memorial University, St. John’s, NL, Canada
- Department of Mathematics and Statistics, Faculty of Science, Memorial University, St. John’s, NL, Canada
| | - Sevtap Savas
- Discipline of Genetics, Faculty of Medicine, Memorial University, St. John’s, NL, Canada
- Division of Biomedical Sciences, Faculty of Medicine, Memorial University, St. John’s, NL, Canada
- Discipline of Oncology, Faculty of Medicine, Memorial University, St. John’s, NL, Canada
- *Correspondence: Sevtap Savas,
| |
Collapse
|
8
|
Yang CH, Lin YD, Chuang LY. Class Balanced Multifactor Dimensionality Reduction to Detect Gene-Gene Interactions. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2020; 17:71-81. [PMID: 30040653 DOI: 10.1109/tcbb.2018.2858776] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
Detecting gene-gene interactions in single-nucleotide polymorphism data is vital for understanding disease susceptibility. However, existing approaches may be limited by the sample size in case-control studies. Herein, we propose a balance approach for the multifactor dimensionality reduction (BMDR) method to increase the accuracy of estimates of the prediction error rate in small samples. BMDR explicitly selects the best model by evaluating the average of prediction error rates over k-fold cross-validation without cross-validation consistency selection. In this study, we used several epistatic models with and without marginal effects under different parameter settings (heritability and minor allele frequencies) to evaluate the performance of existing approaches. Using simulated data sets, BMDR successfully detected gene-gene interactions, particularly for data sets with small sample sizes. A large data set was obtained from the Wellcome Trust Case Control Consortium, and results indicated that BMDR could effectively detect significant gene-gene interactions.
Collapse
|
9
|
Laengsri V, Shoombuatong W, Adirojananon W, Nantasenamat C, Prachayasittikul V, Nuchnoi P. ThalPred: a web-based prediction tool for discriminating thalassemia trait and iron deficiency anemia. BMC Med Inform Decis Mak 2019; 19:212. [PMID: 31699079 PMCID: PMC6836478 DOI: 10.1186/s12911-019-0929-2] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2017] [Accepted: 10/14/2019] [Indexed: 01/07/2023] Open
Abstract
Background The hypochromic microcytic anemia (HMA) commonly found in Thailand are iron deficiency anemia (IDA) and thalassemia trait (TT). Accurate discrimination between IDA and TT is an important issue and better methods are urgently needed. Although considerable RBC formulas and indices with various optimal cut-off values have been developed, distinguishing between IDA and TT is still a challenging problem due to the diversity of various anemic populations. To address this problem, it is desirable to develop an improved and automated prediction model for discriminating IDA from TT. Methods We retrospectively collected laboratory data of HMA found in Thai adults. Five machine learnings, including k-nearest neighbor (k-NN), decision tree, random forest (RF), artificial neural network (ANN) and support vector machine (SVM), were applied to construct a discriminant model. Performance was assessed and compared with thirteen existing discriminant formulas and indices. Results The data of 186 patients (146 patients with TT and 40 with IDA) were enrolled. The interpretable rules derived from the RF model were proposed to demonstrate the combination of RBC indices for discriminating IDA from TT. A web-based tool ‘ThalPred’ was implemented using an SVM model based on seven RBC parameters. ThalPred achieved prediction results with an external accuracy, MCC and AUC of 95.59, 0.87 and 0.98, respectively. Conclusion ThalPred and an interpretable rule were provided for distinguishing IDA from TT. For the convenience of health care team experimental scientists, a web-based tool has been established at http://codes.bio/thalpred/ by which users can easily get their desired screening test result without the need to go through the underlying mathematical and computational details.
Collapse
Affiliation(s)
- V Laengsri
- Center for Research and Innovation, Faculty of Medical Technology, Mahidol University, Bangkok, Thailand.,Department of Clinical Microscopy, Faculty of Medical Technology, Mahidol University, Bangkok, Thailand
| | - W Shoombuatong
- Center of Data Mining and Medical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok, Thailand
| | - W Adirojananon
- Department of Clinical Microscopy, Faculty of Medical Technology, Mahidol University, Bangkok, Thailand
| | - C Nantasenamat
- Center of Data Mining and Medical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok, Thailand
| | - V Prachayasittikul
- Department of Clinical Microbiology and Applied Technology, Faculty of Medical Technology, Mahidol University, Bangkok, Thailand
| | - P Nuchnoi
- Center for Research and Innovation, Faculty of Medical Technology, Mahidol University, Bangkok, Thailand. .,Department of Clinical Microscopy, Faculty of Medical Technology, Mahidol University, Bangkok, Thailand.
| |
Collapse
|
10
|
Fernández-Santiago R, Martín-Flores N, Antonelli F, Cerquera C, Moreno V, Bandres-Ciga S, Manduchi E, Tolosa E, Singleton AB, Moore JH, The International Parkinson’s Disease Genomics Consortium, Martí MJ, Ezquerra M, Malagelada C. SNCA and mTOR Pathway Single Nucleotide Polymorphisms Interact to Modulate the Age at Onset of Parkinson's Disease. Mov Disord 2019; 34:1333-1344. [PMID: 31234232 PMCID: PMC7322732 DOI: 10.1002/mds.27770] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2019] [Revised: 04/25/2019] [Accepted: 05/27/2019] [Indexed: 12/15/2022] Open
Abstract
BACKGROUND Single nucleotide polymorphisms (SNPs) in the α-synuclein (SNCA) gene are associated with differential risk and age at onset (AAO) of both idiopathic and Leucine-rich repeat kinase 2 (LRRK2)-associated Parkinson's disease (PD). Yet potential combinatory or synergistic effects among several modulatory SNPs for PD risk or AAO remain largely underexplored. OBJECTIVES The mechanistic target of rapamycin (mTOR) signaling pathway is functionally impaired in PD. Here we explored whether SNPs in the mTOR pathway, alone or by epistatic interaction with known susceptibility factors, can modulate PD risk and AAO. METHODS Based on functional relevance, we selected a total of 64 SNPs mapping to a total of 57 genes from the mTOR pathway and genotyped a discovery series cohort encompassing 898 PD patients and 921 controls. As a replication series, we screened 4170 PD and 3014 controls available from the International Parkinson's Disease Genomics Consortium. RESULTS In the discovery series cohort, we found a 4-loci interaction involving STK11 rs8111699, FCHSD1 rs456998, GSK3B rs1732170, and SNCA rs356219, which was associated with an increased risk of PD (odds ratio = 2.59, P < .001). In addition, we also found a 3-loci epistatic combination of RPTOR rs11868112 and RPS6KA2 rs6456121 with SNCA rs356219, which was associated (odds ratio = 2.89; P < .0001) with differential AAO. The latter was further validated (odds ratio = 1.56; P = 0.046-0.047) in the International Parkinson's Disease Genomics Consortium cohort. CONCLUSIONS These findings indicate that genetic variability in the mTOR pathway contributes to SNCA effects in a nonlinear epistatic manner to modulate differential AAO in PD, unraveling the contribution of this cascade in the pathogenesis of the disease. © 2019 International Parkinson and Movement Disorder Society.
Collapse
Affiliation(s)
- Rubén Fernández-Santiago
- Lab of Parkinson Disease and Other Neurodegenerative Movement Disorders, institut d’Investigacions Biomédiques August Pi i Sunyer, Barcelona, Catalonia, Spain
- Neurology Service, Hospital Clinic de Barcelona, Barcelona, Catalonia, Spain
- Networked Centre for Biomedical Research of Neurodegenerative Diseases, Madrid, Spain
| | - Núria Martín-Flores
- Department of Biomedicine, Unit of Biochemistry, Universitat de Barcelona, Barcelona, Catalonia, Spain
- institute of Neurosciences, University of Barcelona, Barcelona, Catalonia, Spain
| | - Francesca Antonelli
- Neurology Service, Hospital Clinic de Barcelona, Barcelona, Catalonia, Spain
| | - Catalina Cerquera
- Neurology Service, Hospital Clinic de Barcelona, Barcelona, Catalonia, Spain
| | - Verónica Moreno
- Neurology Service, Hospital Clinic de Barcelona, Barcelona, Catalonia, Spain
| | - Sara Bandres-Ciga
- Laboratory of Neurogenetics, National institute on Aging, National institutes of Health, Bethesda, Maryland, USA
- instituto de investigación Biosanitaria de Granada (ibs. GRANADA), Granada, Spain
| | - Elisabetta Manduchi
- The Perelman School of Medicine University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | - Eduard Tolosa
- Lab of Parkinson Disease and Other Neurodegenerative Movement Disorders, institut d’Investigacions Biomédiques August Pi i Sunyer, Barcelona, Catalonia, Spain
- Neurology Service, Hospital Clinic de Barcelona, Barcelona, Catalonia, Spain
- Networked Centre for Biomedical Research of Neurodegenerative Diseases, Madrid, Spain
| | - Andrew B. Singleton
- Laboratory of Neurogenetics, National institute on Aging, National institutes of Health, Bethesda, Maryland, USA
| | - Jason H. Moore
- The Perelman School of Medicine University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | | | - María-Josep Martí
- Lab of Parkinson Disease and Other Neurodegenerative Movement Disorders, institut d’Investigacions Biomédiques August Pi i Sunyer, Barcelona, Catalonia, Spain
- Neurology Service, Hospital Clinic de Barcelona, Barcelona, Catalonia, Spain
- Networked Centre for Biomedical Research of Neurodegenerative Diseases, Madrid, Spain
| | - Mario Ezquerra
- Lab of Parkinson Disease and Other Neurodegenerative Movement Disorders, institut d’Investigacions Biomédiques August Pi i Sunyer, Barcelona, Catalonia, Spain
- Neurology Service, Hospital Clinic de Barcelona, Barcelona, Catalonia, Spain
- Networked Centre for Biomedical Research of Neurodegenerative Diseases, Madrid, Spain
| | - Cristina Malagelada
- Department of Biomedicine, Unit of Biochemistry, Universitat de Barcelona, Barcelona, Catalonia, Spain
- institute of Neurosciences, University of Barcelona, Barcelona, Catalonia, Spain
| |
Collapse
|
11
|
Yang CH, Chuang LY, Lin YD. Multiobjective multifactor dimensionality reduction to detect SNP-SNP interactions. Bioinformatics 2019; 34:2228-2236. [PMID: 29471406 DOI: 10.1093/bioinformatics/bty076] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2017] [Accepted: 02/16/2018] [Indexed: 11/12/2022] Open
Abstract
Motivation Single-nucleotide polymorphism (SNP)-SNP interactions (SSIs) are popular markers for understanding disease susceptibility. Multifactor dimensionality reduction (MDR) can successfully detect considerable SSIs. Currently, MDR-based methods mainly adopt a single-objective function (a single measure based on contingency tables) to detect SSIs. However, generally, a single-measure function might not yield favorable results due to potential model preferences and disease complexities. Approach This study proposes a multiobjective MDR (MOMDR) method that is based on a contingency table of MDR as an objective function. MOMDR considers the incorporated measures, including correct classification and likelihood rates, to detect SSIs and adopts set theory to predict the most favorable SSIs with cross-validation consistency. MOMDR enables simultaneously using multiple measures to determine potential SSIs. Results Three simulation studies were conducted to compare the detection success rates of MOMDR and single-objective MDR (SOMDR), revealing that MOMDR had higher detection success rates than SOMDR. Furthermore, the Wellcome Trust Case Control Consortium dataset was analyzed by MOMDR to detect SSIs associated with coronary artery disease. Availability and implementation: MOMDR is freely available at https://goo.gl/M8dpDg. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Cheng-Hong Yang
- Department of Electronic Engineering, National Kaohsiung University of Applied Sciences, Kaohsiung, Taiwan.,Graduate Institute of Clinical Medicine, Kaohsiung Medical University, Kaohsiung, Taiwan
| | - Li-Yeh Chuang
- Department of Chemical Engineering and Institute of Biotechnology and Chemical Engineering, I-Shou University, Kaohsiung, Taiwan
| | - Yu-Da Lin
- Department of Electronic Engineering, National Kaohsiung University of Applied Sciences, Kaohsiung, Taiwan
| |
Collapse
|
12
|
Yang CH, Lin YD, Chuang LY. Multiple-Criteria Decision Analysis-Based Multifactor Dimensionality Reduction for Detecting Gene-Gene Interactions. IEEE J Biomed Health Inform 2018; 23:416-426. [PMID: 29993963 DOI: 10.1109/jbhi.2018.2790951] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Gene-gene interactions (GGIs) are important markers for determining susceptibility to a disease. Multifactor dimensionality reduction (MDR) is a popular algorithm for detecting GGIs and primarily adopts the correct classification rate (CCR) to assess the quality of a GGI. However, CCR measurement alone may not successfully detect certain GGIs because of potential model preferences and disease complexities. In this study, multiple-criteria decision analysis (MCDA) based on MDR was named MCDA-MDR and proposed for detecting GGIs. MCDA facilitates MDR to simultaneously adopt multiple measures within the two-way contingency table of MDR to assess GGIs; the CCR and rule utility measure were employed. Cross-validation consistency was adopted to determine the most favorable GGIs among the Pareto sets. Simulation studies were conducted to compare the detection success rates of the MDR-only-based measure and MCDA-MDR, revealing that MCDA-MDR had superior detection success rates. The Wellcome Trust Case Control Consortium dataset was analyzed using MCDA-MDR to detect GGIs associated with coronary artery disease, and MCDA-MDR successfully detected numerous significant GGIs (p < 0.001). MCDA-MDR performance assessment revealed that the applied MCDA successfully enhanced the GGI detection success rate of the MDR-based method compared with MDR alone.
Collapse
|
13
|
MTOR Pathway-Based Discovery of Genetic Susceptibility to L-DOPA-Induced Dyskinesia in Parkinson's Disease Patients. Mol Neurobiol 2018; 56:2092-2100. [PMID: 29992529 DOI: 10.1007/s12035-018-1219-1] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2018] [Accepted: 06/29/2018] [Indexed: 12/31/2022]
Abstract
Dyskinesia induced by L-DOPA administration (LID) is one of the most invalidating adverse effects of the gold standard treatment restoring dopamine transmission in Parkinson's disease (PD). However, LID manifestation in parkinsonian patients is variable and heterogeneous. Here, we performed a candidate genetic pathway analysis of the mTOR signaling cascade to elucidate a potential genetic contribution to LID susceptibility, since mTOR inhibition ameliorates LID in PD animal models. We screened 64 single nucleotide polymorphisms (SNPs) mapping to 57 genes of the mTOR pathway in a retrospective cohort of 401 PD cases treated with L-DOPA (70 PD with moderate/severe LID and 331 with no/mild LID). We performed classic allelic, genotypic, and epistatic analyses to evaluate the association of individual or combinations of SNPs with LID onset and with LID severity after initiation of L-DOPA treatment. As for the time to LID onset, we found significant associations with SNP rs1043098 in the EIF4EBP2 gene and also with an epistatic interaction involving EIF4EBP2 rs1043098, RICTOR rs2043112, and PRKCA rs4790904. For LID severity, we found significant association with HRAS rs12628 and PRKN rs1801582 and also with a four-loci epistatic combination involving RPS6KB1 rs1292034, HRAS rs12628, RPS6KA2 rs6456121, and FCHSD1 rs456998. These findings indicate that the mTOR pathway contributes genetically to LID susceptibility. Our study could help to identify the most susceptible PD patients to L-DOPA in order to prevent the appearance of early and/or severe LID in a future. This information could also be used to stratify PD patients in clinical trials in a more accurate way.
Collapse
|
14
|
Novel genetic associations and gene-gene interactions of chemokine receptor and chemokine genetic polymorphisms in HIV/AIDS. AIDS 2017; 31:1235-1243. [PMID: 28358741 DOI: 10.1097/qad.0000000000001491] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
OBJECTIVE To investigate the influence of candidate polymorphisms on chemokine receptor/ligand genes on HIV infection and AIDS progression (HIV/AIDS). DESIGN Fifteen polymorphisms of the CCR3, CCR4, CCR5, CCR6, CCR8, CXCR3, CXCR6, CCL20, CCL22 and CXCL10 genes were analysed in 206 HIV-positive patients classified as rapid progressors (n = 40), or nonrapid progressors (n = 166), and in 294 HIV-seronegative patients. METHODS The polymorphisms were genotyped using minisequencing. Genetic models were tested using binomial logistic regression; nonparametric multifactor dimensionality reduction (MDR) was used to detect gene-gene interactions. RESULTS The CCR3 rs3091250 [TT, adjusted odds ratio (AOR): 2.147, 95% confidence interval (CI) 1.076-4.287, P = 0.030], CCR8 rs2853699 (GC/CC, AOR: 1.577, 95% CI 1.049-2.371, P = 0.029), CXCL10 rs56061981 (CT/TT, AOR: 1.819, 95% CI 1.074-3.081, P = 0.026) and CCL22 rs4359426 (CA/AA, AOR: 1.887, 95% CI 1.021-3.487, P = 0.043) polymorphisms were associated with susceptibility to HIV infection. The CCL20 rs13034664 (CC, OR: 0.214, 95% CI 0.063-0.730, P = 0.014) and CCL22 rs4359426 (CA/AA, OR: 2.685, 95% CI 1.128-6.392, P = 0.026) variants were associated with rapid progression to AIDS. In MDR analyses revealed that the CXCL10 rs56061981 and CCL22 rs4359426 combination was the best model, with 57% accuracy (P = 0.008) for predicting susceptibility to HIV infection. CONCLUSION Our results provide new insights into the influence of candidate chemokine receptor/ligand polymorphisms and significant evidence for gene-gene interactions on HIV/AIDS susceptibility.
Collapse
|
15
|
Marcus MW, Raji OY, Duffy SW, Young RP, Hopkins RJ, Field JK. Incorporating epistasis interaction of genetic susceptibility single nucleotide polymorphisms in a lung cancer risk prediction model. Int J Oncol 2016; 49:361-70. [PMID: 27121382 PMCID: PMC4902078 DOI: 10.3892/ijo.2016.3499] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2016] [Accepted: 02/17/2016] [Indexed: 02/06/2023] Open
Abstract
Incorporation of genetic variants such as single nucleotide polymorphisms (SNPs) into risk prediction models may account for a substantial fraction of attributable disease risk. Genetic data, from 2385 subjects recruited into the Liverpool Lung Project (LLP) between 2000 and 2008, consisting of 20 SNPs independently validated in a candidate-gene discovery study was used. Multifactor dimensionality reduction (MDR) and random forest (RF) were used to explore evidence of epistasis among 20 replicated SNPs. Multivariable logistic regression was used to identify similar risk predictors for lung cancer in the LLP risk model for the epidemiological model and extended model with SNPs. Both models were internally validated using the bootstrap method and model performance was assessed using area under the curve (AUC) and net reclassification improvement (NRI). Using MDR and RF, the overall best classifier of lung cancer status were SNPs rs1799732 (DRD2), rs5744256 (IL-18), rs2306022 (ITGA11) with training accuracy of 0.6592 and a testing accuracy of 0.6572 and a cross-validation consistency of 10/10 with permutation testing P<0.0001. The apparent AUC of the epidemiological model was 0.75 (95% CI 0.73–0.77). When epistatic data were incorporated in the extended model, the AUC increased to 0.81 (95% CI 0.79–0.83) which corresponds to 8% increase in AUC (DeLong's test P=2.2e-16); 17.5% by NRI. After correction for optimism, the AUC was 0.73 for the epidemiological model and 0.79 for the extended model. Our results showed modest improvement in lung cancer risk prediction when the SNP epistasis factor was added.
Collapse
Affiliation(s)
- Michael W Marcus
- Roy Castle Lung Cancer Research Programme, The University of Liverpool, Department of Molecular and Clinical Cancer Medicine, Institute of Translational Medicine, Liverpool L7 8TX, UK
| | - Olaide Y Raji
- Roy Castle Lung Cancer Research Programme, The University of Liverpool, Department of Molecular and Clinical Cancer Medicine, Institute of Translational Medicine, Liverpool L7 8TX, UK
| | - Stephen W Duffy
- Wolfson Institute of Preventive Medicine, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, London EC1M 6BQ, UK
| | - Robert P Young
- School of Biological Sciences, Faculty of Medical and Health Sciences, University of Auckland, Auckland, New Zealand
| | - Raewyn J Hopkins
- School of Biological Sciences, Faculty of Medical and Health Sciences, University of Auckland, Auckland, New Zealand
| | - John K Field
- Roy Castle Lung Cancer Research Programme, The University of Liverpool, Department of Molecular and Clinical Cancer Medicine, Institute of Translational Medicine, Liverpool L7 8TX, UK
| |
Collapse
|
16
|
Xu J, Qian HX, Hu SP, Liu LY, Zhou M, Feng M, Su J, Ji LD. Gender-Specific Association of ATP2B1 Variants with Susceptibility to Essential Hypertension in the Han Chinese Population. BIOMED RESEARCH INTERNATIONAL 2016; 2016:1910565. [PMID: 26933664 PMCID: PMC4737061 DOI: 10.1155/2016/1910565] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/04/2015] [Revised: 12/10/2015] [Accepted: 12/20/2015] [Indexed: 12/28/2022]
Abstract
Previous genome-wide association studies (GWASs) found that several ATP2B1 variants are associated with essential hypertension (EHT). But the "genome-wide significant" ATP2B1 SNPs (rs2681472, rs2681492, rs17249754, and rs1105378) are in strong linkage disequilibrium (LD) and are located in the same LD block in Chinese populations. We asked whether there are other SNPs within the ATP2B1 gene associated with susceptibility to EHT in the Han Chinese population. Therefore, we performed a case-control study to investigate the association of seven tagSNPs within the ATP2B1 gene and EHT in the Han Chinese population, and we then analyzed the interaction among different SNPs and nongenetic risk factors for EHT. A total of 902 essential hypertensive cases and 902 normotensive controls were involved in the study. All 7 tagSNPs within the ATP2B1 gene were retrieved from HapMap, and genotyping was performed using the Tm-shift genotyping method. Chi-squared test, logistic regression, and propensity score analysis showed that rs17249754 was associated with EHT, particularly in females. The MDR analysis demonstrated that the interaction of rs2070759, rs17249754, TC, TG, and BMI increased the susceptibility to hypertension. Crossover analysis and stratified analysis indicated that BMI has a major effect on the development of hypertension, while ATP2B1 variants have a minor effect.
Collapse
Affiliation(s)
- Jin Xu
- Department of Preventive Medicine, School of Medicine, Ningbo University, Ningbo 315211, China
| | - Hai-xia Qian
- Department of Preventive Medicine, School of Medicine, Ningbo University, Ningbo 315211, China
| | - Su-pei Hu
- Department of Research and Teaching, Ningbo No. 2 Hospital, Ningbo 315010, China
| | - Li-ya Liu
- Department of Preventive Medicine, School of Medicine, Ningbo University, Ningbo 315211, China
| | - Mi Zhou
- Department of Preventive Medicine, School of Medicine, Ningbo University, Ningbo 315211, China
| | - Mei Feng
- Department of Preventive Medicine, School of Medicine, Ningbo University, Ningbo 315211, China
| | - Jia Su
- Department of Gerontology, Ningbo No. 1 Hospital, Ningbo 315010, China
| | - Lin-dan Ji
- Department of Biochemistry, School of Medicine, Ningbo University, Ningbo 315211, China
| |
Collapse
|
17
|
Gola D, Mahachie John JM, van Steen K, König IR. A roadmap to multifactor dimensionality reduction methods. Brief Bioinform 2015; 17:293-308. [PMID: 26108231 PMCID: PMC4793893 DOI: 10.1093/bib/bbv038] [Citation(s) in RCA: 61] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2015] [Indexed: 02/02/2023] Open
Abstract
Complex diseases are defined to be determined by multiple genetic and environmental factors alone as well as in interactions. To analyze interactions in genetic data, many statistical methods have been suggested, with most of them relying on statistical regression models. Given the known limitations of classical methods, approaches from the machine-learning community have also become attractive. From this latter family, a fast-growing collection of methods emerged that are based on the Multifactor Dimensionality Reduction (MDR) approach. Since its first introduction, MDR has enjoyed great popularity in applications and has been extended and modified multiple times. Based on a literature search, we here provide a systematic and comprehensive overview of these suggested methods. The methods are described in detail, and the availability of implementations is listed. Most recent approaches offer to deal with large-scale data sets and rare variants, which is why we expect these methods to even gain in popularity.
Collapse
|
18
|
Association of P2Y12 gene promoter DNA methylation with the risk of clopidogrel resistance in coronary artery disease patients. BIOMED RESEARCH INTERNATIONAL 2014; 2014:450814. [PMID: 24745016 PMCID: PMC3976931 DOI: 10.1155/2014/450814] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/04/2014] [Revised: 02/10/2014] [Accepted: 02/10/2014] [Indexed: 02/06/2023]
Abstract
Background. Clopidogrel inhibits the ADP receptor P2Y12 to keep down the platelet aggregation. The goal of our study is to investigate the contribution of P2Y12 promoter DNA methylation to the risk of clopidogrel resistance (CR). Methods. The platelet functions were measured by the VerifyNow P2Y12 assay. Applying the bisulfite pyrosequencing technology, DNA methylation levels of two CpG dinucleotides on P2Y12 promoter were tested among 49 CR cases and 57 non-CR controls. We also investigated the association among P2Y12 DNA methylation, various biochemical characteristics, and CR. Result. Lower methylation of two CpGs indicated the poorer clopidogrel response (CpG1, P = 0.009; CpG2, P = 0.022) in alcohol abusing status. Meanwhile CpG1 methylation was inversely correlated with CR in smoking patients (P = 0.026) and in subgroup of Albumin < 35 (P = 0.002). We observed that the level of DNA methylation might be affected by some clinical markers, such as TBIL, LEVF, Albumin, AST. The results also showed that the quantity of stent, fasting blood-glucose, and lower HbAC1 were the predictors of CR. Conclusions. The evidence from our study indicates that P2Y12 methylation may bring new hints to elaborate the pathogenesis of CR.
Collapse
|
19
|
Son WY, Lee HJ, Yoon HK, Kang SG, Park YM, Yang HJ, Choi JE, An H, Seo HK, Kim L. Gaba transporter SLC6A11 gene polymorphism associated with tardive dyskinesia. Nord J Psychiatry 2014; 68:123-8. [PMID: 23795861 DOI: 10.3109/08039488.2013.780260] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
BACKGROUND Gamma-aminobutyric acid (GABA) insufficiency has been reported to be related to the tardive dyskinesia (TD) susceptibility. Inada et al. (Pharmacogenet Genomics 2008;18:317-23) identified eight genes belonging to GABA receptor signaling pathway that may be involved in TD susceptibility by genome-wide screening and they replicated associations in an independent sample for polymorphisms in SLC6A11 (GABA transporter 3), GABRG3 (c-3 subunit of GABA-A receptor) and GABRB2 (β-2 subunit of GABA-A receptor). In this study, we tried to replicate their finding in a larger Korean sample and find if any of the genes was associated with the susceptibility to TD. METHODS We selected three polymorphisms in SLC6A11 (rs4684742), GABRG3 (rs2061051) and GABRB2 (rs918528) from the previous study. We carried out a case-control study (105 TD and 175 non-TD schizophrenic patients) to identify the association between the three candidate polymorphisms and susceptibility to TD and their epistatic interactions by using the multifactor dimensionality reduction (MDR) algorithm. RESULTS Among the three variants, SCL6A11 genotypes distribution showed a significant difference between the TD and non-TD patients (P = 0.049). However, GABRG3 and GABRB2 genotype distributions were not associated with TD (P = 0.268 and P = 0.976, respectively). Further, our analyses provided significant evidence for gene-gene interactions (SCL6A11, GABRG3 and GABRB2) in the development of TD. The odds ratio increased to 2.53 (CI = 1.515-4.217, P = 0.0003) when the genetic susceptibility to TD was analyzed with the three genes considered altogether through MDR approach. CONCLUSION These results suggest that GABA receptor signaling pathway was associated with the increased susceptibility to TD in Korean schizophrenic patients.
Collapse
Affiliation(s)
- Woo-Young Son
- Woo-Young Son, Department of Psychiatry, Korea University College of Medicine , Seoul , South Korea , and Department of Biology, Cornell University College of Arts and Sciences , NY 14850 , USA
| | | | | | | | | | | | | | | | | | | |
Collapse
|
20
|
Chen GB, Liu N, Klimentidis YC, Zhu X, Zhi D, Wang X, Lou XY. A unified GMDR method for detecting gene-gene interactions in family and unrelated samples with application to nicotine dependence. Hum Genet 2014; 133:139-50. [PMID: 24057800 PMCID: PMC3947150 DOI: 10.1007/s00439-013-1361-9] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2013] [Accepted: 09/05/2013] [Indexed: 11/26/2022]
Abstract
Gene-gene and gene-environment interactions govern a substantial portion of the variation in complex traits and diseases. In convention, a set of either unrelated or family samples are used in detection of such interactions; even when both kinds of data are available, the unrelated and the family samples are analyzed separately, potentially leading to loss in statistical power. In this report, to detect gene-gene interactions we propose a generalized multifactor dimensionality reduction method that unifies analyses of nuclear families and unrelated subjects within the same statistical framework. We used principal components as genetic background controls against population stratification, and when sibling data are included, within-family control were used to correct for potential spurious association at the tested loci. Through comprehensive simulations, we demonstrate that the proposed method can remarkably increase power by pooling unrelated and offspring's samples together as compared with individual analysis strategies and the Fisher's combining p value method while it retains a controlled type I error rate in the presence of population structure. In application to a real dataset, we detected one significant tetragenic interaction among CHRNA4, CHRNB2, BDNF, and NTRK2 associated with nicotine dependence in the Study of Addiction: Genetics and Environment sample, suggesting the biological role of these genes in nicotine dependence development.
Collapse
Affiliation(s)
- Guo-Bo Chen
- Section on Statistical Genetics, Department of Biostatistics, University of Alabama at Birmingham, Birmingham, Alabama 35294, USA
| | - Nianjun Liu
- Section on Statistical Genetics, Department of Biostatistics, University of Alabama at Birmingham, Birmingham, Alabama 35294, USA
| | - Yann C. Klimentidis
- Section on Statistical Genetics, Department of Biostatistics, University of Alabama at Birmingham, Birmingham, Alabama 35294, USA
| | - Xiaofeng Zhu
- Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland, OH 44106, USA
| | - Degui Zhi
- Section on Statistical Genetics, Department of Biostatistics, University of Alabama at Birmingham, Birmingham, Alabama 35294, USA
| | - Xujing Wang
- Department of Physics and the Comprehensive Diabetes Center, University of Alabama at Birmingham, Birmingham, Alabama 35294, USA
| | - Xiang-Yang Lou
- Section on Statistical Genetics, Department of Biostatistics, University of Alabama at Birmingham, Birmingham, Alabama 35294, USA
| |
Collapse
|
21
|
Nizamutdinov II, Andreeva TV, Stepanov VA, Marusin AV, Rogaev EI, Zasedatelev AS, Nasedkina TV. Biochip for determination of genetic markers of sporadic Alzheimer’s disease risk in the Russian Slavic population. Mol Biol 2013. [DOI: 10.1134/s0026893313060101] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
22
|
Ji L, Cai X, Zhang L, Fei L, Wang L, Su J, Lazar L, Xu J, Zhang Y. Association between polymorphisms in the renin-angiotensin-aldosterone system genes and essential hypertension in the Han Chinese population. PLoS One 2013; 8:e72701. [PMID: 24015270 PMCID: PMC3756014 DOI: 10.1371/journal.pone.0072701] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2013] [Accepted: 07/12/2013] [Indexed: 02/06/2023] Open
Abstract
BACKGROUND Renin-angiotensin-aldosterone system (RAAS) is the most important endocrine blood pressure control mechanism in our body, genes encoding components of this system have been strong candidates for the investigation of the genetic basis of hypertension. However, previous studies mainly focused on limited polymorphisms, thus we carried out a case-control study in the Han Chinese population to systemically investigate the association between polymorphisms in the RAAS genes and essential hypertension. METHODS 905 essential hypertensive cases and 905 normotensive controls were recruited based on stringent inclusion and exclusion criteria. All 41 tagSNPs within RAAS genes were retrieved from HapMap, and the genotyping was performed using the GenomeLab SNPstream Genotyping System. Logistic regression analysis, Multifactor dimensionality reduction (MDR), stratified analysis and crossover analysis were used to identify and characterize interactions among the SNPs and the non-genetic factors. RESULTS Serum levels of total cholesterol (TC) and triglyceride (TG), and body mass index (BMI) were significantly higher in the hypertensive group than in the control group. Of 41 SNPs genotyped, rs3789678 and rs2493132 within AGT, rs4305 within ACE, rs275645 within AGTR1, rs3802230 and rs10086846 within CYP11B2 were shown to associate with hypertension. The MDR analysis demonstrated that the interaction between BMI and rs4305 increased the susceptibility to hypertension. Crossover analysis and stratified analysis further indicated that BMI has a major effect, and rs4305 has a minor effect. CONCLUSION These novel findings indicated that together with non-genetic factors, these genetic variants in the RAAS may play an important role in determining an individual's susceptibility to hypertension in the Han Chinese.
Collapse
Affiliation(s)
- Lindan Ji
- Department of Biochemistry, School of Medicine, Ningbo University, Ningbo, China
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China
| | - Xiaobo Cai
- Department of Biochemistry, School of Medicine, Ningbo University, Ningbo, China
| | - Lina Zhang
- Department of Preventive Medicine, School of Medicine, Ningbo University, Ningbo, China
| | - Lijuan Fei
- Department of Preventive Medicine, School of Medicine, Ningbo University, Ningbo, China
| | - Lin Wang
- Department of Pathology, School of Medicine, Ningbo University, Ningbo, China
| | - Jia Su
- Department of Cardiology, The Affiliated Ningbo No.1 Hospital, School of Medicine, Ningbo University, Ningbo, China
| | - Lissy Lazar
- Department of Preventive Medicine, School of Medicine, Ningbo University, Ningbo, China
| | - Jin Xu
- Department of Preventive Medicine, School of Medicine, Ningbo University, Ningbo, China
- * E-mail: (JX); (YZ)
| | - Yaping Zhang
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China
- * E-mail: (JX); (YZ)
| |
Collapse
|
23
|
Association between polymorphisms of alpha-adducin gene and essential hypertension in Chinese population. BIOMED RESEARCH INTERNATIONAL 2013. [PMID: 23509723 DOI: 10.1155/2013/451094.] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
The association between polymorphisms of α-adducin (ADD1) gene and essential hypertension is still unclear. Thus, we carried out a case-control study and an interaction analysis to test whether ADD1 is a common candidate gene for hypertension in the Chinese population. Blood samples and information including body mass index (BMI), smoking habit, and alcohol abuse were collected. Meanwhile, total cholesterol, high density lipoprotein, triglyceride were measured by automatic biochemistry analyzer. All 6 tag single nucleotide polymorphisms (tagSNPs) within ADD1 gene were genotyped by SNPstream genotyping system. Multifactor dimensionality reduction (MDR) was used to identify the interactions among the SNPs and the non-genetic factors. Results showed that plasma triglyceride, total cholesterol, and BMI were significantly higher in the hypertensive group than in the control group. Result from genotyping indicated that rs4963 was significantly associated with essential hypertension. After stratification by gender, rs4963 was associated with essential hypertension only in males. MDR analysis indicated that interaction among BMI, rs4963, and rs16843452 were involved in susceptibility of hypertension. The present study indicated that rs4963 within ADD1 gene was associated with essential hypertension in Chinese population, which might be related to altered exonic splicing and disrupted gene regulation.
Collapse
|
24
|
Dai H, Charnigo RJ, Becker ML, Leeder JS, Motsinger-Reif AA. Risk score modeling of multiple gene to gene interactions using aggregated-multifactor dimensionality reduction. BioData Min 2013; 6:1. [PMID: 23294634 PMCID: PMC3560267 DOI: 10.1186/1756-0381-6-1] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2012] [Accepted: 12/21/2012] [Indexed: 01/27/2023] Open
Abstract
UNLABELLED BACKGROUND Multifactor Dimensionality Reduction (MDR) has been widely applied to detect gene-gene (GxG) interactions associated with complex diseases. Existing MDR methods summarize disease risk by a dichotomous predisposing model (high-risk/low-risk) from one optimal GxG interaction, which does not take the accumulated effects from multiple GxG interactions into account. RESULTS We propose an Aggregated-Multifactor Dimensionality Reduction (A-MDR) method that exhaustively searches for and detects significant GxG interactions to generate an epistasis enriched gene network. An aggregated epistasis enriched risk score, which takes into account multiple GxG interactions simultaneously, replaces the dichotomous predisposing risk variable and provides higher resolution in the quantification of disease susceptibility. We evaluate this new A-MDR approach in a broad range of simulations. Also, we present the results of an application of the A-MDR method to a data set derived from Juvenile Idiopathic Arthritis patients treated with methotrexate (MTX) that revealed several GxG interactions in the folate pathway that were associated with treatment response. The epistasis enriched risk score that pooled information from 82 significant GxG interactions distinguished MTX responders from non-responders with 82% accuracy. CONCLUSIONS The proposed A-MDR is innovative in the MDR framework to investigate aggregated effects among GxG interactions. New measures (pOR, pRR and pChi) are proposed to detect multiple GxG interactions.
Collapse
Affiliation(s)
- Hongying Dai
- Research Development and Clinical Investigation, Children's Mercy Hospital, Kansas City, MO, 64108, USA.
| | | | | | | | | |
Collapse
|
25
|
Applications of multifactor dimensionality reduction to genome-wide data using the R package 'MDR'. Methods Mol Biol 2013; 1019:479-98. [PMID: 23756907 DOI: 10.1007/978-1-62703-447-0_23] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
This chapter describes how to use the R package 'MDR' to search and identify gene-gene interactions in high-dimensional data and illustrates applications for exploratory analysis of multi-locus models by providing specific examples.
Collapse
|
26
|
Association between polymorphisms of alpha-adducin gene and essential hypertension in Chinese population. BIOMED RESEARCH INTERNATIONAL 2012; 2013:451094. [PMID: 23509723 PMCID: PMC3591139 DOI: 10.1155/2013/451094] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/06/2012] [Revised: 10/24/2012] [Accepted: 10/25/2012] [Indexed: 11/21/2022]
Abstract
The association between polymorphisms of α-adducin (ADD1) gene and essential hypertension is still unclear. Thus, we carried out a case-control study and an interaction analysis to test whether ADD1 is a common candidate gene for hypertension in the Chinese population. Blood samples and information including body mass index (BMI), smoking habit, and alcohol abuse were collected. Meanwhile, total cholesterol, high density lipoprotein, triglyceride were measured by automatic biochemistry analyzer. All 6 tag single nucleotide polymorphisms (tagSNPs) within ADD1 gene were genotyped by SNPstream genotyping system. Multifactor dimensionality reduction (MDR) was used to identify the interactions among the SNPs and the non-genetic factors. Results showed that plasma triglyceride, total cholesterol, and BMI were significantly higher in the hypertensive group than in the control group. Result from genotyping indicated that rs4963 was significantly associated with essential hypertension. After stratification by gender, rs4963 was associated with essential hypertension only in males. MDR analysis indicated that interaction among BMI, rs4963, and rs16843452 were involved in susceptibility of hypertension. The present study indicated that rs4963 within ADD1 gene was associated with essential hypertension in Chinese population, which might be related to altered exonic splicing and disrupted gene regulation.
Collapse
|
27
|
Lack of association between STK39 and hypertension in the Chinese population. J Hum Hypertens 2012; 27:294-7. [DOI: 10.1038/jhh.2012.46] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
|
28
|
Gory JJ, Sweeney HC, Reif DM, Motsinger-Reif AA. A comparison of internal model validation methods for multifactor dimensionality reduction in the case of genetic heterogeneity. BMC Res Notes 2012; 5:623. [PMID: 23126544 PMCID: PMC3599301 DOI: 10.1186/1756-0500-5-623] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2012] [Accepted: 10/29/2012] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Determining the genes responsible for certain human traits can be challenging when the underlying genetic model takes a complicated form such as heterogeneity (in which different genetic models can result in the same trait) or epistasis (in which genes interact with other genes and the environment). Multifactor Dimensionality Reduction (MDR) is a widely used method that effectively detects epistasis; however, it does not perform well in the presence of heterogeneity partly due to its reliance on cross-validation for internal model validation. Cross-validation allows for only one "best" model and is therefore inadequate when more than one model could cause the same trait. We hypothesize that another internal model validation method known as a three-way split will be better at detecting heterogeneity models. RESULTS In this study, we test this hypothesis by performing a simulation study to compare the performance of MDR to detect models of heterogeneity with the two different internal model validation techniques. We simulated a range of disease models with both main effects and gene-gene interactions with a range of effect sizes. We assessed the performance of each method using a range of definitions of power. CONCLUSIONS Overall, the power of MDR to detect heterogeneity models was relatively poor, especially under more conservative (strict) definitions of power. While the overall power was low, our results show that the cross-validation approach greatly outperformed the three-way split approach in detecting heterogeneity. This would motivate using cross-validation with MDR in studies where heterogeneity might be present. These results also emphasize the challenge of detecting heterogeneity models and the need for further methods development.
Collapse
Affiliation(s)
- Jeffrey J Gory
- Bioinformatics Research Center, Department of Statistics, North Carolina State University, Raleigh, NC 27695, USA
| | | | | | | |
Collapse
|
29
|
Usacheva MA, Nasedkina TV, Ikonnikova AY, Kulikov AV, Chudinov AV, Lysov YP, Bondarenko EV, Slominskii PA, Shamalov NA, Shetova IM, Limborskaya SA, Zasedatelev AS, Skvortsova VI. Association of polymophisms of renin-angiotensin and hemostasis system genes with ischemic stroke in Russians from central Russia. Mol Biol 2012. [DOI: 10.1134/s0026893312010232] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
30
|
Gene-gene and gene-environmental interactions of childhood asthma: a multifactor dimension reduction approach. PLoS One 2012; 7:e30694. [PMID: 22355322 PMCID: PMC3280263 DOI: 10.1371/journal.pone.0030694] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2011] [Accepted: 12/22/2011] [Indexed: 01/24/2023] Open
Abstract
Background The importance of gene-gene and gene-environment interactions on asthma is well documented in literature, but a systematic analysis on the interaction between various genetic and environmental factors is still lacking. Methodology/Principal Findings We conducted a population-based, case-control study comprised of seventh-grade children from 14 Taiwanese communities. A total of 235 asthmatic cases and 1,310 non-asthmatic controls were selected for DNA collection and genotyping. We examined the gene-gene and gene-environment interactions between 17 single-nucleotide polymorphisms in antioxidative, inflammatory and obesity-related genes, and childhood asthma. Environmental exposures and disease status were obtained from parental questionnaires. The model-free and non-parametrical multifactor dimensionality reduction (MDR) method was used for the analysis. A three-way gene-gene interaction was elucidated between the gene coding glutathione S-transferase P (GSTP1), the gene coding interleukin-4 receptor alpha chain (IL4Ra) and the gene coding insulin induced gene 2 (INSIG2) on the risk of lifetime asthma. The testing-balanced accuracy on asthma was 57.83% with a cross-validation consistency of 10 out of 10. The interaction of preterm birth and indoor dampness had the highest training-balanced accuracy at 59.09%. Indoor dampness also interacted with many genes, including IL13, beta-2 adrenergic receptor (ADRB2), signal transducer and activator of transcription 6 (STAT6). We also used likelihood ratio tests for interaction and chi-square tests to validate our results and all tests showed statistical significance. Conclusions/Significance The results of this study suggest that GSTP1, INSIG2 and IL4Ra may influence the lifetime asthma susceptibility through gene-gene interactions in schoolchildren. Home dampness combined with each one of the genes STAT6, IL13 and ADRB2 could raise the asthma risk.
Collapse
|
31
|
Oki NO, Motsinger-Reif AA. Multifactor dimensionality reduction as a filter-based approach for genome wide association studies. Front Genet 2011; 2:80. [PMID: 22303374 PMCID: PMC3268633 DOI: 10.3389/fgene.2011.00080] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2011] [Accepted: 10/26/2011] [Indexed: 11/13/2022] Open
Abstract
Advances in genotyping technology and the multitude of genetic data available now provide a vast amount of data that is proving to be useful in the quest for a better understanding of human genetic diseases through the study of genetic variation. This has led to the development of approaches such as genome wide association studies (GWAS) designed specifically for interrogating variants across the genome for association with disease, typically by testing single locus, univariate associations. More recently it has been accepted that epistatic (interaction) effects may also be great contributors to these genetic effects, and GWAS methods are now being applied to find epistatic effects. The challenge for these methods still remain in prioritization and interpretation of results, as it has also become standard for initial findings to be independently investigated in replication cohorts or functional studies. This is motivating the development and implementation of filter-based approaches to prioritize variants found to be significant in a discovery stage for follow-up for replication. Such filters must be able to detect both univariate and interactive effects. In the current study we present and evaluate the use of multifactor dimensionality reduction (MDR) as such a filter, with simulated data and a wide range of effect sizes. Additionally, we compare the performance of the MDR filter to a similar filter approach using logistic regression (LR), the more traditional approach used in GWAS analysis, as well as evaporative cooling (EC)-another prominent machine learning filtering method. The results of our simulation study show that MDR is an effective method for such prioritization, and that it can detect main effects, and interactions with or without marginal effects. Importantly, it performed as well as EC and LR for main effect models. It also significantly outperforms LR for various two-locus epistatic models, while it has equivalent results as EC for the epistatic models. The results of this study demonstrate the potential of MDR as a filter to detect gene-gene interactions in GWAS studies.
Collapse
Affiliation(s)
- Noffisat O. Oki
- Bioinformatics Research Center, North Carolina State UniversityRaleigh, NC, USA
| | - Alison A. Motsinger-Reif
- Bioinformatics Research Center, North Carolina State UniversityRaleigh, NC, USA
- Department of Statistics, North Carolina State UniversityRaleigh, NC, USA
| |
Collapse
|
32
|
Winham SJ, Motsinger-Reif AA. An R package implementation of multifactor dimensionality reduction. BioData Min 2011; 4:24. [PMID: 21846375 PMCID: PMC3177775 DOI: 10.1186/1756-0381-4-24] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2011] [Accepted: 08/16/2011] [Indexed: 12/27/2022] Open
Abstract
Background A breadth of high-dimensional data is now available with unprecedented numbers of genetic markers and data-mining approaches to variable selection are increasingly being utilized to uncover associations, including potential gene-gene and gene-environment interactions. One of the most commonly used data-mining methods for case-control data is Multifactor Dimensionality Reduction (MDR), which has displayed success in both simulations and real data applications. Additional software applications in alternative programming languages can improve the availability and usefulness of the method for a broader range of users. Results We introduce a package for the R statistical language to implement the Multifactor Dimensionality Reduction (MDR) method for nonparametric variable selection of interactions. This package is designed to provide an alternative implementation for R users, with great flexibility and utility for both data analysis and research. The 'MDR' package is freely available online at http://www.r-project.org/. We also provide data examples to illustrate the use and functionality of the package. Conclusions MDR is a frequently-used data-mining method to identify potential gene-gene interactions, and alternative implementations will further increase this usage. We introduce a flexible software package for R users.
Collapse
Affiliation(s)
- Stacey J Winham
- Department of Statistics, North Carolina State University, Raleigh NC 27695, USA.
| | | |
Collapse
|
33
|
Practical and theoretical considerations in study design for detecting gene-gene interactions using MDR and GMDR approaches. PLoS One 2011; 6:e16981. [PMID: 21386969 PMCID: PMC3046176 DOI: 10.1371/journal.pone.0016981] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2010] [Accepted: 01/19/2011] [Indexed: 12/25/2022] Open
Abstract
Detection of interacting risk factors for complex traits is challenging. The choice of an appropriate method, sample size, and allocation of cases and controls are serious concerns. To provide empirical guidelines for planning such studies and data analyses, we investigated the performance of the multifactor dimensionality reduction (MDR) and generalized MDR (GMDR) methods under various experimental scenarios. We developed the mathematical expectation of accuracy and used it as an indicator parameter to perform a gene-gene interaction study. We then examined the statistical power of GMDR and MDR within the plausible range of accuracy (0.50∼0.65) reported in the literature. The GMDR with covariate adjustment had a power of>80% in a case-control design with a sample size of≥2000, with theoretical accuracy ranging from 0.56 to 0.62. However, when the accuracy was<0.56, a sample size of≥4000 was required to have sufficient power. In our simulations, the GMDR outperformed the MDR under all models with accuracy ranging from 0.56∼0.62 for a sample size of 1000–2000. However, the two methods performed similarly when the accuracy was outside this range or the sample was significantly larger. We conclude that with adjustment of a covariate, GMDR performs better than MDR and a sample size of 1000∼2000 is reasonably large for detecting gene-gene interactions in the range of effect size reported by the current literature; whereas larger sample size is required for more subtle interactions with accuracy<0.56.
Collapse
|
34
|
A comparison of multifactor dimensionality reduction and L1-penalized regression to identify gene-gene interactions in genetic association studies. Stat Appl Genet Mol Biol 2011; 10:Article 4. [PMID: 21291414 DOI: 10.2202/1544-6115.1613] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Recently, the amount of high-dimensional data has exploded, creating new analytical challenges for human genetics. Furthermore, much evidence suggests that common complex diseases may be due to complex etiologies such as gene-gene interactions, which are difficult to identify in high-dimensional data using traditional statistical approaches. Data-mining approaches are gaining popularity for variable selection in association studies, and one of the most commonly used methods to evaluate potential gene-gene interactions is Multifactor Dimensionality Reduction (MDR). Additionally, a number of penalized regression techniques, such as Lasso, are gaining popularity within the statistical community and are now being applied to association studies, including extensions for interactions. In this study, we compare the performance of MDR, the traditional lasso with L1 penalty (TL1), and the group lasso for categorical data with group-wise L1 penalty (GL1) to detect gene-gene interactions through a broad range of simulations. We find that each method has both advantages and disadvantages, and relative performance is context dependent. TL1 frequently over-fits, identifying false positive as well as true positive loci. MDR has higher power for epistatic models that exhibit independent main effects; for both Lasso methods, main effects tend to dominate. For purely epistatic models, GL1 has the best performance for lower minor allele frequencies, but MDR performs best for higher frequencies. These results provide guidance of when each approach might be best suited for detecting and characterizing interactions with different mechanisms.
Collapse
|
35
|
Winham SJ, Motsinger-Reif AA. The effect of retrospective sampling on estimates of prediction error for multifactor dimensionality reduction. Ann Hum Genet 2011; 75:46-61. [PMID: 20560921 PMCID: PMC2955770 DOI: 10.1111/j.1469-1809.2010.00587.x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
The standard in genetic association studies of complex diseases is replication and validation of positive results, with an emphasis on assessing the predictive value of associations. In response to this need, a number of analytical approaches have been developed to identify predictive models that account for complex genetic etiologies. Multifactor Dimensionality Reduction (MDR) is a commonly used, highly successful method designed to evaluate potential gene-gene interactions. MDR relies on classification error in a cross-validation framework to rank and evaluate potentially predictive models. Previous work has demonstrated the high power of MDR, but has not considered the accuracy and variance of the MDR prediction error estimate. Currently, we evaluate the bias and variance of the MDR error estimate as both a retrospective and prospective estimator and show that MDR can both underestimate and overestimate error. We argue that a prospective error estimate is necessary if MDR models are used for prediction, and propose a bootstrap resampling estimate, integrating population prevalence, to accurately estimate prospective error. We demonstrate that this bootstrap estimate is preferable for prediction to the error estimate currently produced by MDR. While demonstrated with MDR, the proposed estimation is applicable to all data-mining methods that use similar estimates.
Collapse
Affiliation(s)
- Stacey J Winham
- Department of Statistics, North Carolina State University, Raleigh, 27695, USA
| | | |
Collapse
|
36
|
Winham SJ, Slater AJ, Motsinger-Reif AA. A comparison of internal validation techniques for multifactor dimensionality reduction. BMC Bioinformatics 2010; 11:394. [PMID: 20650002 PMCID: PMC2920275 DOI: 10.1186/1471-2105-11-394] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2009] [Accepted: 07/22/2010] [Indexed: 11/24/2022] Open
Abstract
BACKGROUND It is hypothesized that common, complex diseases may be due to complex interactions between genetic and environmental factors, which are difficult to detect in high-dimensional data using traditional statistical approaches. Multifactor Dimensionality Reduction (MDR) is the most commonly used data-mining method to detect epistatic interactions. In all data-mining methods, it is important to consider internal validation procedures to obtain prediction estimates to prevent model over-fitting and reduce potential false positive findings. Currently, MDR utilizes cross-validation for internal validation. In this study, we incorporate the use of a three-way split (3WS) of the data in combination with a post-hoc pruning procedure as an alternative to cross-validation for internal model validation to reduce computation time without impairing performance. We compare the power to detect true disease causing loci using MDR with both 5- and 10-fold cross-validation to MDR with 3WS for a range of single-locus and epistatic disease models. Additionally, we analyze a dataset in HIV immunogenetics to demonstrate the results of the two strategies on real data. RESULTS MDR with 3WS is computationally approximately five times faster than 5-fold cross-validation. The power to find the exact true disease loci without detecting false positive loci is higher with 5-fold cross-validation than with 3WS before pruning. However, the power to find the true disease causing loci in addition to false positive loci is equivalent to the 3WS. With the incorporation of a pruning procedure after the 3WS, the power of the 3WS approach to detect only the exact disease loci is equivalent to that of MDR with cross-validation. In the real data application, the cross-validation and 3WS analyses indicate the same two-locus model. CONCLUSIONS Our results reveal that the performance of the two internal validation methods is equivalent with the use of pruning procedures. The specific pruning procedure should be chosen understanding the trade-off between identifying all relevant genetic effects but including false positives and missing important genetic factors. This implies 3WS may be a powerful and computationally efficient approach to screen for epistatic effects, and could be used to identify candidate interactions in large-scale genetic studies.
Collapse
Affiliation(s)
- Stacey J Winham
- Department of Statistics, North Carolina State University, Raleigh, NC 27695, USA
| | - Andrew J Slater
- Bioinformatics Research Center, North Carolina State University, Raleigh, NC 27695, USA
- Department of Genetics, North Carolina State University, Raleigh, NC 27695, USA
| | - Alison A Motsinger-Reif
- Department of Statistics, North Carolina State University, Raleigh, NC 27695, USA
- Bioinformatics Research Center, North Carolina State University, Raleigh, NC 27695, USA
| |
Collapse
|
37
|
Park YM, Lee HJ, Kang SG, Choi JE, Cho JH, Kim L. Lack of Association between Glutathione S-Transferase-M1, -T1, and -P1 Polymorphisms and Olanzapine-Induced Weight Gain in Korean Schizophrenic Patients. Psychiatry Investig 2010; 7:147-52. [PMID: 20577625 PMCID: PMC2890870 DOI: 10.4306/pi.2010.7.2.147] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/23/2009] [Revised: 09/30/2009] [Accepted: 05/03/2010] [Indexed: 11/19/2022] Open
Abstract
OBJECTIVE Oxidative stress may be an important pathogenic mechanism in the obesity and metabolic syndrome. The aims of this study was to assess the possible association between the oxidative stress related Glutathione S-Transferase genes (GST-M1, GST-T1, and GST-P1) variants and the olanzapine-induced weight gain in Korean schizophrenic patients. METHODS We categorized 78 schizophrenic patients into two groups the more than 7% weight gain from baseline (weight gain >/=7%) and the less weight gain (weight gain <7%) groups according to weight change between before and after long-term olanzapine treatment (440+/-288 days). All participants were genotyped for the GST-M1, GST-T1 and GST-P1 genes. Differences in allele frequencies between cohorts with different body weight changes were evaluated by a chi-square analysis and Fisher's exact test. The multifactor dimensionality reduction (MDR) approach was used to analyze gene-gene interactions. RESULTS Mean body weight gain was 5.42 kg. There was no difference in the null genotype distribution of GST-M1 and -T1 between subjects with body weight gain >/=7% compared to subjects with body weight gain <7% (p>0.05). No significant difference in GST-P1 genotype and allele frequencies were observed between the groups (p>0.05). MDR analysis did not show a significant interaction between the three GST gene variants and susceptibility to weight gain (p>0.05). CONCLUSION These findings do not support a relationship between the genetic variants of three GST genes (GST-M1, -T1 and -P1) and weight gain in Korean schizophrenic patients receiving olanzapine treatment.
Collapse
Affiliation(s)
- Young-Min Park
- Department of Neuropsychiatry, Inje University College of Medicine, Ilsan Paik Hospital, Goyang, Korea
| | - Heon-Jeong Lee
- Department of Psychiatry, Korea University College of Medicine, Seoul, Korea
- Division of Brain Korea 21 Biomedical Science, Korea University College of Medicine, Seoul, Korea
| | - Seung-Gul Kang
- Department of Psychiatry, Korea University College of Medicine, Seoul, Korea
| | - Jung-Eun Choi
- Department of Psychiatry, Korea University College of Medicine, Seoul, Korea
- Division of Brain Korea 21 Biomedical Science, Korea University College of Medicine, Seoul, Korea
| | - Jae-Hyuck Cho
- Department of Neuropsychiatry, Inje University College of Medicine, Ilsan Paik Hospital, Goyang, Korea
| | - Leen Kim
- Department of Psychiatry, Korea University College of Medicine, Seoul, Korea
| |
Collapse
|
38
|
Edwards TL, Torstensen E, Dudek S, Martin ER, Ritchie MD. A cross-validation procedure for general pedigrees and matched odds ratio fitness metric implemented for the multifactor dimensionality reduction pedigree disequilibrium test. Genet Epidemiol 2010; 34:194-9. [PMID: 19697353 DOI: 10.1002/gepi.20447] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
As genetic epidemiology looks beyond mapping single disease susceptibility loci, interest in detecting epistatic interactions between genes has grown. The dimensionality and comparisons required to search the epistatic space and the inference for a significant result pose challenges for testing epistatic disease models. The multifactor dimensionality reduction-pedigree disequilibrium test (MDR-PDT) was developed to test for multilocus models in pedigree data. In the present study we rigorously tested MDR-PDT with new cross-validation (CV) (both 5- and 10-fold) and omnibus model selection algorithms by simulating a range of heritabilities, odds ratios, minor allele frequencies, sample sizes, and numbers of interacting loci. Power was evaluated using 100, 500, and 1,000 families, with minor allele frequencies 0.2 and 0.4 and broad-sense heritabilities of 0.005, 0.01, 0.03, 0.05, and 0.1 for 2- and 3-locus purely epistatic penetrance models. We also compared the prediction error (PE) measure of effect with a predicted matched odds ratio (MOR) for final model selection and testing. We report that the CV procedure is valid with the permutation test, MDR-PDT performs similarly with 5- and 10-fold CV, and that the MOR is more powerful than PE as the fitness metric for MDR-PDT.
Collapse
Affiliation(s)
- Todd L Edwards
- Center for Genetic Epidemiology and Statistical Genetics, Miami Institute for Human Genomics, University of Miami Miller School of Medicine, Florida, USA
| | | | | | | | | |
Collapse
|
39
|
Motsinger-Reif AA, Antas PRZ, Oki NO, Levy S, Holland SM, Sterling TR. Polymorphisms in IL-1beta, vitamin D receptor Fok1, and Toll-like receptor 2 are associated with extrapulmonary tuberculosis. BMC MEDICAL GENETICS 2010; 11:37. [PMID: 20196868 PMCID: PMC2837863 DOI: 10.1186/1471-2350-11-37] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/26/2009] [Accepted: 03/02/2010] [Indexed: 11/17/2022]
Abstract
Background Human genetic variants may affect tuberculosis susceptibility, but the immunologic correlates of the genetic variants identified are often unclear. Methods We conducted a pilot case-control study to identify genetic variants associated with extrapulmonary tuberculosis in patients with previously characterized immune defects: low CD4+ lymphocytes and low unstimulated cytokine production. Two genetic association approaches were used: 1) variants previously associated with tuberculosis risk; 2) single nucleotide polymorphisms (SNPs) in candidate genes involved in tuberculosis pathogenesis. Single locus association tests and multifactor dimensionality reduction (MDR) assessed main effects and multi-locus interactions. Results There were 24 extrapulmonary tuberculosis cases (18 black), 24 pulmonary tuberculosis controls (19 black) and 57 PPD+ controls (49 black). In approach 1, 22 SNPs and 3 microsatellites were assessed. In single locus association tests, interleukin (IL)-1β +3953 C/T was associated with extrapulmonary tuberculosis compared to PPD+ controls (P = 0.049). Among the sub-set of patients who were black, genotype frequencies of the vitamin D receptor (VDR) Fok1 A/G SNP were significantly different in extrapulmonary vs. pulmonary TB patients (P = 0.018). In MDR analysis, the toll-like receptor (TLR) 2 microsatellite had 76% prediction accuracy for extrapulmonary tuberculosis in blacks (P = 0.002). In approach 2, 613 SNPs in 26 genes were assessed. None were associated with extrapulmonary tuberculosis. Conclusions In this pilot study among extrapulmonary tuberculosis patients with well-characterized immune defects, genetic variants in IL-1β, VDR Fok1, and TLR2 were associated with an increased risk of extrapulmonary disease. Additional studies of the underlying mechanism of these genetic variants are warranted.
Collapse
Affiliation(s)
- Alison A Motsinger-Reif
- Division of Infectious Diseases, Department of Medicine, Vanderbilt University School of Medicine, Nashville, TN, USA
| | | | | | | | | | | |
Collapse
|
40
|
Edwards TL, Turner SD, Torstenson ES, Dudek SM, Martin ER, Ritchie MD. A general framework for formal tests of interaction after exhaustive search methods with applications to MDR and MDR-PDT. PLoS One 2010; 5:e9363. [PMID: 20186329 PMCID: PMC2826406 DOI: 10.1371/journal.pone.0009363] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2009] [Accepted: 12/16/2009] [Indexed: 02/03/2023] Open
Abstract
The initial presentation of multifactor dimensionality reduction (MDR) featured cross-validation to mitigate over-fitting, computationally efficient searches of the epistatic model space, and variable construction with constructive induction to alleviate the curse of dimensionality. However, the method was unable to differentiate association signals arising from true interactions from those due to independent main effects at individual loci. This issue leads to problems in inference and interpretability for the results from MDR and the family-based compliment the MDR-pedigree disequilibrium test (PDT). A suggestion from previous work was to fit regression models post hoc to specifically evaluate the null hypothesis of no interaction for MDR or MDR-PDT models. We demonstrate with simulation that fitting a regression model on the same data as that analyzed by MDR or MDR-PDT is not a valid test of interaction. This is likely to be true for any other procedure that searches for models, and then performs an uncorrected test for interaction. We also show with simulation that when strong main effects are present and the null hypothesis of no interaction is true, that MDR and MDR-PDT reject at far greater than the nominal rate. We also provide a valid regression-based permutation test procedure that specifically tests the null hypothesis of no interaction, and does not reject the null when only main effects are present. The regression-based permutation test implemented here conducts a valid test of interaction after a search for multilocus models, and can be applied to any method that conducts a search to find a multilocus model representing an interaction.
Collapse
Affiliation(s)
- Todd L. Edwards
- Division of Epidemiology, Department of Medicine, Vanderbilt Epidemiology Center, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America
- Center for Human Genetics Research, Vanderbilt University Medical Center, Nashville, Tennessee, United States of America
- Center for Genetic Epidemiology and Statistical Genetics, John P. Hussman Institute for Human Genomics, University of Miami Miller School of Medicine, Miami, Florida, United States of America
| | - Stephen D. Turner
- Center for Human Genetics Research, Vanderbilt University Medical Center, Nashville, Tennessee, United States of America
| | - Eric S. Torstenson
- Center for Human Genetics Research, Vanderbilt University Medical Center, Nashville, Tennessee, United States of America
| | - Scott M. Dudek
- Center for Human Genetics Research, Vanderbilt University Medical Center, Nashville, Tennessee, United States of America
| | - Eden R. Martin
- Center for Genetic Epidemiology and Statistical Genetics, John P. Hussman Institute for Human Genomics, University of Miami Miller School of Medicine, Miami, Florida, United States of America
| | - Marylyn D. Ritchie
- Center for Human Genetics Research, Vanderbilt University Medical Center, Nashville, Tennessee, United States of America
- * E-mail:
| |
Collapse
|
41
|
Kang SG, Lee HJ, Choi JE, An H, Rhee M, Kim L. Association study between glutathione S-transferase GST-M1, GST-T1, and GST-P1 polymorphisms and tardive dyskinesia. Hum Psychopharmacol 2009; 24:55-60. [PMID: 19051221 DOI: 10.1002/hup.988] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
OBJECTIVES Data from several studies suggest that oxidative stress may play a role in the pathophysiology of tardive dyskinesia (TD). Glutathione S-transferase (GST) enzymes play important roles in protecting cells against oxidative stress. In the present study, we investigated the hypothesis that polymorphisms in genes for these detoxifying enzymes can influence susceptibility to TD in patients with schizophrenia. METHODS The GST-M1, GST-T1, and GST-P1 loci were analyzed by polymerase chain reaction (PCR)-based methods in 83 schizophrenic patients with TD and 126 schizophrenic without TD who were matched for antipsychotic drug exposure and other relevant variables. The multifactor dimensionality reduction (MDR) approach was used to analyze gene-gene interactions. RESULTS There were no significant differences in the distributions of the GST-M1, GST-T1, and GST-P1 genotypes between the TD and non-TD groups (p > 0.05). However, in comparison of the severity of TD among genotypes using Poisson regression showed that Ile/Ile genotype of GST-P1 had higher AIMS score compared to Ile/Val + Val/Val genotypes (X(2) = 7.13, p = 0.008). MDR analysis did not show a significant interaction between the three GST gene variants and susceptibility to TD (p > 0.05). CONCLUSIONS These results suggest that GST gene polymorphisms do not confer increased susceptibility to TD in patients with schizophrenia but TD severity might be related with GST-P1 variants.
Collapse
Affiliation(s)
- Seung-Gul Kang
- Department of Psychiatry, Korea University College of Medicine, Seoul, South Korea
| | | | | | | | | | | |
Collapse
|
42
|
Edwards TL, Lewis K, Velez DR, Dudek S, Ritchie MD. Exploring the performance of Multifactor Dimensionality Reduction in large scale SNP studies and in the presence of genetic heterogeneity among epistatic disease models. Hum Hered 2008; 67:183-92. [PMID: 19077437 DOI: 10.1159/000181157] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2008] [Accepted: 07/01/2008] [Indexed: 01/27/2023] Open
Abstract
BACKGROUND/AIMS In genetic studies of complex disease a consideration for the investigator is detection of joint effects. The Multifactor Dimensionality Reduction (MDR) algorithm searches for these effects with an exhaustive approach. Previously unknown aspects of MDR performance were the power to detect interactive effects given large numbers of non-model loci or varying degrees of heterogeneity among multiple epistatic disease models. METHODS To address the performance with many non-model loci, datasets of 500 cases and 500 controls with 100 to 10,000 SNPs were simulated for two-locus models, and one hundred 500-case/500-control datasets with 100 and 500 SNPs were simulated for three-locus models. Multiple levels of locus heterogeneity were simulated in several sample sizes. RESULTS These results show MDR is robust to locus heterogeneity when the definition of power is not as conservative as in previous simulation studies where all model loci were required to be found by the method. The results also indicate that MDR performance is related more strongly to broad-sense heritability than sample size and is not greatly affected by non-model loci. CONCLUSIONS A study in which a population with high heritability estimates is sampled predisposes the MDR study to success more than a larger ascertainment in a population with smaller estimates.
Collapse
Affiliation(s)
- Todd L Edwards
- Center for Human Genetics Research, Vanderbilt University Medical Center, Nashville, Tenn., USA
| | | | | | | | | |
Collapse
|
43
|
Motsinger-Reif AA, Reif DM, Fanelli TJ, Ritchie MD. A comparison of analytical methods for genetic association studies. Genet Epidemiol 2008; 32:767-78. [DOI: 10.1002/gepi.20345] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
|
44
|
Haasl RJ, Ahmadi MR, Meethal SV, Gleason CE, Johnson SC, Asthana S, Bowen RL, Atwood CS. A luteinizing hormone receptor intronic variant is significantly associated with decreased risk of Alzheimer's disease in males carrying an apolipoprotein E epsilon4 allele. BMC MEDICAL GENETICS 2008; 9:37. [PMID: 18439297 PMCID: PMC2396156 DOI: 10.1186/1471-2350-9-37] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/26/2007] [Accepted: 04/25/2008] [Indexed: 01/04/2023]
Abstract
Genetic and biochemical studies support the apolipoprotein E (APOE) ε4 allele as a major risk factor for late-onset Alzheimer's disease (AD), though ~50% of AD patients do not carry the allele. APOE transports cholesterol for luteinizing hormone (LH)-regulated steroidogenesis, and both LH and neurosteroids have been implicated in the etiology of AD. Since polymorphisms of LH beta-subunit (LHB) and its receptor (LHCGR) have not been tested for their association with AD, we scored AD and age-matched control samples for APOE genotype and 14 polymorphisms of LHB and LHCGR. Thirteen gene-gene interactions between the loci of LHB, LHCGR, and APOE were associated with AD. The most strongly supported of these interactions was between an LHCGR intronic polymorphism (rs4073366; lhcgr2) and APOE in males, which was detected using all three interaction analyses: linkage disequilibrium, multi-dimensionality reduction, and logistic regression. While the APOE ε4 allele carried significant risk of AD in males [p = 0.007, odds ratio (OR) = 3.08(95%confidence interval: 1.37, 6.91)], ε4-positive males carrying 1 or 2 C-alleles at lhcgr2 exhibited significantly decreased risk of AD [OR = 0.06(0.01, 0.38); p = 0.003]. This suggests that the lhcgr2 C-allele or a closely linked locus greatly reduces the risk of AD in males carrying an APOE ε4 allele. The reversal of risk embodied in this interaction powerfully supports the importance of considering the role gene-gene interactions play in the etiology of complex biological diseases and demonstrates the importance of using multiple analytic methods to detect well-supported gene-gene interactions.
Collapse
Affiliation(s)
- Ryan J Haasl
- Section of Geriatrics and Gerontology, Department of Medicine, University of Wisconsin School of Medicine and Public Health, Madison, WI 53705, USA.
| | | | | | | | | | | | | | | |
Collapse
|
45
|
Velez DR, White BC, Motsinger AA, Bush WS, Ritchie MD, Williams SM, Moore JH. A balanced accuracy function for epistasis modeling in imbalanced datasets using multifactor dimensionality reduction. Genet Epidemiol 2007; 31:306-15. [PMID: 17323372 DOI: 10.1002/gepi.20211] [Citation(s) in RCA: 224] [Impact Index Per Article: 12.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Multifactor dimensionality reduction (MDR) was developed as a method for detecting statistical patterns of epistasis. The overall goal of MDR is to change the representation space of the data to make interactions easier to detect. It is well known that machine learning methods may not provide robust models when the class variable (e.g. case-control status) is imbalanced and accuracy is used as the fitness measure. This is because most methods learn patterns that are relevant for the larger of the two classes. The goal of this study was to evaluate three different strategies for improving the power of MDR to detect epistasis in imbalanced datasets. The methods evaluated were: (1) over-sampling that resamples with replacement the smaller class until the data are balanced, (2) under-sampling that randomly removes subjects from the larger class until the data are balanced, and (3) balanced accuracy [(sensitivity+specificity)/2] as the fitness function with and without an adjusted threshold. These three methods were compared using simulated data with two-locus epistatic interactions of varying heritability (0.01, 0.025, 0.05, 0.1, 0.2, 0.3, 0.4) and minor allele frequency (0.2, 0.4) that were embedded in 100 replicate datasets of varying sample sizes (400, 800, 1600). Each dataset was generated with different ratios of cases to controls (1 : 1, 1 : 2, 1 : 4). We found that the balanced accuracy function with an adjusted threshold significantly outperformed both over-sampling and under-sampling and fully recovered the power. These results suggest that balanced accuracy should be used instead of accuracy for the MDR analysis of epistasis in imbalanced datasets.
Collapse
Affiliation(s)
- Digna R Velez
- Center for Human Genetics Research, Vanderbilt University Medical Center, Nashville, Tennessee
| | | | | | | | | | | | | |
Collapse
|
46
|
Musani SK, Shriner D, Liu N, Feng R, Coffey CS, Yi N, Tiwari HK, Allison DB. Detection of gene x gene interactions in genome-wide association studies of human population data. Hum Hered 2007; 63:67-84. [PMID: 17283436 DOI: 10.1159/000099179] [Citation(s) in RCA: 138] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
Empirical evidence supporting the commonality of gene x gene interactions, coupled with frequent failure to replicate results from previous association studies, has prompted statisticians to develop methods to handle this important subject. Nonparametric methods have generated intense interest because of their capacity to handle high-dimensional data. Genome-wide association analysis of large-scale SNP data is challenging mathematically and computationally. In this paper, we describe major issues and questions arising from this challenge, along with methodological implications. Data reduction and pattern recognition methods seem to be the new frontiers in efforts to detect gene x gene interactions comprehensively. Currently, there is no single method that is recognized as the 'best' for detecting, characterizing, and interpreting gene x gene interactions. Instead, a combination of approaches with the aim of balancing their specific strengths may be the optimal approach to investigate gene x gene interactions in human data.
Collapse
Affiliation(s)
- Solomon K Musani
- Section on Statistical Genetics, Department of Biostatistics, University of Alabama at Birmingham, Birmingham, AL 35294, USA.
| | | | | | | | | | | | | | | |
Collapse
|