1
|
Wolf G, Leippe P, Onstein S, Goldmann U, Frommelt F, Teoh ST, Girardi E, Wiedmer T, Superti-Furga G. The genetic interaction map of the human solute carrier superfamily. Mol Syst Biol 2025:10.1038/s44320-025-00105-5. [PMID: 40355755 DOI: 10.1038/s44320-025-00105-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2024] [Revised: 03/28/2025] [Accepted: 03/31/2025] [Indexed: 05/15/2025] Open
Abstract
Solute carriers (SLCs), the largest superfamily of transporter proteins in humans with about 450 members, control the movement of molecules across membranes. A typical human cell expresses over 200 different SLCs, yet their collective influence on cell phenotypes is not well understood due to overlapping substrate specificities and expression patterns. To address this, we performed systematic pairwise gene double knockouts using CRISPR-Cas12a and -Cas9 in human colon carcinoma cells. A total of 1,088,605 guide combinations were used to interrogate 35,421 SLC-SLC and SLC-enzyme double knockout combinations across multiple growth conditions, uncovering 1236 genetic interactions with a growth phenotype. Further exploration of an interaction between the mitochondrial citrate/malate exchanger SLC25A1 and the zinc transporter SLC39A1 revealed an unexpected role for SLC39A1 in metabolic reprogramming and anti-apoptotic signaling. This full-scale genetic interaction map of human SLC transporters is the backbone for understanding the intricate functional network of SLCs in cellular systems and generates hypotheses for pharmacological target exploitation in cancer and other diseases. The results are available at https://re-solute.eu/resources/dashboards/genomics/ .
Collapse
Affiliation(s)
- Gernot Wolf
- CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, 1090, Vienna, Austria
| | - Philipp Leippe
- CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, 1090, Vienna, Austria
| | - Svenja Onstein
- CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, 1090, Vienna, Austria
| | - Ulrich Goldmann
- CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, 1090, Vienna, Austria
| | - Fabian Frommelt
- CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, 1090, Vienna, Austria
| | - Shao Thing Teoh
- CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, 1090, Vienna, Austria
| | - Enrico Girardi
- CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, 1090, Vienna, Austria
- Solgate GmbH, IST Park Building, 3400, Klosterneuburg, Austria
| | - Tabea Wiedmer
- CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, 1090, Vienna, Austria
| | - Giulio Superti-Furga
- CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, 1090, Vienna, Austria.
- Center for Physiology and Pharmacology, Medical University of Vienna, 1090, Vienna, Austria.
- Fondazione Ri.MED, Palermo, Italy.
| |
Collapse
|
2
|
Liu S, Liu Y, Li M, Shang S, Cao Y, Shen X, Huang C. Artificial intelligence in autoimmune diseases: a bibliometric exploration of the past two decades. Front Immunol 2025; 16:1525462. [PMID: 40330462 PMCID: PMC12052778 DOI: 10.3389/fimmu.2025.1525462] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2024] [Accepted: 03/27/2025] [Indexed: 05/08/2025] Open
Abstract
Objective Autoimmune diseases have long been recognized for their intricate nature and elusive mechanisms, presenting significant challenges in both diagnosis and treatment. The advent of artificial intelligence technology has opened up new possibilities for understanding, diagnosing, predicting, and managing autoimmune disorders. This study aims to explore the current state and emerging trends in the field through bibliometric analysis, providing guidance for future research directions. Methods The study employed the Web of Science Core Collection database for data acquisition and performed bibliometric analysis using CiteSpace, HistCite Pro, and VOSviewer. Results Over the past two decades, 1,695 publications emerged in this research field, including 1,409 research articles and 286 reviews. This investigation unveils the global development landscape predominantly led by the United States and China. The research identifies key institutions, such as Brigham & Women's Hospital, influential journals like the Annals of the Rheumatic Diseases, distinguished authors including Katherine P. Liao, and pivotal articles. It visually maps out the research clusters' evolutionary path over time and explores their applications in patient identification, risk factors, prognosis assessment, diagnosis, classification of disease subtypes, monitoring and decision support, and drug discovery. Conclusion AI is increasingly recognized for its potential in the field of autoimmune diseases, yet it continues to face numerous challenges, including insufficient model validation and difficulties in data integration and computational power. Significant advancements have been demanded to enhance diagnostic precision, improve treatment methodologies, and establish robust frameworks for data protection, thereby facilitating more effective management of these complex conditions.
Collapse
Affiliation(s)
- Sidi Liu
- Department of Rheumatology and Immunology, The First Affiliated Hospital of Anhui University of Traditional Chinese Medicine, Hefei, Anhui, China
- Center for Xin’an Medicine and Modernization of Traditional Chinese Medicine of Institute of Health and Medicine (IHM), The First Affiliated Hospital of Anhui University of Traditional Chinese Medicine, Hefei, Anhui, China
| | - Yang Liu
- Department of Orthopedics, The First Affiliated Hospital of Anhui University of Traditional Chinese Medicine, Hefei, Anhui, China
| | - Ming Li
- Department of Rheumatology and Immunology, The First Affiliated Hospital of Anhui University of Traditional Chinese Medicine, Hefei, Anhui, China
- Center for Xin’an Medicine and Modernization of Traditional Chinese Medicine of Institute of Health and Medicine (IHM), The First Affiliated Hospital of Anhui University of Traditional Chinese Medicine, Hefei, Anhui, China
| | - Shuangshuang Shang
- Department of Rheumatology and Immunology, The First Affiliated Hospital of Anhui University of Traditional Chinese Medicine, Hefei, Anhui, China
- Center for Xin’an Medicine and Modernization of Traditional Chinese Medicine of Institute of Health and Medicine (IHM), The First Affiliated Hospital of Anhui University of Traditional Chinese Medicine, Hefei, Anhui, China
| | - Yunxiang Cao
- Department of Rheumatology and Immunology, The First Affiliated Hospital of Anhui University of Traditional Chinese Medicine, Hefei, Anhui, China
- Center for Xin’an Medicine and Modernization of Traditional Chinese Medicine of Institute of Health and Medicine (IHM), The First Affiliated Hospital of Anhui University of Traditional Chinese Medicine, Hefei, Anhui, China
| | - Xi Shen
- Department of Rheumatology and Immunology, The First Affiliated Hospital of Anhui University of Traditional Chinese Medicine, Hefei, Anhui, China
- Center for Xin’an Medicine and Modernization of Traditional Chinese Medicine of Institute of Health and Medicine (IHM), The First Affiliated Hospital of Anhui University of Traditional Chinese Medicine, Hefei, Anhui, China
| | - Chuanbing Huang
- Department of Rheumatology and Immunology, The First Affiliated Hospital of Anhui University of Traditional Chinese Medicine, Hefei, Anhui, China
- Center for Xin’an Medicine and Modernization of Traditional Chinese Medicine of Institute of Health and Medicine (IHM), The First Affiliated Hospital of Anhui University of Traditional Chinese Medicine, Hefei, Anhui, China
| |
Collapse
|
3
|
Lee YH, Song GG. Associations of RANKL levels and polymorphisms with rheumatoid arthritis: A meta-analysis. PLoS One 2025; 20:e0317517. [PMID: 39804865 PMCID: PMC11729962 DOI: 10.1371/journal.pone.0317517] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2024] [Accepted: 12/29/2024] [Indexed: 01/16/2025] Open
Abstract
OBJECTIVES This study examined the correlation between circulating receptor activator for nuclear factor-κB ligand (RANKL) levels and rheumatoid arthritis (RA), and investigated the association between polymorphisms in the RANKL gene and susceptibility to RA. METHOD We searched the Medline, Embase, and Cochrane databases for relevant publications up to September 2024. A meta-analysis was conducted to assess serum/plasma RANKL levels in patients with RA and controls, and to explore the relationship between RANKL rs9533156 and rs2277438 polymorphisms and RA susceptibility. RESULTS Ten studies encompassing 1,682 RA patients and 1,288 controls were analyzed. RANKL levels were significantly higher in RA patients compared to controls (SMD = 0.665, 95% CI = 0.290-1.040, P = 0.001). Subgroup analysis affirmed these findings' consistency across different sample sizes and publication years. RANKL levels were positively associated with rheumatoid factor (RF) and Disease Activity Score-28 (DAS28) (RF correlation coefficient = 0.157, 95% CI = 0.028-0.282, P = 0.018; DAS28 correlation coefficient = 0.151, 95% CI = 0.125-0.370, P < 0.001). Additionally, the meta-analysis revealed significant associations between the susceptibility to RA and the RANKL rs9533156 C allele (OR = 0.609, 95% CI = 0.520-0.714, P < 0.010) as well as the rs2277438 G allele (OR = 1.206, 95% CI = 1.003-1.451, P = 0.047). These associations were consistent across homozygote comparisons and different genetic models. CONCLUSIONS This meta-analysis underscores the elevated circulating RANKL levels in RA patients and their significant correlation with RF and DAS28. Additionally, the RANKL rs9533156 and rs2277438 polymorphisms were significantly associated with RA susceptibility.
Collapse
Affiliation(s)
- Young Ho Lee
- Department of Rheumatology, Korea University College of Medicine, Seoul, Korea
| | - Gwan Gyu Song
- Department of Rheumatology, Korea University College of Medicine, Seoul, Korea
| |
Collapse
|
4
|
Lyu C, Joehanes R, Huan T, Levy D, Li Y, Wang M, Liu X, Liu C, Ma J. Enhancing selection of alcohol consumption-associated genes by random forest. Br J Nutr 2024; 131:2058-2067. [PMID: 38606596 PMCID: PMC11216877 DOI: 10.1017/s0007114524000795] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/13/2024]
Abstract
Machine learning methods have been used in identifying omics markers for a variety of phenotypes. We aimed to examine whether a supervised machine learning algorithm can improve identification of alcohol-associated transcriptomic markers. In this study, we analysed array-based, whole-blood derived expression data for 17 873 gene transcripts in 5508 Framingham Heart Study participants. By using the Boruta algorithm, a supervised random forest (RF)-based feature selection method, we selected twenty-five alcohol-associated transcripts. In a testing set (30 % of entire study participants), AUC (area under the receiver operating characteristics curve) of these twenty-five transcripts were 0·73, 0·69 and 0·66 for non-drinkers v. moderate drinkers, non-drinkers v. heavy drinkers and moderate drinkers v. heavy drinkers, respectively. The AUC of the selected transcripts by the Boruta method were comparable to those identified using conventional linear regression models, for example, AUC of 1958 transcripts identified by conventional linear regression models (false discovery rate < 0·2) were 0·74, 0·66 and 0·65, respectively. With Bonferroni correction for the twenty-five Boruta method-selected transcripts and three CVD risk factors (i.e. at P < 6·7e-4), we observed thirteen transcripts were associated with obesity, three transcripts with type 2 diabetes and one transcript with hypertension. For example, we observed that alcohol consumption was inversely associated with the expression of DOCK4, IL4R, and SORT1, and DOCK4 and SORT1 were positively associated with obesity, and IL4R was inversely associated with hypertension. In conclusion, using a supervised machine learning method, the RF-based Boruta algorithm, we identified novel alcohol-associated gene transcripts.
Collapse
Affiliation(s)
- Chenglin Lyu
- Department of Biostatistics, Boston University School of Public Health, Boston, MA
- Department of Anatomy and Neurobiology, Boston University Chobanian & Avedisian School of Medicine, Boston, MA
| | - Roby Joehanes
- Framingham Heart Study and Population Sciences Branch, NHLBI, Framingham, MA
| | - Tianxiao Huan
- Framingham Heart Study and Population Sciences Branch, NHLBI, Framingham, MA
| | - Daniel Levy
- Framingham Heart Study and Population Sciences Branch, NHLBI, Framingham, MA
| | - Yi Li
- Department of Biostatistics, Boston University School of Public Health, Boston, MA
| | - Mengyao Wang
- Department of Biostatistics, Boston University School of Public Health, Boston, MA
| | - Xue Liu
- Department of Biostatistics, Boston University School of Public Health, Boston, MA
| | - Chunyu Liu
- Department of Biostatistics, Boston University School of Public Health, Boston, MA
| | - Jiantao Ma
- Nutrition Epidemiology and Data Science, Friedman School of Nutrition Science and Policy, Tufts University, Boston, MA
| |
Collapse
|
5
|
Tang YY, Xu WD, Fu L, Liu XY, Huang AF. Synergistic effects of BTN3A1, SHP2, CD274, and STAT3 gene polymorphisms on the risk of systemic lupus erythematosus: a multifactorial dimensional reduction analysis. Clin Rheumatol 2024; 43:489-499. [PMID: 37688767 DOI: 10.1007/s10067-023-06765-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Revised: 08/23/2023] [Accepted: 09/01/2023] [Indexed: 09/11/2023]
Abstract
OBJECTIVE Systemic lupus erythematosus is a complex autoimmune disorder, and evidence supports the significance of genetic polymorphisms in SLE genetic susceptibility. The aim of this study was to assess the effects of BTN3A1 (butyrophilin 3A1), SHP2 (Src homology-2 containing protein tyrosine phosphatase), CD274 (programmed cell death 1 ligand 1), and STAT3 (signal transducer-activator of transcription 3) gene interactions on SLE risk. MATERIALS AND METHODS Two hundred and ninety patients diagnosed with SLE and 370 healthy controls were recruited. A multifactor dimensionality reduction (MDR) approach was used to determine the epistasis among single nucleotide polymorphisms (SNPs) on the BTN3A1 (rs742090), SHP2 (rs58116261), CD174 (rs702275), and STAT3 (rs8078731) genes. The best risk prediction model was identified in terms of precision and cross-validation consistency. RESULTS Allele A and genotype AA were negatively related to genetic susceptibility of SLE for BTN3A1 rs742090 (OR = 0.788 (0.625-0.993), P = 0.044; OR = 0.604 (0.372-0.981), P = 0.040). For STAT3 rs8078731, allele A and genotype AA were positively related to the risk of SLE (OR = 1.307 (1.032-1.654), P = 0.026; OR = 1.752 (1.020-3.010), P = 0.041). MDR analysis revealed the most significant interaction between BTN3A1 rs742090 and SHP2 rs58116261. The best risk prediction model was a combination of BTN3A1 rs742090, SHP2 rs58116261, and STAT3 rs8078731 (accuracy = 0.5866, consistency = 10/10, OR = 1.9870 (1.5964-2.4731), P = 0.001). CONCLUSION These data indicate that risk prediction models formed by gene interactions (BTN3A1, SHP2, STAT3) can identify susceptible populations of SLE. Key Points • BTN3A1 rs742090 polymorphism was a protective factor for systemic lupus erythematosus, while STAT3 rs8078731 polymorphism was a risk factor. • There was a strong synergistic effect of BTN3A1 rs742090 and SHP2 rs58116261, and interaction among BTN3A1 rs742090, SHP2 rs58116261, and STAT3 rs8078731 constructed the best model to show association with SLE risk.
Collapse
Affiliation(s)
- Yang-Yang Tang
- Department of Evidence-Based Medicine, Southwest Medical University, Luzhou, Sichuan, China
| | - Wang-Dong Xu
- Department of Evidence-Based Medicine, Southwest Medical University, Luzhou, Sichuan, China.
| | - Lu Fu
- Laboratory Animal Center, Southwest Medical University, Luzhou, Sichuan, China
| | - Xiao-Yan Liu
- Department of Evidence-Based Medicine, Southwest Medical University, Luzhou, Sichuan, China
| | - An-Fang Huang
- Department of Rheumatology and Immunology, Affiliated Hospital of Southwest Medical University, Luzhou, Sichuan, China.
| |
Collapse
|
6
|
Lau CF, Malek S, Gunalan R, Saw A, Milow P, Song C. Paediatric orthopaedic expert system. Health Informatics J 2023; 29:14604582231218530. [PMID: 38019888 DOI: 10.1177/14604582231218530] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2023]
Abstract
The paediatric orthopaedic expert system analyses and predicts the healing time of limb fractures in children using machine learning. As far we know, no published research on the paediatric orthopaedic expert system that predicts paediatric fracture healing time using machine learning has been published. The University Malaya Medical Centre (UMMC) offers paediatric orthopaedic data, comprises children under the age of 12 radiographs limb fractures with ages recorded from the date and time of initial trauma. SVR algorithms are used to predict and discover variables associated with fracture healing time. This study developed an expert system capable of predicting healing time, which can assist general practitioners and healthcare practitioners during treatment and follow-up. The system is available online at https://kidsfractureexpert.com/.
Collapse
Affiliation(s)
- Chia Fong Lau
- Institute of Biological Sciences, Faculty of Science, University of Malaya, Kuala Lumpur, Malaysia
| | - Sorayya Malek
- Institute of Biological Sciences, Faculty of Science, University of Malaya, Kuala Lumpur, Malaysia
| | - Roshan Gunalan
- Department of Orthopaedics / NOCERAL, Faculty of Medicine, University of Malaya, Kuala Lumpur, Malaysia
| | - Aik Saw
- Department of Orthopaedics / NOCERAL, Faculty of Medicine, University of Malaya, Kuala Lumpur, Malaysia
| | - Pozi Milow
- Institute of Biological Sciences, Faculty of Science, University of Malaya, Kuala Lumpur, Malaysia
| | - Cheen Song
- Institute of Biological Sciences, Faculty of Science, University of Malaya, Kuala Lumpur, Malaysia
| |
Collapse
|
7
|
Zhang C, Qin Q, Li Y, Zheng X, Chen W, Zhen Q, Li B, Wang W, Sun L. Multifactor dimensionality reduction reveals the effect of interaction between ERAP1 and IFIH1 polymorphisms in psoriasis susceptibility genes. Front Genet 2022; 13:1009589. [PMID: 36425068 PMCID: PMC9679141 DOI: 10.3389/fgene.2022.1009589] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2022] [Accepted: 10/18/2022] [Indexed: 09/18/2023] Open
Abstract
Background: Psoriasis is a common immune-mediated hyperproliferative skin dysfunction with known genetic predisposition. Gene-gene interaction (e.g., between HLA-C and ERAP1) in the psoriasis context has been reported in various populations. As ERAP1 has been recognized as a psoriasis susceptibility gene and plays a critical role in antigen presentation, we performed this study to identify interactions between ERAP1 and other psoriasis susceptibility gene variants. Methods: We validated psoriasis susceptibility gene variants in an independent cohort of 5,414 patients with psoriasis and 5,556 controls. Multifactor dimensionality reduction (MDR) analysis was performed to identify the interaction between variants significantly associated with psoriasis in the validation cohort and ERAP1 variants. We then conducted a meta-analysis of those variants with datasets from exome sequencing, target sequencing, and validation analyses and used MDR to identify the best gene-gene interaction model, including variants that were significant in the meta-analysis and ERAP1 variants. Results: We found that 19 of the replicated variants were identified with p < 0.05 and detected six single-nucleotide polymorphisms of psoriasis susceptibility genes in the meta-analysis. MDR analysis revealed that the best predictive model was that between the rs27044 polymorphism of ERAP1 and the rs7590692 polymorphism of IFIH1 (cross-validation consistency = 9/10, test accuracy = 0.53, odds ratio = 1.32 (95% CI, 1.09-1.59), p < 0.01). Conclusion: Our findings suggest that the interaction between ERAP1 and IFIH1 affects the development of psoriasis. This hypothesis needs to be tested in basic biological studies.
Collapse
Affiliation(s)
- Chang Zhang
- Department of Dermatology, The First Affiliated Hospital of Anhui Medical University, Hefei, China
- Institute of Dermatology, Anhui Medical University, Hefei, China
- Key Laboratory of Dermatology, Anhui Medical University, Ministry of Education, Hefei, China
- Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei, China
- Anhui Provincial Institute of Translational Medicine, Hefei, China
| | - Qin Qin
- Department of Dermatology, The First Affiliated Hospital of Anhui Medical University, Hefei, China
- Institute of Dermatology, Anhui Medical University, Hefei, China
- Key Laboratory of Dermatology, Anhui Medical University, Ministry of Education, Hefei, China
- Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei, China
- Anhui Provincial Institute of Translational Medicine, Hefei, China
| | - Yuanyuan Li
- Department of Dermatology, The First Affiliated Hospital of Anhui Medical University, Hefei, China
- Institute of Dermatology, Anhui Medical University, Hefei, China
- Key Laboratory of Dermatology, Anhui Medical University, Ministry of Education, Hefei, China
- Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei, China
- Anhui Provincial Institute of Translational Medicine, Hefei, China
| | - Xiaodong Zheng
- Department of Dermatology, The First Affiliated Hospital of Anhui Medical University, Hefei, China
- Institute of Dermatology, Anhui Medical University, Hefei, China
- Key Laboratory of Dermatology, Anhui Medical University, Ministry of Education, Hefei, China
- Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei, China
- Anhui Provincial Institute of Translational Medicine, Hefei, China
| | - Weiwei Chen
- Department of Dermatology, The First Affiliated Hospital of Anhui Medical University, Hefei, China
- Institute of Dermatology, Anhui Medical University, Hefei, China
- Key Laboratory of Dermatology, Anhui Medical University, Ministry of Education, Hefei, China
- Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei, China
- Anhui Provincial Institute of Translational Medicine, Hefei, China
| | - Qi Zhen
- Department of Dermatology, The First Affiliated Hospital of Anhui Medical University, Hefei, China
- Institute of Dermatology, Anhui Medical University, Hefei, China
- Key Laboratory of Dermatology, Anhui Medical University, Ministry of Education, Hefei, China
- Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei, China
- Anhui Provincial Institute of Translational Medicine, Hefei, China
| | - Bao Li
- Department of Dermatology, The First Affiliated Hospital of Anhui Medical University, Hefei, China
- Institute of Dermatology, Anhui Medical University, Hefei, China
- Key Laboratory of Dermatology, Anhui Medical University, Ministry of Education, Hefei, China
- Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei, China
- Anhui Provincial Institute of Translational Medicine, Hefei, China
| | - Wenjun Wang
- Department of Dermatology, The First Affiliated Hospital of Anhui Medical University, Hefei, China
- Institute of Dermatology, Anhui Medical University, Hefei, China
- Key Laboratory of Dermatology, Anhui Medical University, Ministry of Education, Hefei, China
- Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei, China
- Anhui Provincial Institute of Translational Medicine, Hefei, China
| | - Liangdan Sun
- Department of Dermatology, The First Affiliated Hospital of Anhui Medical University, Hefei, China
- Institute of Dermatology, Anhui Medical University, Hefei, China
- Key Laboratory of Dermatology, Anhui Medical University, Ministry of Education, Hefei, China
- Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei, China
- Anhui Provincial Institute of Translational Medicine, Hefei, China
| |
Collapse
|
8
|
Momtazmanesh S, Nowroozi A, Rezaei N. Artificial Intelligence in Rheumatoid Arthritis: Current Status and Future Perspectives: A State-of-the-Art Review. Rheumatol Ther 2022; 9:1249-1304. [PMID: 35849321 PMCID: PMC9510088 DOI: 10.1007/s40744-022-00475-4] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2022] [Accepted: 06/24/2022] [Indexed: 11/23/2022] Open
Abstract
Investigation of the potential applications of artificial intelligence (AI), including machine learning (ML) and deep learning (DL) techniques, is an exponentially growing field in medicine and healthcare. These methods can be critical in providing high-quality care to patients with chronic rheumatological diseases lacking an optimal treatment, like rheumatoid arthritis (RA), which is the second most prevalent autoimmune disease. Herein, following reviewing the basic concepts of AI, we summarize the advances in its applications in RA clinical practice and research. We provide directions for future investigations in this field after reviewing the current knowledge gaps and technical and ethical challenges in applying AI. Automated models have been largely used to improve RA diagnosis since the early 2000s, and they have used a wide variety of techniques, e.g., support vector machine, random forest, and artificial neural networks. AI algorithms can facilitate screening and identification of susceptible groups, diagnosis using omics, imaging, clinical, and sensor data, patient detection within electronic health record (EHR), i.e., phenotyping, treatment response assessment, monitoring disease course, determining prognosis, novel drug discovery, and enhancing basic science research. They can also aid in risk assessment for incidence of comorbidities, e.g., cardiovascular diseases, in patients with RA. However, the proposed models may vary significantly in their performance and reliability. Despite the promising results achieved by AI models in enhancing early diagnosis and management of patients with RA, they are not fully ready to be incorporated into clinical practice. Future investigations are required to ensure development of reliable and generalizable algorithms while they carefully look for any potential source of bias or misconduct. We showed that a growing body of evidence supports the potential role of AI in revolutionizing screening, diagnosis, and management of patients with RA. However, multiple obstacles hinder clinical applications of AI models. Incorporating the machine and/or deep learning algorithms into real-world settings would be a key step in the progress of AI in medicine.
Collapse
Affiliation(s)
- Sara Momtazmanesh
- School of Medicine, Tehran University of Medical Sciences, Tehran, Iran
- Network of Immunity in Infection, Malignancy and Autoimmunity (NIIMA), Universal Scientific Education and Research Network (USERN), Tehran, Iran
- Research Center for Immunodeficiencies, Pediatrics Center of Excellence, Children's Medical Center, Tehran University of Medical Sciences, Dr. Gharib St, Keshavarz Blvd, Tehran, Iran
| | - Ali Nowroozi
- School of Medicine, Tehran University of Medical Sciences, Tehran, Iran
- Network of Immunity in Infection, Malignancy and Autoimmunity (NIIMA), Universal Scientific Education and Research Network (USERN), Tehran, Iran
| | - Nima Rezaei
- Network of Immunity in Infection, Malignancy and Autoimmunity (NIIMA), Universal Scientific Education and Research Network (USERN), Tehran, Iran.
- Research Center for Immunodeficiencies, Pediatrics Center of Excellence, Children's Medical Center, Tehran University of Medical Sciences, Dr. Gharib St, Keshavarz Blvd, Tehran, Iran.
- Department of Immunology, School of Medicine, Tehran University of Medical Sciences, Tehran, Iran.
| |
Collapse
|
9
|
Ning W, Acharya A, Li S, Schmalz G, Huang S. Identification of Key Pyroptosis-Related Genes and Distinct Pyroptosis-Related Clusters in Periodontitis. Front Immunol 2022; 13:862049. [PMID: 35844512 PMCID: PMC9281553 DOI: 10.3389/fimmu.2022.862049] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2022] [Accepted: 05/23/2022] [Indexed: 12/02/2022] Open
Abstract
Aim This study aims to identify pyroptosis-related genes (PRGs), their functional immune characteristics, and distinct pyroptosis-related clusters in periodontitis. Methods Differentially expressed (DE)-PRGs were determined by merging the expression profiles of GSE10334, GSE16134, and PRGs obtained from previous literatures and Molecular Signatures Database (MSigDB). Least absolute shrinkage and selection operator (LASSO) regression was applied to screen the prognostic PRGs and develop a prognostic model. Consensus clustering was applied to determine the pyroptosis-related clusters. Functional analysis and single-sample gene set enrichment analysis (ssGSEA) were performed to explore the biological characteristics and immune activities of the clusters. The hub pyroptosis-related modules were defined using weighted correlation network analysis (WGCNA). Results Of the 26 periodontitis-related DE-PRGs, the highest positive relevance was for High-Mobility Group Box 1 (HMGB1) and SR-Related CTD Associated Factor 11 (SCAF11). A 14-PRG-based signature was developed through the LASSO model. In addition, three pyroptosis-related clusters were obtained based on the 14 prognostic PRGs. Caspase 3 (CASP3), Granzyme B (GZMB), Interleukin 1 Alpha (IL1A), IL1Beta (B), IL6, Phospholipase C Gamma 1 (PLCG1) and PYD And CARD Domain Containing (PYCARD) were dysregulated in the three clusters. Distinct biological functions and immune activities, including human leukocyte antigen (HLA) gene expression, immune cell infiltration, and immune pathway activities, were identified in the three pyroptosis-related clusters of periodontitis. Furthermore, the pink module associated with endoplasmic stress-related functions was found to be correlated with cluster 2 and was suggested as the hub pyroptosis-related module. Conclusion The study identified 14 key pyroptosis-related genes, three distinct pyroptosis-related clusters, and one pyroptosis-related gene module describing several molecular aspects of pyroptosis in the pathogenesis and immune micro-environment regulation of periodontitis and also highlighted functional heterogeneity in pyroptosis-related mechanisms.
Collapse
Affiliation(s)
- Wanchen Ning
- Stomatological Hospital, Southern Medical University, Guangzhou, China
| | - Aneesha Acharya
- Dr. D. Y. Patil Dental College and Hospital, Dr. D. Y. Patil Vidyapeeth, Pune, India
| | - Simin Li
- Stomatological Hospital, Southern Medical University, Guangzhou, China
| | - Gerhard Schmalz
- Department of Cariology, Endodontology and Periodontology, University Leipzig, Leipzig, Germany
| | - Shaohong Huang
- Stomatological Hospital, Southern Medical University, Guangzhou, China
| |
Collapse
|
10
|
Xu Q, Zheng X, Mao Y, Chen W, Chen S, Zhang H, Zhen Q, Li B, Yong L, Ge H, Yu Y, Zhang R, Cao L, Cheng H, Wang W, Sun L. Gene interaction analysis of psoriasis in Chinese Han population. Mol Genet Genomic Med 2022; 10:e1858. [PMID: 35352505 PMCID: PMC9034666 DOI: 10.1002/mgg3.1858] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2021] [Revised: 10/10/2021] [Accepted: 12/17/2021] [Indexed: 11/21/2022] Open
Abstract
Background/aims Psoriasis is a chronic immune‐mediated inflammatory skin disease characterized by excessive proliferation of keratinocytes. It has a strong genetic predisposition; gene‐gene interactions are important genetic models for common diseases. In this study, we explore pair‐wise interactions among SNPs contributing to psoriasis susceptibility. Methods We first performed gene interactions with exome‐sequencing, next, we analyzed gene interactions combining the exome sequencing data with the targeted sequencing data. After we sequenced HLA region, we analyzed gene interactions including HLA regions and non‐HLA regions. Results We found interactions between HLA regions were significant. We observed significant interactions between HLA‐C*06:02 and rs118179173 (snp31443520; p = 8.21 × 10−20, OR = 0.22) and between HLA‐C*06:02 and HLA‐B:AA67 (p = 1.22 × 10−12, OR = 0.45). Conclusion This study provides evidence that HLA is the most important susceptibility region on the risk of psoriasis and interactions that occur in this region are still significant.
Collapse
Affiliation(s)
- Qiongqiong Xu
- Department of Dermatology, First Affiliated Hospital, Anhui Medical University, Hefei, China.,Institute of Dermatology, Anhui Medical University, Hefei, China.,Key Laboratory of Dermatology, Anhui Medical University, Ministry of Education, Hefei, China.,Anhui Medical University, Anhui Provincial Institute of Translational Medicine, Hefei, China.,Anhui Medical University, Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei, China
| | - Xiaodong Zheng
- Department of Dermatology, First Affiliated Hospital, Anhui Medical University, Hefei, China.,Institute of Dermatology, Anhui Medical University, Hefei, China.,Key Laboratory of Dermatology, Anhui Medical University, Ministry of Education, Hefei, China.,Anhui Medical University, Anhui Provincial Institute of Translational Medicine, Hefei, China.,Anhui Medical University, Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei, China
| | - Yiwen Mao
- Department of Dermatology, First Affiliated Hospital, Anhui Medical University, Hefei, China.,Institute of Dermatology, Anhui Medical University, Hefei, China.,Key Laboratory of Dermatology, Anhui Medical University, Ministry of Education, Hefei, China.,Anhui Medical University, Anhui Provincial Institute of Translational Medicine, Hefei, China.,Anhui Medical University, Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei, China
| | - Weiwei Chen
- Department of Dermatology, First Affiliated Hospital, Anhui Medical University, Hefei, China.,Institute of Dermatology, Anhui Medical University, Hefei, China.,Key Laboratory of Dermatology, Anhui Medical University, Ministry of Education, Hefei, China.,Anhui Medical University, Anhui Provincial Institute of Translational Medicine, Hefei, China.,Anhui Medical University, Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei, China
| | - Shirui Chen
- Department of Dermatology, First Affiliated Hospital, Anhui Medical University, Hefei, China.,Institute of Dermatology, Anhui Medical University, Hefei, China.,Key Laboratory of Dermatology, Anhui Medical University, Ministry of Education, Hefei, China.,Anhui Medical University, Anhui Provincial Institute of Translational Medicine, Hefei, China.,Anhui Medical University, Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei, China
| | - Hui Zhang
- Department of Dermatology, First Affiliated Hospital, Anhui Medical University, Hefei, China.,Institute of Dermatology, Anhui Medical University, Hefei, China.,Key Laboratory of Dermatology, Anhui Medical University, Ministry of Education, Hefei, China.,Anhui Medical University, Anhui Provincial Institute of Translational Medicine, Hefei, China.,Anhui Medical University, Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei, China
| | - Qi Zhen
- Department of Dermatology, First Affiliated Hospital, Anhui Medical University, Hefei, China.,Institute of Dermatology, Anhui Medical University, Hefei, China.,Key Laboratory of Dermatology, Anhui Medical University, Ministry of Education, Hefei, China.,Anhui Medical University, Anhui Provincial Institute of Translational Medicine, Hefei, China.,Anhui Medical University, Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei, China
| | - Bao Li
- Department of Dermatology, First Affiliated Hospital, Anhui Medical University, Hefei, China.,Institute of Dermatology, Anhui Medical University, Hefei, China.,Key Laboratory of Dermatology, Anhui Medical University, Ministry of Education, Hefei, China.,Anhui Medical University, Anhui Provincial Institute of Translational Medicine, Hefei, China.,Anhui Medical University, Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei, China
| | - Liang Yong
- Department of Dermatology, First Affiliated Hospital, Anhui Medical University, Hefei, China.,Institute of Dermatology, Anhui Medical University, Hefei, China.,Key Laboratory of Dermatology, Anhui Medical University, Ministry of Education, Hefei, China.,Anhui Medical University, Anhui Provincial Institute of Translational Medicine, Hefei, China.,Anhui Medical University, Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei, China
| | - Huiyao Ge
- Department of Dermatology, First Affiliated Hospital, Anhui Medical University, Hefei, China.,Institute of Dermatology, Anhui Medical University, Hefei, China.,Key Laboratory of Dermatology, Anhui Medical University, Ministry of Education, Hefei, China.,Anhui Medical University, Anhui Provincial Institute of Translational Medicine, Hefei, China.,Anhui Medical University, Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei, China
| | - Yafen Yu
- Department of Dermatology, First Affiliated Hospital, Anhui Medical University, Hefei, China.,Institute of Dermatology, Anhui Medical University, Hefei, China.,Key Laboratory of Dermatology, Anhui Medical University, Ministry of Education, Hefei, China.,Anhui Medical University, Anhui Provincial Institute of Translational Medicine, Hefei, China.,Anhui Medical University, Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei, China
| | - Ruixue Zhang
- Department of Dermatology, First Affiliated Hospital, Anhui Medical University, Hefei, China.,Institute of Dermatology, Anhui Medical University, Hefei, China.,Key Laboratory of Dermatology, Anhui Medical University, Ministry of Education, Hefei, China.,Anhui Medical University, Anhui Provincial Institute of Translational Medicine, Hefei, China.,Anhui Medical University, Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei, China
| | - Lu Cao
- Department of Dermatology, First Affiliated Hospital, Anhui Medical University, Hefei, China.,Institute of Dermatology, Anhui Medical University, Hefei, China.,Key Laboratory of Dermatology, Anhui Medical University, Ministry of Education, Hefei, China.,Anhui Medical University, Anhui Provincial Institute of Translational Medicine, Hefei, China.,Anhui Medical University, Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei, China
| | - Hui Cheng
- Department of Dermatology, First Affiliated Hospital, Anhui Medical University, Hefei, China.,Institute of Dermatology, Anhui Medical University, Hefei, China.,Key Laboratory of Dermatology, Anhui Medical University, Ministry of Education, Hefei, China.,Anhui Medical University, Anhui Provincial Institute of Translational Medicine, Hefei, China.,Anhui Medical University, Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei, China
| | - Wenjun Wang
- Department of Dermatology, First Affiliated Hospital, Anhui Medical University, Hefei, China.,Institute of Dermatology, Anhui Medical University, Hefei, China.,Key Laboratory of Dermatology, Anhui Medical University, Ministry of Education, Hefei, China.,Anhui Medical University, Anhui Provincial Institute of Translational Medicine, Hefei, China.,Anhui Medical University, Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei, China
| | - Liangdan Sun
- Department of Dermatology, First Affiliated Hospital, Anhui Medical University, Hefei, China.,Institute of Dermatology, Anhui Medical University, Hefei, China.,Key Laboratory of Dermatology, Anhui Medical University, Ministry of Education, Hefei, China.,Anhui Medical University, Anhui Provincial Institute of Translational Medicine, Hefei, China.,Anhui Medical University, Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei, China
| |
Collapse
|
11
|
Gola D, König IR. Empowering individual trait prediction using interactions for precision medicine. BMC Bioinformatics 2021; 22:74. [PMID: 33602124 PMCID: PMC7890638 DOI: 10.1186/s12859-021-04011-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2019] [Accepted: 02/08/2021] [Indexed: 11/11/2022] Open
Abstract
Background One component of precision medicine is to construct prediction models with their predicitve ability as high as possible, e.g. to enable individual risk prediction. In genetic epidemiology, complex diseases like coronary artery disease, rheumatoid arthritis, and type 2 diabetes, have a polygenic basis and a common assumption is that biological and genetic features affect the outcome under consideration via interactions. In the case of omics data, the use of standard approaches such as generalized linear models may be suboptimal and machine learning methods are appealing to make individual predictions. However, most of these algorithms focus mostly on main or marginal effects of the single features in a dataset. On the other hand, the detection of interacting features is an active area of research in the realm of genetic epidemiology. One big class of algorithms to detect interacting features is based on the multifactor dimensionality reduction (MDR). Here, we further develop the model-based MDR (MB-MDR), a powerful extension of the original MDR algorithm, to enable interaction empowered individual prediction. Results Using a comprehensive simulation study we show that our new algorithm (median AUC: 0.66) can use information hidden in interactions and outperforms two other state-of-the-art algorithms, namely the Random Forest (median AUC: 0.54) and Elastic Net (median AUC: 0.50), if interactions are present in a scenario of two pairs of two features having small effects. The performance of these algorithms is comparable if no interactions are present. Further, we show that our new algorithm is applicable to real data by comparing the performance of the three algorithms on a dataset of rheumatoid arthritis cases and healthy controls. As our new algorithm is not only applicable to biological/genetic data but to all datasets with discrete features, it may have practical implications in other research fields where interactions between features have to be considered as well, and we made our method available as an R package (https://github.com/imbs-hl/MBMDRClassifieR). Conclusions The explicit use of interactions between features can improve the prediction performance and thus should be included in further attempts to move precision medicine forward.
Collapse
Affiliation(s)
- Damian Gola
- Institut für Medizinische Biometrie und Statistik, Universität zu Lübeck, Universitätsklinikum Schleswig-Holstein, Campus Lübeck, Lübeck, Germany
| | - Inke R König
- Institut für Medizinische Biometrie und Statistik, Universität zu Lübeck, Universitätsklinikum Schleswig-Holstein, Campus Lübeck, Lübeck, Germany.
| |
Collapse
|
12
|
Tian H, Cao S, Hu M, Wang Y, Fu Q, Pan Y, Qin T. Identification of predictive factors in hepatocellular carcinoma outcome: A longitudinal study. Oncol Lett 2020; 20:765-773. [PMID: 32566003 PMCID: PMC7285798 DOI: 10.3892/ol.2020.11581] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2019] [Accepted: 02/19/2020] [Indexed: 12/24/2022] Open
Abstract
Various surgical methods impact the prognosis of patients with hepatocellular carcinoma (HCC) differently. However, clinical guidelines remain inconsistent and the relative importance of predictors of survival outcomes requires further evaluation. The present study aimed to rank the importance of predictive factors that impact the survival outcomes of patients with HCC and to compare the prognosis associated with different surgical methods based on data obtained from the Surveillance, Epidemiology and End Results database. To achieve these aims, the present study used a random forest (RF) model to detect important predictive factors associated with survival outcomes in patients with HCC. Cox regression analysis was used to compare different surgery methods. The variables included in the Cox regression model were selected based on the Gini index calculated by the RF model. Using the RF model, the present study demonstrated that surgery method, tumor size and age were the first, second and third most important factors associated with HCC prognosis, respectively. Overall, patients who underwent local tumor destruction [(hazard ratio (HR)=0.48; 95% confidence interval (CI), 0.45–0.51; P<0.001)], wedge or segmental resection (HR, 0.31; 95% CI, 0.29–0.33; P<0.001), lobectomy (HR, 0.29, 95% CI, 0.27–0.31; P<0.001) or liver transplantation (HR, 0.16; 95% CI, 0.14–0.17; P<0.001) demonstrated improved overall survival time compared with those treated with surgery, with a gradual decreasing trend observed in HRs. The present study demonstrated that the surgical method used is the most important predictor of the survival outcomes of patients with HCC. Liver transplantation resulted in the best prognosis for patients with HCC, except for those with undifferentiated tumors or distant metastasis.
Collapse
Affiliation(s)
- Huiyuan Tian
- Department of Research and Discipline Development, Henan Provincial People's Hospital, Zhengzhou University People's Hospital, Henan University People's Hospital, Zhengzhou, Henan 450003, P.R. China
| | - Shaofeng Cao
- Department of Gastroenterology, The Fifth Affiliated Hospital of Zhengzhou University, Zhengzhou, Henan 450000, P.R. China
| | - Mingxing Hu
- Department of Hepatobiliary and Pancreatic Surgery, Henan Provincial People's Hospital, Zhengzhou University People's Hospital, Henan University People's Hospital, Zhengzhou, Henan 450003, P.R. China
| | - Yuzhu Wang
- Department of Hepatobiliary and Pancreatic Surgery, Henan Provincial People's Hospital, Zhengzhou University People's Hospital, Henan University People's Hospital, Zhengzhou, Henan 450003, P.R. China
| | - Qiang Fu
- Department of Hepatobiliary and Pancreatic Surgery, Henan Provincial People's Hospital, Zhengzhou University People's Hospital, Henan University People's Hospital, Zhengzhou, Henan 450003, P.R. China
| | - Yanfeng Pan
- Department of Infectious Diseases, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, Henan 450000, P.R. China
| | - Tao Qin
- Department of Hepatobiliary and Pancreatic Surgery, Henan Provincial People's Hospital, Zhengzhou University People's Hospital, Henan University People's Hospital, Zhengzhou, Henan 450003, P.R. China
| |
Collapse
|
13
|
Malten J, König IR. Modified entropy-based procedure detects gene-gene-interactions in unconventional genetic models. BMC Med Genomics 2020; 13:65. [PMID: 32326960 PMCID: PMC7181579 DOI: 10.1186/s12920-020-0703-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2019] [Accepted: 03/13/2020] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Since it is assumed that genetic interactions play an important role in understanding the mechanisms of complex diseases, different statistical approaches have been suggested in recent years for this task. One interesting approach is the entropy-based IGENT method by Kwon et al. that promises an efficient detection of main effects and interaction effects simultaneously. However, a modification is required if the aim is to only detect interaction effects. METHODS Based on the IGENT method, we present a modification that leads to a conditional mutual information based approach under the condition of linkage equilibrium. The modified estimator is investigated in a comprehensive simulation based on five genetic interaction models and applied to real data from the genome-wide association study by the North American Rheumatoid Arthritis Consortium (NARAC). RESULTS The presented modification of IGENT controls the type I error in all simulated constellations. Furthermore, it provides high power for detecting pure interactions specifically on unconventional genetic models both in simulation and real data. CONCLUSIONS The proposed method uses the IGENT software, which is free available, simple and fast, and detects pure interactions on unconventional genetic models. Our results demonstrate that this modification is an attractive complement to established analysis methods.
Collapse
Affiliation(s)
- Jörg Malten
- Institut für Medizinische Biometrie und Statistik, Universität zu Lübeck, Universitätsklinikum Schleswig-Holstein, Campus Lübeck, Ratzeburger Allee 160, 23562 Lübeck, Germany
| | - Inke R König
- Institut für Medizinische Biometrie und Statistik, Universität zu Lübeck, Universitätsklinikum Schleswig-Holstein, Campus Lübeck, Ratzeburger Allee 160, 23562 Lübeck, Germany.
| |
Collapse
|
14
|
Stafford IS, Kellermann M, Mossotto E, Beattie RM, MacArthur BD, Ennis S. A systematic review of the applications of artificial intelligence and machine learning in autoimmune diseases. NPJ Digit Med 2020; 3:30. [PMID: 32195365 PMCID: PMC7062883 DOI: 10.1038/s41746-020-0229-3] [Citation(s) in RCA: 120] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2019] [Accepted: 01/17/2020] [Indexed: 02/07/2023] Open
Abstract
Autoimmune diseases are chronic, multifactorial conditions. Through machine learning (ML), a branch of the wider field of artificial intelligence, it is possible to extract patterns within patient data, and exploit these patterns to predict patient outcomes for improved clinical management. Here, we surveyed the use of ML methods to address clinical problems in autoimmune disease. A systematic review was conducted using MEDLINE, embase and computers and applied sciences complete databases. Relevant papers included "machine learning" or "artificial intelligence" and the autoimmune diseases search term(s) in their title, abstract or key words. Exclusion criteria: studies not written in English, no real human patient data included, publication prior to 2001, studies that were not peer reviewed, non-autoimmune disease comorbidity research and review papers. 169 (of 702) studies met the criteria for inclusion. Support vector machines and random forests were the most popular ML methods used. ML models using data on multiple sclerosis, rheumatoid arthritis and inflammatory bowel disease were most common. A small proportion of studies (7.7% or 13/169) combined different data types in the modelling process. Cross-validation, combined with a separate testing set for more robust model evaluation occurred in 8.3% of papers (14/169). The field may benefit from adopting a best practice of validation, cross-validation and independent testing of ML models. Many models achieved good predictive results in simple scenarios (e.g. classification of cases and controls). Progression to more complex predictive models may be achievable in future through integration of multiple data types.
Collapse
Affiliation(s)
- I. S. Stafford
- Department of Human Genetics and Genomic Medicine, University of Southampton, Southampton, UK
- Institute for Life Sciences, University of Southampton, Southampton, UK
| | - M. Kellermann
- Department of Human Genetics and Genomic Medicine, University of Southampton, Southampton, UK
| | - E. Mossotto
- Department of Human Genetics and Genomic Medicine, University of Southampton, Southampton, UK
- Institute for Life Sciences, University of Southampton, Southampton, UK
| | - R. M. Beattie
- Department of Paediatric Gastroenterology, Southampton Children’s Hospital, Southampton, UK
| | - B. D. MacArthur
- Institute for Life Sciences, University of Southampton, Southampton, UK
| | - S. Ennis
- Department of Human Genetics and Genomic Medicine, University of Southampton, Southampton, UK
| |
Collapse
|
15
|
Saad MN, Mabrouk MS, Eldeib AM, Shaker OG. Comparative study for haplotype block partitioning methods - Evidence from chromosome 6 of the North American Rheumatoid Arthritis Consortium (NARAC) dataset. PLoS One 2018; 13:e0209603. [PMID: 30596705 PMCID: PMC6312333 DOI: 10.1371/journal.pone.0209603] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2018] [Accepted: 12/07/2018] [Indexed: 11/19/2022] Open
Abstract
Haplotype-based methods compete with "one-SNP-at-a-time" approaches on being preferred for association studies. Chromosome 6 contains most of the known genetic biomarkers for rheumatoid arthritis (RA) disease. Therefore, chromosome 6 serves as a benchmark for the haplotype methods testing. The aim of this study is to test the North American Rheumatoid Arthritis Consortium (NARAC) dataset to find out if haplotype block methods or single-locus approaches alone can sufficiently provide the significant single nucleotide polymorphisms (SNPs) associated with RA. In addition, could we be satisfied with only one method of the haplotype block methods for partitioning chromosome 6 of the NARAC dataset? In the NARAC dataset, chromosome 6 comprises 35,574 SNPs for 2,062 individuals (868 cases, 1,194 controls). Individual SNP approach and three haplotype block methods were applied to the NARAC dataset to identify the RA biomarkers. We employed three haplotype partitioning methods which are confidence interval test (CIT), four gamete test (FGT), and solid spine of linkage disequilibrium (SSLD). P-values after stringent Bonferroni correction for multiple testing were measured to assess the strength of association between the genetic variants and RA susceptibility. Moreover, the block size (in base pairs (bp) and number of SNPs included), number of blocks, percentage of uncovered SNPs by the block method, percentage of significant blocks from the total number of blocks, number of significant haplotypes and SNPs were used to compare among the three haplotype block methods. Individual SNP, CIT, FGT, and SSLD methods detected 432, 1,086, 1,099, and 1,322 associated SNPs, respectively. Each method identified significant SNPs that were not detected by any other method (Individual SNP: 12, FGT: 37, CIT: 55, and SSLD: 189 SNPs). 916 SNPs were discovered by all the three haplotype block methods. 367 SNPs were discovered by the haplotype block methods and the individual SNP approach. The P-values of these 367 SNPs were lower than those of the SNPs uniquely detected by only one method. The 367 SNPs detected by all the methods represent promising candidates for RA susceptibility. They should be further investigated for the European population. A hybrid technique including the four methods should be applied to detect the significant SNPs associated with RA for chromosome 6 of the NARAC dataset. Moreover, SSLD method may be preferred for its favored benefits in case of selecting only one method.
Collapse
Affiliation(s)
- Mohamed N. Saad
- Biomedical Engineering Department, Faculty of Engineering, Minia University, Minia, Egypt
| | - Mai S. Mabrouk
- Biomedical Engineering Department, Faculty of Engineering, Misr University for Science and Technology (MUST), 6th of October City, Egypt
| | - Ayman M. Eldeib
- Systems and Biomedical Engineering Department, Faculty of Engineering, Cairo University, Giza, Egypt
| | - Olfat G. Shaker
- Medical Biochemistry and Molecular Biology Department, Faculty of Medicine, Cairo University, Cairo, Egypt
| |
Collapse
|
16
|
Uppu S, Krishna A, Gopalan RP. A Review on Methods for Detecting SNP Interactions in High-Dimensional Genomic Data. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2018; 15:599-612. [PMID: 28060710 DOI: 10.1109/tcbb.2016.2635125] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
In this era of genome-wide association studies (GWAS), the quest for understanding the genetic architecture of complex diseases is rapidly increasing more than ever before. The development of high throughput genotyping and next generation sequencing technologies enables genetic epidemiological analysis of large scale data. These advances have led to the identification of a number of single nucleotide polymorphisms (SNPs) responsible for disease susceptibility. The interactions between SNPs associated with complex diseases are increasingly being explored in the current literature. These interaction studies are mathematically challenging and computationally complex. These challenges have been addressed by a number of data mining and machine learning approaches. This paper reviews the current methods and the related software packages to detect the SNP interactions that contribute to diseases. The issues that need to be considered when developing these models are addressed in this review. The paper also reviews the achievements in data simulation to evaluate the performance of these models. Further, it discusses the future of SNP interaction analysis.
Collapse
|
17
|
Malek S, Gunalan R, Kedija S, Lau C, Mosleh MA, Milow P, Lee S, Saw A. Random forest and Self Organizing Maps application for analysis of pediatric fracture healing time of the lower limb. Neurocomputing 2018. [DOI: 10.1016/j.neucom.2017.05.094] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
|
18
|
Vernerova L, Spoutil F, Vlcek M, Krskova K, Penesova A, Meskova M, Marko A, Raslova K, Vohnout B, Rovensky J, Killinger Z, Jochmanova I, Lazurova I, Steiner G, Smolen J, Imrich R. A Combination of CD28 (rs1980422) and IRF5 (rs10488631) Polymorphisms Is Associated with Seropositivity in Rheumatoid Arthritis: A Case Control Study. PLoS One 2016; 11:e0153316. [PMID: 27092776 PMCID: PMC4836711 DOI: 10.1371/journal.pone.0153316] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2015] [Accepted: 03/28/2016] [Indexed: 12/15/2022] Open
Abstract
Introduction The aim of the study was to analyse genetic architecture of RA by utilizing multiparametric statistical methods such as linear discriminant analysis (LDA) and redundancy analysis (RDA). Methods A total of 1393 volunteers, 499 patients with RA and 894 healthy controls were included in the study. The presence of shared epitope (SE) in HLA-DRB1 and 11 SNPs (PTPN22 C/T (rs2476601), STAT4 G/T (rs7574865), CTLA4 A/G (rs3087243), TRAF1/C5 A/G (rs3761847), IRF5 T/C (rs10488631), TNFAIP3 C/T (rs5029937), AFF3 A/T (rs11676922), PADI4 C/T (rs2240340), CD28 T/C (rs1980422), CSK G/A (rs34933034) and FCGR3A A/C (rs396991), rheumatoid factor (RF), anti–citrullinated protein antibodies (ACPA) and clinical status was analysed using the LDA and RDA. Results HLA-DRB1, PTPN22, STAT4, IRF5 and PADI4 significantly discriminated between RA patients and healthy controls in LDA. The correlation between RA diagnosis and the explanatory variables in the model was 0.328 (Trace = 0.107; F = 13.715; P = 0.0002). The risk variants of IRF5 and CD28 genes were found to be common determinants for seropositivity in RDA, while positivity of RF alone was associated with the CTLA4 risk variant in heterozygous form. The correlation between serologic status and genetic determinants on the 1st ordinal axis was 0.468, and 0.145 on the 2nd one (Trace = 0.179; F = 6.135; P = 0.001). The risk alleles in AFF3 gene together with the presence of ACPA were associated with higher clinical severity of RA. Conclusions The association among multiple risk variants related to T cell receptor signalling with seropositivity may play an important role in distinct clinical phenotypes of RA. Our study demonstrates that multiparametric analyses represent a powerful tool for investigation of mutual relationships of potential risk factors in complex diseases such as RA.
Collapse
Affiliation(s)
- Lucia Vernerova
- Institute of Clinical and Translational Research, Biomedical Centre, Slovak Academy of Sciences, Bratislava, Slovakia
| | - Frantisek Spoutil
- Institute of Molecular Genetics, Czech Academy of Sciences, Prague, Czech Republic.,Institute of Experimental Medicine, Czech Academy of Sciences, Prague, Czech Republic
| | - Miroslav Vlcek
- Institute of Clinical and Translational Research, Biomedical Centre, Slovak Academy of Sciences, Bratislava, Slovakia
| | - Katarina Krskova
- Institute of Clinical and Translational Research, Biomedical Centre, Slovak Academy of Sciences, Bratislava, Slovakia
| | - Adela Penesova
- Institute of Clinical and Translational Research, Biomedical Centre, Slovak Academy of Sciences, Bratislava, Slovakia
| | - Milada Meskova
- Institute of Clinical and Translational Research, Biomedical Centre, Slovak Academy of Sciences, Bratislava, Slovakia
| | - Andrea Marko
- Institute of Clinical and Translational Research, Biomedical Centre, Slovak Academy of Sciences, Bratislava, Slovakia
| | | | | | - Jozef Rovensky
- National Institute of Rheumatic Diseases, Piešťany, Slovakia
| | - Zdenko Killinger
- 5th Department of Internal Medicine, Medical Faculty of Comenius University, Bratislava, Slovakia
| | - Ivana Jochmanova
- 1st Department of Internal Medicine, Faculty of Medicine, Pavol Jozef Šafárik University in Košice, Košice, Slovakia
| | - Ivica Lazurova
- 1st Department of Internal Medicine, Faculty of Medicine, Pavol Jozef Šafárik University in Košice, Košice, Slovakia
| | - Guenter Steiner
- Department of Internal Medicine III, Division of Rheumatology, Medical University of Vienna, Vienna, Austria
| | - Josef Smolen
- Department of Internal Medicine III, Division of Rheumatology, Medical University of Vienna, Vienna, Austria
| | - Richard Imrich
- Institute of Clinical and Translational Research, Biomedical Centre, Slovak Academy of Sciences, Bratislava, Slovakia
| |
Collapse
|
19
|
|
20
|
Wright MN, Ziegler A, König IR. Do little interactions get lost in dark random forests? BMC Bioinformatics 2016; 17:145. [PMID: 27029549 PMCID: PMC4815164 DOI: 10.1186/s12859-016-0995-8] [Citation(s) in RCA: 53] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2016] [Accepted: 03/21/2016] [Indexed: 12/16/2022] Open
Abstract
Background Random forests have often been claimed to uncover interaction effects. However, if and how interaction effects can be differentiated from marginal effects remains unclear. In extensive simulation studies, we investigate whether random forest variable importance measures capture or detect gene-gene interactions. With capturing interactions, we define the ability to identify a variable that acts through an interaction with another one, while detection is the ability to identify an interaction effect as such. Results Of the single importance measures, the Gini importance captured interaction effects in most of the simulated scenarios, however, they were masked by marginal effects in other variables. With the permutation importance, the proportion of captured interactions was lower in all cases. Pairwise importance measures performed about equal, with a slight advantage for the joint variable importance method. However, the overall fraction of detected interactions was low. In almost all scenarios the detection fraction in a model with only marginal effects was larger than in a model with an interaction effect only. Conclusions Random forests are generally capable of capturing gene-gene interactions, but current variable importance measures are unable to detect them as interactions. In most of the cases, interactions are masked by marginal effects and interactions cannot be differentiated from marginal effects. Consequently, caution is warranted when claiming that random forests uncover interactions. Electronic supplementary material The online version of this article (doi:10.1186/s12859-016-0995-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Marvin N Wright
- Institut für Medizinische Biometrie und Statistik, Universität zu Lübeck, Universitätsklinikum Schleswig-Holstein, Campus Lübeck, Ratzeburger Allee 160, Lübeck, 23562, Germany
| | - Andreas Ziegler
- Institut für Medizinische Biometrie und Statistik, Universität zu Lübeck, Universitätsklinikum Schleswig-Holstein, Campus Lübeck, Ratzeburger Allee 160, Lübeck, 23562, Germany.,Zentrum für Klinische Studien, Universität zu Lübeck, Universitätsklinikum Schleswig-Holstein, Campus Lübeck, Lübeck, Germany.,School of Mathematics, Statistics and Computer Science, University of KwaZulu-Natal, Pietermaritzburg, South Africa
| | - Inke R König
- Institut für Medizinische Biometrie und Statistik, Universität zu Lübeck, Universitätsklinikum Schleswig-Holstein, Campus Lübeck, Ratzeburger Allee 160, Lübeck, 23562, Germany.
| |
Collapse
|
21
|
Wei WH, Loh CY, Worthington J, Eyre S. Immunochip Analyses of Epistasis in Rheumatoid Arthritis Confirm Multiple Interactions within MHC and Suggest Novel Non-MHC Epistatic Signals. J Rheumatol 2016; 43:839-45. [PMID: 26879349 DOI: 10.3899/jrheum.150836] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/06/2016] [Indexed: 12/22/2022]
Abstract
OBJECTIVE Studying statistical gene-gene interactions (epistasis) has been limited by the difficulties in performance, both statistically and computationally, in large enough sample numbers to gain sufficient power. Three large Immunochip datasets from cohort samples recruited in the United Kingdom, United States, and Sweden with European ancestry were used to examine epistasis in rheumatoid arthritis (RA). METHODS A full pairwise search was conducted in the UK cohort using a high-throughput tool and the resultant significant epistatic signals were tested for replication in the United States and Swedish cohorts. A forward selection approach was applied to remove redundant signals, while conditioning on the preidentified additive effects. RESULTS We detected abundant genome-wide significant (p < 1.0e-13) epistatic signals, all within the MHC region. These signals were reduced substantially, but a proportion remained significant (p < 1.0e-03) in conditional tests. We identified 11 independent epistatic interactions across the entire MHC, each explaining on average 0.12% of the phenotypic variance, nearly all replicated in both replication cohorts. We also identified non-MHC epistatic interactions between RA susceptible loci LOC100506023 and IRF5 with Immunochip-wide significance (p < 1.1e-08) and between 2 neighboring single-nucleotide polymorphism near PTPN22 that were in low linkage disequilibrium with independent interaction (p < 1.0e-05). Both non-MHC epistatic interactions were statistically replicated with a similar interaction pattern in the US cohort only. CONCLUSION There are multiple but relatively weak interactions independent of the additive effects in RA and a larger sample number is required to confidently assign additional non-MHC epistasis.
Collapse
Affiliation(s)
- Wen-Hua Wei
- From the Arthritis Research UK Centre for Genetics and Genomics, Institute of Inflammation and Repair, Faculty of Medical and Human Sciences, Manchester Academic Health Science Centre, University of Manchester; National Institute for Health Research (NIHR) Manchester Musculoskeletal Biomedical Research Unit, Central Manchester National Health Service (NHS) Foundation Trust, Manchester Academic Health Science Centre, Manchester, UK.W.H. Wei*, PhD, Lecturer in Statistical Genetics, Arthritis Research UK Centre for Genetics and Genomics, Institute of Inflammation and Repair, Faculty of Medical and Human Sciences, Manchester Academic Health Science Centre, University of Manchester; C.Y. Loh*, MRes, PhD Student, Arthritis Research UK Centre for Genetics and Genomics, Institute of Inflammation and Repair, Faculty of Medical and Human Sciences, Manchester Academic Health Science Centre, University of Manchester; J. Worthington, PhD, Professor of Chronic Disease Genetics, Arthritis Research UK Centre for Genetics and Genomics, Institute of Inflammation and Repair, Faculty of Medical and Human Sciences, Manchester Academic Health Science Centre, University of Manchester, and NIHR Manchester Musculoskeletal Biomedical Research Unit, Central Manchester NHS Foundation Trust, Manchester Academic Health Science Centre; S. Eyre, PhD, Senior Research Fellow on Rheumatological Disorders, Arthritis Research UK Centre for Genetics and Genomics, Institute of Inflammation and Repair, Faculty of Medical and Human Sciences, Manchester Academic Health Science Centre, University of Manchester, and NIHR Manchester Musculoskeletal Biomedical Research Unit, Central Manchester NHS Foundation Trust, Manchester Academic Health Science Centre.
| | - Chia-Yin Loh
- From the Arthritis Research UK Centre for Genetics and Genomics, Institute of Inflammation and Repair, Faculty of Medical and Human Sciences, Manchester Academic Health Science Centre, University of Manchester; National Institute for Health Research (NIHR) Manchester Musculoskeletal Biomedical Research Unit, Central Manchester National Health Service (NHS) Foundation Trust, Manchester Academic Health Science Centre, Manchester, UK.W.H. Wei*, PhD, Lecturer in Statistical Genetics, Arthritis Research UK Centre for Genetics and Genomics, Institute of Inflammation and Repair, Faculty of Medical and Human Sciences, Manchester Academic Health Science Centre, University of Manchester; C.Y. Loh*, MRes, PhD Student, Arthritis Research UK Centre for Genetics and Genomics, Institute of Inflammation and Repair, Faculty of Medical and Human Sciences, Manchester Academic Health Science Centre, University of Manchester; J. Worthington, PhD, Professor of Chronic Disease Genetics, Arthritis Research UK Centre for Genetics and Genomics, Institute of Inflammation and Repair, Faculty of Medical and Human Sciences, Manchester Academic Health Science Centre, University of Manchester, and NIHR Manchester Musculoskeletal Biomedical Research Unit, Central Manchester NHS Foundation Trust, Manchester Academic Health Science Centre; S. Eyre, PhD, Senior Research Fellow on Rheumatological Disorders, Arthritis Research UK Centre for Genetics and Genomics, Institute of Inflammation and Repair, Faculty of Medical and Human Sciences, Manchester Academic Health Science Centre, University of Manchester, and NIHR Manchester Musculoskeletal Biomedical Research Unit, Central Manchester NHS Foundation Trust, Manchester Academic Health Science Centre
| | - Jane Worthington
- From the Arthritis Research UK Centre for Genetics and Genomics, Institute of Inflammation and Repair, Faculty of Medical and Human Sciences, Manchester Academic Health Science Centre, University of Manchester; National Institute for Health Research (NIHR) Manchester Musculoskeletal Biomedical Research Unit, Central Manchester National Health Service (NHS) Foundation Trust, Manchester Academic Health Science Centre, Manchester, UK.W.H. Wei*, PhD, Lecturer in Statistical Genetics, Arthritis Research UK Centre for Genetics and Genomics, Institute of Inflammation and Repair, Faculty of Medical and Human Sciences, Manchester Academic Health Science Centre, University of Manchester; C.Y. Loh*, MRes, PhD Student, Arthritis Research UK Centre for Genetics and Genomics, Institute of Inflammation and Repair, Faculty of Medical and Human Sciences, Manchester Academic Health Science Centre, University of Manchester; J. Worthington, PhD, Professor of Chronic Disease Genetics, Arthritis Research UK Centre for Genetics and Genomics, Institute of Inflammation and Repair, Faculty of Medical and Human Sciences, Manchester Academic Health Science Centre, University of Manchester, and NIHR Manchester Musculoskeletal Biomedical Research Unit, Central Manchester NHS Foundation Trust, Manchester Academic Health Science Centre; S. Eyre, PhD, Senior Research Fellow on Rheumatological Disorders, Arthritis Research UK Centre for Genetics and Genomics, Institute of Inflammation and Repair, Faculty of Medical and Human Sciences, Manchester Academic Health Science Centre, University of Manchester, and NIHR Manchester Musculoskeletal Biomedical Research Unit, Central Manchester NHS Foundation Trust, Manchester Academic Health Science Centre
| | - Stephen Eyre
- From the Arthritis Research UK Centre for Genetics and Genomics, Institute of Inflammation and Repair, Faculty of Medical and Human Sciences, Manchester Academic Health Science Centre, University of Manchester; National Institute for Health Research (NIHR) Manchester Musculoskeletal Biomedical Research Unit, Central Manchester National Health Service (NHS) Foundation Trust, Manchester Academic Health Science Centre, Manchester, UK.W.H. Wei*, PhD, Lecturer in Statistical Genetics, Arthritis Research UK Centre for Genetics and Genomics, Institute of Inflammation and Repair, Faculty of Medical and Human Sciences, Manchester Academic Health Science Centre, University of Manchester; C.Y. Loh*, MRes, PhD Student, Arthritis Research UK Centre for Genetics and Genomics, Institute of Inflammation and Repair, Faculty of Medical and Human Sciences, Manchester Academic Health Science Centre, University of Manchester; J. Worthington, PhD, Professor of Chronic Disease Genetics, Arthritis Research UK Centre for Genetics and Genomics, Institute of Inflammation and Repair, Faculty of Medical and Human Sciences, Manchester Academic Health Science Centre, University of Manchester, and NIHR Manchester Musculoskeletal Biomedical Research Unit, Central Manchester NHS Foundation Trust, Manchester Academic Health Science Centre; S. Eyre, PhD, Senior Research Fellow on Rheumatological Disorders, Arthritis Research UK Centre for Genetics and Genomics, Institute of Inflammation and Repair, Faculty of Medical and Human Sciences, Manchester Academic Health Science Centre, University of Manchester, and NIHR Manchester Musculoskeletal Biomedical Research Unit, Central Manchester NHS Foundation Trust, Manchester Academic Health Science Centre
| |
Collapse
|
22
|
A review for detecting gene-gene interactions using machine learning methods in genetic epidemiology. BIOMED RESEARCH INTERNATIONAL 2013; 2013:432375. [PMID: 24228248 PMCID: PMC3818807 DOI: 10.1155/2013/432375] [Citation(s) in RCA: 44] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/17/2013] [Revised: 08/26/2013] [Accepted: 08/27/2013] [Indexed: 01/04/2023]
Abstract
Recently, the greatest statistical computational challenge in genetic epidemiology is to identify and characterize the genes that interact with other genes and environment factors that bring the effect on complex multifactorial disease. These gene-gene interactions are also denoted as epitasis in which this phenomenon cannot be solved by traditional statistical method due to the high dimensionality of the data and the occurrence of multiple polymorphism. Hence, there are several machine learning methods to solve such problems by identifying such susceptibility gene which are neural networks (NNs), support vector machine (SVM), and random forests (RFs) in such common and multifactorial disease. This paper gives an overview on machine learning methods, describing the methodology of each machine learning methods and its application in detecting gene-gene and gene-environment interactions. Lastly, this paper discussed each machine learning method and presents the strengths and weaknesses of each machine learning method in detecting gene-gene interactions in complex human disease.
Collapse
|
23
|
Impact of natural genetic variation on gene expression dynamics. PLoS Genet 2013; 9:e1003514. [PMID: 23754949 PMCID: PMC3674999 DOI: 10.1371/journal.pgen.1003514] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2012] [Accepted: 04/04/2013] [Indexed: 01/03/2023] Open
Abstract
DNA sequence variation causes changes in gene expression, which in turn has profound effects on cellular states. These variations affect tissue development and may ultimately lead to pathological phenotypes. A genetic locus containing a sequence variation that affects gene expression is called an “expression quantitative trait locus” (eQTL). Whereas the impact of cellular context on expression levels in general is well established, a lot less is known about the cell-state specificity of eQTL. Previous studies differed with respect to how “dynamic eQTL” were defined. Here, we propose a unified framework distinguishing static, conditional and dynamic eQTL and suggest strategies for mapping these eQTL classes. Further, we introduce a new approach to simultaneously infer eQTL from different cell types. By using murine mRNA expression data from four stages of hematopoiesis and 14 related cellular traits, we demonstrate that static, conditional and dynamic eQTL, although derived from the same expression data, represent functionally distinct types of eQTL. While static eQTL affect generic cellular processes, non-static eQTL are more often involved in hematopoiesis and immune response. Our analysis revealed substantial effects of individual genetic variation on cell type-specific expression regulation. Among a total number of 3,941 eQTL we detected 2,729 static eQTL, 1,187 eQTL were conditionally active in one or several cell types, and 70 eQTL affected expression changes during cell type transitions. We also found evidence for feedback control mechanisms reverting the effect of an eQTL specifically in certain cell types. Loci correlated with hematological traits were enriched for conditional eQTL, thus, demonstrating the importance of conditional eQTL for understanding molecular mechanisms underlying physiological trait variation. The classification proposed here has the potential to streamline and unify future analysis of conditional and dynamic eQTL as well as many other kinds of QTL data. Complex physiological traits are affected through subtle changes of molecular traits like gene expression in the relevant tissues, which in turn are caused by genetic variation. A genetic locus containing a sequence variation affecting gene expression is called an expression quantitative trait locus (eQTL). Understanding the tissue and cell type specificity of eQTL effects is essential for revealing the molecular mechanisms underlying disease phenotypes. However, so far the cell-state dependence of eQTL is poorly understood. In order to systematically assess the importance of cell state-specific eQTL, we propose to distinguish static, conditional and dynamic eQTL and suggest strategies for mapping these eQTL classes. We applied our framework to mouse gene expression data from four hematopoietic stages and related cellular traits. The different eQTL classes, although derived from the same expression data, represent functionally distinct types of eQTL. Importantly, conditional eQTL are well correlated with relevant hematological traits. These findings emphasize the condition specificity of many regulatory relationships, even if the conditions under study are related. This calls for due caution when transferring conclusions about regulatory mechanisms across cell types or tissues. The proposed classification will also help to unravel dynamic behaviors in many other kinds of QTL data.
Collapse
|
24
|
Janitza S, Strobl C, Boulesteix AL. An AUC-based permutation variable importance measure for random forests. BMC Bioinformatics 2013; 14:119. [PMID: 23560875 PMCID: PMC3626572 DOI: 10.1186/1471-2105-14-119] [Citation(s) in RCA: 99] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2012] [Accepted: 03/21/2013] [Indexed: 11/30/2022] Open
Abstract
Background The random forest (RF) method is a commonly used tool for classification with
high dimensional data as well as for ranking candidate predictors based on
the so-called random forest variable importance measures (VIMs). However the
classification performance of RF is known to be suboptimal in case of
strongly unbalanced data, i.e. data where response class sizes differ
considerably. Suggestions were made to obtain better classification
performance based either on sampling procedures or on cost sensitivity
analyses. However to our knowledge the performance of the VIMs has not yet
been examined in the case of unbalanced response classes. In this paper we
explore the performance of the permutation VIM for unbalanced data settings
and introduce an alternative permutation VIM based on the area under the
curve (AUC) that is expected to be more robust towards class imbalance. Results We investigated the performance of the standard permutation VIM and of our
novel AUC-based permutation VIM for different class imbalance levels using
simulated data and real data. The results suggest that the new AUC-based
permutation VIM outperforms the standard permutation VIM for unbalanced data
settings while both permutation VIMs have equal performance for balanced
data settings. Conclusions The standard permutation VIM loses its ability to discriminate between
associated predictors and predictors not associated with the response for
increasing class imbalance. It is outperformed by our new AUC-based
permutation VIM for unbalanced data settings, while the performance of both
VIMs is very similar in the case of balanced classes. The new AUC-based VIM
is implemented in the R package party for the unbiased RF variant based on
conditional inference trees. The codes implementing our study are available
from the companion website:
http://www.ibe.med.uni-muenchen.de/organisation/mitarbeiter/070_drittmittel/janitza/index.html.
Collapse
Affiliation(s)
- Silke Janitza
- Department of Medical Informatics, Biometry and Epidemiology, University of Munich, Marchioninistr. 15, D-81377, Munich, Germany.
| | | | | |
Collapse
|
25
|
Kang C, Yu H, Yi GS. Finding type 2 diabetes causal single nucleotide polymorphism combinations and functional modules from genome-wide association data. BMC Med Inform Decis Mak 2013; 13 Suppl 1:S3. [PMID: 23566118 PMCID: PMC3618247 DOI: 10.1186/1472-6947-13-s1-s3] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
Background Due to the low statistical power of individual markers from a genome-wide association study (GWAS), detecting causal single nucleotide polymorphisms (SNPs) for complex diseases is a challenge. SNP combinations are suggested to compensate for the low statistical power of individual markers, but SNP combinations from GWAS generate high computational complexity. Methods We aim to detect type 2 diabetes (T2D) causal SNP combinations from a GWAS dataset with optimal filtration and to discover the biological meaning of the detected SNP combinations. Optimal filtration can enhance the statistical power of SNP combinations by comparing the error rates of SNP combinations from various Bonferroni thresholds and p-value range-based thresholds combined with linkage disequilibrium (LD) pruning. T2D causal SNP combinations are selected using random forests with variable selection from an optimal SNP dataset. T2D causal SNP combinations and genome-wide SNPs are mapped into functional modules using expanded gene set enrichment analysis (GSEA) considering pathway, transcription factor (TF)-target, miRNA-target, gene ontology, and protein complex functional modules. The prediction error rates are measured for SNP sets from functional module-based filtration that selects SNPs within functional modules from genome-wide SNPs based expanded GSEA. Results A T2D causal SNP combination containing 101 SNPs from the Wellcome Trust Case Control Consortium (WTCCC) GWAS dataset are selected using optimal filtration criteria, with an error rate of 10.25%. Matching 101 SNPs with known T2D genes and functional modules reveals the relationships between T2D and SNP combinations. The prediction error rates of SNP sets from functional module-based filtration record no significance compared to the prediction error rates of randomly selected SNP sets and T2D causal SNP combinations from optimal filtration. Conclusions We propose a detection method for complex disease causal SNP combinations from an optimal SNP dataset by using random forests with variable selection. Mapping the biological meanings of detected SNP combinations can help uncover complex disease mechanisms.
Collapse
Affiliation(s)
- Chiyong Kang
- Department of Bio and Brain Engineering, KAIST, Daejeon 305-701, South Korea
| | | | | |
Collapse
|
26
|
You CG, Li XJ, Li YM, Wang LP, Li FF, Guo XL, Gao LN. Association analysis of single nucleotide polymorphisms of proinflammatory cytokine and their receptors genes with rheumatoid arthritis in northwest Chinese Han population. Cytokine 2013; 61:133-8. [DOI: 10.1016/j.cyto.2012.09.007] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2012] [Revised: 09/04/2012] [Accepted: 09/17/2012] [Indexed: 12/17/2022]
|
27
|
Kruppa J, Ziegler A, König IR. Risk estimation and risk prediction using machine-learning methods. Hum Genet 2012; 131:1639-54. [PMID: 22752090 PMCID: PMC3432206 DOI: 10.1007/s00439-012-1194-y] [Citation(s) in RCA: 90] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2012] [Accepted: 06/14/2012] [Indexed: 01/02/2023]
Abstract
After an association between genetic variants and a phenotype has been established, further study goals comprise the classification of patients according to disease risk or the estimation of disease probability. To accomplish this, different statistical methods are required, and specifically machine-learning approaches may offer advantages over classical techniques. In this paper, we describe methods for the construction and evaluation of classification and probability estimation rules. We review the use of machine-learning approaches in this context and explain some of the machine-learning algorithms in detail. Finally, we illustrate the methodology through application to a genome-wide association analysis on rheumatoid arthritis.
Collapse
Affiliation(s)
- Jochen Kruppa
- Institut für Medizininsche Biometrie und Statistik, Universität zu Lübeck, Universitätsklinikum Schleswig-Holstein, Campus Lübeck, Maria-Goeppert-Str. 1, 23562 Lübeck, Germany
| | | | | |
Collapse
|
28
|
Abstract
Autoimmunity and allergy involving the digestive system may be considered as paradigmatic for numerous common themes of complex diseases secondary to tolerance breakdown. Among gastrointestinal autoimmune diseases, for example, we encounter diseases in which a clear environmental trigger is identified (i.e., celiac disease), serum autoantibodies are most specific (i.e., primary biliary cirrhosis), or in which the disease pathophysiology is clearly understood (i.e., autoimmune gastritis). Similarly, it is intriguing that the gastrointestinal tract and the liver circulation represent the crucial environment for the development of immune tolerance. This issue is dedicated to the discussion of recent concepts while identifying two major common issues, i.e., the need for serum biomarkers and the role of vitamin D. Other common themes characterize the etiology and effector mechanisms of these and other autoimmune diseases and are discussed in each cutting-edge overview.
Collapse
|
29
|
Lu Q, Wei C, Ye C, Li M, Elston RC. A likelihood ratio-based Mann-Whitney approach finds novel replicable joint gene action for type 2 diabetes. Genet Epidemiol 2012; 36:583-93. [PMID: 22760990 PMCID: PMC3634342 DOI: 10.1002/gepi.21651] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2012] [Revised: 04/09/2012] [Accepted: 05/09/2012] [Indexed: 12/29/2022]
Abstract
The potential importance of the joint action of genes, whether modeled with or without a statistical interaction term, has long been recognized. However, identifying such action has been a great challenge, especially when millions of genetic markers are involved. We propose a likelihood ratio-based Mann-Whitney test to search for joint gene action either among candidate genes or genome-wide. It extends the traditional univariate Mann-Whitney test to assess the joint association of genotypes at multiple loci with disease, allowing for high-order statistical interactions. Because only one overall significance test is conducted for the entire analysis, it avoids the issue of multiple testing. Moreover, the approach adopts a computationally efficient algorithm, making a genome-wide search feasible in a reasonable amount of time on a high performance personal computer. We evaluated the approach using both theoretical and real data. By applying the approach to 40 type 2 diabetes (T2D) susceptibility single-nucleotide polymorphisms (SNPs), we identified a four-locus model strongly associated with T2D in the Wellcome Trust (WT) study (permutation P-value < 0.001), and replicated the same finding in the Nurses' Health Study/Health Professionals Follow-Up Study (NHS/HPFS) (P-value = 3.03×10-11). We also conducted a genome-wide search on 385,598 SNPs in the WT study. The analysis took approximately 55 hr on a personal computer, identifying the same first two loci, but overall a different set of four SNPs, jointly associated with T2D (P-value = 1.29×10-5). The nominal significance of this same association reached 4.01×10-6 in the NHS/HPFS.
Collapse
Affiliation(s)
- Qing Lu
- Department of Epidemiology and Biostatistics, Michigan State University, East Lansing, Michigan
| | - Changshuai Wei
- Department of Epidemiology and Biostatistics, Michigan State University, East Lansing, Michigan
| | - Chengyin Ye
- Department of Epidemiology and Biostatistics, Michigan State University, East Lansing, Michigan
| | - Ming Li
- Department of Epidemiology and Biostatistics, Michigan State University, East Lansing, Michigan
| | - Robert C. Elston
- Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland, Ohio
| |
Collapse
|
30
|
Upstill-Goddard R, Eccles D, Fliege J, Collins A. Machine learning approaches for the discovery of gene-gene interactions in disease data. Brief Bioinform 2012; 14:251-60. [DOI: 10.1093/bib/bbs024] [Citation(s) in RCA: 69] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023] Open
|
31
|
Boulesteix AL, Bender A, Lorenzo Bermejo J, Strobl C. Random forest Gini importance favours SNPs with large minor allele frequency: impact, sources and recommendations. Brief Bioinform 2011; 13:292-304. [DOI: 10.1093/bib/bbr053] [Citation(s) in RCA: 66] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open
|
32
|
Behrens M, Smart M, Luckey D, Luthra H, Taneja V. To B or not to B: role of B cells in pathogenesis of arthritis in HLA transgenic mice. J Autoimmun 2011; 37:95-103. [PMID: 21665435 DOI: 10.1016/j.jaut.2011.05.002] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2011] [Accepted: 05/02/2011] [Indexed: 10/18/2022]
Abstract
Population studies have shown that amongst all the genetic factors linked with autoimmune disease development, MHC class II genes are the most significant. Experimental autoimmune arthritis resembling human rheumatoid arthritis (RA) can be induced in susceptible strains of mice following immunization with type II collagen (CIA). We generated transgenic mice lacking endogenous class II molecules and expressing various HLA genes including RA-associated, HLA-DRB1*0401 and HLA-DQ8, and RA-resistant, DRB1*0402, genes. The HLA molecules in these mice are expressed on the cell surface and can positively select CD4+ T cells expressing various Vβ T cell receptors. Endogenous class II invariant chain is required for proper functioning of the class II transgene. Arthritis development in transgenic mice is CD4+ and B cells dependent. Studies in humanized mice showed that B cells are required as antigen presenting cells in addition to antibody producing cells for the development of CIA. The transgenic mice expressing *0401 and *0401/DQ8 genes developed sex-biased arthritis with predominantly females being affected, similar to that of human RA. Further, the transgenic mice produced autoantibodies like rheumatoid factor and anti-cyclic antibodies. Antigen presentation by B cells leads to a sex-specific immune response in DRB1*0401 mice suggesting a role of B cells and HLA-DR in rendering susceptibility to develop arthritis in females.
Collapse
Affiliation(s)
- Marshall Behrens
- Department of Immunology, Mayo Clinic College of Medicine, Rochester, MN 55905, United States
| | | | | | | | | |
Collapse
|
33
|
Beyerlein A, von Kries R, Ness AR, Ong KK. Genetic markers of obesity risk: stronger associations with body composition in overweight compared to normal-weight children. PLoS One 2011; 6:e19057. [PMID: 21526213 PMCID: PMC3078148 DOI: 10.1371/journal.pone.0019057] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2010] [Accepted: 03/26/2011] [Indexed: 11/19/2022] Open
Abstract
Background Genetic factors are important determinants of overweight. We examined whether there are differential effect sizes depending on children's body composition. Methods We analysed data of n = 4,837 children recorded in the Avon Longitudinal Study of Parents and Children (ALSPAC), applying quantile regression with sex- and age-specific standard deviation scores (SDS) of body mass index (BMI) or with body fat mass index and fat-free mass index at 9 years as outcome variables and an “obesity-risk-allele score” based on eight genetic variants known to be associated with childhood BMI as the explanatory variable. Results The quantile regression coefficients increased with increasing child's BMI-SDS and fat mass index percentiles, indicating larger effects of the genetic factors at higher percentiles. While the associations with BMI-SDS were of similar size in medium and high BMI quantiles (40th percentile and above), effect sizes with fat mass index increased over the whole fat mass index distribution. For example, the fat mass index of a normal-weight (50th percentile) child was increased by 0.13 kg/m2 (95% confidence interval (CI): 0.09, 0.16) per additional allele, compared to 0.24 kg/m2 per allele (95% CI: 0.15, 0.32) in children at the 90th percentile. The genetic associations with fat-free mass index were weaker and the quantile regression effects less pronounced than those on fat mass index. Conclusions Genetic risk factors for childhood overweight appear to have greater effects on fatter children. Interaction of known genetic factors with environmental or unknown genetic factors might provide a potential explanation of these findings.
Collapse
Affiliation(s)
- Andreas Beyerlein
- Institute of Social Paediatrics and Adolescent Medicine, Ludwig-Maximilians University of Munich, Munich, Germany.
| | | | | | | |
Collapse
|