1
|
Tremmel R, Honore A, Park Y, Zhou Y, Xiao M, Lauschke VM. Machine learning models for pharmacogenomic variant effect predictions - recent developments and future frontiers. Pharmacogenomics 2025:1-12. [PMID: 40401639 DOI: 10.1080/14622416.2025.2504863] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2025] [Accepted: 05/08/2025] [Indexed: 05/23/2025] Open
Abstract
Pharmacogenomic variations in genes involved in drug disposition and in drug targets is a major determinant of inter-individual differences in drug response and toxicity. While the effects of common variants are well established, millions of rare variations remain functionally uncharacterized, posing a challenge for the implementation of precision medicine. Recent advances in machine learning (ML) have significantly enhanced the prediction of variant effects by considering DNA as well as protein sequences, as well as their evolutionary conservation and haplotype structures. Emerging deep learning models utilize techniques to capture evolutionary conservation and biophysical properties, and ensemble approaches that integrate multiple predictive models exhibit increased accuracy, robustness, and interpretability. This review explores the current landscape of ML-based variant effect predictors. We discuss key methodological differences and highlight their strengths and limitations for pharmacogenomic applications. We furthermore discuss emerging methodologies for the prediction of substrate-specificity and for consideration of variant epistasis. Combined, these tools improve the functional effect prediction of drug-related variants and offer a viable strategy that could in the foreseeable future translate comprehensive genomic information into pharmacogenetic recommendations.
Collapse
Affiliation(s)
- Roman Tremmel
- Dr Margarete Fischer-Bosch Institute of Clinical Pharmacology, Stuttgart, Germany
- University of Tübingen, Tübingen, Germany
| | - Antoine Honore
- Division of Information Science and Engineering, KTH Royal Institute of Technology, Stockholm, Sweden
| | - Yoomi Park
- Department of Physiology and Pharmacology and Center for Molecular Medicine, Karolinska Institutet and University Hospital, Stockholm, Sweden
- Medical Research Center, Seoul National University College of Medicine, Seoul, South Korea
| | - Yitian Zhou
- Department of Physiology and Pharmacology and Center for Molecular Medicine, Karolinska Institutet and University Hospital, Stockholm, Sweden
| | - Ming Xiao
- Division of Information Science and Engineering, KTH Royal Institute of Technology, Stockholm, Sweden
| | - Volker M Lauschke
- Dr Margarete Fischer-Bosch Institute of Clinical Pharmacology, Stuttgart, Germany
- University of Tübingen, Tübingen, Germany
- Department of Physiology and Pharmacology and Center for Molecular Medicine, Karolinska Institutet and University Hospital, Stockholm, Sweden
- Department of Pharmacy, The Second Xiangya Hospital, Central South University, Changsha, China
| |
Collapse
|
2
|
Sadowski M, Thompson M, Mefford J, Haldar T, Oni-Orisan A, Border R, Pazokitoroudi A, Cai N, Ayroles JF, Sankararaman S, Dahl AW, Zaitlen N. Characterizing the genetic architecture of drug response using gene-context interaction methods. CELL GENOMICS 2024; 4:100722. [PMID: 39637863 DOI: 10.1016/j.xgen.2024.100722] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/28/2024] [Revised: 06/24/2024] [Accepted: 11/15/2024] [Indexed: 12/07/2024]
Abstract
Identifying factors that affect treatment response is a central objective of clinical research, yet the role of common genetic variation remains largely unknown. Here, we develop a framework to study the genetic architecture of response to commonly prescribed drugs in large biobanks. We quantify treatment response heritability for statins, metformin, warfarin, and methotrexate in the UK Biobank. We find that genetic variation modifies the primary effect of statins on LDL cholesterol (9% heritable) as well as their side effects on hemoglobin A1c and blood glucose (10% and 11% heritable, respectively). We identify dozens of genes that modify drug response, which we replicate in a retrospective pharmacogenomic study. Finally, we find that polygenic score (PGS) accuracy varies up to 2-fold depending on treatment status, showing that standard PGSs are likely to underperform in clinical contexts.
Collapse
Affiliation(s)
- Michal Sadowski
- Bioinformatics Interdepartmental Program, University of California Los Angeles, Los Angeles, CA 90095, USA.
| | - Mike Thompson
- Bioinformatics Interdepartmental Program, University of California Los Angeles, Los Angeles, CA 90095, USA
| | - Joel Mefford
- Department of Neurology, University of California Los Angeles, Los Angeles, CA 90095, USA
| | - Tanushree Haldar
- Institute for Human Genetics, University of California San Francisco, San Francisco, CA 94143, USA; Department of Clinical Pharmacy, University of California San Francisco, San Francisco, CA 94143, USA
| | - Akinyemi Oni-Orisan
- Institute for Human Genetics, University of California San Francisco, San Francisco, CA 94143, USA; Department of Clinical Pharmacy, University of California San Francisco, San Francisco, CA 94143, USA
| | - Richard Border
- Department of Neurology, University of California Los Angeles, Los Angeles, CA 90095, USA; Department of Computer Science, University of California Los Angeles, Los Angeles, CA 90095, USA; Department of Computational Medicine, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA 90095, USA
| | - Ali Pazokitoroudi
- Department of Computer Science, University of California Los Angeles, Los Angeles, CA 90095, USA
| | - Na Cai
- Helmholtz Pioneer Campus, Helmholtz Munich, 85764 Neuherberg, Germany; Computational Health Centre, Helmholtz Munich, 85764 Neuherberg, Germany; School of Medicine and Health, Technical University of Munich, 80333 Munich, Germany
| | - Julien F Ayroles
- Department of Ecology and Evolution, Princeton University, Princeton, NJ 08544, USA; Lewis Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08544, USA
| | - Sriram Sankararaman
- Bioinformatics Interdepartmental Program, University of California Los Angeles, Los Angeles, CA 90095, USA; Department of Computer Science, University of California Los Angeles, Los Angeles, CA 90095, USA; Department of Computational Medicine, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA 90095, USA; Department of Human Genetics, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA 90095, USA
| | - Andy W Dahl
- Section of Genetic Medicine, Department of Medicine, University of Chicago, Chicago, IL 60637, USA
| | - Noah Zaitlen
- Bioinformatics Interdepartmental Program, University of California Los Angeles, Los Angeles, CA 90095, USA; Department of Neurology, University of California Los Angeles, Los Angeles, CA 90095, USA; Department of Computational Medicine, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA 90095, USA; Department of Human Genetics, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA 90095, USA.
| |
Collapse
|
3
|
Shi L, Zhang X, Mao L, Zhang Y. Anti-neoplastic effect of heterophyllin B on ovarian cancer via the regulation of NRF2/HO-1 in vitro and in vivo. Tissue Cell 2024; 91:102566. [PMID: 39341007 DOI: 10.1016/j.tice.2024.102566] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2024] [Revised: 09/11/2024] [Accepted: 09/11/2024] [Indexed: 09/30/2024]
Abstract
BACKGROUND Heterophyllin B (HB) is a cyclic peptide with anti-neoplastic effect on many cancers. However, its effect and mechanism of action in ovarian cancer cells are still unknown. PURPOSE The primary objective of this study was to assess the impact of HB on the proliferation of ovarian cancer (OC) cells and delve into the underlying mechanisms involved. METHODS We performed CCK-8 assays, HE staining, KI67 staining, clonogenic formation assays, Annexin V-FITC/PI staining, tumor invasion assays, and migration assays to detect the effects of HB on cell viability, proliferation, apoptosis, migration, and invasion in ovarian cancer cells. Additionally, real-time fluorescent quantitative PCR (qPCR) and Western blotting were utilized for verification. The expression of NF-E2-related factor 2 (NRF2) and heme oxygenase 1 (HMOX1/HO-1) signaling molecules was detected using qPCR and Western blotting. A specific inducer, Hemin, was used to activate HO-1 and Nrf2 overexpression, in order to verify the pharmacological mechanism of HB on ovarian cancer cells. The binding relationship between HB and NRF2 was investigated through molecular docking. RESULTS HB treatment inhibited the viability of OC cells, meanwhile it showed suppressive effect on the proliferation, migration, and invasion of OC cells, Meanwhile, HB could promote the apoptosis of tumor cells. For the mechanisms, we found that HB treatment could significantly down-regulate the levels of NRF2/HO-1. Consistent with the results of in vitro experiments, administration of HB significantly delayed tumor growth in OVCAR8 xenografted nude mice, and inhibited the expression of Ki67, Nrf2 and HO-1. CONCLUSION This study demonstrated that HB had anti-neoplastic effect on OC by inhibiting Nrf2/HO-1 signaling pathway and may be a potential drug for the treatment of OC.
Collapse
Affiliation(s)
- Linyu Shi
- Department of Obstetrics and Gynecology, Affiliated Hospital of Nantong University, No.20, Xisi Road, Nantong, Jiangsu Province 226001, China
| | - Xiaoyu Zhang
- Department of Gastroenterology and Hepatology, Affiliated Hospital of Nantong University, Nantong, Jiangsu Province 226001, China
| | - Liming Mao
- Department of Immunology, School of Medicine, Nantong University, 19 Qixiu Road, Nantong, Jiangsu 226001, China; Basic Medical Research Center, School of Medicine, Nantong University, Nantong 226019, China
| | - Yuquan Zhang
- Department of Obstetrics and Gynecology, Affiliated Hospital of Nantong University, No.20, Xisi Road, Nantong, Jiangsu Province 226001, China.
| |
Collapse
|
4
|
Herrera-Luis E, Martin-Almeida M, Pino-Yanes M. Asthma-Genomic Advances Toward Risk Prediction. Clin Chest Med 2024; 45:599-610. [PMID: 39069324 PMCID: PMC11284279 DOI: 10.1016/j.ccm.2024.03.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/30/2024]
Abstract
Asthma is a common complex airway disease whose prediction of disease risk and most severe outcomes is crucial in clinical practice for adequate clinical management. This review discusses the latest findings in asthma genomics and current obstacles faced in moving forward to translational medicine. While genome-wide association studies have provided valuable insights into the genetic basis of asthma, there are challenges that must be addressed to improve disease prediction, such as the need for diverse representation, the functional characterization of genetic variants identified, variant selection for genetic testing, and refining prediction models using polygenic risk scores.
Collapse
Affiliation(s)
- Esther Herrera-Luis
- Department of Epidemiology, Bloomberg School of Public Health, Johns Hopkins University, 615 N Wolfe Street, Baltimore, MD 21205, USA.
| | - Mario Martin-Almeida
- Genomics and Health Group, Department of Biochemistry, Microbiology, Cell Biology and Genetics, Universidad de La Laguna (ULL), Avenida Astrofísico Francisco Sánchez, s/n. Facultad de Ciencias, San Cristóbal de La Laguna, S/C de Tenerife La Laguna 38200, Tenerife, Spain
| | - Maria Pino-Yanes
- Genomics and Health Group, Department of Biochemistry, Microbiology, Cell Biology and Genetics, Universidad de La Laguna (ULL), Avenida Astrofísico Francisco Sánchez, s/n. Facultad de Ciencias, San Cristóbal de La Laguna, S/C de Tenerife La Laguna 38200, Tenerife, Spain; CIBER de Enfermedades Respiratorias, Instituto de Salud Carlos III, Madrid 28029, Spain; Instituto de Tecnologías Biomédicas (ITB), Universidad de La Laguna (ULL), San Cristóbal de La Laguna 38200, Tenerife, Spain
| |
Collapse
|
5
|
Boye C, Nirmalan S, Ranjbaran A, Luca F. Genotype × environment interactions in gene regulation and complex traits. Nat Genet 2024; 56:1057-1068. [PMID: 38858456 PMCID: PMC11492161 DOI: 10.1038/s41588-024-01776-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Accepted: 04/25/2024] [Indexed: 06/12/2024]
Abstract
Genotype × environment interactions (GxE) have long been recognized as a key mechanism underlying human phenotypic variation. Technological developments over the past 15 years have dramatically expanded our appreciation of the role of GxE in both gene regulation and complex traits. The richness and complexity of these datasets also required parallel efforts to develop robust and sensitive statistical and computational approaches. Although our understanding of the genetic architecture of molecular and complex traits has been maturing, a large proportion of complex trait heritability remains unexplained. Furthermore, there are increasing efforts to characterize the effect of environmental exposure on human health. We therefore review GxE in human gene regulation and complex traits, advocating for a comprehensive approach that jointly considers genetic and environmental factors in human health and disease.
Collapse
Affiliation(s)
- Carly Boye
- Center for Molecular Medicine and Genetics, Wayne State University, Detroit, MI, US
| | - Shreya Nirmalan
- Center for Molecular Medicine and Genetics, Wayne State University, Detroit, MI, US
| | - Ali Ranjbaran
- Center for Molecular Medicine and Genetics, Wayne State University, Detroit, MI, US
| | - Francesca Luca
- Center for Molecular Medicine and Genetics, Wayne State University, Detroit, MI, US.
- Department of Obstetrics and Gynecology, Wayne State University, Detroit, MI, US.
- Department of Biology, University of Rome "Tor Vergata", Rome, Italy.
| |
Collapse
|
6
|
Zhai S, Mehrotra DV, Shen J. Applying polygenic risk score methods to pharmacogenomics GWAS: challenges and opportunities. Brief Bioinform 2023; 25:bbad470. [PMID: 38152980 PMCID: PMC10782924 DOI: 10.1093/bib/bbad470] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2023] [Revised: 11/20/2023] [Accepted: 11/28/2023] [Indexed: 12/29/2023] Open
Abstract
Polygenic risk scores (PRSs) have emerged as promising tools for the prediction of human diseases and complex traits in disease genome-wide association studies (GWAS). Applying PRSs to pharmacogenomics (PGx) studies has begun to show great potential for improving patient stratification and drug response prediction. However, there are unique challenges that arise when applying PRSs to PGx GWAS beyond those typically encountered in disease GWAS (e.g. Eurocentric or trans-ethnic bias). These challenges include: (i) the lack of knowledge about whether PGx or disease GWAS/variants should be used in the base cohort (BC); (ii) the small sample sizes in PGx GWAS with corresponding low power and (iii) the more complex PRS statistical modeling required for handling both prognostic and predictive effects simultaneously. To gain insights in this landscape about the general trends, challenges and possible solutions, we first conduct a systematic review of both PRS applications and PRS method development in PGx GWAS. To further address the challenges, we propose (i) a novel PRS application strategy by leveraging both PGx and disease GWAS summary statistics in the BC for PRS construction and (ii) a new Bayesian method (PRS-PGx-Bayesx) to reduce Eurocentric or cross-population PRS prediction bias. Extensive simulations are conducted to demonstrate their advantages over existing PRS methods applied in PGx GWAS. Our systematic review and methodology research work not only highlights current gaps and key considerations while applying PRS methods to PGx GWAS, but also provides possible solutions for better PGx PRS applications and future research.
Collapse
Affiliation(s)
- Song Zhai
- Biostatistics and Research Decision Sciences, Merck & Co., Inc., Rahway, NJ 07065, USA
| | - Devan V Mehrotra
- Biostatistics and Research Decision Sciences, Merck & Co., Inc., North Wales, PA 19454, USA
| | - Judong Shen
- Biostatistics and Research Decision Sciences, Merck & Co., Inc., Rahway, NJ 07065, USA
| |
Collapse
|
7
|
Liu X, Morelli D, Littlejohns TJ, Clifton DA, Clifton L. Combining machine learning with Cox models to identify predictors for incident post-menopausal breast cancer in the UK Biobank. Sci Rep 2023; 13:9221. [PMID: 37286615 DOI: 10.1038/s41598-023-36214-0] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2023] [Accepted: 05/31/2023] [Indexed: 06/09/2023] Open
Abstract
We aimed to identify potential novel predictors for breast cancer among post-menopausal women, with pre-specified interest in the role of polygenic risk scores (PRS) for risk prediction. We utilised an analysis pipeline where machine learning was used for feature selection, prior to risk prediction by classical statistical models. An "extreme gradient boosting" (XGBoost) machine with Shapley feature-importance measures were used for feature selection among [Formula: see text] 1.7 k features in 104,313 post-menopausal women from the UK Biobank. We constructed and compared the "augmented" Cox model (incorporating the two PRS, known and novel predictors) with a "baseline" Cox model (incorporating the two PRS and known predictors) for risk prediction. Both of the two PRS were significant in the augmented Cox model ([Formula: see text]). XGBoost identified 10 novel features, among which five showed significant associations with post-menopausal breast cancer: plasma urea (HR = 0.95, 95% CI 0.92-0.98, [Formula: see text]), plasma phosphate (HR = 0.68, 95% CI 0.53-0.88, [Formula: see text]), basal metabolic rate (HR = 1.17, 95% CI 1.11-1.24, [Formula: see text]), red blood cell count (HR = 1.21, 95% CI 1.08-1.35, [Formula: see text]), and creatinine in urine (HR = 1.05, 95% CI 1.01-1.09, [Formula: see text]). Risk discrimination was maintained in the augmented Cox model, yielding C-index 0.673 vs 0.667 (baseline Cox model) with the training data and 0.665 vs 0.664 with the test data. We identified blood/urine biomarkers as potential novel predictors for post-menopausal breast cancer. Our findings provide new insights to breast cancer risk. Future research should validate novel predictors, investigate using multiple PRS and more precise anthropometry measures for better breast cancer risk prediction.
Collapse
Affiliation(s)
- Xiaonan Liu
- Nuffield Department of Population Health, University of Oxford, Oxford, UK.
- Big Data Institute, University of Oxford, Old Road Campus, Oxford, OX3 7LF, UK.
| | - Davide Morelli
- Department of Engineering Science, University of Oxford, Oxford, UK
| | - Thomas J Littlejohns
- Nuffield Department of Population Health, University of Oxford, Oxford, UK
- Big Data Institute, University of Oxford, Old Road Campus, Oxford, OX3 7LF, UK
| | - David A Clifton
- Department of Engineering Science, University of Oxford, Oxford, UK
| | - Lei Clifton
- Nuffield Department of Population Health, University of Oxford, Oxford, UK
- Big Data Institute, University of Oxford, Old Road Campus, Oxford, OX3 7LF, UK
| |
Collapse
|