Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Levy JJ, Titus AJ, Petersen CL, Chen Y, Salas LA, Christensen BC. MethylNet: an automated and modular deep learning approach for DNA methylation analysis. BMC Bioinformatics 2020;21:108. [PMID: 32183722 PMCID: PMC7076991 DOI: 10.1186/s12859-020-3443-8] [Citation(s) in RCA: 41] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2019] [Accepted: 03/04/2020] [Indexed: 12/13/2022] Open

For:	Levy JJ, Titus AJ, Petersen CL, Chen Y, Salas LA, Christensen BC. MethylNet: an automated and modular deep learning approach for DNA methylation analysis. BMC Bioinformatics 2020;21:108. [PMID: 32183722 PMCID: PMC7076991 DOI: 10.1186/s12859-020-3443-8] [Citation(s) in RCA: 41] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2019] [Accepted: 03/04/2020] [Indexed: 12/13/2022] Open

Number

Cited by Other Article(s)

Yap CX, Vo DD, Heffel MG, Bhattacharya A, Wen C, Yang Y, Kemper KE, Zeng J, Zheng Z, Zhu Z, Hannon E, Vellame DS, Franklin A, Caggiano C, Wamsley B, Geschwind DH, Zaitlen N, Gusev A, Pasaniuc B, Mill J, Luo C, Gandal MJ. Brain cell-type shifts in Alzheimer's disease, autism, and schizophrenia interrogated using methylomics and genetics. SCIENCE ADVANCES 2024;10:eadn7655. [PMID: 38781333 PMCID: PMC11114225 DOI: 10.1126/sciadv.adn7655] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/27/2023] [Accepted: 03/14/2024] [Indexed: 05/25/2024]

Affiliation(s)

Chloe X. Yap Mater Research Institute, University of Queensland, Brisbane, Queensland, Australia Institute for Molecular Bioscience, University of Queensland, Brisbane, Queensland, Australia Department of Psychiatry, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA Program in Neurobehavioral Genetics, Semel Institute, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA
Daniel D. Vo Department of Psychiatry, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA Program in Neurobehavioral Genetics, Semel Institute, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA Lifespan Brain Institute at Penn Medicine and The Children’s Hospital of Philadelphia, Department of Psychiatry, University of Pennsylvania, Philadelphia, PA, USA Department of Psychiatry, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
Matthew G. Heffel Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA Bioinformatics Interdepartmental Program, University of California Los Angeles, Los Angeles, CA, USA
Arjun Bhattacharya Department of Pathology and Laboratory Medicine, David Geffen School of Medicine, University of California, Los Angeles, CA, USA Institute for Quantitative and Computational Biosciences, David Geffen School of Medicine, University of California, Los Angeles, CA, USA Department of Epidemiology, University of Texas MD Anderson Cancer Center, Houston, TX, USA Institute for Data Science in Oncology, University of Texas MD Anderson Cancer Center, Houston, TX, USA
Cindy Wen Department of Psychiatry, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA Program in Neurobehavioral Genetics, Semel Institute, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA
Yuanhao Yang Mater Research Institute, University of Queensland, Brisbane, Queensland, Australia Institute for Molecular Bioscience, University of Queensland, Brisbane, Queensland, Australia
Kathryn E. Kemper Institute for Molecular Bioscience, University of Queensland, Brisbane, Queensland, Australia
Jian Zeng Institute for Molecular Bioscience, University of Queensland, Brisbane, Queensland, Australia
Zhili Zheng Institute for Molecular Bioscience, University of Queensland, Brisbane, Queensland, Australia
Zhihong Zhu Institute for Molecular Bioscience, University of Queensland, Brisbane, Queensland, Australia The National Centre for Register-based Research, Aarhus University, Denmark
Eilis Hannon Department of Clinical and Biomedical Sciences, University of Exeter Medical School, University of Exeter, Exeter, UK
Dorothea Seiler Vellame Department of Clinical and Biomedical Sciences, University of Exeter Medical School, University of Exeter, Exeter, UK
Alice Franklin Department of Clinical and Biomedical Sciences, University of Exeter Medical School, University of Exeter, Exeter, UK
Christa Caggiano Bioinformatics Interdepartmental Program, University of California Los Angeles, Los Angeles, CA, USA Department of Neurology, University of California Los Angeles, Los Angeles, CA, USA
Brie Wamsley Department of Psychiatry, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA Program in Neurobehavioral Genetics, Semel Institute, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA Department of Neurology, University of California Los Angeles, Los Angeles, CA, USA Center for Autism Research and Treatment, Semel Institute, University of California, Los Angeles, Los Angeles, CA, USA
Daniel H. Geschwind Department of Psychiatry, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA Program in Neurobehavioral Genetics, Semel Institute, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA Department of Neurology, University of California Los Angeles, Los Angeles, CA, USA Center for Autism Research and Treatment, Semel Institute, University of California, Los Angeles, Los Angeles, CA, USA
Noah Zaitlen Department of Neurology, University of California Los Angeles, Los Angeles, CA, USA Department of Computational Medicine, University of California Los Angeles, Los Angeles, CA, USA
Alexander Gusev Department of Medical Oncology, Dana-Farber Cancer Institute and Harvard Medical School, Boston, MA, USA Division of Genetics, Brigham & Women’s Hospital, Boston, MA, USA Medical and Population Genetics, Broad Institute, Cambridge, MA, USA
Bogdan Pasaniuc Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA Bioinformatics Interdepartmental Program, University of California Los Angeles, Los Angeles, CA, USA Department of Pathology and Laboratory Medicine, David Geffen School of Medicine, University of California, Los Angeles, CA, USA Department of Computational Medicine, University of California Los Angeles, Los Angeles, CA, USA Institute for Precision Health, University of California, Los Angeles, Los Angeles, CA, USA
Jonathan Mill Department of Clinical and Biomedical Sciences, University of Exeter Medical School, University of Exeter, Exeter, UK
Chongyuan Luo Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA
Michael J. Gandal Department of Psychiatry, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA Program in Neurobehavioral Genetics, Semel Institute, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA Lifespan Brain Institute at Penn Medicine and The Children’s Hospital of Philadelphia, Department of Psychiatry, University of Pennsylvania, Philadelphia, PA, USA Department of Psychiatry, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA

Collapse

Tao X, Zhu Z, Wang L, Li C, Sun L, Wang W, Gong W. Biomarkers of Aging and Relevant Evaluation Techniques: A Comprehensive Review. Aging Dis 2024;15:977-1005. [PMID: 37611906 PMCID: PMC11081160 DOI: 10.14336/ad.2023.00808-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2023] [Accepted: 08/08/2023] [Indexed: 08/25/2023] Open

Zhao J, Li H, Qu J, Zong X, Liu Y, Kuang Z, Wang H. A multi-organization epigenetic age prediction based on a channel attention perceptron networks. Front Genet 2024;15:1393856. [PMID: 38725481 PMCID: PMC11080615 DOI: 10.3389/fgene.2024.1393856] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2024] [Accepted: 04/09/2024] [Indexed: 05/12/2024] Open

Meng D, Zhang S, Huang Y, Mao K, Han JDJ. Application of AI in biological age prediction. Curr Opin Struct Biol 2024;85:102777. [PMID: 38310737 DOI: 10.1016/j.sbi.2024.102777] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Revised: 12/12/2023] [Accepted: 01/15/2024] [Indexed: 02/06/2024]

Klauschen F, Dippel J, Keyl P, Jurmeister P, Bockmayr M, Mock A, Buchstab O, Alber M, Ruff L, Montavon G, Müller KR. Toward Explainable Artificial Intelligence for Precision Pathology. ANNUAL REVIEW OF PATHOLOGY 2024;19:541-570. [PMID: 37871132 DOI: 10.1146/annurev-pathmechdis-051222-113147] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]

Affiliation(s)

Frederick Klauschen Institute of Pathology, Ludwig-Maximilians-Universität München, Munich, Germany; Institute of Pathology, Charité Universitätsmedizin Berlin, Berlin, Germany Berlin Institute for the Foundations of Learning and Data (BIFOLD), Berlin, Germany German Cancer Consortium, German Cancer Research Center (DKTK/DKFZ), Munich Partner Site, Munich, Germany
Jonas Dippel Berlin Institute for the Foundations of Learning and Data (BIFOLD), Berlin, Germany Machine Learning Group, Department of Electrical Engineering and Computer Science, Technische Universität Berlin, Berlin, Germany;
Philipp Keyl Institute of Pathology, Ludwig-Maximilians-Universität München, Munich, Germany;
Philipp Jurmeister Institute of Pathology, Ludwig-Maximilians-Universität München, Munich, Germany; German Cancer Consortium, German Cancer Research Center (DKTK/DKFZ), Munich Partner Site, Munich, Germany
Michael Bockmayr Institute of Pathology, Charité Universitätsmedizin Berlin, Berlin, Germany Department of Pediatric Hematology and Oncology, University Medical Center Hamburg-Eppendorf, Hamburg, Germany Research Institute Children's Cancer Center Hamburg, Hamburg, Germany
Andreas Mock Institute of Pathology, Ludwig-Maximilians-Universität München, Munich, Germany; German Cancer Consortium, German Cancer Research Center (DKTK/DKFZ), Munich Partner Site, Munich, Germany
Oliver Buchstab Institute of Pathology, Ludwig-Maximilians-Universität München, Munich, Germany;
Maximilian Alber Institute of Pathology, Charité Universitätsmedizin Berlin, Berlin, Germany Aignostics, Berlin, Germany
Lukas Ruff Aignostics, Berlin, Germany
Grégoire Montavon Berlin Institute for the Foundations of Learning and Data (BIFOLD), Berlin, Germany Machine Learning Group, Department of Electrical Engineering and Computer Science, Technische Universität Berlin, Berlin, Germany; Department of Mathematics and Computer Science, Freie Universität Berlin, Berlin, Germany
Klaus-Robert Müller Berlin Institute for the Foundations of Learning and Data (BIFOLD), Berlin, Germany Machine Learning Group, Department of Electrical Engineering and Computer Science, Technische Universität Berlin, Berlin, Germany; Department of Artificial Intelligence, Korea University, Seoul, Korea Max Planck Institute for Informatics, Saarbrücken, Germany

Collapse

Prosz A, Pipek O, Börcsök J, Palla G, Szallasi Z, Spisak S, Csabai I. Biologically informed deep learning for explainable epigenetic clocks. Sci Rep 2024;14:1306. [PMID: 38225268 PMCID: PMC10789766 DOI: 10.1038/s41598-023-50495-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2023] [Accepted: 12/20/2023] [Indexed: 01/17/2024] Open

Kalyakulina A, Yusipov I, Moskalev A, Franceschi C, Ivanchenko M. eXplainable Artificial Intelligence (XAI) in aging clock models. Ageing Res Rev 2024;93:102144. [PMID: 38030090 DOI: 10.1016/j.arr.2023.102144] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2023] [Revised: 11/07/2023] [Accepted: 11/23/2023] [Indexed: 12/01/2023]

Shi L, Hai B, Kuang Z, Wang H, Zhao J. ResnetAge: A Resnet-Based DNA Methylation Age Prediction Method. Bioengineering (Basel) 2023;11:34. [PMID: 38247911 PMCID: PMC10813502 DOI: 10.3390/bioengineering11010034] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2023] [Revised: 12/13/2023] [Accepted: 12/26/2023] [Indexed: 01/23/2024] Open

Hughes BK, Wallis R, Bishop CL. Yearning for machine learning: applications for the classification and characterisation of senescence. Cell Tissue Res 2023;394:1-16. [PMID: 37016180 PMCID: PMC10558380 DOI: 10.1007/s00441-023-03768-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2022] [Accepted: 03/05/2023] [Indexed: 04/06/2023]

Yassi M, Chatterjee A, Parry M. Application of deep learning in cancer epigenetics through DNA methylation analysis. Brief Bioinform 2023;24:bbad411. [PMID: 37985455 PMCID: PMC10661960 DOI: 10.1093/bib/bbad411] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2023] [Revised: 10/08/2023] [Accepted: 10/25/2023] [Indexed: 11/22/2023] Open

Martínez-Enguita D, Dwivedi SK, Jörnsten R, Gustafsson M. NCAE: data-driven representations using a deep network-coherent DNA methylation autoencoder identify robust disease and risk factor signatures. Brief Bioinform 2023;24:bbad293. [PMID: 37587790 PMCID: PMC10516364 DOI: 10.1093/bib/bbad293] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2023] [Revised: 07/25/2023] [Accepted: 07/29/2023] [Indexed: 08/18/2023] Open

Yuan T, Edelmann D, Fan Z, Alwers E, Kather JN, Brenner H, Hoffmeister M. Machine learning in the identification of prognostic DNA methylation biomarkers among patients with cancer: A systematic review of epigenome-wide studies. Artif Intell Med 2023;143:102589. [PMID: 37673571 DOI: 10.1016/j.artmed.2023.102589] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2022] [Revised: 04/19/2023] [Accepted: 04/30/2023] [Indexed: 09/08/2023]

Abstract

BACKGROUND

DNA methylation biomarkers have great potential in improving prognostic classification systems for patients with cancer. Machine learning (ML)-based analytic techniques might help overcome the challenges of analyzing high-dimensional data in relatively small sample sizes. This systematic review summarizes the current use of ML-based methods in epigenome-wide studies for the identification of DNA methylation signatures associated with cancer prognosis.

METHODS

We searched three electronic databases including PubMed, EMBASE, and Web of Science for articles published until 2 January 2023. ML-based methods and workflows used to identify DNA methylation signatures associated with cancer prognosis were extracted and summarized. Two authors independently assessed the methodological quality of included studies by a seven-item checklist adapted from 'A Tool to Assess Risk of Bias and Applicability of Prediction Model Studies (PROBAST)' and from the 'Reporting Recommendations for Tumor Marker Prognostic Studies (REMARK). Different ML methods and workflows used in included studies were summarized and visualized by a sunburst chart, a bubble chart, and Sankey diagrams, respectively.

RESULTS

Eighty-three studies were included in this review. Three major types of ML-based workflows were identified. 1) unsupervised clustering, 2) supervised feature selection, and 3) deep learning-based feature transformation. For the three workflows, the most frequently used ML techniques were consensus clustering, least absolute shrinkage and selection operator (LASSO), and autoencoder, respectively. The systematic review revealed that the performance of these approaches has not been adequately evaluated yet and that methodological and reporting flaws were common in the identified studies using ML techniques.

CONCLUSIONS

There is great heterogeneity in ML-based methodological strategies used by epigenome-wide studies to identify DNA methylation markers associated with cancer prognosis. In theory, most existing workflows could not handle the high multi-collinearity and potentially non-linearity interactions in epigenome-wide DNA methylation data. Benchmarking studies are needed to compare the relative performance of various approaches for specific cancer types. Adherence to relevant methodological and reporting guidelines are urgently needed.

Collapse

Park S, Rehman MU, Ullah F, Tayara H, Chong KT. iCpG-Pos: an accurate computational approach for identification of CpG sites using positional features on single-cell whole genome sequence data. Bioinformatics 2023;39:btad474. [PMID: 37555812 PMCID: PMC10444964 DOI: 10.1093/bioinformatics/btad474] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2023] [Revised: 05/11/2023] [Accepted: 08/08/2023] [Indexed: 08/10/2023] Open

Abstract

MOTIVATION

The investigation of DNA methylation can shed light on the processes underlying human well-being and help determine overall human health. However, insufficient coverage makes it challenging to implement single-stranded DNA methylation sequencing technologies, highlighting the need for an efficient prediction model. Models are required to create an understanding of the underlying biological systems and to project single-cell (methylated) data accurately.

RESULTS

In this study, we developed positional features for predicting CpG sites. Positional characteristics of the sequence are derived using data from CpG regions and the separation between nearby CpG sites. Multiple optimized classifiers and different ensemble learning approaches are evaluated. The OPTUNA framework is used to optimize the algorithms. The CatBoost algorithm followed by the stacking algorithm outperformed existing DNA methylation identifiers.

AVAILABILITY AND IMPLEMENTATION

The data and methodologies used in this study are openly accessible to the research community. Researchers can access the positional features and algorithms used for predicting CpG site methylation patterns. To achieve superior performance, we employed the CatBoost algorithm followed by the stacking algorithm, which outperformed existing DNA methylation identifiers. The proposed iCpG-Pos approach utilizes only positional features, resulting in a substantial reduction in computational complexity compared to other known approaches for detecting CpG site methylation patterns. In conclusion, our study introduces a novel approach, iCpG-Pos, for predicting CpG site methylation patterns. By focusing on positional features, our model offers both accuracy and efficiency, making it a promising tool for advancing DNA methylation research and its applications in human health and well-being.

Collapse

Gedefaw L, Liu CF, Ip RKL, Tse HF, Yeung MHY, Yip SP, Huang CL. Artificial Intelligence-Assisted Diagnostic Cytology and Genomic Testing for Hematologic Disorders. Cells 2023;12:1755. [PMID: 37443789 PMCID: PMC10340428 DOI: 10.3390/cells12131755] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2023] [Revised: 06/21/2023] [Accepted: 06/28/2023] [Indexed: 07/15/2023] Open

Beaude A, Rafiee Vahid M, Augé F, Zehraoui F, Hanczar B. AttOmics: attention-based architecture for diagnosis and prognosis from omics data. Bioinformatics 2023;39:i94-i102. [PMID: 37387182 DOI: 10.1093/bioinformatics/btad232] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/01/2023] Open

Li T, Li Y, Zhu X, He Y, Wu Y, Ying T, Xie Z. Artificial intelligence in cancer immunotherapy: Applications in neoantigen recognition, antibody design and immunotherapy response prediction. Semin Cancer Biol 2023;91:50-69. [PMID: 36870459 DOI: 10.1016/j.semcancer.2023.02.007] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2022] [Revised: 02/13/2023] [Accepted: 02/28/2023] [Indexed: 03/06/2023]

Sugino RP, Ohira M, Mansai SP, Kamijo T. Comparative epigenomics by machine learning approach for neuroblastoma. BMC Genomics 2022;23:852. [PMID: 36572864 PMCID: PMC9793522 DOI: 10.1186/s12864-022-09061-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2022] [Accepted: 12/02/2022] [Indexed: 12/28/2022] Open

Fryett JJ, Morris AP, Cordell HJ. Investigating the prediction of CpG methylation levels from SNP genotype data to help elucidate relationships between methylation, gene expression and complex traits. Genet Epidemiol 2022;46:629-643. [PMID: 35930604 PMCID: PMC9804820 DOI: 10.1002/gepi.22496] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2022] [Revised: 06/27/2022] [Accepted: 07/19/2022] [Indexed: 01/09/2023]

de Lima Camillo LP, Lapierre LR, Singh R. A pan-tissue DNA-methylation epigenetic clock based on deep learning. NPJ AGING 2022. [PMCID: PMC9158789 DOI: 10.1038/s41514-022-00085-y] [Citation(s) in RCA: 20] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]

Abstract AbstractSeveral age predictors based on DNA methylation, dubbed epigenetic clocks, have been created in recent years, with the vast majority based on regularized linear regression. This study explores the improvement in the performance and interpretation of epigenetic clocks using deep learning. First, we gathered 142 publicly available data sets from several human tissues to develop AltumAge, a neural network framework that is a highly accurate and precise age predictor. Compared to ElasticNet, AltumAge performs better for within-data set and cross-data set age prediction, being particularly more generalizable in older ages and new tissue types. We then used deep learning interpretation methods to learn which methylation sites contributed to the final model predictions. We observe that while most important CpG sites are linearly related to age, some highly-interacting CpG sites can influence the relevance of such relationships. Using chromatin annotations, we show that the CpG sites with the highest contribution to the model predictions were related to gene regulatory regions in the genome, including proximity to CTCF binding sites. We also found age-related KEGG pathways for genes containing these CpG sites. Lastly, we performed downstream analyses of AltumAge to explore its applicability and compare its age acceleration with Horvath’s 2013 model. We show that our neural network approach predicts higher age acceleration for tumors, for cells that exhibit age-related changes in vitro, such as immune and mitochondrial dysfunction, and for samples from patients with multiple sclerosis, type 2 diabetes, and HIV, among other conditions. Altogether, our neural network approach provides significant improvement and flexibility compared to current epigenetic clocks for both performance and model interpretability. Collapse

Li A, Koch Z, Ideker T. Epigenetic aging: Biological age prediction and informing a mechanistic theory of aging. J Intern Med 2022;292:733-744. [PMID: 35726002 DOI: 10.1111/joim.13533] [Citation(s) in RCA: 21] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]

Chen L, Saykin AJ, Yao B, Zhao F. Multi-task deep autoencoder to predict Alzheimer's disease progression using temporal DNA methylation data in peripheral blood. Comput Struct Biotechnol J 2022;20:5761-5774. [PMID: 36756173 PMCID: PMC9619306 DOI: 10.1016/j.csbj.2022.10.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2022] [Revised: 10/10/2022] [Accepted: 10/11/2022] [Indexed: 11/03/2022] Open

Kalyakulina A, Yusipov I, Bacalini MG, Franceschi C, Vedunova M, Ivanchenko M. Disease classification for whole-blood DNA methylation: Meta-analysis, missing values imputation, and XAI. Gigascience 2022;11:giac097. [PMID: 36259657 PMCID: PMC9718659 DOI: 10.1093/gigascience/giac097] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2022] [Revised: 08/01/2022] [Accepted: 09/15/2022] [Indexed: 07/25/2023] Open

Deep Learning in High Voltage Engineering: A Literature Review. ENERGIES 2022. [DOI: 10.3390/en15145005] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]

Jeong Y, de Andrade E Sousa LB, Thalmeier D, Toth R, Ganslmeier M, Breuer K, Plass C, Lutsik P. Systematic evaluation of cell-type deconvolution pipelines for sequencing-based bulk DNA methylomes. Brief Bioinform 2022;23:6632618. [PMID: 35794707 PMCID: PMC9294431 DOI: 10.1093/bib/bbac248] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2022] [Revised: 05/18/2022] [Accepted: 05/26/2022] [Indexed: 11/18/2022] Open

Crawford J, Christensen BC, Chikina M, Greene CS. Widespread redundancy in -omics profiles of cancer mutation states. Genome Biol 2022;23:137. [PMID: 35761387 PMCID: PMC9238138 DOI: 10.1186/s13059-022-02705-y] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Accepted: 06/14/2022] [Indexed: 02/04/2023] Open

Lin Y, Li H, Xiao X, Zhang L, Wang K, Zhao J, Wang M, Zheng F, Zhang M, Yang W, Han J, Yu R. DAISM-DNN^XMBD: Highly accurate cell type proportion estimation with in silico data augmentation and deep neural networks. PATTERNS (NEW YORK, N.Y.) 2022;3:100440. [PMID: 35510186 PMCID: PMC9058910 DOI: 10.1016/j.patter.2022.100440] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/08/2021] [Revised: 09/29/2021] [Accepted: 01/06/2022] [Indexed: 12/31/2022]

Abstract

Understanding the immune cell abundance of cancer and other disease-related tissues has an important role in guiding disease treatments. Computational cell type proportion estimation methods have been previously developed to derive such information from bulk RNA sequencing data. Unfortunately, our results show that the performance of these methods can be seriously plagued by the mismatch between training data and real-world data. To tackle this issue, we propose the DAISM-DNN^XMBD (XMBD: Xiamen Big Data, a biomedical open software initiative in the National Institute for Data Science in Health and Medicine, Xiamen University, China.) (denoted as DAISM-DNN) pipeline that trains a deep neural network (DNN) with dataset-specific training data populated from a certain amount of calibrated samples using DAISM, a novel data augmentation method with an in silico mixing strategy. The evaluation results demonstrate that the DAISM-DNN pipeline outperforms other existing methods consistently and substantially for all the cell types under evaluation in real-world datasets.

•

We propose a data augmentation method (DAISM) for DNN-based cell type deconvolution

•

DAISM-DNN enables accurate cell type deconvolution with dataset-specific training data

•

DAISM-DNN is robust to random errors in calibration samples

•

Trained DAISM-DNN model is reusable across biomedical experiments following same SOP

Computational cell type deconvolution methods were developed to understand the cellular heterogeneity in disease-related tissues from bulk RNA-seq data. Due to the presence of strong batch effects, the performance of existing methods could fluctuate greatly when applied to different datasets even with the latest development in batch normalization or platform-agnostic signature designs. To tackle this issue, we proposed a DNN-based cell abundance estimation method with dataset-specific training data populated from a certain number of calibrated samples from a target dataset using DAISM, a data augmentation method using an in silico mixing strategy. DAISM-DNN enables accurate cell type proportions prediction and is robust to random errors in the ground truth cell type proportions of calibration samples. Importantly, we showed that with strict SOPs, it is possible to create a “train once, reuse many times” DAISM-DNN model for multiple biomedical experiments without the need for retraining.

Collapse

Cristoferi I, Giacon TA, Boer K, van Baardwijk M, Neri F, Campisi M, Kimenai HJAN, Clahsen-van Groningen MC, Pavanello S, Furian L, Minnee RC. The applications of DNA methylation as a biomarker in kidney transplantation: a systematic review. Clin Epigenetics 2022;14:20. [PMID: 35130936 PMCID: PMC8822833 DOI: 10.1186/s13148-022-01241-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2021] [Accepted: 01/27/2022] [Indexed: 12/27/2022] Open

Abstract

Background

Although kidney transplantation improves patient survival and quality of life, long-term results are hampered by both immune- and non-immune-mediated complications. Current biomarkers of post-transplant complications, such as allograft rejection, chronic renal allograft dysfunction, and cutaneous squamous cell carcinoma, have a suboptimal predictive value. DNA methylation is an epigenetic modification that directly affects gene expression and plays an important role in processes such as ischemia/reperfusion injury, fibrosis, and alloreactive immune response. Novel techniques can quickly assess the DNA methylation status of multiple loci in different cell types, allowing a deep and interesting study of cells’ activity and function. Therefore, DNA methylation has the potential to become an important biomarker for prediction and monitoring in kidney transplantation.

Purpose of the study

The aim of this study was to evaluate the role of DNA methylation as a potential biomarker of graft survival and complications development in kidney transplantation.

Material and Methods

A systematic review of several databases has been conducted. The Newcastle–Ottawa scale and the Jadad scale have been used to assess the risk of bias for observational and randomized studies, respectively.

Results

Twenty articles reporting on DNA methylation as a biomarker for kidney transplantation were included, all using DNA methylation for prediction and monitoring. DNA methylation pattern alterations in cells isolated from different tissues, such as kidney biopsies, urine, and blood, have been associated with ischemia–reperfusion injury and chronic renal allograft dysfunction. These alterations occurred in different and specific loci. DNA methylation status has also proved to be important for immune response modulation, having a crucial role in regulatory T cell definition and activity. Research also focused on a better understanding of the role of this epigenetic modification assessment for regulatory T cells isolation and expansion for future tolerance induction-oriented therapies.

Conclusions

Studies included in this review are heterogeneous in study design, biological samples, and outcome. More coordinated investigations are needed to affirm DNA methylation as a clinically relevant biomarker important for prevention, monitoring, and intervention.

Supplementary Information

The online version contains supplementary material available at 10.1186/s13148-022-01241-7.

Collapse

Affiliation(s)

Iacopo Cristoferi Division of HPB and Transplant Surgery, Department of Surgery, Erasmus MC, University Medical Center Rotterdam, Doctor Molewaterplein 40, 3015GD, Rotterdam, the Netherlands. .,Department of Pathology and Clinical Bioinformatics, Erasmus MC, University Medical Center Rotterdam, Doctor Molewaterplein 40, 3015GD, Rotterdam, the Netherlands. .,Erasmus MC Transplant Institute, Erasmus MC, University Medical Center Rotterdam, Doctor Molewaterplein 40, 3015GD, Rotterdam, the Netherlands.
Tommaso Antonio Giacon Kidney and Pancreas Transplantation Unit, Department of Surgical, Oncological and Gastroenterological Sciences, Padua University Hospital, Via Giustiniani 2, 35128, Padua, Italy.,Occupational Medicine, Department of Cardiac, Thoracic, Vascular Sciences and Public Health, Padua University, Via Giustiniani 2, 35128, Padua, Italy.,Environmental and Respiratory Physiology Laboratory, Department of Biomedical Sciences, Padua University, Via Marzolo 3, 35131, Padua, Italy.,Institute of Anaesthesia and Intensive Care, Department of Medicine - DIMED, Padua University Hospital, Via Cesare Battisti 267, 35128, Padua, Italy
Karin Boer Erasmus MC Transplant Institute, Erasmus MC, University Medical Center Rotterdam, Doctor Molewaterplein 40, 3015GD, Rotterdam, the Netherlands.,Division of Nephrology and Transplantation, Department of Internal Medicine, Erasmus MC, University Medical Center Rotterdam, Doctor Molewaterplein 40, 3015GD, Rotterdam, The Netherlands
Myrthe van Baardwijk Division of HPB and Transplant Surgery, Department of Surgery, Erasmus MC, University Medical Center Rotterdam, Doctor Molewaterplein 40, 3015GD, Rotterdam, the Netherlands.,Department of Pathology and Clinical Bioinformatics, Erasmus MC, University Medical Center Rotterdam, Doctor Molewaterplein 40, 3015GD, Rotterdam, the Netherlands.,Erasmus MC Transplant Institute, Erasmus MC, University Medical Center Rotterdam, Doctor Molewaterplein 40, 3015GD, Rotterdam, the Netherlands
Flavia Neri Kidney and Pancreas Transplantation Unit, Department of Surgical, Oncological and Gastroenterological Sciences, Padua University Hospital, Via Giustiniani 2, 35128, Padua, Italy
Manuela Campisi Occupational Medicine, Department of Cardiac, Thoracic, Vascular Sciences and Public Health, Padua University, Via Giustiniani 2, 35128, Padua, Italy
Hendrikus J A N Kimenai Division of HPB and Transplant Surgery, Department of Surgery, Erasmus MC, University Medical Center Rotterdam, Doctor Molewaterplein 40, 3015GD, Rotterdam, the Netherlands.,Erasmus MC Transplant Institute, Erasmus MC, University Medical Center Rotterdam, Doctor Molewaterplein 40, 3015GD, Rotterdam, the Netherlands
Marian C Clahsen-van Groningen Department of Pathology and Clinical Bioinformatics, Erasmus MC, University Medical Center Rotterdam, Doctor Molewaterplein 40, 3015GD, Rotterdam, the Netherlands.,Erasmus MC Transplant Institute, Erasmus MC, University Medical Center Rotterdam, Doctor Molewaterplein 40, 3015GD, Rotterdam, the Netherlands.,Institute of Experimental Medicine and Systems Biology, RWTH Aachen University, Pauwelsstraße 30, 52074, Aachen, Germany
Sofia Pavanello Occupational Medicine, Department of Cardiac, Thoracic, Vascular Sciences and Public Health, Padua University, Via Giustiniani 2, 35128, Padua, Italy
Lucrezia Furian Kidney and Pancreas Transplantation Unit, Department of Surgical, Oncological and Gastroenterological Sciences, Padua University Hospital, Via Giustiniani 2, 35128, Padua, Italy
Robert C Minnee Division of HPB and Transplant Surgery, Department of Surgery, Erasmus MC, University Medical Center Rotterdam, Doctor Molewaterplein 40, 3015GD, Rotterdam, the Netherlands.,Erasmus MC Transplant Institute, Erasmus MC, University Medical Center Rotterdam, Doctor Molewaterplein 40, 3015GD, Rotterdam, the Netherlands

Collapse

Chow YL, Singh S, Carpenter AE, Way GP. Predicting drug polypharmacology from cell morphology readouts using variational autoencoder latent space arithmetic. PLoS Comput Biol 2022;18:e1009888. [PMID: 35213530 PMCID: PMC8906577 DOI: 10.1371/journal.pcbi.1009888] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2021] [Revised: 03/09/2022] [Accepted: 02/01/2022] [Indexed: 01/13/2023] Open

DNA Methylation Biomarkers-Based Human Age Prediction Using Machine Learning. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022;2022:8393498. [PMID: 35111213 PMCID: PMC8803417 DOI: 10.1155/2022/8393498] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/30/2021] [Revised: 11/20/2021] [Accepted: 12/22/2021] [Indexed: 12/28/2022]

Kanapeckaitė A, Burokienė N, Mažeikienė A, Cottrell GS, Widera D. Biophysics is reshaping our perception of the epigenome: from DNA-level to high-throughput studies. BIOPHYSICAL REPORTS 2021;1:100028. [PMID: 36425454 PMCID: PMC9680810 DOI: 10.1016/j.bpr.2021.100028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/26/2021] [Accepted: 09/24/2021] [Indexed: 06/16/2023]

Deep Learning for Human Disease Detection, Subtype Classification, and Treatment Response Prediction Using Epigenomic Data. Biomedicines 2021;9:biomedicines9111733. [PMID: 34829962 PMCID: PMC8615388 DOI: 10.3390/biomedicines9111733] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2021] [Revised: 10/26/2021] [Accepted: 11/17/2021] [Indexed: 12/25/2022] Open

De Waele G, Clauwaert J, Menschaert G, Waegeman W. CpG Transformer for imputation of single-cell methylomes. Bioinformatics 2021;38:597-603. [PMID: 34718418 PMCID: PMC8756163 DOI: 10.1093/bioinformatics/btab746] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2021] [Revised: 10/19/2021] [Accepted: 10/25/2021] [Indexed: 02/03/2023] Open

Watson DS. Interpretable machine learning for genomics. Hum Genet 2021;141:1499-1513. [PMID: 34669035 PMCID: PMC8527313 DOI: 10.1007/s00439-021-02387-9] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2021] [Accepted: 10/08/2021] [Indexed: 12/19/2022]

Caudai C, Galizia A, Geraci F, Le Pera L, Morea V, Salerno E, Via A, Colombo T. AI applications in functional genomics. Comput Struct Biotechnol J 2021;19:5762-5790. [PMID: 34765093 PMCID: PMC8566780 DOI: 10.1016/j.csbj.2021.10.009] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2021] [Revised: 10/05/2021] [Accepted: 10/05/2021] [Indexed: 12/13/2022] Open

Huang K, Xiao C, Glass LM, Critchlow CW, Gibson G, Sun J. Machine learning applications for therapeutic tasks with genomics data. PATTERNS (NEW YORK, N.Y.) 2021;2:100328. [PMID: 34693370 PMCID: PMC8515011 DOI: 10.1016/j.patter.2021.100328] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]

Tran KA, Kondrashova O, Bradley A, Williams ED, Pearson JV, Waddell N. Deep learning in cancer diagnosis, prognosis and treatment selection. Genome Med 2021;13:152. [PMID: 34579788 PMCID: PMC8477474 DOI: 10.1186/s13073-021-00968-x] [Citation(s) in RCA: 190] [Impact Index Per Article: 63.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2020] [Accepted: 09/12/2021] [Indexed: 12/13/2022] Open

Simpson DJ, Chandra T. Epigenetic age prediction. Aging Cell 2021;20:e13452. [PMID: 34415665 PMCID: PMC8441394 DOI: 10.1111/acel.13452] [Citation(s) in RCA: 56] [Impact Index Per Article: 18.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2021] [Revised: 07/21/2021] [Accepted: 07/27/2021] [Indexed: 12/14/2022] Open

Levy JJ, Chen Y, Azizgolshani N, Petersen CL, Titus AJ, Moen EL, Vaickus LJ, Salas LA, Christensen BC. MethylSPWNet and MethylCapsNet: Biologically Motivated Organization of DNAm Neural Networks, Inspired by Capsule Networks. NPJ Syst Biol Appl 2021;7:33. [PMID: 34417465 PMCID: PMC8379254 DOI: 10.1038/s41540-021-00193-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2020] [Accepted: 07/01/2021] [Indexed: 02/07/2023] Open

Barefoot ME, Loyfer N, Kiliti AJ, McDeed AP, Kaplan T, Wellstein A. Detection of Cell Types Contributing to Cancer From Circulating, Cell-Free Methylated DNA. Front Genet 2021;12:671057. [PMID: 34386036 PMCID: PMC8353442 DOI: 10.3389/fgene.2021.671057] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2021] [Accepted: 05/17/2021] [Indexed: 12/24/2022] Open

Chaudhari M, Thapa N, Roy K, Newman RH, Saigo H, B K C D. DeepRMethylSite: a deep learning based approach for prediction of arginine methylation sites in proteins. Mol Omics 2021;16:448-454. [PMID: 32555810 DOI: 10.1039/d0mo00025f] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]

Verifying explainability of a deep learning tissue classifier trained on RNA-seq data. Sci Rep 2021;11:2641. [PMID: 33514769 PMCID: PMC7846764 DOI: 10.1038/s41598-021-81773-9] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2020] [Accepted: 01/11/2021] [Indexed: 12/16/2022] Open

Santoni D, Pignotti D, Vergni D. A genome-wide study on differential methylation in different cancers using TCGA database. INFORMATICS IN MEDICINE UNLOCKED 2021. [DOI: 10.1016/j.imu.2021.100542] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022] Open

Smyth LJ, Patterson CC, Swan EJ, Maxwell AP, McKnight AJ. DNA Methylation Associated With Diabetic Kidney Disease in Blood-Derived DNA. Front Cell Dev Biol 2020;8:561907. [PMID: 33178681 PMCID: PMC7593403 DOI: 10.3389/fcell.2020.561907] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2020] [Accepted: 09/15/2020] [Indexed: 12/23/2022] Open

Abstract

A subset of individuals with type 1 diabetes will develop diabetic kidney disease (DKD). DKD is heritable and large-scale genome-wide association studies have begun to identify genetic factors that influence DKD. Complementary to genetic factors, we know that a person’s epigenetic profile is also altered with DKD. This study reports analysis of DNA methylation, a major epigenetic feature, evaluating methylome-wide loci for association with DKD. Unique features (n = 485,577; 482,421 CpG probes) were evaluated in blood-derived DNA from carefully phenotyped White European individuals diagnosed with type 1 diabetes with (cases) or without (controls) DKD (n = 677 samples). Explicitly, 150 cases were compared to 100 controls using the 450K array, with subsequent analysis using data previously generated for a further 96 cases and 96 controls on the 27K array, and de novo methylation data generated for replication in 139 cases and 96 controls. Following stringent quality control, raw data were quantile normalized and beta values calculated to reflect the methylation status at each site. The difference in methylation status was evaluated between cases and controls; resultant P-values for array-based data were adjusted for multiple testing. Genes with significantly increased (hypermethylated) and/or decreased (hypomethylated) levels of DNA methylation were considered for biological relevance by functional enrichment analysis using KEGG pathways. Twenty-two loci demonstrated statistically significant fold changes associated with DKD and additional support for these associated loci was sought using independent samples derived from patients recruited with similar inclusion criteria. Markers associated with CCNL1 and ZNF187 genes are supported as differentially regulated loci (P < 10^–8), with evidence also presented for AFF3, which has been identified from a meta-analysis and subsequent replication of genome-wide association studies. Further supporting evidence for differential gene expression in CCNL1 and ZNF187 is presented from kidney biopsy and blood-derived RNA in people with and without kidney disease from NephroSeq. Evidence confirming that methylation sites influence the development of DKD may aid risk prediction tools and stimulate research to identify epigenomic therapies which might be clinically useful for this disease.

Collapse

Levy JJ, O'Malley AJ. Don't dismiss logistic regression: the case for sensible extraction of interactions in the era of machine learning. BMC Med Res Methodol 2020;20:171. [PMID: 32600277 PMCID: PMC7325087 DOI: 10.1186/s12874-020-01046-3] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2019] [Accepted: 06/10/2020] [Indexed: 01/08/2023] Open

Abstract

BACKGROUND

Machine learning approaches have become increasingly popular modeling techniques, relying on data-driven heuristics to arrive at its solutions. Recent comparisons between these algorithms and traditional statistical modeling techniques have largely ignored the superiority gained by the former approaches due to involvement of model-building search algorithms. This has led to alignment of statistical and machine learning approaches with different types of problems and the under-development of procedures that combine their attributes. In this context, we hoped to understand the domains of applicability for each approach and to identify areas where a marriage between the two approaches is warranted. We then sought to develop a hybrid statistical-machine learning procedure with the best attributes of each.

METHODS

We present three simple examples to illustrate when to use each modeling approach and posit a general framework for combining them into an enhanced logistic regression model building procedure that aids interpretation. We study 556 benchmark machine learning datasets to uncover when machine learning techniques outperformed rudimentary logistic regression models and so are potentially well-equipped to enhance them. We illustrate a software package, InteractionTransformer, which embeds logistic regression with advanced model building capacity by using machine learning algorithms to extract candidate interaction features from a random forest model for inclusion in the model. Finally, we apply our enhanced logistic regression analysis to two real-word biomedical examples, one where predictors vary linearly with the outcome and another with extensive second-order interactions.

RESULTS

Preliminary statistical analysis demonstrated that across 556 benchmark datasets, the random forest approach significantly outperformed the logistic regression approach. We found a statistically significant increase in predictive performance when using hybrid procedures and greater clarity in the association with the outcome of terms acquired compared to directly interpreting the random forest output.

CONCLUSIONS

When a random forest model is closer to the true model, hybrid statistical-machine learning procedures can substantially enhance the performance of statistical procedures in an automated manner while preserving easy interpretation of the results. Such hybrid methods may help facilitate widespread adoption of machine learning techniques in the biomedical setting.

Collapse