Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Durham TJ, Libbrecht MW, Howbert JJ, Bilmes J, Noble WS. PREDICTD PaRallel Epigenomics Data Imputation with Cloud-based Tensor Decomposition. Nat Commun 2018;9:1402. [PMID: 29643364 DOI: 10.1038/s41467-018-03635-9] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2017] [Accepted: 03/02/2018] [Indexed: 11/24/2022] Open

For:	Durham TJ, Libbrecht MW, Howbert JJ, Bilmes J, Noble WS. PREDICTD PaRallel Epigenomics Data Imputation with Cloud-based Tensor Decomposition. Nat Commun 2018;9:1402. [PMID: 29643364 DOI: 10.1038/s41467-018-03635-9] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2017] [Accepted: 03/02/2018] [Indexed: 11/24/2022] Open

Number

Cited by Other Article(s)

Maciejewski E, Horvath S, Ernst J. CMImpute: cross-species and tissue imputation of species-level DNA methylation samples across mammalian species. Genome Biol 2025;26:133. [PMID: 40394556 PMCID: PMC12090574 DOI: 10.1186/s13059-025-03561-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2024] [Accepted: 03/26/2025] [Indexed: 05/22/2025] Open

López-Hernández L, Toolan-Kerr P, Bannister AJ, Millán-Zambrano G. Dynamic histone modification patterns coordinating DNA processes. Mol Cell 2025;85:225-237. [PMID: 39824165 DOI: 10.1016/j.molcel.2024.10.034] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2024] [Revised: 10/02/2024] [Accepted: 10/25/2024] [Indexed: 01/20/2025]

Murphy AE, Beardall W, Rei M, Phuycharoen M, Skene NG. Predicting cell type-specific epigenomic profiles accounting for distal genetic effects. Nat Commun 2024;15:9951. [PMID: 39550354 PMCID: PMC11569248 DOI: 10.1038/s41467-024-54441-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2024] [Accepted: 11/06/2024] [Indexed: 11/18/2024] Open

Wen W, Zhong J, Zhang Z, Jia L, Chu T, Wang N, Danko CG, Wang Z. dHICA: a deep transformer-based model enables accurate histone imputation from chromatin accessibility. Brief Bioinform 2024;25:bbae459. [PMID: 39316943 PMCID: PMC11421843 DOI: 10.1093/bib/bbae459] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2024] [Revised: 07/13/2024] [Accepted: 09/04/2024] [Indexed: 09/26/2024] Open

Tan ZC, Meyer AS. The structure is the message: Preserving experimental context through tensor decomposition. Cell Syst 2024;15:679-693. [PMID: 39173584 PMCID: PMC11366223 DOI: 10.1016/j.cels.2024.07.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2024] [Revised: 06/25/2024] [Accepted: 07/22/2024] [Indexed: 08/24/2024]

Min A, Schreiber J, Kundaje A, Noble WS. Predicting chromatin conformation contact maps. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.12.589240. [PMID: 38645064 PMCID: PMC11030330 DOI: 10.1101/2024.04.12.589240] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/23/2024]

Xiang G, Guo Y, Bumcrot D, Sigova A. JMnorm: a novel joint multi-feature normalization method for integrative and comparative epigenomics. Nucleic Acids Res 2024;52:e11. [PMID: 38055833 PMCID: PMC10810286 DOI: 10.1093/nar/gkad1146] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2023] [Revised: 10/25/2023] [Accepted: 11/14/2023] [Indexed: 12/08/2023] Open

Yu X, Zhao H, Wang R, Chen Y, Ouyang X, Li W, Sun Y, Peng A. Cancer epigenetics: from laboratory studies and clinical trials to precision medicine. Cell Death Discov 2024;10:28. [PMID: 38225241 PMCID: PMC10789753 DOI: 10.1038/s41420-024-01803-z] [Citation(s) in RCA: 72] [Impact Index Per Article: 72.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2023] [Revised: 12/23/2023] [Accepted: 01/04/2024] [Indexed: 01/17/2024] Open

Fadri MTM, Lee JB, Keung AJ. Summary of ChIP-Seq Methods and Description of an Optimized ChIP-Seq Protocol. Methods Mol Biol 2024;2842:419-447. [PMID: 39012609 DOI: 10.1007/978-1-0716-4051-7_22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/17/2024]

Hawkins-Hooker A, Visonà G, Narendra T, Rojas-Carulla M, Schölkopf B, Schweikert G. Getting personal with epigenetics: towards individual-specific epigenomic imputation with machine learning. Nat Commun 2023;14:4750. [PMID: 37550323 PMCID: PMC10406842 DOI: 10.1038/s41467-023-40211-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2022] [Accepted: 07/18/2023] [Indexed: 08/09/2023] Open

Schreiber JM, Boix CA, Wook Lee J, Li H, Guan Y, Chang CC, Chang JC, Hawkins-Hooker A, Schölkopf B, Schweikert G, Carulla MR, Canakoglu A, Guzzo F, Nanni L, Masseroli M, Carman MJ, Pinoli P, Hong C, Yip KY, Spence JP, Batra SS, Song YS, Mahony S, Zhang Z, Tan W, Shen Y, Sun Y, Shi M, Adrian J, Sandstrom RS, Farrell NP, Halow JM, Lee K, Jiang L, Yang X, Epstein CB, Strattan JS, Bernstein BE, Snyder MP, Kellis M, Noble WS, Kundaje AB. The ENCODE Imputation Challenge: a critical assessment of methods for cross-cell type imputation of epigenomic profiles. Genome Biol 2023;24:79. [PMID: 37072822 PMCID: PMC10111747 DOI: 10.1186/s13059-023-02915-y] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2022] [Accepted: 03/24/2023] [Indexed: 04/20/2023] Open

Affiliation(s)

Jacob Matthew Schreiber Department of Genetics, Stanford University, Stanford, CA, USA.
Carles A Boix Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, USA
Jin Wook Lee Department of Genetics, Stanford University, Stanford, CA, USA
Hongyang Li Department of computational medicine and bioinformatics, University of Michigan, Ann Arbor, MI, USA
Yuanfang Guan Department of computational medicine and bioinformatics, University of Michigan, Ann Arbor, MI, USA
Chun-Chieh Chang Department of Research and Development, DeepSeq.AI, San Francisco, CA, USA
Jen-Chien Chang RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
Alex Hawkins-Hooker Department of Empirical Inference, Max Planck Institute for Intelligent Systems, Stuttgart, Germany
Bernhard Schölkopf Department of Empirical Inference, Max Planck Institute for Intelligent Systems, Stuttgart, Germany
Gabriele Schweikert School of Life Sciences, University of Dundee, Dundee, UK
Mateo Rojas Carulla Department of Empirical Inference, Max Planck Institute for Intelligent Systems, Stuttgart, Germany
Arif Canakoglu Department of Electronics, Information and Bioengineering, Politecnico di Milano, Milano, Italy
Francesco Guzzo Department of Electronics, Information and Bioengineering, Politecnico di Milano, Milano, Italy
Luca Nanni Department of Computational Biology, University of Lausanne, Lausanne, Switzerland
Marco Masseroli Department of Electronics, Information and Bioengineering, Politecnico di Milano, Milano, Italy
Mark James Carman Department of Electronics, Information and Bioengineering, Politecnico di Milano, Milano, Italy
Pietro Pinoli Department of Electronics, Information and Bioengineering, Politecnico di Milano, Milano, Italy
Chenyang Hong Department of Computer Science and Engineering, The Chinese University of Hong Kong, Sha Tin, Hong Kong
Kevin Y Yip Sanford Burnham Prebys Medical Discovery Institute, San Diego, CA, USA
Jefrey P Spence Department of Genetics, Stanford University, Stanford, CA, USA
Sanjit Singh Batra Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, Berkeley, CA, USA
Yun S Song Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, Berkeley, CA, USA Department of Statistics, University of California, Berkeley, Berkeley, CA, USA
Shaun Mahony Department of Biochemistry & Molecular Biology, Center for Eukaryotic Gene Regulation, Pennsylvania State University, University Park, PA, USA
Zheng Zhang Department of Statistics, Pennsylvania State University, University Park, PA, USA
Wuwei Tan Department of Electrical and Computer Engineering, Texas A&M University, College Station, TX, USA
Yang Shen Department of Electrical and Computer Engineering, Texas A&M University, College Station, TX, USA
Yuanfei Sun Department of Electrical and Computer Engineering, Texas A&M University, College Station, TX, USA
Minyi Shi Department of Genetics, Stanford University, Stanford, CA, USA
Jessika Adrian Department of Genetics, Stanford University, Stanford, CA, USA
Richard S Sandstrom Altius Institute, Seattle, WA, USA
Nina P Farrell Epigenomics Program, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
Jessica M Halow Altius Institute, Seattle, WA, USA
Kristen Lee Altius Institute, Seattle, WA, USA
Lixia Jiang Department of Genetics, Stanford University, Stanford, CA, USA
Xinqiong Yang Department of Genetics, Stanford University, Stanford, CA, USA
Charles B Epstein Epigenomics Program, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
J Seth Strattan Department of Genetics, Stanford University, Stanford, CA, USA
Bradley E Bernstein Epigenomics Program, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
Michael P Snyder Department of Genetics, Stanford University, Stanford, CA, USA
Manolis Kellis Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, USA
William S Noble Department of Genome Sciences, University of Washington, Seattle, WA, USA
Anshul Bharat Kundaje Department of Genetics, Stanford University, Stanford, CA, USA Department of Computer Science, Stanford University, Stanford, CA, USA

Collapse

Llera A, Brammer M, Oakley B, Tillmann J, Zabihi M, Amelink JS, Mei T, Charman T, Ecker C, Dell'Acqua F, Banaschewski T, Moessnang C, Baron-Cohen S, Holt R, Durston S, Murphy D, Loth E, Buitelaar JK, Floris DL, Beckmann CF. Evaluation of data imputation strategies in complex, deeply-phenotyped data sets: the case of the EU-AIMS Longitudinal European Autism Project. BMC Med Res Methodol 2022;22:229. [PMID: 35971088 PMCID: PMC9380301 DOI: 10.1186/s12874-022-01656-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2022] [Accepted: 06/02/2022] [Indexed: 12/19/2022] Open

Abstract

An increasing number of large-scale multi-modal research initiatives has been conducted in the typically developing population, e.g. Dev. Cogn. Neur. 32:43-54, 2018; PLoS Med. 12(3):e1001779, 2015; Elam and Van Essen, Enc. Comp. Neur., 2013, as well as in psychiatric cohorts, e.g. Trans. Psych. 10(1):100, 2020; Mol. Psych. 19:659–667, 2014; Mol. Aut. 8:24, 2017; Eur. Child and Adol. Psych. 24(3):265–281, 2015. Missing data is a common problem in such datasets due to the difficulty of assessing multiple measures on a large number of participants. The consequences of missing data accumulate when researchers aim to integrate relationships across multiple measures. Here we aim to evaluate different imputation strategies to fill in missing values in clinical data from a large (total N = 764) and deeply phenotyped (i.e. range of clinical and cognitive instruments administered) sample of N = 453 autistic individuals and N = 311 control individuals recruited as part of the EU-AIMS Longitudinal European Autism Project (LEAP) consortium. In particular, we consider a total of 160 clinical measures divided in 15 overlapping subsets of participants. We use two simple but common univariate strategies—mean and median imputation—as well as a Round Robin regression approach involving four independent multivariate regression models including Bayesian Ridge regression, as well as several non-linear models: Decision Trees (Extra Trees., and Nearest Neighbours regression. We evaluate the models using the traditional mean square error towards removed available data, and also consider the Kullback–Leibler divergence between the observed and the imputed distributions. We show that all of the multivariate approaches tested provide a substantial improvement compared to typical univariate approaches. Further, our analyses reveal that across all 15 data-subsets tested, an Extra Trees regression approach provided the best global results. This not only allows the selection of a unique model to impute missing data for the LEAP project and delivers a fixed set of imputed clinical data to be used by researchers working with the LEAP dataset in the future, but provides more general guidelines for data imputation in large scale epidemiological studies.

Collapse

Affiliation(s)

A Llera Donders Institute for Brain, Cognition and Behaviour, Centre for Cognitive Neuroimaging, Nijmegen, The Netherlands. .,Department of Cognitive Neuroscience, Radboud University Medical Centre, Nijmegen, The Netherlands. .,LIS Data Solutions, Machine Learning Group, Santander, Spain.
M Brammer Institute of Psychiatry, Psychology, and Neuroscience, Sackler Institute for Translational Neurodevelopment, King's College London, London, UK
B Oakley Department of Forensic and Neurodevelopmental Sciences, Institute of Psychiatry, Psychology, and Neuroscience, King's College London, London, UK
J Tillmann Roche Pharma Research and Early Development, Neuroscience and Rare Diseases, Roche Innovation Center Basel, F. Hoffmann-La Roche Ltd., Basel, Switzerland
M Zabihi Donders Institute for Brain, Cognition and Behaviour, Centre for Cognitive Neuroimaging, Nijmegen, The Netherlands.,Department of Cognitive Neuroscience, Radboud University Medical Centre, Nijmegen, The Netherlands
J S Amelink Donders Institute for Brain, Cognition and Behaviour, Centre for Cognitive Neuroimaging, Nijmegen, The Netherlands.,Max Planck Institute for Psycholinguistics, Language & Genetics Department, Nijmegen, The Netherlands
T Mei Donders Institute for Brain, Cognition and Behaviour, Centre for Cognitive Neuroimaging, Nijmegen, The Netherlands.,Department of Cognitive Neuroscience, Radboud University Medical Centre, Nijmegen, The Netherlands
T Charman Department of Psychology, Institute of Psychiatry, Psychology, and Neuroscience, King's College London, London, UK
C Ecker Institute of Psychiatry, Psychology, and Neuroscience, Sackler Institute for Translational Neurodevelopment, King's College London, London, UK.,Department of Child and Adolescent Psychiatry, Psychosomatics and Psychotherapy, University Hospital Frankfurt Am Main, Goethe University, Frankfurt, Germany
F Dell'Acqua Institute of Psychiatry, Psychology, and Neuroscience, Sackler Institute for Translational Neurodevelopment, King's College London, London, UK
T Banaschewski Child and Adolescent Psychiatry, Central Institute of Mental Health, University of Heidelberg, Mannheim, Germany
C Moessnang Department of Child and Adolescent Psychiatry, Psychosomatics and Psychotherapy, University Hospital Frankfurt Am Main, Goethe University, Frankfurt, Germany.,Department of Applied Psychology, SRH University, Heidelberg, Germany
S Baron-Cohen Autism Research Centre, Department of Psychiatry, University of Cambridge, Cambridge, UK
R Holt Autism Research Centre, Department of Psychiatry, University of Cambridge, Cambridge, UK
S Durston Department of Psychiatry, Brain Center Rudolf Magnus, University Medical Center Utrecht, Utrecht, The Netherlands
D Murphy Institute of Psychiatry, Psychology, and Neuroscience, Sackler Institute for Translational Neurodevelopment, King's College London, London, UK.,Department of Forensic and Neurodevelopmental Sciences, Institute of Psychiatry, Psychology, and Neuroscience, King's College London, London, UK
E Loth Institute of Psychiatry, Psychology, and Neuroscience, Sackler Institute for Translational Neurodevelopment, King's College London, London, UK.,Department of Forensic and Neurodevelopmental Sciences, Institute of Psychiatry, Psychology, and Neuroscience, King's College London, London, UK
J K Buitelaar Donders Institute for Brain, Cognition and Behaviour, Centre for Cognitive Neuroimaging, Nijmegen, The Netherlands.,Department of Cognitive Neuroscience, Radboud University Medical Centre, Nijmegen, The Netherlands.,Karakter Child and Adolescent Psychiatry University Centre, Nijmegen, The Netherlands
D L Floris Donders Institute for Brain, Cognition and Behaviour, Centre for Cognitive Neuroimaging, Nijmegen, The Netherlands.,Department of Cognitive Neuroscience, Radboud University Medical Centre, Nijmegen, The Netherlands.,Methods of Plasticity Research, Department of Psychology, University of Zurich, Zurich, Switzerland
C F Beckmann Donders Institute for Brain, Cognition and Behaviour, Centre for Cognitive Neuroimaging, Nijmegen, The Netherlands.,Department of Cognitive Neuroscience, Radboud University Medical Centre, Nijmegen, The Netherlands.,Wellcome Centre for Integrative Neuroimaging - Centre for Functional MRI of the Brain (WIN FMRIB), University of Oxford, Oxford, UK

Collapse

Dsouza KB, Li AY, Bhargava VK, Libbrecht MW. Latent Representation of the Human Pan-Celltype Epigenome Through a Deep Recurrent Neural Network. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022;19:2313-2323. [PMID: 34043510 DOI: 10.1109/tcbb.2021.3084147] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]

Albrecht S, Andreani T, Andrade-Navarro MA, Fontaine JF. Single-cell specific and interpretable machine learning models for sparse scChIP-seq data imputation. PLoS One 2022;17:e0270043. [PMID: 35776722 PMCID: PMC9249201 DOI: 10.1371/journal.pone.0270043] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2021] [Accepted: 06/02/2022] [Indexed: 11/19/2022] Open

Abstract

MOTIVATION

Single-cell Chromatin ImmunoPrecipitation DNA-Sequencing (scChIP-seq) analysis is challenging due to data sparsity. High degree of sparsity in biological high-throughput single-cell data is generally handled with imputation methods that complete the data, but specific methods for scChIP-seq are lacking. We present SIMPA, a scChIP-seq data imputation method leveraging predictive information within bulk data from the ENCODE project to impute missing protein-DNA interacting regions of target histone marks or transcription factors.

RESULTS

Imputations using machine learning models trained for each single cell, each ChIP protein target, and each genomic region accurately preserve cell type clustering and improve pathway-related gene identification on real human data. Results on bulk data simulating single cells show that the imputations are single-cell specific as the imputed profiles are closer to the simulated cell than to other cells related to the same ChIP protein target and the same cell type. Simulations also show that 100 input genomic regions are already enough to train single-cell specific models for the imputation of thousands of undetected regions. Furthermore, SIMPA enables the interpretation of machine learning models by revealing interaction sites of a given single cell that are most important for the imputation model trained for a specific genomic region. The corresponding feature importance values derived from promoter-interaction profiles of H3K4me3, an activating histone mark, highly correlate with co-expression of genes that are present within the cell-type specific pathways in 2 real human and mouse datasets. The SIMPA's interpretable imputation method allows users to gain a deep understanding of individual cells and, consequently, of sparse scChIP-seq datasets.

AVAILABILITY AND IMPLEMENTATION

Our interpretable imputation algorithm was implemented in Python and is available at https://github.com/salbrec/SIMPA.

Collapse

Luo K, Zhong J, Safi A, Hong LK, Tewari AK, Song L, Reddy TE, Ma L, Crawford GE, Hartemink AJ. Profiling the quantitative occupancy of myriad transcription factors across conditions by modeling chromatin accessibility data. Genome Res 2022;32:1183-1198. [PMID: 35609992 PMCID: PMC9248881 DOI: 10.1101/gr.272203.120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2020] [Accepted: 05/06/2022] [Indexed: 11/24/2022]

Affiliation(s)

Kaixuan Luo Computational Biology & Bioinformatics Graduate Program, Duke University, Durham, North Carolina 27708, USA Center for Genomic and Computational Biology, Duke University, Durham, North Carolina 27708, USA Department of Computer Science, Duke University, Durham, North Carolina 27708, USA Department of Human Genetics, The University of Chicago, Chicago, Illinois 60637, USA
Jianling Zhong Computational Biology & Bioinformatics Graduate Program, Duke University, Durham, North Carolina 27708, USA Center for Genomic and Computational Biology, Duke University, Durham, North Carolina 27708, USA Department of Computer Science, Duke University, Durham, North Carolina 27708, USA
Alexias Safi Center for Genomic and Computational Biology, Duke University, Durham, North Carolina 27708, USA Department of Pediatrics, Duke University Medical Center, Durham, North Carolina 27710, USA
Linda K Hong Center for Genomic and Computational Biology, Duke University, Durham, North Carolina 27708, USA Department of Pediatrics, Duke University Medical Center, Durham, North Carolina 27710, USA
Alok K Tewari Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, Massachusetts 02215, USA
Lingyun Song Center for Genomic and Computational Biology, Duke University, Durham, North Carolina 27708, USA Department of Pediatrics, Duke University Medical Center, Durham, North Carolina 27710, USA
Timothy E Reddy Computational Biology & Bioinformatics Graduate Program, Duke University, Durham, North Carolina 27708, USA Center for Genomic and Computational Biology, Duke University, Durham, North Carolina 27708, USA Department of Biostatistics and Bioinformatics, Durham, North Carolina 27710, USA Department of Molecular Genetics and Microbiology, Duke University Medical Center, Durham, North Carolina 27710, USA Department of Biomedical Engineering, Duke University, Durham, North Carolina 27708, USA
Li Ma Computational Biology & Bioinformatics Graduate Program, Duke University, Durham, North Carolina 27708, USA Department of Statistical Science, Duke University, Durham, North Carolina 27708, USA
Gregory E Crawford Computational Biology & Bioinformatics Graduate Program, Duke University, Durham, North Carolina 27708, USA Center for Genomic and Computational Biology, Duke University, Durham, North Carolina 27708, USA Department of Pediatrics, Duke University Medical Center, Durham, North Carolina 27710, USA
Alexander J Hartemink Computational Biology & Bioinformatics Graduate Program, Duke University, Durham, North Carolina 27708, USA Center for Genomic and Computational Biology, Duke University, Durham, North Carolina 27708, USA Department of Computer Science, Duke University, Durham, North Carolina 27708, USA Department of Biology, Duke University, Durham, North Carolina 27708, USA

Collapse

Hesami M, Alizadeh M, Jones AMP, Torkamaneh D. Machine learning: its challenges and opportunities in plant system biology. Appl Microbiol Biotechnol 2022;106:3507-3530. [PMID: 35575915 DOI: 10.1007/s00253-022-11963-6] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2022] [Revised: 03/14/2022] [Accepted: 05/07/2022] [Indexed: 12/25/2022]

Daneshpajouh H, Chen B, Shokraneh N, Masoumi S, Wiese KC, Libbrecht MW. Continuous chromatin state feature annotation of the human epigenome. Bioinformatics 2022;38:3029-3036. [PMID: 35451453 PMCID: PMC9154241 DOI: 10.1093/bioinformatics/btac283] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2021] [Revised: 02/18/2022] [Accepted: 04/18/2022] [Indexed: 12/02/2022] Open

Abstract

Motivation

Segmentation and genome annotation (SAGA) algorithms are widely used to understand genome activity and gene regulation. These methods take as input a set of sequencing-based assays of epigenomic activity, such as ChIP-seq measurements of histone modification and transcription factor binding. They output an annotation of the genome that assigns a chromatin state label to each genomic position. Existing SAGA methods have several limitations caused by the discrete annotation framework: such annotations cannot easily represent varying strengths of genomic elements, and they cannot easily represent combinatorial elements that simultaneously exhibit multiple types of activity. To remedy these limitations, we propose an annotation strategy that instead outputs a vector of chromatin state features at each position rather than a single discrete label. Continuous modeling is common in other fields, such as in topic modeling of text documents. We propose a method, epigenome-ssm-nonneg, that uses a non-negative state space model to efficiently annotate the genome with chromatin state features. We also propose several measures of the quality of a chromatin state feature annotation and we compare the performance of several alternative methods according to these quality measures.

Results

We show that chromatin state features from epigenome-ssm-nonneg are more useful for several downstream applications than both continuous and discrete alternatives, including their ability to identify expressed genes and enhancers. Therefore, we expect that these continuous chromatin state features will be valuable reference annotations to be used in visualization and downstream analysis.

Availability and implementation

Source code for epigenome-ssm is available at https://github.com/habibdanesh/epigenome-ssm and Zenodo (DOI: 10.5281/zenodo.6507585).

Supplementary information

Supplementary data are available at Bioinformatics online.

Collapse

Li H, Guan Y. Asymmetric predictive relationships across histone modifications. NAT MACH INTELL 2022;4:288-299. [DOI: 10.1038/s42256-022-00455-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Wang Z, Chivu AG, Choate LA, Rice EJ, Miller DC, Chu T, Chou SP, Kingsley NB, Petersen JL, Finno CJ, Bellone RR, Antczak DF, Lis JT, Danko CG. Prediction of histone post-translational modification patterns based on nascent transcription data. Nat Genet 2022;54:295-305. [PMID: 35273399 PMCID: PMC9444190 DOI: 10.1038/s41588-022-01026-x] [Citation(s) in RCA: 64] [Impact Index Per Article: 21.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2021] [Accepted: 01/24/2022] [Indexed: 01/01/2023]

Affiliation(s)

Zhong Wang Baker Institute for Animal Health, College of Veterinary Medicine, Cornell University, Ithaca, NY, USA School of Software Technology, Dalian University of Technology, Dalian, China
Alexandra G Chivu Baker Institute for Animal Health, College of Veterinary Medicine, Cornell University, Ithaca, NY, USA Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, USA
Lauren A Choate Baker Institute for Animal Health, College of Veterinary Medicine, Cornell University, Ithaca, NY, USA
Edward J Rice Baker Institute for Animal Health, College of Veterinary Medicine, Cornell University, Ithaca, NY, USA
Donald C Miller Baker Institute for Animal Health, College of Veterinary Medicine, Cornell University, Ithaca, NY, USA
Tinyi Chu Baker Institute for Animal Health, College of Veterinary Medicine, Cornell University, Ithaca, NY, USA
Shao-Pei Chou Baker Institute for Animal Health, College of Veterinary Medicine, Cornell University, Ithaca, NY, USA
Nicole B Kingsley Veterinary Genetics Laboratory, School of Veterinary Medicine, University of California, Davis, Davis, CA, USA
Jessica L Petersen Department of Animal Science, University of Nebraska-Lincoln, Lincoln, NE, USA
Carrie J Finno Department of Population Health and Reproduction, University of California, Davis, Davis, CA, USA
Rebecca R Bellone Veterinary Genetics Laboratory, School of Veterinary Medicine, University of California, Davis, Davis, CA, USA
Douglas F Antczak Baker Institute for Animal Health, College of Veterinary Medicine, Cornell University, Ithaca, NY, USA
John T Lis Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, USA
Charles G Danko Baker Institute for Animal Health, College of Veterinary Medicine, Cornell University, Ithaca, NY, USA. Department of Biomedical Sciences, College of Veterinary Medicine, Cornell University, Ithaca, NY, USA.

Collapse

Ni Z, Zheng X, Zheng X, Zou X. scLRTD : A Novel Low Rank Tensor Decomposition Method for Imputing Missing Values in Single-Cell Multi-Omics Sequencing Data. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022;19:1144-1153. [PMID: 32960767 DOI: 10.1109/tcbb.2020.3025804] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]

Arslan E, Schulz J, Rai K. Machine Learning in Epigenomics: Insights into Cancer Biology and Medicine. Biochim Biophys Acta Rev Cancer 2021;1876:188588. [PMID: 34245839 PMCID: PMC8595561 DOI: 10.1016/j.bbcan.2021.188588] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2021] [Revised: 05/29/2021] [Accepted: 07/02/2021] [Indexed: 02/01/2023]

Morrow A, Hughes J, Singh J, Joseph A, Yosef N. Epitome: predicting epigenetic events in novel cell types with multi-cell deep ensemble learning. Nucleic Acids Res 2021;49:e110. [PMID: 34379786 PMCID: PMC8565335 DOI: 10.1093/nar/gkab676] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2021] [Revised: 07/19/2021] [Accepted: 07/25/2021] [Indexed: 01/04/2023] Open

Libbrecht MW, Chan RCW, Hoffman MM. Segmentation and genome annotation algorithms for identifying chromatin state and other genomic patterns. PLoS Comput Biol 2021;17:e1009423. [PMID: 34648491 PMCID: PMC8516206 DOI: 10.1371/journal.pcbi.1009423] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open

Zhou W, Hongkai J. Genome-wide Prediction of Chromatin Accessibility Based on Gene Expression. WILEY INTERDISCIPLINARY REVIEWS. COMPUTATIONAL STATISTICS 2021;13:e1544. [PMID: 39391743 PMCID: PMC11466374 DOI: 10.1002/wics.1544] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/19/2019] [Accepted: 11/28/2020] [Indexed: 10/12/2024]

Nieboer MM, Nguyen L, de Ridder J. Predicting pathogenic non-coding SVs disrupting the 3D genome in 1646 whole cancer genomes using multiple instance learning. Sci Rep 2021;11:14411. [PMID: 34257393 PMCID: PMC8277903 DOI: 10.1038/s41598-021-93917-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2021] [Accepted: 07/01/2021] [Indexed: 11/21/2022] Open

Bayat F, Libbrecht M. VSS: Variance-stabilized signals for sequencing-based genomic signals. Bioinformatics 2021;37:4383-4391. [PMID: 34165492 PMCID: PMC8652025 DOI: 10.1093/bioinformatics/btab457] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2020] [Revised: 04/28/2021] [Accepted: 06/17/2021] [Indexed: 11/12/2022] Open

Schreiber J, Singh R. Machine learning for profile prediction in genomics. Curr Opin Chem Biol 2021;65:35-41. [PMID: 34107341 DOI: 10.1016/j.cbpa.2021.04.008] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2021] [Revised: 04/21/2021] [Accepted: 04/24/2021] [Indexed: 02/08/2023]

Schreiber J, Bilmes J, Noble WS. Prioritizing transcriptomic and epigenomic experiments using an optimization strategy that leverages imputed data. Bioinformatics 2021;37:439-447. [PMID: 32966546 PMCID: PMC8088321 DOI: 10.1093/bioinformatics/btaa830] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2019] [Revised: 07/28/2020] [Accepted: 09/09/2020] [Indexed: 12/03/2022] Open

Abstract

Motivation

Successful science often involves not only performing experiments well, but also choosing well among many possible experiments. In a hypothesis generation setting, choosing an experiment well means choosing an experiment whose results are interesting or novel. In this work, we formalize this selection procedure in the context of genomics and epigenomics data generation. Specifically, we consider the task faced by a scientific consortium such as the National Institutes of Health ENCODE Consortium, whose goal is to characterize all of the functional elements in the human genome. Given a list of possible cell types or tissue types (‘biosamples’) and a list of possible high-throughput sequencing assays, where at least one experiment has been performed in each biosample and for each assay, we ask ‘Which experiments should ENCODE perform next?’

Results

We demonstrate how to represent this task as a submodular optimization problem, where the goal is to choose a panel of experiments that maximize the facility location function. A key aspect of our approach is that we use imputed data, rather than experimental data, to directly answer the posed question. We find that, across several evaluations, our method chooses a panel of experiments that span a diversity of biochemical activity. Finally, we propose two modifications of the facility location function, including a novel submodular–supermodular function, that allow incorporation of domain knowledge or constraints into the optimization procedure.

Availability and implementation

Our method is available as a Python package at https://github.com/jmschrei/kiwano and can be installed using the command pip install kiwano. The source code used here and the similarity matrix can be found at http://doi.org/10.5281/zenodo.3708538.

Supplementary information

Supplementary data are available at Bioinformatics online.

Collapse

Nakato R, Sakata T. Methods for ChIP-seq analysis: A practical workflow and advanced applications. Methods 2021;187:44-53. [PMID: 32240773 DOI: 10.1016/j.ymeth.2020.03.005] [Citation(s) in RCA: 120] [Impact Index Per Article: 30.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2020] [Revised: 03/17/2020] [Accepted: 03/18/2020] [Indexed: 12/13/2022] Open

Pei G, Hu R, Dai Y, Manuel AM, Zhao Z, Jia P. Predicting regulatory variants using a dense epigenomic mapped CNN model elucidated the molecular basis of trait-tissue associations. Nucleic Acids Res 2021;49:53-66. [PMID: 33300042 PMCID: PMC7797043 DOI: 10.1093/nar/gkaa1137] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2020] [Revised: 10/22/2020] [Accepted: 12/08/2020] [Indexed: 02/06/2023] Open

Pei G, Wang YY, Simon LM, Dai Y, Zhao Z, Jia P. Gene expression imputation and cell-type deconvolution in human brain with spatiotemporal precision and its implications for brain-related disorders. Genome Res 2020;31:146-158. [PMID: 33272935 PMCID: PMC7849392 DOI: 10.1101/gr.265769.120] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2020] [Accepted: 11/25/2020] [Indexed: 12/30/2022]

Schreiber J, Singh R, Bilmes J, Noble WS. A pitfall for machine learning methods aiming to predict across cell types. Genome Biol 2020;21:282. [PMID: 33213499 PMCID: PMC7678316 DOI: 10.1186/s13059-020-02177-y] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2020] [Accepted: 10/07/2020] [Indexed: 01/19/2023] Open

Song M, Greenbaum J, Luttrell J, Zhou W, Wu C, Shen H, Gong P, Zhang C, Deng HW. A Review of Integrative Imputation for Multi-Omics Datasets. Front Genet 2020;11:570255. [PMID: 33193667 PMCID: PMC7594632 DOI: 10.3389/fgene.2020.570255] [Citation(s) in RCA: 60] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2020] [Accepted: 09/16/2020] [Indexed: 01/05/2023] Open

Sahu A, Li N, Dunkel I, Chung HR. EPIGENE: genome-wide transcription unit annotation using a multivariate probabilistic model of histone modifications. Epigenetics Chromatin 2020;13:20. [PMID: 32264931 PMCID: PMC7137282 DOI: 10.1186/s13072-020-00341-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2019] [Accepted: 03/28/2020] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Understanding the transcriptome is critical for explaining the functional as well as regulatory roles of genomic regions. Current methods for the identification of transcription units (TUs) use RNA-seq that, however, require large quantities of mRNA rendering the identification of inherently unstable TUs, e.g. miRNA precursors, difficult. This problem can be alleviated by chromatin-based approaches due to a correlation between histone modifications and transcription.

RESULTS

Here, we introduce EPIGENE, a novel chromatin segmentation method for the identification of active TUs using transcription-associated histone modifications. Unlike the existing chromatin segmentation approaches, EPIGENE uses a constrained, semi-supervised multivariate hidden Markov model (HMM) that models the observed combination of histone modifications using a product of independent Bernoulli random variables, to identify active TUs. Our results show that EPIGENE can identify genome-wide TUs in an unbiased manner. EPIGENE-predicted TUs show an enrichment of RNA Polymerase II at the transcription start site and in gene body indicating that they are indeed transcribed. Comprehensive validation using existing annotations revealed that 93% of EPIGENE TUs can be explained by existing gene annotations and 5% of EPIGENE TUs in HepG2 can be explained by microRNA annotations. EPIGENE outperformed the existing RNA-seq-based approaches in TU prediction precision across human cell lines. Finally, we identified 232 novel TUs in K562 and 43 novel cell-specific TUs all of which were supported by RNA Polymerase II ChIP-seq and Nascent RNA-seq data.

CONCLUSION

We demonstrate the applicability of EPIGENE to identify genome-wide active TUs and to provide valuable information about unannotated TUs. EPIGENE is an open-source method and is freely available at: https://github.com/imbbLab/EPIGENE.

Collapse

Schreiber J, Bilmes J, Noble WS. Completing the ENCODE3 compendium yields accurate imputations across a variety of assays and human biosamples. Genome Biol 2020;21:82. [PMID: 32228713 PMCID: PMC7104481 DOI: 10.1186/s13059-020-01978-5] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2019] [Accepted: 02/26/2020] [Indexed: 12/16/2022] Open

Schreiber J, Durham T, Bilmes J, Noble WS. Avocado: a multi-scale deep tensor factorization method learns a latent representation of the human epigenome. Genome Biol 2020;21:81. [PMID: 32228704 PMCID: PMC7104480 DOI: 10.1186/s13059-020-01977-6] [Citation(s) in RCA: 63] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2019] [Accepted: 02/26/2020] [Indexed: 02/08/2023] Open

Zhang S, Chasman D, Knaack S, Roy S. In silico prediction of high-resolution Hi-C interaction matrices. Nat Commun 2019;10:5449. [PMID: 31811132 PMCID: PMC6898380 DOI: 10.1038/s41467-019-13423-8] [Citation(s) in RCA: 48] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2018] [Accepted: 11/07/2019] [Indexed: 11/28/2022] Open

Zhang Y, Mahony S. Direct prediction of regulatory elements from partial data without imputation. PLoS Comput Biol 2019;15:e1007399. [PMID: 31682602 PMCID: PMC6855516 DOI: 10.1371/journal.pcbi.1007399] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2019] [Revised: 11/14/2019] [Accepted: 09/12/2019] [Indexed: 01/07/2023] Open

Abstract

Genome segmentation approaches allow us to characterize regulatory states in a given cell type using combinatorial patterns of histone modifications and other regulatory signals. In order to analyze regulatory state differences across cell types, current genome segmentation approaches typically require that the same regulatory genomics assays have been performed in all analyzed cell types. This necessarily limits both the numbers of cell types that can be analyzed and the complexity of the resulting regulatory states, as only a small number of histone modifications have been profiled across many cell types. Data imputation approaches that aim to estimate missing regulatory signals have been applied before genome segmentation. However, this approach is computationally costly and propagates any errors in imputation to produce incorrect genome segmentation results downstream. We present an extension to the IDEAS genome segmentation platform which can perform genome segmentation on incomplete regulatory genomics dataset collections without using imputation. Instead of relying on imputed data, we use an expectation-maximization approach to estimate marginal density functions within each regulatory state. We demonstrate that our genome segmentation results compare favorably with approaches based on imputation or other strategies for handling missing data. We further show that our approach can accurately impute missing data after genome segmentation, reversing the typical order of imputation/genome segmentation pipelines. Finally, we present a new 2D genome segmentation analysis of 127 human cell types studied by the Roadmap Epigenomics Consortium. By using an expanded set of chromatin marks that have been profiled in subsets of these cell types, our new segmentation results capture a more complex picture of combinatorial regulatory patterns that appear on the human genome.

Collapse

Zitnik M, Nguyen F, Wang B, Leskovec J, Goldenberg A, Hoffman MM. Machine Learning for Integrating Data in Biology and Medicine: Principles, Practice, and Opportunities. AN INTERNATIONAL JOURNAL ON INFORMATION FUSION 2019;50:71-91. [PMID: 30467459 PMCID: PMC6242341 DOI: 10.1016/j.inffus.2018.09.012] [Citation(s) in RCA: 262] [Impact Index Per Article: 43.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/10/2023]

Nair S, Kim DS, Perricone J, Kundaje A. Integrating regulatory DNA sequence and gene expression to predict genome-wide chromatin accessibility across cellular contexts. Bioinformatics 2019;35:i108-i116. [PMID: 31510655 PMCID: PMC6612838 DOI: 10.1093/bioinformatics/btz352] [Citation(s) in RCA: 43] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023] Open

Keilwagen J, Posch S, Grau J. Accurate prediction of cell type-specific transcription factor binding. Genome Biol 2019;20:9. [PMID: 30630522 PMCID: PMC6327544 DOI: 10.1186/s13059-018-1614-y] [Citation(s) in RCA: 56] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2018] [Accepted: 12/18/2018] [Indexed: 01/11/2023] Open

Stein-O'Brien GL, Arora R, Culhane AC, Favorov AV, Garmire LX, Greene CS, Goff LA, Li Y, Ngom A, Ochs MF, Xu Y, Fertig EJ. Enter the Matrix: Factorization Uncovers Knowledge from Omics. Trends Genet 2018;34:790-805. [PMID: 30143323 PMCID: PMC6309559 DOI: 10.1016/j.tig.2018.07.003] [Citation(s) in RCA: 132] [Impact Index Per Article: 18.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2018] [Revised: 06/01/2018] [Accepted: 07/16/2018] [Indexed: 12/20/2022]

Affiliation(s)

Genevieve L Stein-O'Brien Department of Oncology, Division of Biostatistics and Bioinformatics, Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins School of Medicine, Baltimore, MD, USA; Department of Neuroscience, Johns Hopkins School of Medicine, Baltimore, MD, USA; McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins School of Medicine, Baltimore, MD, USA
Raman Arora Department of Computer Science, Institute for Data Intensive Engineering and Science, Johns Hopkins University, Baltimore, MD, USA
Aedin C Culhane Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute, Boston, MA, USA; Department of Biostatistics, Harvard TH Chan School of Public Health, Boston, MA, USA
Alexander V Favorov Department of Oncology, Division of Biostatistics and Bioinformatics, Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins School of Medicine, Baltimore, MD, USA; Vavilov Institute of General Genetics, Moscow, Russia
Lana X Garmire University of Hawaii Cancer Center, Honolulu, HI, USA
Casey S Greene Department of Systems Pharmacology and Translational Therapeutics, Perelman School of Medicine, University of Pennsylvania, PA, USA; Childhood Cancer Data Lab, Alex's Lemonade Stand Foundation, PA, USA
Loyal A Goff Department of Neuroscience, Johns Hopkins School of Medicine, Baltimore, MD, USA; McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins School of Medicine, Baltimore, MD, USA
Yifeng Li Digital Technologies Research Centre, National Research Council of Canada, Ottawa, ON, Canada
Aloune Ngom School of Computer Science, University of Windsor, Windsor, ON, Canada
Michael F Ochs Department of Mathematics and Statistics, The College of New Jersey, Ewing, NJ, USA
Yanxun Xu Department of Applied Mathematics and Statistics, Whiting School of Engineering, Johns Hopkins University, Baltimore, MD, USA
Elana J Fertig Department of Oncology, Division of Biostatistics and Bioinformatics, Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins School of Medicine, Baltimore, MD, USA.

Collapse