Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Trieu T, Martinez-Fundichely A, Khurana E. DeepMILO: a deep learning approach to predict the impact of non-coding sequence variants on 3D chromatin structure. Genome Biol 2020;21:79. [PMID: 32216817 PMCID: PMC7098089 DOI: 10.1186/s13059-020-01987-4] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2019] [Accepted: 03/06/2020] [Indexed: 12/17/2022] Open

For:	Trieu T, Martinez-Fundichely A, Khurana E. DeepMILO: a deep learning approach to predict the impact of non-coding sequence variants on 3D chromatin structure. Genome Biol 2020;21:79. [PMID: 32216817 PMCID: PMC7098089 DOI: 10.1186/s13059-020-01987-4] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2019] [Accepted: 03/06/2020] [Indexed: 12/17/2022] Open

Number

Cited by Other Article(s)

Lee J, Mo HL, Ha Y, Nam DY, Lim G, Park JW, Park S, Choi WY, Lee HJ, Rhee JK. Unraveling the three-dimensional genome structure using machine learning. BMB Rep 2025;58:203-208. [PMID: 40058875 PMCID: PMC12123201] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2024] [Revised: 03/07/2024] [Accepted: 09/06/2024] [Indexed: 05/29/2025] Open

Zeng Y, You Z, Guo J, Zhao J, Zhou Y, Huang J, Lyu X, Chen L, Li Q. Chrombus-XMBD: a graph convolution model predicting 3D-genome from chromatin features. Brief Bioinform 2025;26:bbaf183. [PMID: 40315432 PMCID: PMC12047703 DOI: 10.1093/bib/bbaf183] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2024] [Revised: 03/11/2025] [Accepted: 03/26/2025] [Indexed: 05/04/2025] Open

Kumar Halder A, Agarwal A, Jodkowska K, Plewczynski D. A systematic analyses of different bioinformatics pipelines for genomic data and its impact on deep learning models for chromatin loop prediction. Brief Funct Genomics 2024;23:538-548. [PMID: 38555493 DOI: 10.1093/bfgp/elae009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Revised: 02/07/2024] [Accepted: 03/04/2024] [Indexed: 04/02/2024] Open

Woo BJ, Moussavi-Baygi R, Karner H, Karimzadeh M, Yousefi H, Lee S, Garcia K, Joshi T, Yin K, Navickas A, Gilbert LA, Wang B, Asgharian H, Feng FY, Goodarzi H. Integrative identification of non-coding regulatory regions driving metastatic prostate cancer. Cell Rep 2024;43:114764. [PMID: 39276353 PMCID: PMC11466230 DOI: 10.1016/j.celrep.2024.114764] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Revised: 07/08/2024] [Accepted: 08/29/2024] [Indexed: 09/17/2024] Open

Affiliation(s)

Brian J Woo Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, CA, USA; Department of Urology, University of California, San Francisco, San Francisco, CA, USA; Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA, USA; Arc Institute, Palo Alto, CA 94305, USA
Ruhollah Moussavi-Baygi Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, CA, USA; Department of Urology, University of California, San Francisco, San Francisco, CA, USA; Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA, USA
Heather Karner Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, CA, USA; Department of Urology, University of California, San Francisco, San Francisco, CA, USA; Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA, USA; Arc Institute, Palo Alto, CA 94305, USA
Mehran Karimzadeh Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, CA, USA; Department of Urology, University of California, San Francisco, San Francisco, CA, USA; Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA, USA; Vector Institute, Toronto, ON, Canada; Peter Munk Cardiac Centre, University Health Network, Toronto, ON, Canada; Arc Institute, Palo Alto, CA 94305, USA
Hassan Yousefi Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, CA, USA; Department of Urology, University of California, San Francisco, San Francisco, CA, USA; Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA, USA; Arc Institute, Palo Alto, CA 94305, USA
Sean Lee Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, CA, USA; Department of Urology, University of California, San Francisco, San Francisco, CA, USA; Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA, USA; Arc Institute, Palo Alto, CA 94305, USA
Kristle Garcia Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, CA, USA; Department of Urology, University of California, San Francisco, San Francisco, CA, USA; Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA, USA
Tanvi Joshi Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, CA, USA; Department of Urology, University of California, San Francisco, San Francisco, CA, USA; Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA, USA
Keyi Yin Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, CA, USA; Department of Urology, University of California, San Francisco, San Francisco, CA, USA; Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA, USA
Albertas Navickas Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, CA, USA; Department of Urology, University of California, San Francisco, San Francisco, CA, USA; Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA, USA
Luke A Gilbert Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, CA, USA; Department of Urology, University of California, San Francisco, San Francisco, CA, USA; Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA, USA; Arc Institute, Palo Alto, CA 94305, USA
Bo Wang Vector Institute, Toronto, ON, Canada; Peter Munk Cardiac Centre, University Health Network, Toronto, ON, Canada
Hosseinali Asgharian Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, CA, USA; Department of Urology, University of California, San Francisco, San Francisco, CA, USA; Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA, USA; Bakar Computational Health Sciences Institute, University of California, San Francisco, San Francisco, CA, USA.
Felix Y Feng Department of Urology, University of California, San Francisco, San Francisco, CA, USA; Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA, USA; Department of Radiation Oncology, University of California, San Francisco, San Francisco, CA, USA.
Hani Goodarzi Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, CA, USA; Department of Urology, University of California, San Francisco, San Francisco, CA, USA; Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA, USA; Arc Institute, Palo Alto, CA 94305, USA; Bakar Computational Health Sciences Institute, University of California, San Francisco, San Francisco, CA, USA.

Collapse

Schlegel L, Bhardwaj R, Shahryary Y, Demirtürk D, Marand A, Schmitz R, Johannes F. GenomicLinks: deep learning predictions of 3D chromatin interactions in the maize genome. NAR Genom Bioinform 2024;6:lqae123. [PMID: 39318505 PMCID: PMC11420838 DOI: 10.1093/nargab/lqae123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2024] [Revised: 07/25/2024] [Accepted: 08/30/2024] [Indexed: 09/26/2024] Open

Sokolova K, Chen KM, Hao Y, Zhou J, Troyanskaya OG. Deep Learning Sequence Models for Transcriptional Regulation. Annu Rev Genomics Hum Genet 2024;25:105-122. [PMID: 38594933 DOI: 10.1146/annurev-genom-021623-024727] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/11/2024]

Alagarswamy K, Shi W, Boini A, Messaoudi N, Grasso V, Cattabiani T, Turner B, Croner R, Kahlert UD, Gumbs A. Should AI-Powered Whole-Genome Sequencing Be Used Routinely for Personalized Decision Support in Surgical Oncology—A Scoping Review. BIOMEDINFORMATICS 2024;4:1757-1772. [DOI: 10.3390/biomedinformatics4030096] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/12/2025]

Shen J, Wang Y, Luo J. CD-Loop: a chromatin loop detection method based on the diffusion model. Front Genet 2024;15:1393406. [PMID: 38770419 PMCID: PMC11102972 DOI: 10.3389/fgene.2024.1393406] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2024] [Accepted: 04/11/2024] [Indexed: 05/22/2024] Open

Wall BPG, Nguyen M, Harrell JC, Dozmorov MG. Machine and deep learning methods for predicting 3D genome organization. ARXIV 2024:arXiv:2403.03231v1. [PMID: 38495565 PMCID: PMC10942493] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Subscribe] [Scholar Register] [Indexed: 03/19/2024]

Wang Y, Guo X, Niu Z, Huang X, Wang B, Gao L. DeepCBS: shedding light on the impact of mutations occurring at CTCF binding sites. Front Genet 2024;15:1354208. [PMID: 38463168 PMCID: PMC10920299 DOI: 10.3389/fgene.2024.1354208] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2023] [Accepted: 01/30/2024] [Indexed: 03/12/2024] Open

Abstract

CTCF-mediated chromatin loops create insulated neighborhoods that constrain promoter-enhancer interactions, serving as a unit of gene regulation. Disruption of the CTCF binding sites (CBS) will lead to the destruction of insulated neighborhoods, which in turn can cause dysregulation of the contained genes. In a recent study, it is found that CTCF/cohesin binding sites are a major mutational hotspot in the cancer genome. Mutations can affect CTCF binding, causing the disruption of insulated neighborhoods. And our analysis reveals a significant enrichment of well-known proto-oncogenes in insulated neighborhoods with mutations specifically occurring in anchor regions. It can be assumed that some mutations disrupt CTCF binding, leading to the disruption of insulated neighborhoods and subsequent activation of proto-oncogenes within these insulated neighborhoods. To explore the consequences of such mutations, we develop DeepCBS, a computational tool capable of analyzing mutations at CTCF binding sites, predicting their influence on insulated neighborhoods, and investigating the potential activation of proto-oncogenes. Futhermore, DeepCBS is applied to somatic mutation data of liver cancer. As a result, 87 mutations that disrupt CTCF binding sites are identified, which leads to the identification of 237 disrupted insulated neighborhoods containing a total of 135 genes. Integrative analysis of gene expression differences in liver cancer further highlights three genes: ARHGEF39, UBE2C and DQX1. Among them, ARHGEF39 and UBE2C have been reported in the literature as potential oncogenes involved in the development of liver cancer. The results indicate that DQX1 may be a potential oncogene in liver cancer and may contribute to tumor immune escape. In conclusion, DeepCBS is a promising method to analyze impacts of mutations occurring at CTCF binding sites on the insulator function of CTCF, with potential extensions to shed light on the effects of mutations on other functions of CTCF.

Collapse

Tan W, Shen Y. Multimodal learning of noncoding variant effects using genome sequence and chromatin structure. Bioinformatics 2023;39:btad541. [PMID: 37669132 PMCID: PMC10502240 DOI: 10.1093/bioinformatics/btad541] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2022] [Revised: 08/28/2023] [Accepted: 09/04/2023] [Indexed: 09/07/2023] Open

Li Z, Portillo-Ledesma S, Schlick T. Techniques for and challenges in reconstructing 3D genome structures from 2D chromosome conformation capture data. Curr Opin Cell Biol 2023;83:102209. [PMID: 37506571 PMCID: PMC10529954 DOI: 10.1016/j.ceb.2023.102209] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2023] [Revised: 06/07/2023] [Accepted: 06/26/2023] [Indexed: 07/30/2023]

Ma W, Fu Y, Bao Y, Wang Z, Lei B, Zheng W, Wang C, Liu Y. DeepSATA: A Deep Learning-Based Sequence Analyzer Incorporating the Transcription Factor Binding Affinity to Dissect the Effects of Non-Coding Genetic Variants. Int J Mol Sci 2023;24:12023. [PMID: 37569400 PMCID: PMC10418434 DOI: 10.3390/ijms241512023] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2023] [Revised: 07/13/2023] [Accepted: 07/24/2023] [Indexed: 08/13/2023] Open

Affiliation(s)

Wenlong Ma Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Key Laboratory of Livestock and Poultry Multi-Omics of MARA, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518124, China; (W.M.); (Y.F.); (Y.B.); (Z.W.); (B.L.); (W.Z.); (C.W.) Innovation Group of Pig Genome Design and Breeding, Research Centre for Animal Genome, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518124, China
Yang Fu Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Key Laboratory of Livestock and Poultry Multi-Omics of MARA, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518124, China; (W.M.); (Y.F.); (Y.B.); (Z.W.); (B.L.); (W.Z.); (C.W.) Innovation Group of Pig Genome Design and Breeding, Research Centre for Animal Genome, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518124, China
Yongzhou Bao Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Key Laboratory of Livestock and Poultry Multi-Omics of MARA, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518124, China; (W.M.); (Y.F.); (Y.B.); (Z.W.); (B.L.); (W.Z.); (C.W.) Innovation Group of Pig Genome Design and Breeding, Research Centre for Animal Genome, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518124, China School of Life Sciences, Henan University, Kaifeng 475004, China
Zhen Wang Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Key Laboratory of Livestock and Poultry Multi-Omics of MARA, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518124, China; (W.M.); (Y.F.); (Y.B.); (Z.W.); (B.L.); (W.Z.); (C.W.) Innovation Group of Pig Genome Design and Breeding, Research Centre for Animal Genome, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518124, China School of Life Sciences, Henan University, Kaifeng 475004, China
Bowen Lei Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Key Laboratory of Livestock and Poultry Multi-Omics of MARA, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518124, China; (W.M.); (Y.F.); (Y.B.); (Z.W.); (B.L.); (W.Z.); (C.W.) Innovation Group of Pig Genome Design and Breeding, Research Centre for Animal Genome, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518124, China Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education & Key Lab of Swine Genetics and Breeding of Ministry of Agriculture and Rural Affairs, Huazhong Agricultural University, Wuhan 430070, China
Weigang Zheng Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Key Laboratory of Livestock and Poultry Multi-Omics of MARA, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518124, China; (W.M.); (Y.F.); (Y.B.); (Z.W.); (B.L.); (W.Z.); (C.W.) Innovation Group of Pig Genome Design and Breeding, Research Centre for Animal Genome, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518124, China Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education & Key Lab of Swine Genetics and Breeding of Ministry of Agriculture and Rural Affairs, Huazhong Agricultural University, Wuhan 430070, China
Chao Wang Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Key Laboratory of Livestock and Poultry Multi-Omics of MARA, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518124, China; (W.M.); (Y.F.); (Y.B.); (Z.W.); (B.L.); (W.Z.); (C.W.) Innovation Group of Pig Genome Design and Breeding, Research Centre for Animal Genome, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518124, China Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education & Key Lab of Swine Genetics and Breeding of Ministry of Agriculture and Rural Affairs, Huazhong Agricultural University, Wuhan 430070, China
Yuwen Liu Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Key Laboratory of Livestock and Poultry Multi-Omics of MARA, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518124, China; (W.M.); (Y.F.); (Y.B.); (Z.W.); (B.L.); (W.Z.); (C.W.) Innovation Group of Pig Genome Design and Breeding, Research Centre for Animal Genome, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518124, China Kunpeng Institute of Modern Agriculture at Foshan, Chinese Academy of Agricultural Sciences, Foshan 528226, China

Collapse

Yang R, Das A, Gao VR, Karbalayghareh A, Noble WS, Bilmes JA, Leslie CS. Epiphany: predicting Hi-C contact maps from 1D epigenomic signals. Genome Biol 2023;24:134. [PMID: 37280678 PMCID: PMC10242996 DOI: 10.1186/s13059-023-02934-9] [Citation(s) in RCA: 28] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2021] [Accepted: 04/06/2023] [Indexed: 06/08/2023] Open

Woo BJ, Moussavi-Baygi R, Karner H, Karimzadeh M, Garcia K, Joshi T, Yin K, Navickas A, Gilbert LA, Wang B, Asgharian H, Feng FY, Goodarzi H. Integrative identification of non-coding regulatory regions driving metastatic prostate cancer. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.04.14.535921. [PMID: 37398273 PMCID: PMC10312451 DOI: 10.1101/2023.04.14.535921] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/04/2023]

Affiliation(s)

Brian J Woo Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, California, USA Department of Urology, University of California, San Francisco, San Francisco, California, USA Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, California, USA
Ruhollah Moussavi-Baygi Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, California, USA Department of Urology, University of California, San Francisco, San Francisco, California, USA Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, California, USA
Heather Karner Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, California, USA Department of Urology, University of California, San Francisco, San Francisco, California, USA Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, California, USA
Mehran Karimzadeh Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, California, USA Department of Urology, University of California, San Francisco, San Francisco, California, USA Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, California, USA Vector Institute, Toronto, ON, Canada Peter Munk Cardiac Centre, University Health Network, Toronto, ON, Canada Arc Institute, Palo Alto 94305, USA
Kristle Garcia Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, California, USA Department of Urology, University of California, San Francisco, San Francisco, California, USA Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, California, USA
Tanvi Joshi Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, California, USA Department of Urology, University of California, San Francisco, San Francisco, California, USA Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, California, USA
Keyi Yin Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, California, USA Department of Urology, University of California, San Francisco, San Francisco, California, USA Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, California, USA
Albertas Navickas Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, California, USA Department of Urology, University of California, San Francisco, San Francisco, California, USA Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, California, USA
Luke A. Gilbert Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, California, USA Department of Urology, University of California, San Francisco, San Francisco, California, USA Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, California, USA Arc Institute, Palo Alto 94305, USA
Bo Wang Vector Institute, Toronto, ON, Canada Peter Munk Cardiac Centre, University Health Network, Toronto, ON, Canada
Hosseinali Asgharian Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, California, USA Department of Urology, University of California, San Francisco, San Francisco, California, USA Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, California, USA Bakar Computational Health Sciences Institute, University of California, San Francisco, CA, US
Felix Y. Feng Department of Urology, University of California, San Francisco, San Francisco, California, USA Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, California, USA Department of Radiation Oncology, University of California, San Francisco, San Francisco, California, USA
Hani Goodarzi Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, California, USA Department of Urology, University of California, San Francisco, San Francisco, California, USA Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, California, USA Bakar Computational Health Sciences Institute, University of California, San Francisco, CA, US

Collapse

Hamamoto R, Takasawa K, Shinkai N, Machino H, Kouno N, Asada K, Komatsu M, Kaneko S. Analysis of super-enhancer using machine learning and its application to medical biology. Brief Bioinform 2023;24:bbad107. [PMID: 36960780 PMCID: PMC10199775 DOI: 10.1093/bib/bbad107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2022] [Revised: 02/11/2023] [Accepted: 03/01/2023] [Indexed: 03/25/2023] Open

Agarwal A, Chen L. DeepPHiC: predicting promoter-centered chromatin interactions using a novel deep learning approach. Bioinformatics 2023;39:6887158. [PMID: 36495179 PMCID: PMC9825766 DOI: 10.1093/bioinformatics/btac801] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2022] [Revised: 11/23/2022] [Accepted: 12/09/2022] [Indexed: 12/14/2022] Open

Abstract

MOTIVATION

Promoter-centered chromatin interactions, which include promoter-enhancer (PE) and promoter-promoter (PP) interactions, are important to decipher gene regulation and disease mechanisms. The development of next-generation sequencing technologies such as promoter capture Hi-C (pcHi-C) leads to the discovery of promoter-centered chromatin interactions. However, pcHi-C experiments are expensive and thus may be unavailable for tissues/cell types of interest. In addition, these experiments may be underpowered due to insufficient sequencing depth or various artifacts, which results in a limited finding of interactions. Most existing computational methods for predicting chromatin interactions are based on in situ Hi-C and can detect chromatin interactions across the entire genome. However, they may not be optimal for predicting promoter-centered chromatin interactions.

RESULTS

We develop a supervised multi-modal deep learning model, which utilizes a comprehensive set of features such as genomic sequence, epigenetic signal, anchor distance, evolutionary features and DNA structural features to predict tissue/cell type-specific PE and PP interactions. We further extend the deep learning model in a multi-task learning and a transfer learning framework and demonstrate that the proposed approach outperforms state-of-the-art deep learning methods. Moreover, the proposed approach can achieve comparable prediction performance using predefined biologically relevant tissues/cell types compared to using all tissues/cell types in the pretraining especially for predicting PE interactions. The prediction performance can be further improved by using computationally inferred biologically relevant tissues/cell types in the pretraining, which are defined based on the common genes in the proximity of two anchors in the chromatin interactions.

AVAILABILITY AND IMPLEMENTATION

https://github.com/lichen-lab/DeepPHiC.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

Collapse

Lan AY, Corces MR. Deep learning approaches for noncoding variant prioritization in neurodegenerative diseases. Front Aging Neurosci 2022;14:1027224. [PMID: 36466610 PMCID: PMC9716280 DOI: 10.3389/fnagi.2022.1027224] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2022] [Accepted: 10/24/2022] [Indexed: 11/19/2022] Open

DLoopCaller: A deep learning approach for predicting genome-wide chromatin loops by integrating accessible chromatin landscapes. PLoS Comput Biol 2022;18:e1010572. [PMID: 36206320 PMCID: PMC9581407 DOI: 10.1371/journal.pcbi.1010572] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2022] [Revised: 10/19/2022] [Accepted: 09/14/2022] [Indexed: 11/20/2022] Open

Yang M, Ma J. Machine Learning Methods for Exploring Sequence Determinants of 3D Genome Organization. J Mol Biol 2022;434:167666. [PMID: 35659533 DOI: 10.1016/j.jmb.2022.167666] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2021] [Revised: 05/23/2022] [Accepted: 05/27/2022] [Indexed: 01/25/2023]

Alharbi WS, Rashid M. A review of deep learning applications in human genomics using next-generation sequencing data. Hum Genomics 2022;16:26. [PMID: 35879805 PMCID: PMC9317091 DOI: 10.1186/s40246-022-00396-x] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2021] [Accepted: 07/12/2022] [Indexed: 12/02/2022] Open

Piecyk RS, Schlegel L, Johannes F. Predicting 3D chromatin interactions from DNA sequence using Deep Learning. Comput Struct Biotechnol J 2022;20:3439-3448. [PMID: 35832620 PMCID: PMC9271978 DOI: 10.1016/j.csbj.2022.06.047] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2022] [Revised: 06/21/2022] [Accepted: 06/21/2022] [Indexed: 11/22/2022] Open

Yang D, Chung T, Kim D. DeepLUCIA: predicting tissue-specific chromatin loops using Deep Learning-based Universal Chromatin Interaction Annotator. Bioinformatics 2022;38:3501-3512. [PMID: 35640981 DOI: 10.1093/bioinformatics/btac373] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2021] [Revised: 04/17/2022] [Accepted: 05/27/2022] [Indexed: 11/13/2022] Open

Avdeyev P, Zhou J. Computational Approaches for Understanding Sequence Variation Effects on the 3D Genome Architecture. Annu Rev Biomed Data Sci 2022;5:183-204. [PMID: 35537461 DOI: 10.1146/annurev-biodatasci-102521-012018] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Sefer E. ProbC: joint modeling of epigenome and transcriptome effects in 3D genome. BMC Genomics 2022;23:287. [PMID: 35397520 PMCID: PMC8994916 DOI: 10.1186/s12864-022-08498-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2021] [Accepted: 03/23/2022] [Indexed: 11/30/2022] Open

Abstract

BACKGROUND

Hi-C and its high nucleosome resolution variant Micro-C provide a window into the spatial packing of a genome in 3D within the cell. Even though both techniques do not directly depend on the binding of specific antibodies, previous work has revealed enriched interactions and domain structures around multiple chromatin marks; epigenetic modifications and transcription factor binding sites. However, the joint impact of chromatin marks in Hi-C and Micro-C interactions have not been globally characterized, which limits our understanding of 3D genome characteristics. An emerging question is whether it is possible to deduce 3D genome characteristics and interactions by integrative analysis of multiple chromatin marks and associate interactions to functionality of the interacting loci.

RESULT

We come up with a probabilistic method PROBC to decompose Hi-C and Micro-C interactions by known chromatin marks. PROBC is based on convex likelihood optimization, which can directly take into account both interaction existence and nonexistence. Through PROBC, we discover histone modifications (H3K27ac, H3K9me3, H3K4me3, H3K4me1) and CTCF as particularly predictive of Hi-C and Micro-C contacts across cell types and species. Moreover, histone modifications are more effective than transcription factor binding sites in explaining the genome's 3D shape through these interactions. PROBC can successfully predict Hi-C and Micro-C interactions in given species, while it is trained on different cell types or species. For instance, it can predict missing nucleosome resolution Micro-C interactions in human ES cells trained on mouse ES cells only from these 5 chromatin marks with above 0.75 AUC. Additionally, PROBC outperforms the existing methods in predicting interactions across almost all chromosomes.

CONCLUSION

Via our proposed method, we optimally decompose Hi-C interactions in terms of these chromatin marks at genome and chromosome levels. We find a subset of histone modifications and transcription factor binding sites to be predictive of both Hi-C and Micro-C interactions and TADs across human, mouse, and different cell types. Through learned models, we can predict interactions on species just from chromatin marks for which Hi-C data may be limited.

Collapse

InsuLock: A Weakly Supervised Learning Approach for Accurate Insulator Prediction, and Variant Impact Quantification. Genes (Basel) 2022;13:genes13040621. [PMID: 35456427 PMCID: PMC9026820 DOI: 10.3390/genes13040621] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2022] [Revised: 03/24/2022] [Accepted: 03/25/2022] [Indexed: 02/01/2023] Open

Shen Y, Zhong Q, Liu T, Wen Z, Shen W, Li L. CharID: a two-step model for universal prediction of interactions between chromatin accessible regions. Brief Bioinform 2022;23:6514800. [PMID: 35077535 DOI: 10.1093/bib/bbab602] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2021] [Revised: 12/23/2021] [Accepted: 12/24/2021] [Indexed: 11/14/2022] Open

Abstract

Open chromatin regions (OCRs) allow direct interaction between cis-regulatory elements and trans-acting factors. Therefore, predicting all potential OCR-mediated loops is essential for deciphering the regulation mechanism of gene expression. However, existing loop prediction tools are restricted to specific anchor types. Here, we present CharID (Chromatin Accessible Region Interaction Detector), a two-step model that combines neural network and ensemble learning to predict OCR-mediated loops. In the first step, CharID-Anchor, an attention-based hybrid CNN-BiGRU network is constructed to discriminate between the anchor and nonanchor OCRs. In the second step, CharID-Loop uses gradient boosting decision tree with chromosome-split strategy to predict the interactions between anchor OCRs. The performance was assessed in three human cell lines, and CharID showed superior prediction performance compared with other algorithms. In contrast to the methods designed to predict a particular type of loops, CharID can detect varieties of chromatin loops not limited to enhancer-promoter loops or architectural protein-mediated loops. We constructed the OCR-mediated interaction network using the predicted loops and identified hub anchors, which are highlighted by their proximity to housekeeping genes. By analyzing loops containing SNPs associated with cardiovascular disease, we identified an SNP-gene loop indicating the regulation mechanism of the GFOD1. Taken together, CharID universally predicts diverse chromatin loops beyond other state-of-the-art methods, which are limited by anchor types, and experimental techniques, which are limited by sensitivities drastically decaying with the genomic distance of anchors. Finally, we hosted Peaksniffer, a user-friendly web server that provides online prediction, query and visualization of OCRs and associated loops.

Collapse

Kodali S, Meyer-Nava S, Landry S, Chakraborty A, Rivera-Mulia JC, Feng W. Epigenomic signatures associated with spontaneous and replication stress-induced DNA double strand breaks. Front Genet 2022;13:907547. [PMID: 36506300 PMCID: PMC9730818 DOI: 10.3389/fgene.2022.907547] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2022] [Accepted: 11/07/2022] [Indexed: 11/25/2022] Open

Abstract

Common fragile sites (CFSs) are specific regions of all individuals' genome that are predisposed to DNA double strand breaks (DSBs) and undergo subsequent rearrangements. CFS formation can be induced in vitro by mild level of DNA replication stress, such as DNA polymerase inhibition or nucleotide pool disturbance. The mechanisms of CFS formation have been linked to DNA replication timing control, transcription activities, as well as chromatin organization. However, it is unclear what specific cis- or trans-factors regulate the interplay between replication and transcription that determine CFS formation. We recently reported genome-wide mapping of DNA DSBs under replication stress induced by aphidicolin in human lymphoblastoids for the first time. Here, we systematically compared these DSBs with regards to nearby epigenomic features mapped in the same cell line from published studies. We demonstrate that aphidicolin-induced DSBs are strongly correlated with histone 3 lysine 36 trimethylation, a marker for active transcription. We further demonstrate that this DSB signature is a composite effect by the dual treatment of aphidicolin and its solvent, dimethylsulfoxide, the latter of which potently induces transcription on its own. We also present complementing evidence for the association between DSBs and 3D chromosome architectural domains with high density gene cluster and active transcription. Additionally, we show that while DSBs were detected at all but one of the fourteen finely mapped CFSs, they were not enriched in the CFS core sequences and rather demarcated the CFS core region. Related to this point, DSB density was not higher in large genes of greater than 300 kb, contrary to reported enrichment of CFS sites at these large genes. Finally, replication timing analyses demonstrate that the CFS core region contain initiation events, suggesting that altered replication dynamics are responsible for CFS formation in relatively higher level of replication stress.

Collapse

Li R, Li L, Xu Y, Yang J. Machine learning meets omics: applications and perspectives. Brief Bioinform 2021;23:6425809. [PMID: 34791021 DOI: 10.1093/bib/bbab460] [Citation(s) in RCA: 60] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2021] [Revised: 09/29/2021] [Accepted: 10/07/2021] [Indexed: 02/07/2023] Open

Belokopytova P, Fishman V. Predicting Genome Architecture: Challenges and Solutions. Front Genet 2021;11:617202. [PMID: 33552135 PMCID: PMC7862721 DOI: 10.3389/fgene.2020.617202] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2020] [Accepted: 12/15/2020] [Indexed: 12/22/2022] Open

Tao H, Li H, Xu K, Hong H, Jiang S, Du G, Wang J, Sun Y, Huang X, Ding Y, Li F, Zheng X, Chen H, Bo X. Computational methods for the prediction of chromatin interaction and organization using sequence and epigenomic profiles. Brief Bioinform 2021;22:6102668. [PMID: 33454752 PMCID: PMC8424394 DOI: 10.1093/bib/bbaa405] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2020] [Revised: 11/26/2020] [Accepted: 12/10/2020] [Indexed: 12/14/2022] Open

Kuang S, Wang L. Deep Learning of Sequence Patterns for CCCTC-Binding Factor-Mediated Chromatin Loop Formation. J Comput Biol 2020;28:133-145. [PMID: 33232622 DOI: 10.1089/cmb.2020.0225] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open

Fudenberg G, Kelley DR, Pollard KS. Predicting 3D genome folding from DNA sequence with Akita. Nat Methods 2020;17:1111-1117. [PMID: 33046897 PMCID: PMC8211359 DOI: 10.1038/s41592-020-0958-x] [Citation(s) in RCA: 169] [Impact Index Per Article: 33.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2019] [Accepted: 08/20/2020] [Indexed: 02/07/2023]

Fudenberg G, Kelley DR, Pollard KS. Predicting 3D genome folding from DNA sequence with Akita. Nat Methods 2020;17:1111-1117. [PMID: 33046897 DOI: 10.1101/800060] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2019] [Accepted: 08/20/2020] [Indexed: 05/20/2023]

Trieu T, Martinez-Fundichely A, Khurana E. DeepMILO: a deep learning approach to predict the impact of non-coding sequence variants on 3D chromatin structure. Genome Biol 2020;21:79. [PMID: 32216817 PMCID: PMC7098089 DOI: 10.1186/s13059-020-01987-4] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2019] [Accepted: 03/06/2020] [Indexed: 12/17/2022] Open