1
|
Reyna J, Fetter K, Ignacio R, Ali Marandi CC, Ma A, Rao N, Jiang Z, Figueroa DS, Bhattacharyya S, Ay F. Loop Catalog: a comprehensive HiChIP database of human and mouse samples. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2024.04.26.591349. [PMID: 38746164 PMCID: PMC11092438 DOI: 10.1101/2024.04.26.591349] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2024]
Abstract
HiChIP enables cost-effective and high-resolution profiling of chromatin loops. To leverage the increasing number of HiChIP datasets, we developed Loop Catalog (https://loopcatalog.lji.org), a web-based database featuring loop calls from 1000+ distinct human and mouse HiChIP samples from 152 studies plus 44 high-resolution Hi-C samples. We demonstrate its utility for interpreting GWAS and eQTL variants through SNP-to-gene linking, identifying enriched sequence motifs and motif pairs, and generating regulatory networks and 2D representations of chromatin structure. Our catalog spans over 4.19M unique loops, and with embedded analysis modules, constitutes an important resource for the field.
Collapse
Affiliation(s)
- Joaquin Reyna
- Centers for Cancer Immunotherapy and Autoimmunity, La Jolla Institute for Immunology, La Jolla, CA 92037 USA
- Bioinformatics and Systems Biology Graduate Program University of California, San Diego, La Jolla, CA 92093 USA
| | - Kyra Fetter
- Centers for Cancer Immunotherapy and Autoimmunity, La Jolla Institute for Immunology, La Jolla, CA 92037 USA
- Department of Bioengineering, University of California San Diego, La Jolla, CA 92093 USA
| | - Romeo Ignacio
- Centers for Cancer Immunotherapy and Autoimmunity, La Jolla Institute for Immunology, La Jolla, CA 92037 USA
- Department of Mathematics, University of California San Diego, La Jolla, CA 92093 USA
| | - Cemil Can Ali Marandi
- Centers for Cancer Immunotherapy and Autoimmunity, La Jolla Institute for Immunology, La Jolla, CA 92037 USA
- Bioinformatics and Systems Biology Graduate Program University of California, San Diego, La Jolla, CA 92093 USA
| | - Astoria Ma
- Centers for Cancer Immunotherapy and Autoimmunity, La Jolla Institute for Immunology, La Jolla, CA 92037 USA
- Department of Computer Science and Engineering, University of California San Diego, La Jolla, CA 92093 USA
| | - Nikhil Rao
- Centers for Cancer Immunotherapy and Autoimmunity, La Jolla Institute for Immunology, La Jolla, CA 92037 USA
- Department of Computer Science and Engineering, University of California San Diego, La Jolla, CA 92093 USA
| | - Zichen Jiang
- Centers for Cancer Immunotherapy and Autoimmunity, La Jolla Institute for Immunology, La Jolla, CA 92037 USA
- School of Biological Sciences, University of California San Diego, La Jolla, CA 92093 USA
| | - Daniela Salgado Figueroa
- Centers for Cancer Immunotherapy and Autoimmunity, La Jolla Institute for Immunology, La Jolla, CA 92037 USA
- Bioinformatics and Systems Biology Graduate Program University of California, San Diego, La Jolla, CA 92093 USA
| | - Sourya Bhattacharyya
- Centers for Cancer Immunotherapy and Autoimmunity, La Jolla Institute for Immunology, La Jolla, CA 92037 USA
| | - Ferhat Ay
- Centers for Cancer Immunotherapy and Autoimmunity, La Jolla Institute for Immunology, La Jolla, CA 92037 USA
- Bioinformatics and Systems Biology Graduate Program University of California, San Diego, La Jolla, CA 92093 USA
- Department of Pediatrics, University of California San Diego, La Jolla, CA 92093 USA
| |
Collapse
|
2
|
Wang J, Nakato R. Churros: a Docker-based pipeline for large-scale epigenomic analysis. DNA Res 2024; 31:dsad026. [PMID: 38102723 PMCID: PMC11389749 DOI: 10.1093/dnares/dsad026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2023] [Revised: 11/23/2023] [Accepted: 12/13/2023] [Indexed: 12/17/2023] Open
Abstract
The epigenome, which reflects the modifications on chromatin or DNA sequences, provides crucial insight into gene expression regulation and cellular activity. With the continuous accumulation of epigenomic datasets such as chromatin immunoprecipitation followed by sequencing (ChIP-seq) data, there is a great demand for a streamlined pipeline to consistently process them, especially for large-dataset comparisons involving hundreds of samples. Here, we present Churros, an end-to-end epigenomic analysis pipeline that is environmentally independent and optimized for handling large-scale data. We successfully demonstrated the effectiveness of Churros by analyzing large-scale ChIP-seq datasets with the hg38 or Telomere-to-Telomere (T2T) human reference genome. We found that applying T2T to the typical analysis workflow has important impacts on read mapping, quality checks, and peak calling. We also introduced a useful feature to study context-specific epigenomic landscapes. Churros will contribute a comprehensive and unified resource for analyzing large-scale epigenomic data.
Collapse
Affiliation(s)
- Jiankang Wang
- School of Biomedical Sciences, Hunan University, Changsha, Hunan, China
- Institute for Quantitative Biosciences, The University of Tokyo, Bunkyo-ku, Tokyo, Japan
| | - Ryuichiro Nakato
- Institute for Quantitative Biosciences, The University of Tokyo, Bunkyo-ku, Tokyo, Japan
| |
Collapse
|
3
|
Agarwal A, Korsak S, Choudhury A, Plewczynski D. The dynamic role of cohesin in maintaining human genome architecture. Bioessays 2023; 45:e2200240. [PMID: 37603403 DOI: 10.1002/bies.202200240] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2022] [Revised: 08/03/2023] [Accepted: 08/07/2023] [Indexed: 08/22/2023]
Abstract
Recent advances in genomic and imaging techniques have revealed the complex manner of organizing billions of base pairs of DNA necessary for maintaining their functionality and ensuring the proper expression of genetic information. The SMC proteins and cohesin complex primarily contribute to forming higher-order chromatin structures, such as chromosomal territories, compartments, topologically associating domains (TADs) and chromatin loops anchored by CCCTC-binding factor (CTCF) protein or other genome organizers. Cohesin plays a fundamental role in chromatin organization, gene expression and regulation. This review aims to describe the current understanding of the dynamic nature of the cohesin-DNA complex and its dependence on cohesin for genome maintenance. We discuss the current 3C technique and numerous bioinformatics pipelines used to comprehend structural genomics and epigenetics focusing on the analysis of Cohesin-centred interactions. We also incorporate our present comprehension of Loop Extrusion (LE) and insights from stochastic modelling.
Collapse
Affiliation(s)
- Abhishek Agarwal
- Centre of New Technologies, University of Warsaw, Warsaw, Poland
| | - Sevastianos Korsak
- Faculty of Mathematics and Information Science, Warsaw University of Technology, Warsaw, Poland
| | | | - Dariusz Plewczynski
- Centre of New Technologies, University of Warsaw, Warsaw, Poland
- Faculty of Mathematics and Information Science, Warsaw University of Technology, Warsaw, Poland
| |
Collapse
|
4
|
Wang J, Nakato R. Comprehensive multiomics analyses reveal pervasive involvement of aberrant cohesin binding in transcriptional and chromosomal disorder of cancer cells. iScience 2023; 26:106908. [PMID: 37283809 PMCID: PMC10239702 DOI: 10.1016/j.isci.2023.106908] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2023] [Revised: 02/27/2023] [Accepted: 05/12/2023] [Indexed: 06/08/2023] Open
Abstract
Chromatin organization, whose malfunction causes various diseases including cancer, is fundamentally controlled by cohesin. While cancer cells have been found with mutated or misexpressed cohesin genes, there is no comprehensive survey about the presence and role of abnormal cohesin binding in cancer cells. Here, we systematically identified ∼1% of cohesin-binding sites (701-2,633) as cancer-aberrant binding sites of cohesin (CASs). We integrated CASs with large-scale transcriptomics, epigenomics, 3D genomics, and clinical information. CASs represent tissue-specific epigenomic signatures enriched for cancer-dysregulated genes with functional and clinical significance. CASs exhibited alterations in chromatin compartments, loops within topologically associated domains, and cis-regulatory elements, indicating that CASs induce dysregulated genes through misguided chromatin structure. Cohesin depletion data suggested that cohesin binding at CASs actively regulates cancer-dysregulated genes. Overall, our comprehensive investigation suggests that aberrant cohesin binding is an essential epigenomic signature responsible for dysregulated chromatin structure and transcription in cancer cells.
Collapse
Affiliation(s)
- Jiankang Wang
- School of Biomedical Sciences, Hunan University, Changsha, China
- Institute for Quantitative Biosciences, The University of Tokyo, Bunkyo-ku, Tokyo, Japan
| | - Ryuichiro Nakato
- Institute for Quantitative Biosciences, The University of Tokyo, Bunkyo-ku, Tokyo, Japan
| |
Collapse
|